TY - JOUR
T1 - Inhalation Toxicity Screening of Consumer Products Chemicals using OECD Test Guideline Data-based Machine Learning Models
AU - Kim, Donghyeon
AU - Cho, Soyoung
AU - Jeon, Jong June
AU - Choi, Jinhee
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/10/5
Y1 - 2024/10/5
N2 - This study aimed to screen the inhalation toxicity of chemicals found in consumer products such as air fresheners, fragrances, and anti-fogging agents submitted to K-REACH using machine learning models. We manually curated inhalation toxicity data based on OECD test guideline 403 (Acute inhalation), 412 (Sub-acute inhalation), and 413 (Sub-chronic inhalation) for 1709 chemicals from the OECD eChemPortal database. Machine learning models were trained using ten algorithms, along with four molecular fingerprints (MACCS, Morgan, Topo, RDKit) and molecular descriptors, achieving F1 scores ranging from 51 % to 91 % in test dataset. Leveraging the high-performing models, we conducted a virtual screening of chemicals, initially applying them to data-rich chemicals generally used in occupational settings to determine the prediction uncertainty. Results showed high sensitivity (75 %) but low specificity (23 %), suggesting that our models can contribute to conservative screening of chemicals. Subsequently, we applied the models to consumer product chemicals, identifying 79 as of high concern. Most of the prioritized chemicals lacked GHS classifications related to inhalation toxicity, even though they were predicted to be used in many consumer products. This study highlights a potential regulatory blind spot concerning the inhalation risk of consumer product chemicals while also indicating the potential of artificial intelligence (AI) models to aid in prioritizing chemicals at the screening level.
AB - This study aimed to screen the inhalation toxicity of chemicals found in consumer products such as air fresheners, fragrances, and anti-fogging agents submitted to K-REACH using machine learning models. We manually curated inhalation toxicity data based on OECD test guideline 403 (Acute inhalation), 412 (Sub-acute inhalation), and 413 (Sub-chronic inhalation) for 1709 chemicals from the OECD eChemPortal database. Machine learning models were trained using ten algorithms, along with four molecular fingerprints (MACCS, Morgan, Topo, RDKit) and molecular descriptors, achieving F1 scores ranging from 51 % to 91 % in test dataset. Leveraging the high-performing models, we conducted a virtual screening of chemicals, initially applying them to data-rich chemicals generally used in occupational settings to determine the prediction uncertainty. Results showed high sensitivity (75 %) but low specificity (23 %), suggesting that our models can contribute to conservative screening of chemicals. Subsequently, we applied the models to consumer product chemicals, identifying 79 as of high concern. Most of the prioritized chemicals lacked GHS classifications related to inhalation toxicity, even though they were predicted to be used in many consumer products. This study highlights a potential regulatory blind spot concerning the inhalation risk of consumer product chemicals while also indicating the potential of artificial intelligence (AI) models to aid in prioritizing chemicals at the screening level.
KW - Artificial Intelligence
KW - Consumer Products Chemicals
KW - Inhalation Toxicity
KW - Quantitative Structure Activity Relationship
KW - Toxicity Prediction
UR - http://www.scopus.com/inward/record.url?scp=85201514325&partnerID=8YFLogxK
U2 - 10.1016/j.jhazmat.2024.135446
DO - 10.1016/j.jhazmat.2024.135446
M3 - Article
C2 - 39154469
AN - SCOPUS:85201514325
SN - 0304-3894
VL - 478
JO - Journal of Hazardous Materials
JF - Journal of Hazardous Materials
M1 - 135446
ER -