GIG OpenIR  > 有机地球化学国家重点实验室
Han, Min1,2,3; Jin, Biao1,2,3; Liang, Jun4; Huang, Chen1,2,3; Arp, Hans Peter H.5,6
Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure
Source PublicationWATER RESEARCH
ISSN0043-1354
2023-10-01
Volume244Pages:12
DOI10.1016/j.watres.2023.120470
Language英语
WOS Research AreaEngineering ; Environmental Sciences & Ecology ; Water Resources
AbstractDetermining which substances on the global market could be classified as persistent, mobile and toxic (PMT) substances or very persistent, very mobile (vPvM) substances is essential to prevent or reduce drinking water contamination from them. This study developed machine learning models based on different molecular de-scriptors (MDs) and defined applicability domains for the screening of PMT/vPvM substances. The models were trained with 3111 substances with expert weight-of-evidence based PMT/vPvM hazard classifications that considered the highest quality data available. The model was based on the hypothesis that PMT/vPvM substances contain similar MDs, representative of chemical structures resistant to degradation, be associated with low sorption (or high-water solubility) and in some cases be associated with known toxic mechanisms. All possible model combinations were tested by integrating different molecular description methods, data balancing stra-tegies and machine learning algorithms. Our model allows one-step prediction of candidate PMT/vPvM sub-stances, and our method was compared with the approach predicting P, M and T separately (i.e. three-step prediction). The results showed that the one-step model achieved a higher accuracy of 92% for PMT/vPvM identification (i.e. positive samples) for an internal test set, and also resulted in a higher accuracy of 90% for an external test set of chemical pollutants detected in Taihu Lake, China. Furthermore, prediction mechanism of the model was interpreted by Shapley additive explanations (SHAP). This work presents an advance of big data in silico screening models for the identification of substances that potentially meet the PMT/vPvM criteria.
KeywordPollutants Water quality Machine learning Chemicals
WOS IDWOS:001063597800001
Indexed BySCI
Citation statistics
Cited Times:7[WOS]   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://ir.gig.ac.cn/handle/344008/74805
Collection有机地球化学国家重点实验室
Corresponding AuthorJin, Biao
Affiliation1.Chinese Acad Sci, Guangzhou Inst Geochem, State Key Lab Organ Geochem, Guangzhou 510640, Peoples R China
2.CAS Ctr Excellence Deep Earth Sci, Guangzhou 510640, Peoples R China
3.Univ Chinese Acad Sci, Beijing 10069, Peoples R China
4.South China Normal Univ, Sch Software, Foshan 528225, Peoples R China
5.Norwegian Geotech Inst NGI, POB 3930 Ullevaal Stad, N-0806 Oslo, Norway
6.Norwegian Univ Sci & Technol NTNU, NO-7491 Trondheim, Norway
Recommended Citation
GB/T 7714
Han, Min,Jin, Biao,Liang, Jun,et al. Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure[J]. WATER RESEARCH,2023,244:12.
APA Han, Min,Jin, Biao,Liang, Jun,Huang, Chen,&Arp, Hans Peter H..(2023).Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure.WATER RESEARCH,244,12.
MLA Han, Min,et al."Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure".WATER RESEARCH 244(2023):12.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Han, Min]'s Articles
[Jin, Biao]'s Articles
[Liang, Jun]'s Articles
Baidu academic
Similar articles in Baidu academic
[Han, Min]'s Articles
[Jin, Biao]'s Articles
[Liang, Jun]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Han, Min]'s Articles
[Jin, Biao]'s Articles
[Liang, Jun]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.