反应性(心理学)
分子描述符
机器学习
化学
人工智能
特征(语言学)
生化工程
数量结构-活动关系
计算机科学
集合(抽象数据类型)
生物系统
卤素
表征(材料科学)
随机森林
化学信息学
羟基自由基
灵敏度(控制系统)
工作(物理)
电子结构
计算化学
稳健性(进化)
钥匙(锁)
接口(物质)
人工神经网络
数据挖掘
合理设计
分子
电子效应
数据集
天然有机质
作者
Shihua Zou,Zonglin Li,Yicen Dai,Jiaxing Miao,Zhiyu Zhao,Lili Hu,Hongying Zhao
标识
DOI:10.1021/acs.est.6c02173
摘要
Understanding hydroxyl radical (HO·) reactivity with organic pollutants is crucial for optimizing advanced oxidation processes in water purification. Machine learning (ML) models have been developed to predict HO· reactivity but often produce black-box results due to complex chemical interplay. Herein, we constructed an interpretable ML framework to unveil the intrinsic molecular factors governing the HO· reactivity with antibiotic contaminants. A comprehensive set of DFT-derived constitutional, quantum-chemical, and Abraham descriptors was first employed to characterize the intricate structural and electronic nature of antibiotics. An attention-driven feature interaction method then engineered feature representations to generate the optimal subset comprising initial and new features. SHapley Additive exPlanations (SHAP) analysis quantified the interpretable contribution of individual features for model output, revealing key molecular properties, such as volume-regulated electronic migration ability (V_VIP) and electronic attraction ability mediated by the number of halogen atoms (#X_ME). Finally, a causal interface ML model was developed to identify cause-and-effect relationships between target variables and intrinsic properties even within small data sets. The optimized random forest model demonstrated high accuracy in predicting HO· reactivity, with experimental validation showing relative errors of below 6%. This work establishes an applicable and robust causal discovery framework for enabling the more rational design of water purification strategies.
科研通智能强力驱动
Strongly Powered by AbleSci AI