不确定度量化
财产(哲学)
稳健性(进化)
标杆管理
计算机科学
人工智能
水准点(测量)
机器学习
化学空间
贝叶斯概率
数据集
集合(抽象数据类型)
大数据
数据挖掘
适用范围
测量不确定度
可靠性(半导体)
贝叶斯推理
合成数据
不确定度分析
均方预测误差
贝叶斯定理
实验数据
预测建模
计算模型
药物发现
财产价值
训练集
敏感性分析
作者
Raquel Parrondo-Pizarro,Jessica Lanini,Raquel Rodríguez-Pérez
标识
DOI:10.1021/acs.jcim.5c02381
摘要
Drug discovery and medicinal chemistry efforts are increasingly influenced by machine learning (ML), with compound property prediction as a central application. ML models have demonstrated strong performance in predicting various compound properties from chemical structure. However, these models can exhibit varying levels of prediction error, making uncertainty quantification (UQ) essential for informed decisions. Standard UQ metrics include the distance to the molecules in the training set and prediction variance, obtained through methods such as model ensembles or Bayesian modeling. Although several UQ methodologies have been developed in recent years, no single approach consistently outperformed others. Herein, we present a comprehensive benchmark of UQ strategies for ML-based prediction of absorption, distribution, metabolism, and excretion (ADME) properties, using both in-house and public data sets. We employed the recently introduced UNIQUE (UNcertaInty QUantification bEnchmarking) framework and evaluated UQ method performance under data shifts. Our findings indicate data-based UQ metrics (e.g., chemical distance), and model-based UQ metrics (e.g., predicted value and variance) may capture complementary aspects of uncertainty. Their combination through error models, designed to predict the original ML model's error, yielded higher-quality uncertainty estimates. These error models emerged as a promising strategy for enhancing UQ, showing robustness in under various degrees and types of data shift. Taken together, our work highlights the potential of combining diverse UQ metrics and error modeling to improve reliability in molecular property prediction. By establishing standardized evaluation setups and assessing UQ under data shifts, we provide a foundation for future UQ method development and benchmarking in the field.
科研通智能强力驱动
Strongly Powered by AbleSci AI