广告
机器学习
人工智能
决策树
药物发现
算法
计算机科学
药代动力学
随机森林
药理学
化学
生物
生物化学
作者
Cheng Fang,Ye Wang,Richard Grater,Sudarshan Kapadnis,Cheryl Black,Patrick Trapa,Simone Sciabola
标识
DOI:10.1021/acs.jcim.3c00160
摘要
Absorption, distribution, metabolism, and excretion (ADME), which collectively define the concentration profile of a drug at the site of action, are of critical importance to the success of a drug candidate. Recent advances in machine learning algorithms and the availability of larger proprietary as well as public ADME data sets have generated renewed interest within the academic and pharmaceutical science communities in predicting pharmacokinetic and physicochemical endpoints in early drug discovery. In this study, we collected 120 internal prospective data sets over 20 months across six ADME in vitro endpoints: human and rat liver microsomal stability, MDR1-MDCK efflux ratio, solubility, and human and rat plasma protein binding. A variety of machine learning algorithms in combination with different molecular representations were evaluated. Our results suggest that gradient boosting decision tree and deep learning models consistently outperformed random forest over time. We also observed better performance when models were retrained on a fixed schedule, and the more frequent retraining generally resulted in increased accuracy, while hyperparameters tuning only improved the prospective predictions marginally.
科研通智能强力驱动
Strongly Powered by AbleSci AI