计算机科学
直觉
化学空间
催化作用
学习迁移
人工智能
机器学习
共轭梯度法
组合化学
化学
有机化学
算法
药物发现
生物化学
认识论
哲学
作者
Katrina W. Lexa,Kevin M. Belyk,Jeremy Henle,Bangping Xiang,Robert P. Sheridan,Scott E. Denmark,Rebecca T. Ruck,R. Thomas Williamson
标识
DOI:10.1021/acs.oprd.1c00155
摘要
Molecular design benefits from a strong partnership between chemical intuition and machine learning. Given the proliferation of machine learning in the small molecule catalyst space, several ideas on how best to apply and interpret various 2D or 3D methods are under discussion across the field. We undertook an investigation of several methods for modeling a catalyst system with a large training set. The synthesis of the drug letermovir involves a key conjugate addition that is promoted asymmetrically by a cinchonidine-derived “bis-quat” phase transfer catalyst. An initial data set acquired from 177 catalysts was used to drive five additional rounds of optimization based on machine learning approaches. For this specific data set, random forest with 2D molecular descriptors outperformed all other 2D methods tested, alternative descriptor combinations, and 3D-based approaches. Improvement in the model performance was observed over time, and a high-throughput approach for the synthesis of new catalysts was key to iterating through larger rounds of optimization. Optimizing reaction conditions for one of the best catalysts identified during the machine learning work led to improvement of enantioselectivity to 89%.
科研通智能强力驱动
Strongly Powered by AbleSci AI