Enhancing Retrosynthetic Reaction Prediction with Deep Learning Using Multiscale Reaction Classification

回顾性分析 计算机科学 人工智能 集合(抽象数据类型) 机器学习 药物发现 化学 全合成 生物化学 有机化学 程序设计语言
作者
Javier L. Baylon,Nicholas A. Cilfone,Jeffrey R. Gulcher,Thomas Chittenden
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:59 (2): 673-688 被引量:68
标识
DOI:10.1021/acs.jcim.8b00801
摘要

Chemical synthesis planning is a key aspect in many fields of chemistry, especially drug discovery. Recent implementations of machine learning and artificial intelligence techniques for retrosynthetic analysis have shown great potential to improve computational methods for synthesis planning. Herein, we present a multiscale, data-driven approach for retrosynthetic analysis with deep highway networks (DHN). We automatically extracted reaction rules (i.e., ways in which a molecule is produced) from a data set consisting of chemical reactions derived from U.S. patents. We performed the retrosynthetic reaction prediction task in two steps: first, we built a DHN model to predict which group of reactions (consisting of chemically similar reaction rules) was employed to produce a molecule. Once a reaction group was identified, a DHN trained on the subset of reactions within the identified reaction group, was employed to predict the transformation rule used to produce a molecule. To validate our approach, we predicted the first retrosynthetic reaction step for 40 approved drugs using our multiscale model and compared its predictive performance with a conventional model trained on all machine-extracted reaction rules employed as a control. Our multiscale approach showed a success rate of 82.9% at generating valid reactants from retrosynthetic reaction predictions. Comparatively, the control model trained on all machine-extracted reaction rules yielded a success rate of 58.5% on the validation set of 40 pharmaceutical molecules, indicating a significant statistical improvement with our approach to match known first synthetic reaction of the tested drugs in this study. While our multiscale approach was unable to outperform state-of-the-art rule-based systems curated by expert chemists, multiscale classification represents a marked enhancement in retrosynthetic analysis and can be easily adapted for use in a range of artificial intelligence strategies.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
张小闲完成签到,获得积分10
刚刚
Joy完成签到,获得积分10
刚刚
爱玛爱玛完成签到 ,获得积分10
1秒前
科目三应助vigour采纳,获得10
1秒前
Zhusy完成签到 ,获得积分10
1秒前
糊涂的含卉完成签到,获得积分10
1秒前
科研通AI6.2应助陈秋艳采纳,获得10
1秒前
bkagyin应助英俊雁兰采纳,获得10
1秒前
CC完成签到,获得积分10
1秒前
渡己发布了新的文献求助10
2秒前
XUAN发布了新的文献求助10
2秒前
杨华启完成签到,获得积分0
2秒前
xc发布了新的文献求助10
2秒前
艾迪富富完成签到,获得积分10
2秒前
胖虎完成签到,获得积分10
2秒前
wjj119完成签到,获得积分10
3秒前
3秒前
kaworul完成签到,获得积分10
3秒前
爆米花应助11采纳,获得10
4秒前
花痴的易真完成签到,获得积分0
4秒前
simon完成签到,获得积分10
4秒前
科研通AI6.1应助望远山采纳,获得10
4秒前
4秒前
lin发布了新的文献求助10
4秒前
程程发布了新的文献求助10
4秒前
4秒前
5秒前
5秒前
aaa完成签到 ,获得积分10
5秒前
别让情为难完成签到,获得积分10
5秒前
6秒前
6秒前
贾明灵完成签到,获得积分10
6秒前
魁梧的乐曲完成签到,获得积分10
6秒前
言午完成签到,获得积分10
6秒前
激动的鹰完成签到,获得积分10
7秒前
7秒前
7秒前
7秒前
7秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
No Good Deed Goes Unpunished 1100
Bioseparations Science and Engineering Third Edition 1000
Lloyd's Register of Shipping's Approach to the Control of Incidents of Brittle Fracture in Ship Structures 1000
BRITTLE FRACTURE IN WELDED SHIPS 1000
Entre Praga y Madrid: los contactos checoslovaco-españoles (1948-1977) 1000
Polymorphism and polytypism in crystals 1000
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 纳米技术 有机化学 物理 生物化学 化学工程 计算机科学 复合材料 内科学 催化作用 光电子学 物理化学 电极 冶金 遗传学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 6103481
求助须知:如何正确求助?哪些是违规求助? 7932872
关于积分的说明 16432859
捐赠科研通 5231687
什么是DOI,文献DOI怎么找? 2795669
邀请新用户注册赠送积分活动 1777993
关于科研通互助平台的介绍 1651347