化学
碳-13核磁共振
核磁共振谱数据库
鉴定(生物学)
谱线
人工智能
计算机科学
立体化学
物理
天文
植物
生物
作者
Zhuo Yang,Jianfei Song,Minjian Yang,Lin Yao,Jiahua Zhang,Hui Shi,Xiangyang Ji,Yafeng Deng,Xiaojian Wang
出处
期刊:Analytical Chemistry
[American Chemical Society]
日期:2021-11-29
卷期号:93 (50): 16947-16955
被引量:14
标识
DOI:10.1021/acs.analchem.1c04307
摘要
Library matching using carbon-13 nuclear magnetic resonance (13C NMR) spectra has been a popular method adopted in compound identification systems. However, the usability of existing approaches has been restricted as enlarging a library containing both a chemical structure and spectrum is a costly and time-consuming process. Therefore, we propose a fundamentally different, novel approach to match 13C NMR spectra directly against a molecular structure library. We develop a cross-modal retrieval between spectrum and structure (CReSS) system using deep contrastive learning, which allows us to search a molecular structure library using the 13C NMR spectrum of a compound. In the test of searching 41,494 13C NMR spectra against a reference structure library containing 10.4 million compounds, CReSS reached a recall@10 accuracy of 91.64% and a processing speed of 0.114 s per query spectrum. When further incorporating a filter with a molecular weight tolerance of 5 Da, CReSS achieved a new remarkable recall@10 of 98.39%. Furthermore, CReSS has potential in detecting scaffolds of novel structures and demonstrates great performance for the task of structural revision. CReSS is built and developed to bridge the gap between 13C NMR spectra and structures and could be generally applicable in compound identification.
科研通智能强力驱动
Strongly Powered by AbleSci AI