核磁共振波谱
化学
稳健性(进化)
谱线
光谱学
核磁共振谱数据库
化学位移
二维核磁共振波谱
卷积神经网络
生物系统
模式识别(心理学)
分析化学(期刊)
人工智能
计算机科学
色谱法
物理化学
有机化学
物理
立体化学
生物化学
量子力学
天文
生物
基因
作者
Weiwei Wei,Yuxuan Liao,Yufei Wang,Shaoqi Wang,Wen Du,Hongmei Lü,Bo Kong,Huawu Yang,Zhimin Zhang
出处
期刊:Molecules
[Multidisciplinary Digital Publishing Institute]
日期:2022-06-07
卷期号:27 (12): 3653-3653
被引量:30
标识
DOI:10.3390/molecules27123653
摘要
Nuclear magnetic resonance (NMR) spectroscopy is highly unbiased and reproducible, which provides us a powerful tool to analyze mixtures consisting of small molecules. However, the compound identification in NMR spectra of mixtures is highly challenging because of chemical shift variations of the same compound in different mixtures and peak overlapping among molecules. Here, we present a pseudo-Siamese convolutional neural network method (pSCNN) to identify compounds in mixtures for NMR spectroscopy. A data augmentation method was implemented for the superposition of several NMR spectra sampled from a spectral database with random noises. The augmented dataset was split and used to train, validate and test the pSCNN model. Two experimental NMR datasets (flavor mixtures and additional flavor mixture) were acquired to benchmark its performance in real applications. The results show that the proposed method can achieve good performances in the augmented test set (ACC = 99.80%, TPR = 99.70% and FPR = 0.10%), the flavor mixtures dataset (ACC = 97.62%, TPR = 96.44% and FPR = 2.29%) and the additional flavor mixture dataset (ACC = 91.67%, TPR = 100.00% and FPR = 10.53%). We have demonstrated that the translational invariance of convolutional neural networks can solve the chemical shift variation problem in NMR spectra. In summary, pSCNN is an off-the-shelf method to identify compounds in mixtures for NMR spectroscopy because of its accuracy in compound identification and robustness to chemical shift variation.
科研通智能强力驱动
Strongly Powered by AbleSci AI