代谢组学
代谢途径
小桶
化学
计算生物学
代谢网络
代谢物
苯丙素
人工智能
计算机科学
生物化学
新陈代谢
生物合成
色谱法
生物
转录组
基因表达
基因
酶
作者
Han Bao,Xiuqiong Zhang,Xinxin Wang,Jinhui Zhao,Xinjie Zhao,Chunxia Zhao,Xin Lu,Guowang Xu
标识
DOI:10.1021/acs.analchem.4c06875
摘要
MS/MS-based untargeted metabolomics generates complex data, but pathway enrichment analysis is constrained by the low annotation rates of metabolic features. Here, we propose MS2MP, a novel deep learning-based framework for KEGG pathway prediction directly from untargeted tandem mass spectrometry (MS2), eliminating the need for prior metabolite annotation. MS2MP utilizes a graph neural network architecture to learn the complex relationships between spectral features and metabolic pathways, representing MS2 spectra as fragmentation tree graphs. Trained on 33,221 experimental MS2 spectra, MS2MP achieves robust predictive performance with a balanced accuracy of 94.1% in cross-validation and 87.8%-91.2% on three independent test sets. Notably, MS2MP achieves an "exact match" for 97-98 out of 161 tested metabolite standards across diverse experimental conditions, underscoring its reliability and adaptability. Subsequently, a novel MS2-based pathway enrichment method was developed. The established methods were applied to identify significantly perturbed pathways in transgenic maize. The results uncovered disruptions in phenylpropanoid biosynthesis and related downstream pathways, including those involved in amino acid and secondary metabolite metabolism, which were overlooked by the conventional annotation-based enrichment analysis method. To the best of our knowledge, MS2MP is the first computational tool capable of directly predicting metabolic pathways from MS2 spectra. By linking MS2-based untargeted metabolomics data to metabolic pathways, MS2MP enables more efficient pathway enrichment analysis, thereby accelerating biological discoveries and enhancing our understanding of complex metabolic networks.
科研通智能强力驱动
Strongly Powered by AbleSci AI