工作流程
质谱法
鉴定(生物学)
匹配(统计)
计算机科学
串联质谱法
质谱
谱线
表征(材料科学)
分子
小分子
模式识别(心理学)
人工智能
化学
生物系统
数据挖掘
色谱法
纳米技术
物理
材料科学
数学
数据库
生物化学
植物
统计
有机化学
天文
生物
作者
Yuhui Hong,Yuzhen Ye,Haixu Tang
标识
DOI:10.1146/annurev-anchem-071224-082157
摘要
Tandem mass spectrometry (MS/MS) is crucial for small-molecule analysis; however, traditional computational methods are limited by incomplete reference libraries and complex data processing. Machine learning (ML) is transforming small-molecule mass spectrometry in three key directions: ( a ) predicting MS/MS spectra and related physicochemical properties to expand reference libraries, ( b ) improving spectral matching through automated pattern extraction, and ( c ) predicting molecular structures of compounds directly from their MS/MS spectra. We review ML approaches for molecular representations [descriptors, simplified molecular-input line-entry (SMILE) strings, and graphs] and MS/MS spectra representations (using binned vectors and peak lists) along with recent advances in spectra prediction, retention time, collision cross sections, and spectral matching. Finally, we discuss ML-integrated workflows for chemical formula identification. By addressing the limitations of current methods for compound identification, these ML approaches can greatly enhance the understanding of biological processes and the development of diagnostic and therapeutic tools.
科研通智能强力驱动
Strongly Powered by AbleSci AI