化学
深度学习
计算生物学
串联质谱法
质谱法
基因组
DNA测序
领域(数学)
人工智能
计算机科学
色谱法
基因
生物化学
生物
数学
纯数学
作者
Wout Bittremieux,Varun Ananth,William E. Fondrie,Carlo Melendez,Marina Pominova,Justin J. Sanders,Bo Wen,Melih Yilmaz,William Stafford Noble
摘要
ABSTRACT Protein tandem mass spectrometry data are most often interpreted by matching observed mass spectra to a protein database derived from the reference genome of the sample being analyzed. In many application domains, however, a relevant protein database is unavailable or incomplete, and in such settings de novo sequencing is required. Since the introduction of the DeepNovo algorithm in 2017, the field of de novo sequencing has been dominated by deep learning methods, which use large amounts of labeled mass spectrometry data to train multi‐layer neural networks to translate from observed mass spectra to corresponding peptide sequences. Here, we describe these deep learning methods, outline procedures for evaluating their performance, and discuss the challenges in the field, both in terms of methods development and evaluation protocols.
科研通智能强力驱动
Strongly Powered by AbleSci AI