Deep Learning Prediction of Glycopeptide Tandem Mass Spectra Powers Glycoproteomics

糖蛋白组学 糖基化 串联质谱法 聚糖 糖肽 化学信息学 计算机科学 计算生物学 质谱法 化学 糖蛋白 色谱法 生物 生物化学 计算化学 抗生素
作者
裕文 宗,Yuxin Wang,Xipeng Qiu,Xuanjing Huang,Liang Qiao
标识
DOI:10.1101/2024.02.03.575604
摘要

Abstract Protein glycosylation plays a significant role in numerous physiological and pathological cellular functions. Glycoproteomics based on liquid chromatography-tandem mass spectrometry (LC-MS/MS) studies the protein glycosylation on a proteome-wide scale to get combinational information on glycosylation site, glycosylation level and glycan structure. However, the current sequence searching-based methods for glycoproteomics often fall short in glycan structure determination due to the limited occurrence of structure-determining ions. While spectral searching methods can utilize fragment intensity information to facilitate the identification of glycopeptides, its application is hindered by the difficulties in spectral library construction. In this work, we present DeepGP, a hybrid deep learning framework based on Transformer and graph neural network (GNN), for the prediction of MS/MS spectra and retention time of glycopeptides. Two GNN modules are utilized to capture the branched glycan structure and predict glycan ions intensity, respectively. Additionally, a pre-training strategy is implemented to alleviate the insufficiency of glycoproteomics data. Testing on multiple biological datasets, we demonstrate that DeepGP can predict MS/MS spectra and retention time of glycopeptides closely aligning with the experimental results. Comprehensive benchmarking of DeepGP on synthetic and biological datasets validates its effectiveness in distinguishing similar glycoforms. Remarkably, DeepGP can differentiate isomeric glycopeptides using MS/MS spectra without diagnostic ions. Based on various decoy methods, we demonstrated that DeepGP in combination with database searching can significantly increase the detection sensitivity of glycopeptides. We outlook that DeepGP can inspire extensive future work in glycoproteomics.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
bkagyin应助rio采纳,获得10
1秒前
黄少阳发布了新的文献求助30
2秒前
wu发布了新的文献求助10
3秒前
4秒前
阿怪完成签到,获得积分20
4秒前
韩soso完成签到,获得积分10
5秒前
盐以律己发布了新的文献求助30
5秒前
6秒前
fjmelite发布了新的文献求助10
6秒前
科研通AI6.4应助老木虫采纳,获得10
6秒前
Han完成签到,获得积分10
7秒前
zxx发布了新的文献求助10
7秒前
7秒前
8秒前
8秒前
DafeiWu完成签到,获得积分10
8秒前
xjm发布了新的文献求助10
11秒前
12完成签到,获得积分10
12秒前
YEM发布了新的文献求助10
12秒前
12秒前
13秒前
黄少阳完成签到,获得积分10
13秒前
drhai完成签到,获得积分20
13秒前
rio发布了新的文献求助10
13秒前
season发布了新的文献求助10
13秒前
无风风发布了新的文献求助10
15秒前
15秒前
赘婿应助Oliver采纳,获得100
17秒前
orixero应助苞米粒粒采纳,获得10
17秒前
小鹿发布了新的文献求助10
18秒前
19秒前
drhai发布了新的文献求助10
19秒前
十三发布了新的文献求助20
21秒前
21秒前
NexusExplorer应助马小跳采纳,获得10
22秒前
22秒前
Alstonadas完成签到,获得积分10
23秒前
复杂雪一完成签到,获得积分10
24秒前
24秒前
在水一方应助无风风采纳,获得10
25秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
Rehabilitation of Long-Standing Groin Pain in Athletes: A Scoping Review of Exercise Content and Reporting 500
The Immune System (Fifth Edition) 500
久松真一著作集〈第5巻〉禅と芸術 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6585819
求助须知:如何正确求助?哪些是违规求助? 8359673
关于积分的说明 17901496
捐赠科研通 5728204
什么是DOI,文献DOI怎么找? 2949675
邀请新用户注册赠送积分活动 1925160
关于科研通互助平台的介绍 1811771