Reference-informed prediction of alternative splicing and splicing-altering mutations from sequences

RNA剪接 生物 选择性拼接 遗传学 基因 计算生物学 小基因 外显子跳跃 生物信息学 突变 剪接 外显子 核糖核酸
作者
Chencheng Xu,Suying Bao,Sheng Wang,Wenxing Li,Hao Chen,Yufeng Shen,Tao Jiang,Chaolin Zhang
出处
期刊:Genome Research [Cold Spring Harbor Laboratory Press]
被引量:1
标识
DOI:10.1101/gr.279044.124
摘要

Alternative splicing plays a crucial role in protein diversity and gene expression regulation in higher eukaryotes, and mutations causing dysregulated splicing underlie a range of genetic diseases. Computational prediction of alternative splicing from genomic sequences not only provides insight into gene-regulatory mechanisms but also helps identify disease-causing mutations and drug targets. However, the current methods for the quantitative prediction of splice site usage still have limited accuracy. Here, we present DeltaSplice, a deep neural network model optimized to learn the impact of mutations on quantitative changes in alternative splicing from the comparative analysis of homologous genes. The model architecture enables DeltaSplice to perform “reference-informed prediction” by incorporating the known splice site usage of a reference gene sequence to improve its prediction on splicing-altering mutations. We benchmarked DeltaSplice and several other state-of-the-art methods on various prediction tasks, including evolutionary sequence divergence on lineage-specific splicing and splicing-altering mutations in human populations and neurodevelopmental disorders, and demonstrated that DeltaSplice outperformed consistently. DeltaSplice predicted ∼15% of splicing quantitative trait loci (sQTLs) in the human brain as causal splicing-altering variants. It also predicted splicing-altering de novo mutations outside the splice sites in a subset of patients affected by autism and other neurodevelopmental disorders (NDDs), including 19 genes with recurrent splicing-altering mutations. Integration of splicing-altering mutations with other types of de novo mutation burdens allowed the prediction of eight novel NDD-risk genes. Our work expanded the capacity of in silico splicing models with potential applications in genetic diagnosis and the development of splicing-based precision medicine.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
彭于晏应助LuxuryQ采纳,获得10
刚刚
李子发布了新的文献求助10
1秒前
氢氧化发布了新的文献求助10
2秒前
3秒前
3秒前
852应助喔喔采纳,获得10
3秒前
5秒前
chetouhua发布了新的文献求助10
6秒前
小白完成签到,获得积分10
7秒前
扁扁xx发布了新的文献求助10
8秒前
棒棒冰完成签到,获得积分10
10秒前
king完成签到,获得积分10
11秒前
能干砖家发布了新的文献求助10
11秒前
Lost_Flight完成签到,获得积分10
12秒前
12秒前
14秒前
14秒前
14秒前
向会妍完成签到,获得积分10
15秒前
自然剑发布了新的文献求助10
15秒前
康康完成签到,获得积分10
15秒前
15秒前
矿泉水发布了新的文献求助10
16秒前
17秒前
科研通AI6.3应助sonya采纳,获得10
17秒前
uup发布了新的文献求助10
18秒前
18秒前
19秒前
陈欣发布了新的文献求助10
19秒前
1111关注了科研通微信公众号
19秒前
研友_VZG7GZ应助李子采纳,获得10
20秒前
啊咧咧完成签到 ,获得积分10
20秒前
21秒前
wangyu发布了新的文献求助10
21秒前
小罗黑的发布了新的文献求助10
22秒前
小思雅完成签到,获得积分10
23秒前
范奕恒完成签到,获得积分10
23秒前
23秒前
干净的琦应助renpp822采纳,获得200
24秒前
qqqwwwere完成签到 ,获得积分10
25秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6476181
求助须知:如何正确求助?哪些是违规求助? 8278638
关于积分的说明 17654558
捐赠科研通 5557600
什么是DOI,文献DOI怎么找? 2910513
邀请新用户注册赠送积分活动 1887382
关于科研通互助平台的介绍 1740454