增强子
计算生物学
生物
调节顺序
基因
突变
DNA
饱和突变
遗传学
人类基因组
报告基因
基因表达调控
DNA测序
基因组
基因表达
突变
突变体
作者
Žiga Avsec,Vikram Agarwal,Daniel Visentin,Joseph R. Ledsam,Agnieszka Grabska‐Barwińska,Kyle R. Taylor,Yannis Assael,John Jumper,Pushmeet Kohli,David R. Kelley
出处
期刊:Nature Methods
[Nature Portfolio]
日期:2021-10-01
卷期号:18 (10): 1196-1203
被引量:714
标识
DOI:10.1038/s41592-021-01252-x
摘要
Abstract How noncoding DNA determines gene expression in different cell types is a major unsolved problem, and critical downstream applications in human genetics depend on improved solutions. Here, we report substantially improved gene expression prediction accuracy from DNA sequences through the use of a deep learning architecture, called Enformer, that is able to integrate information from long-range interactions (up to 100 kb away) in the genome. This improvement yielded more accurate variant effect predictions on gene expression for both natural genetic variants and saturation mutagenesis measured by massively parallel reporter assays. Furthermore, Enformer learned to predict enhancer–promoter interactions directly from the DNA sequence competitively with methods that take direct experimental data as input. We expect that these advances will enable more effective fine-mapping of human disease associations and provide a framework to interpret cis -regulatory evolution.
科研通智能强力驱动
Strongly Powered by AbleSci AI