生物
生物信息学
遗传学
计算生物学
人类基因组
基因
突变
DNA微阵列
基因组
基因表达
作者
Jian Zhou,Chandra L. Theesfeld,Kevin Yao,Kathleen Chen,Aaron K. Wong,Olga G. Troyanskaya
出处
期刊:Nature Genetics
[Nature Portfolio]
日期:2018-07-13
卷期号:50 (8): 1171-1179
被引量:476
标识
DOI:10.1038/s41588-018-0160-6
摘要
Key challenges for human genetics, precision medicine and evolutionary biology include deciphering the regulatory code of gene expression and understanding the transcriptional effects of genome variation. However, this is extremely difficult because of the enormous scale of the noncoding mutation space. We developed a deep learning–based framework, ExPecto, that can accurately predict, ab initio from a DNA sequence, the tissue-specific transcriptional effects of mutations, including those that are rare or that have not been observed. We prioritized causal variants within disease- or trait-associated loci from all publicly available genome-wide association studies and experimentally validated predictions for four immune-related diseases. By exploiting the scalability of ExPecto, we characterized the regulatory mutation space for human RNA polymerase II–transcribed genes by in silico saturation mutagenesis and profiled > 140 million promoter-proximal mutations. This enables probing of evolutionary constraints on gene expression and ab initio prediction of mutation disease effects, making ExPecto an end-to-end computational framework for the in silico prediction of expression and disease risk. ExPecto is a deep learning–based framework that can predict the tissue-specific transcriptional effects of mutations on the basis of DNA sequence alone. ExPecto can prioritize causal variants from GWAS loci and be used to predict the disease risk of a variant.
科研通智能强力驱动
Strongly Powered by AbleSci AI