染色质
计算生物学
剪接
组蛋白
生物
序列(生物学)
人类基因组
DNA
计算机科学
遗传学
转录因子
基因组
DNA测序
限制
基因组学
调节顺序
抄写(语言学)
基因
人工智能
模态(人机交互)
深度学习
DNA结合位点
基因表达调控
机器学习
编码
选择性拼接
模式
功能基因组学
参考基因组
生物信息学
范围(计算机科学)
基因组计划
作者
Žiga Avsec,Natasha S. Latysheva,Jun Cheng,Guido Novati,Kyle R. Taylor,Tom Ward,Clare Bycroft,Lauren Nicolaisen,Eirini Arvaniti,Joshua Pan,Raina W. Thomas,Vincent Dutordoir,Matteo Perino,S. P. M. Boer De,Alexander Karollus,Adam Gayoso,Toby Sargeant,Anne Mottram,Lai Hong Wong,Pavol Drotár
出处
期刊:Nature
[Nature Portfolio]
日期:2026-01-28
卷期号:649 (8099): 1206-1218
被引量:86
标识
DOI:10.1038/s41586-025-10014-0
摘要
Abstract Deep learning models that predict functional genomic measurements from DNA sequences are powerful tools for deciphering the genetic regulatory code. Existing methods involve a trade-off between input sequence length and prediction resolution, thereby limiting their modality scope and performance 1–5 . We present AlphaGenome, a unified DNA sequence model, which takes as input 1 Mb of DNA sequence and predicts thousands of functional genomic tracks up to single-base-pair resolution across diverse modalities. The modalities include gene expression, transcription initiation, chromatin accessibility, histone modifications, transcription factor binding, chromatin contact maps, splice site usage and splice junction coordinates and strength. Trained on human and mouse genomes, AlphaGenome matches or exceeds the strongest available external models in 25 of 26 evaluations of variant effect prediction. The ability of AlphaGenome to simultaneously score variant effects across all modalities accurately recapitulates the mechanisms of clinically relevant variants near the TAL1 oncogene 6 . To facilitate broader use, we provide tools for making genome track and variant effect predictions from sequence.
科研通智能强力驱动
Strongly Powered by AbleSci AI