计算机科学
可扩展性
编码
基因组学
计算生物学
染色质
水准点(测量)
限制
表观遗传学
功能基因组学
人工智能
机器学习
DNA测序
基因组
卷积神经网络
深度学习
边距(机器学习)
表观基因组
学习迁移
CTCF公司
生物
标杆管理
重新调整用途
序列(生物学)
匹配(统计)
计算模型
监督学习
作者
Canzhuang Sun,Zhijie He,Shifei Zhang,Kang Xu,Yu Sun,Yue Wang,Pengzhen Hu,Xiaochen Bo,Mingzhi Liao,Hao Li,H CHEN
标识
DOI:10.1038/s41467-026-73129-6
摘要
Sequence-based deep learning has advanced genome interpretation, yet most models remain task-specific and rely on retraining, limiting scalability across biological contexts. Here we present SUCCEED, a supervised multi-task DNA foundation model pretrained on 6,389 ENCODE functional genomics tracks to learn transferable regulatory representations. By integrating convolutional layers with a Transformer architecture, SUCCEED captures both local sequence motifs and long-range regulatory dependencies, achieving performance comparable to or exceeding Enformer across benchmark tasks. Through transfer learning, it predicts cell-type-specific epigenomic profiles, denoises sparse chromatin accessibility signals, and predicts three-dimensional chromatin contacts without CTCF input across data scales and cell types. Across diverse genomics tasks, SUCCEED performs comparably to supervised foundation models such as Sei and outperforms self-supervised models trained solely on DNA sequence. Overall, SUCCEED is a transferable and scalable foundation model that provides a unified framework for genome-scale regulatory modeling in complex biological contexts.
科研通智能强力驱动
Strongly Powered by AbleSci AI