注释
从头算
基因
计算生物学
计算机科学
生物
遗传学
人工智能
物理
量子力学
作者
Kai Ye,Pengyu Zhang,Tun Xu,Songbo Wang,Xiaofei Yang,Peisen Sun,Jia‐Shi Peng,Bo Wang,Stephen J. Bush,Zemin Ning
出处
期刊:Research Square
日期:2025-04-24
被引量:7
标识
DOI:10.21203/rs.3.rs-6402260/v1
摘要
Abstract Accurate gene annotation is essential for deciphering the mapping from genomic sequences to their functional roles. However, current methods struggle to model complex gene transmission patterns and evolutionary variations in gene lengths. Here, we introduce ANNEVO, a mixture of experts (MoE)-based genomic language model that directly models distal sequence dependencies and joint evolutionary relationships from diverse genomes, enabling precise ab initio gene annotation. Through extensive benchmarking on 566 species, we demonstrate that ANNEVO significantly outperforms existing ab initio methods and achieves performance comparable to state-of-the-art annotation pipelines. Furthermore, ANNEVO’s independence from external evidence allows it to deliver more complete annotations than reference annotations for a broad range of species while correcting errors within them. These advancements will significantly advance genome sequence interpretation and provide a framework capable of integrating evolutionary insights.
科研通智能强力驱动
Strongly Powered by AbleSci AI