转录组
计算生物学
序列(生物学)
全基因组测序
基因组
生物
遗传学
基因
基因表达
作者
Guoliang Fu,Yubin Yan,Ye Chen,Bin Shao
标识
DOI:10.1101/2024.12.30.630741
摘要
Abstract We present TXpredict, a transformer-based framework for predicting microbial transcriptomes using annotated genome sequences. By leveraging information learned from a large protein language model, TXpredict achieves an average Spearman correlation of 0.53 and 0.62 in predicting gene expression for new bacterial and fungal genomes. We further extend this framework to predict transcriptomes for 2,685 additional microbial genomes spanning 1,744 genera, 82% of which remain uncharacterized at the transcriptional level. Our analysis highlights conserved and divergent transcriptional programs across understudied genera, providing a powerful resource for uncovering microbial adaptation strategies and metabolic potential across the tree of life.
科研通智能强力驱动
Strongly Powered by AbleSci AI