重要事件
基础(证据)
数据科学
钥匙(锁)
计算机科学
基因调控网络
计算生物学
生物
基因
生态学
遗传学
基因表达
政治学
地理
考古
法学
作者
Xiaodong Yang,Guole Liu,Guihai Feng,Dechao Bu,Pengfei Wang,Jie Jiang,Shubai Chen,Qinmeng Yang,Yiyang Zhang,Zhenpeng Man,Zhongming Liang,Zichen Wang,Yaning Li,Zheng Li,Yana Liu,Yao Tian,Ao Li,Jingxi Dong,Zhilong Hu,Fang Chen
标识
DOI:10.1101/2023.09.26.559542
摘要
Abstract Deciphering the universal gene regulatory mechanisms in diverse organisms holds great potential to advance our knowledge of fundamental life process and facilitate research on clinical applications. However, the traditional research paradigm primarily focuses on individual model organisms, resulting in limited collection and integration of complex features on various cell types across species. Recent breakthroughs in single-cell sequencing and advancements in deep learning techniques present an unprecedented opportunity to tackle this challenge. In this study, we developed GeneCompass, the first knowledge-informed, cross-species foundation model pre-trained on an extensive dataset of over 120 million single-cell transcriptomes from human and mouse. During pre-training, GeneCompass effectively integrates four types of biological prior knowledge to enhance the understanding of gene regulatory mechanisms in a self-supervised manner. Fine-tuning towards multiple downstream tasks, GeneCompass outperforms competing state-of-the-art models in multiple tasks on single species and unlocks new realms of cross-species biological investigation. Overall, GeneCompass marks a milestone in advancing knowledge of universal gene regulatory mechanisms and accelerating the discovery of key cell fate regulators and candidate targets for drug development.
科研通智能强力驱动
Strongly Powered by AbleSci AI