基础(证据)
计算生物学
组学
生物
计算机科学
生物信息学
数据科学
地理
考古
作者
Alejandro Tejada-Lapuerta,Anna C. Schaar,Robert M. Gutgesell,Giovanni Palla,Lennard Halle,Mariia Minaeva,Larsen Vornholz,Leander Dony,Francesca Drummer,Till Richter,Mojtaba Bahrami,Fabian J. Theis
标识
DOI:10.1038/s41592-025-02814-z
摘要
Tissue makeup depends on the local cellular microenvironment. Spatial single-cell genomics enables scalable and unbiased interrogation of these interactions. Here we introduce Nicheformer, a transformer-based foundation model trained on both human and mouse dissociated single-cell and targeted spatial transcriptomics data. Pretrained on SpatialCorpus-110M, a curated collection of over 57 million dissociated and 53 million spatially resolved cells across 73 tissues on cellular reconstruction, Nicheformer learns cell representations that capture spatial context. It excels in linear-probing and fine-tuning scenarios for a newly designed set of downstream tasks, in particular spatial composition prediction and spatial label prediction. Critically, we show that models trained only on dissociated data fail to recover the complexity of spatial microenvironments, underscoring the need for multiscale integration. Nicheformer enables the prediction of the spatial context of dissociated cells, allowing the transfer of rich spatial information to scRNA-seq datasets. Overall, Nicheformer sets the stage for the next generation of machine-learning models in spatial single-cell analysis.
科研通智能强力驱动
Strongly Powered by AbleSci AI