转录组
乳腺癌
组织学
计算生物学
生物
人工智能
癌症
基因表达谱
相关性
仿形(计算机编程)
基因
病理
生物信息学
基因表达
医学
计算机科学
遗传学
数学
操作系统
几何学
作者
Yuanning Zheng,Marija Pizurica,Francisco Carrillo‐Pérez,Humaira Noor,Wei Yao,Christian Wohlfart,Kathleen Marchal,Antoaneta Vladimirova,Olivier Gevaert
标识
DOI:10.1101/2023.09.28.560068
摘要
Abstract Cancer is a heterogeneous disease that demands precise molecular profiling for better understanding and management. Recently, deep learning has demonstrated potentials for cost-efficient prediction of molecular alterations from histology images. While transformer-based deep learning architectures have enabled significant progress in non-medical domains, their application to histology images remains limited due to small dataset sizes coupled with the explosion of trainable parameters. Here, we develop SEQUOIA , a transformer model to predict cancer transcriptomes from whole-slide histology images. To enable the full potential of transformers, we first pre-train the model using data from 1,802 normal tissues. Then, we fine-tune and evaluate the model in 4,331 tumor samples across nine cancer types. The prediction performance is assessed at individual gene levels and pathway levels through Pearson correlation analysis and root mean square error. The generalization capacity is validated across two independent cohorts comprising 1,305 tumors. In predicting the expression levels of 25,749 genes, the highest performance is observed in cancers from breast, kidney and lung, where SEQUOIA accurately predicts the expression of 11,069, 10,086 and 8,759 genes, respectively. The accurately predicted genes are associated with the regulation of inflammatory response, cell cycles and metabolisms. While the model is trained at the tissue level, we showcase its potential in predicting spatial gene expression patterns using spatial transcriptomics datasets. Leveraging the prediction performance, we develop a digital gene expression signature that predicts the risk of recurrence in breast cancer. SEQUOIA deciphers clinically relevant gene expression patterns from histology images, opening avenues for improved cancer management and personalized therapies.
科研通智能强力驱动
Strongly Powered by AbleSci AI