串扰
可解释性
计算机科学
液体活检
计算生物学
生物信息学
癌症
医学
机器学习
生物
工程类
内科学
电子工程
作者
Xiaofan Liu,Yuhuan Tao,Zexiang Cai,Pengfei Bao,Hongli Ma,Kexing Li,Yunping Zhu,Zhi John Lu
标识
DOI:10.1101/2023.05.23.541554
摘要
Abstract Multi-omics data provide a comprehensive view of gene regulation at multiple levels, which is helpful in achieving accurate diagnosis of complex diseases like cancer. To integrate various multi-omics data of tissue and liquid biopsies for disease diagnosis and prognosis, we developed a biological pathway informed Transformer, Pathformer. It embeds multi-omics input with a compacted multi-modal vector and a pathway-based sparse neural network. Pathformer also leverages criss-cross attention mechanism to capture the crosstalk between different pathways and modalities. We first benchmarked Pathformer with 18 comparable methods on multiple cancer datasets, where Pathformer outperformed all the other methods, with an average improvement of 6.3%-14.7% in F1 score for cancer survival prediction and 5.1%-12% for cancer stage prediction. Subsequently, for cancer prognosis prediction based on tissue multi-omics data, we used a case study to demonstrate the biological interpretability of Pathformer by identifying key pathways and their biological crosstalk. Then, for cancer early diagnosis based on liquid biopsy data, we used plasma and platelet datasets to demonstrate Pathformer’s potential of clinical applications in cancer screen. Moreover, we revealed deregulation of interesting pathways (e.g., scavenger receptor pathway) and their crosstalk in cancer patients’ blood, providing new candidate targets for cancer microenvironment study.
科研通智能强力驱动
Strongly Powered by AbleSci AI