Abstract 4970: Multi-modal machine learning approaches for predicting cancer type and Gleason grade leveraging public TCGA data

情态动词 医学 癌症 肿瘤科 内科学 计算机科学 化学 高分子化学
作者
Christian Wohlfart,Eldad Klaiman,Jacub Witkowski,Michael R. King,Jacob Gildenblat,Ofir Etz-Hadar,Mohammad Ashtari,Antoaneta Vladimirova
出处
期刊:Cancer Research [American Association for Cancer Research]
卷期号:84 (6_Supplement): 4970-4970
标识
DOI:10.1158/1538-7445.am2024-4970
摘要

Abstract Introduction: To better understand the complex and challenging nature of diseases such as cancer and for improved diagnosis, it may require the combination of multiple data modalities, such as histopathological images and omics data such as RNA-seq. By integrating these heterogeneous but complementary data, a multimodal approach unites both worlds and could achieve better synergistic results compared to using a single modality. The growing availability of large datasets such as The Cancer Genome Atlas (TCGA) with more than 10000 patients made it possible to combine different modalities to train machine learning algorithms which offers great potential to address challenging cancer related research. In this proof of concept initiative we use machine learning approaches within an open-source framework in order to leverage the potential of multimodality (Histopathology Whole Slide Images (WSI) and Genomics/RNA-seq) to build predictive AI models for cancer type and prostate Gleason score, and provide a potential to develop a quality control step. Method: We used matched WSI and RNA-Seq profiles from TCGA, including 11093 samples and 30 cancer types to develop a pancancer classification model using both modalities. For prostate Gleason score prediction 401 patients were available. Both datasets were split into a train (70%) and test (30%) components. We used a late fusion approach where we combined the RNA-seq model (linear SVM) with the WSI model (Resnet18) by multiplying the probability scores of each single-modality model. Model performance was measured with the F1 metric. Results: For cancer type prediction, the multimodality model achieved an F1 score of 0.95 on the test set. About 40% of the cancer types benefited from a synergistic effect by combining the two modalities. Cancer types and percent increase in F1 scores, respectively, that benefit most by combining modalities are: Cervical squamous cell carcinoma and endocervical adenocarcinoma (4.23%), Cholangiosarcoma (6.66%) and Uterine carcinosarcoma (4%). Interestingly, in other cancer types the combination did not result in improved predictive scores compared to a single modality model, e.g. in Rectum adenocarcinoma, Sarcoma or Stomach adenocarcinoma. For Prostate cancer grading, Gleason score prediction of patterns 3/4/5, combined multi modality model earned 0.73 F1 outperforming the single modality models. Conclusion: By combining histopathology imaging and omics modalities we demonstrated synergistic effects in predictive power for both cancer-related research questions. We show improved predictive performance in 40% of the classified cancer types by taking both modalities. Imaging or omics modalities alone can be sufficient in some cases and their strengths are very problem-specific. Citation Format: Christian Wohlfart, Eldad Klaiman, Jacub Witkowski, Michael King, Jacob Gildenblat, Ofir Etz-Hadar, Mohammad Ashtari, Antoaneta Vladimirova. Multi-modal machine learning approaches for predicting cancer type and Gleason grade leveraging public TCGA data [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 4970.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
迷路小丸子完成签到,获得积分10
刚刚
充电宝应助熠迩采纳,获得30
刚刚
在下技能五完成签到,获得积分10
刚刚
研友_VZG7GZ应助sdl采纳,获得10
1秒前
lilili完成签到,获得积分10
1秒前
Lucas应助搞怪的牛爷爷采纳,获得10
1秒前
FashionBoy应助默默紊采纳,获得10
2秒前
edmund发布了新的文献求助10
2秒前
亚历山大完成签到,获得积分10
3秒前
慕青应助lulu采纳,获得10
3秒前
3秒前
3秒前
万能图书馆应助纳斯达克采纳,获得10
3秒前
4秒前
吐个泡泡完成签到,获得积分10
4秒前
阿宝发布了新的文献求助10
4秒前
5秒前
十三发布了新的文献求助10
5秒前
5秒前
6秒前
6秒前
6秒前
开心的又夏完成签到,获得积分10
6秒前
坦率的平安完成签到,获得积分10
6秒前
dahaoren发布了新的文献求助10
7秒前
222发布了新的文献求助10
7秒前
wythu16完成签到,获得积分10
7秒前
7秒前
7秒前
7秒前
小飞完成签到,获得积分10
8秒前
8秒前
张宇完成签到,获得积分10
8秒前
张朋乐关注了科研通微信公众号
8秒前
蔡思艺发布了新的文献求助10
9秒前
9秒前
小呼发布了新的文献求助10
9秒前
10秒前
10秒前
小飞发布了新的文献求助10
10秒前
高分求助中
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
Cybercrime: The Transformation of Crime in the Information Age, 2nd Edition 400
Moore's Clinically Oriented Anatomy 10th Edition 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6617669
求助须知:如何正确求助?哪些是违规求助? 8381923
关于积分的说明 17932108
捐赠科研通 5787034
什么是DOI,文献DOI怎么找? 2959884
邀请新用户注册赠送积分活动 1935130
关于科研通互助平台的介绍 1839742