乳腺癌
计算机科学
人工智能
模态(人机交互)
深度学习
特征提取
机器学习
主成分分析
特征(语言学)
数据集
模式识别(心理学)
数据挖掘
癌症
医学
内科学
语言学
哲学
作者
Tao Liu,Junhao Huang,Tianchi Liao,Ruitao Pu,Shuanglin Liu,Yue Peng
出处
期刊:Irbm
[Elsevier BV]
日期:2021-01-10
卷期号:43 (1): 62-74
被引量:63
标识
DOI:10.1016/j.irbm.2020.12.002
摘要
The prediction of breast cancer subtypes plays a key role in the diagnosis and prognosis of breast cancer. In recent years, deep learning (DL) has shown good performance in the intelligent prediction of breast cancer subtypes. However, most of the traditional DL models use single modality data, which can just extract a few features, so it cannot establish a stable relationship between patient characteristics and breast cancer subtypes. We used the TCGA-BRCA dataset as a sample set for molecular subtype prediction of breast cancer. It is a public dataset that can be obtained through the following link: https://portal.gdc.cancer.gov/projects/TCGA-BRCA In this paper, a Hybrid DL model based on the multimodal data is proposed. We combine the patient's gene modality data with image modality data to construct a multimodal fusion framework. According to the different forms and states, we set up feature extraction networks respectively, and then we fuse the output of the two feature networks based on the idea of weighted linear aggregation. Finally, the fused features are used to predict breast cancer subtypes. In particular, we use the principal component analysis to reduce the dimensionality of high-dimensional data of gene modality and filter the data of image modality. Besides, we also improve the traditional feature extraction network to make it show better performance. The results show that compared with the traditional DL model, the Hybrid DL model proposed in this paper is more accurate and efficient in predicting breast cancer subtypes. Our model achieved a prediction accuracy of 88.07% in 10 times of 10-fold cross-validation. We did a separate AUC test for each subtype, and the average AUC value obtained was 0.9427. In terms of subtype prediction accuracy, our model is about 7.45% higher than the previous average.
科研通智能强力驱动
Strongly Powered by AbleSci AI