计算机科学
人工智能
特征(语言学)
一致性(知识库)
利用
深度学习
机器学习
自然语言处理
多模态
特征提取
医学影像学
面子(社会学概念)
钥匙(锁)
特征学习
模态(人机交互)
数据集成
模式识别(心理学)
数据挖掘
人工神经网络
特征选择
作者
Die Hu,Xuelei He,Fengjun Zhao,Xiaowei He
标识
DOI:10.1109/bibm66473.2025.11356903
摘要
Hepatocellular carcinoma is a highly heterogeneous and complex malignant tumor, posing significant challenges for precise diagnosis and treatment. Existing methods face limitations in multimodal data integration and multi-phase feature modeling, making it difficult to fully exploit the complementary information from multi-phase CT images, structured clinical data, and medical texts. On one hand, most methods rely solely on single-modal data (e.g. CT images or clinical reports), failing to effectively utilize the complementary characteristics of multimodal data. On the other hand, traditional approaches typically concatenate multi-phase CT images as input to the model, ignoring the dynamic evolution features between different phases, leading to information loss and limited predictive performance. To address these issues, we propose a novel Tri-modal Phase-aware Contrastive Learning Framework (TPCL), which incorporates a Phase-aware Attention Fusion Network (PAAF-Net) and a Phase-Conditional Prompt Network (PCPN) to achieve deep alignment and integration of multimodal features. Additionally, we design a Multi-modal Contrastive Loss to further optimize the consistency of feature distributions across different modalities. Experimental results on multiple public and private datasets demonstrate that TPCL significantly outperforms existing methods, achieving up to an $18.54\%$ improvement in ACC and a $10.09\%$ improvement in AUC.
科研通智能强力驱动
Strongly Powered by AbleSci AI