基础(证据)
疾病
医学
计算机科学
病理
历史
考古
作者
Guangqian Yang,Kangrui Du,Zhihan Yang,Ye Du,Eva YW Cheung,Yongping Zheng,Mo Yang,Zoe Kourtzi,Carola‐Bibiane Schönlieb,Shujun Wang
标识
DOI:10.1109/jbhi.2025.3576436
摘要
Alzheimer's disease (AD) is an incurable neurodegenerative disorder characterized by progressive cognitive and functional decline. Consequently, early diagnosis and accurate prediction of disease progression are of paramount importance and inherently complex, necessitating the integration of multi-modal data. While existing methods are typically task-specific and lack generalization, we present ADFound, the first multi-modal foundation model for AD capable of simultaneously addressing diagnosis and prognosis tasks through a unified framework. ADFound leverages a substantial amount of unlabeled 3D multi-modal neuroimaging, including paired and unpaired data, to achieve its objectives. Specifically, ADFound is developed upon the Multi-modal Vim encoder by Vision Mamba block to capture long-range dependencies inherent in 3D multi-modal medical images. To efficiently pre-train ADFound on unlabeled paired and upaired multi-modal neuroimaging data, we proposed a novel self-supervised learning framework that integrates multi-modal masked autoencoder (MAE) and contrastive learning. The multi-modal MAE aims to learn local relations among modalities by reconstructing images with unmasked image patches. Additionally, we introduce a Dual Contrastive Learning for Multi-modal Data to enhance the discriminative capabilities of multi-modal representations from intra-modal and inter-modal perspectives. Our experiments demonstrate that ADFound outperforms state-of-the-art methods across a wide range of downstream tasks relevant to the diagnosis and prognosis of AD. Furthermore, the results indicate that our foundation model can be extended to more modalities, such as non-image data, showing its versatility.
科研通智能强力驱动
Strongly Powered by AbleSci AI