计算机科学
人工智能
基础(证据)
数字化病理学
特征(语言学)
深度学习
生成语法
泰坦(火箭家族)
临床科学
病理
机器学习
领域(数学)
临床影像学
特征提取
编码
分子病理学
医学物理学
多模态
医学影像学
模式识别(心理学)
特征学习
疾病
计算模型
解剖病理学
医学诊断
词典学习
数据科学
临床诊断
作者
Tong Ding,Sophia J. Wagner,Andrew H. Song,Richard J. Chen,Ming Y. Lu,Andrew Zhang,Anurag Vaidya,Guillaume Jaume,Muhammad Shaban,Ahrong Kim,Drew F. K. Williamson,Harry Robertson,Bowen Chen,Cristina Almagro-Pérez,Paul Doucet,Sharifa Sahai,Chengkuan Chen,Christina S. Chen,Daisuke Komura,Akihiro Kawabe
出处
期刊:Nature Medicine
[Nature Portfolio]
日期:2025-11-01
卷期号:31 (11): 3749-3761
被引量:48
标识
DOI:10.1038/s41591-025-03982-3
摘要
The field of computational pathology has been transformed with recent advances in foundation models that encode histopathology region-of-interests (ROIs) into versatile and transferable feature representations via self-supervised learning. However, translating these advancements to address complex clinical challenges at the patient and slide level remains constrained by limited clinical data in disease-specific cohorts, especially for rare clinical conditions. We propose Transformer-based pathology Image and Text Alignment Network (TITAN), a multimodal whole-slide foundation model pretrained using 335,645 whole-slide images via visual self-supervised learning and vision-language alignment with corresponding pathology reports and 423,122 synthetic captions generated from a multimodal generative AI copilot for pathology. Without any fine-tuning or requiring clinical labels, TITAN can extract general-purpose slide representations and generate pathology reports that generalize to resource-limited clinical scenarios such as rare disease retrieval and cancer prognosis. We evaluate TITAN on diverse clinical tasks and find that it outperforms both ROI and slide foundation models across machine learning settings, including linear probing, few-shot and zero-shot classification, rare cancer retrieval, cross-modal retrieval and pathology report generation.
科研通智能强力驱动
Strongly Powered by AbleSci AI