医学诊断
工作流程
医学物理学
医学
计算机科学
医学影像学
阅读(过程)
水准点(测量)
磁共振成像
诊断准确性
放射性武器
放射科
人工智能
神经学
领域(数学分析)
神经影像学
范畴变量
机器学习
心理学
统一医学语言系统
自然语言生成
作者
Mingliang Wang,Rui-Peng Zhang,Wen-Juan Wu,Yu Lu,Xiao-Er Wei,Z. Y. Sun,Bao-Hui Guan,Jun-Jie Zhang,X. Wu,Lei Zhang,Tian-Le Wang,Yuehua Li
标识
DOI:10.1038/s41746-026-02380-4
摘要
Abstract Automatically deriving radiological diagnoses from brain MRI report findings is challenging due to high complexity and domain expertise. This study evaluated 10 large language models (LLMs) in generating diagnoses from brain MRI report findings, using 4293 reports (9973 diagnostic labels) covering 15 brain disease categories from three medical centers. DeepSeek-R1 achieved the highest performance among the evaluated models on the full dataset and across different clinical scenarios and subgroups, particularly when provided with structured report findings and clinical information. A top three differential-diagnosis prompting strategy achieved superior performance, with 97.6% patient-level accuracy versus 87.1% for single-diagnosis prompting. The diagnostic performance of six radiologists was assessed with and without DeepSeek-R1 assistance on 500 reports. Integration of DeepSeek-R1 significantly improved diagnostic accuracy (AUPRC: 0.774–0.893) and reduced reading time (from 61 to 53 s), with more pronounced benefits for junior radiologists. Our findings indicate that effective automated diagnostic impression generation in brain MRI reporting requires advanced large-scale LLMs like DeepSeek-R1. With optimized prompting and input strategies, this framework may serve as a supportive tool in drafting brain MRI reports and contribute to enhanced workflow efficiency in radiology practice.
科研通智能强力驱动
Strongly Powered by AbleSci AI