麦克内马尔试验
医学
乳房成像
考试(生物学)
双雷达
医学物理学
放射科
人工智能
乳腺癌
乳腺摄影术
计算机科学
统计
内科学
古生物学
数学
癌症
生物
作者
Jian Wang,Hongtian Tian,Xin Yang,Huaiyu Wu,Xiliang Zhu,Rusi Chen,Allan Chang,Ya‐Fang Chen,Haoran Dou,Ruobing Huang,Jun Cheng,Yongsong Zhou,Rui Gao,Keen Yang,Guoqiu Li,Jing Chen,Dong Ni,Fajin Dong,Jinfeng Xu,Ning Gu
出处
期刊:Radiology
[Radiological Society of North America]
日期:2025-06-18
摘要
“Just Accepted” papers have undergone full peer review and have been accepted for publication in Radiology: Artificial Intelligence. This article will undergo copyediting, layout, and proof review before it is published in its final version. Please note that during production of the final copyedited article, errors may be discovered which could affect the content. Purpose To develop and evaluate an artificial intelligence (AI) system for generating breast ultrasound (BUS) reports. Materials and Methods This retrospective study included 104,364 cases from three hospitals (January 2020–December 2022). The AI system was trained on 82,896 cases, validated on 10,385 cases, and tested on an internal set (10,383 cases) and two external sets (300 and 400 cases). Under blind review, three senior radiologists (> 10 years of experience) evaluated AI-generated reports and those written by one midlevel radiologist (7 years of experience), as well as reports from three junior radiologists (2–3 years of experience) with and without AI assistance. The primary outcomes included the acceptance rates of Breast Imaging Reporting and Data System (BI-RADS) categories and lesion characteristics. Statistical analysis included one-sided and two-sided McNemar tests for non-inferiority and significance testing. Results In external test set 1 (300 cases), the midlevel radiologist and AI system achieved BI-RADS acceptance rates of 95.00% [285/300] versus 92.33% [277/300] ( P < .001; non-inferiority test with a prespecified margin of 10%). In external test set 2 (400 cases), three junior radiologists had BI-RADS acceptance rates of 87.00% [348/400] versus 90.75% [363/400] ( P = .06), 86.50% [346/400] versus 92.00% [368/400] ( P = .007), and 84.75% [339/400] versus 90.25% [361/400] ( P = .02) with and without AI assistance, respectively. Conclusion The AI system performed comparably to a midlevel radiologist and aided junior radiologists in BI-RADS classification. ©RSNA, 2025
科研通智能强力驱动
Strongly Powered by AbleSci AI