亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Large Language Models for Diagnosing Focal Liver Lesions From CT/MRI Reports: A Comparative Study With Radiologists

医学 放射科
作者
Liuji Sheng,Yidi Chen,Hong Wei,Feng Che,Yingyi Wu,Qin Qin,Chongtu Yang,Yanshu Wang,Jingwen Peng,Mustafa R. Bashir,Maxime Ronot,Bin Song,Hanyu Jiang
出处
期刊:Liver International [Wiley]
卷期号:45 (6)
标识
DOI:10.1111/liv.70115
摘要

ABSTRACT Background & Aims Whether large language models (LLMs) could be integrated into the diagnostic workflow of focal liver lesions (FLLs) remains unclear. We aimed to investigate two generic LLMs (ChatGPT‐4o and Gemini) regarding their diagnostic accuracies referring to the CT/MRI reports, compared to and combined with radiologists of different experience levels. Methods From April 2022 to April 2024, this single‐center retrospective study included consecutive adult patients who underwent contrast‐enhanced CT/MRI for single FLL and subsequent histopathologic examination. The LLMs were prompted by clinical information and the “findings” section of radiology reports three times to provide differential diagnoses in the descending order of likelihood, with the first considered the final diagnosis. In the research setting, six radiologists (three junior and three middle‐level) independently reviewed the CT/MRI images and clinical information in two rounds (first alone, then with LLM assistance). In the clinical setting, diagnoses were retrieved from the “impressions” section of radiology reports. Diagnostic accuracy was investigated against histopathology. Results 228 patients (median age, 59 years; 155 males) with 228 FLLs (median size, 3.6 cm) were included. Regarding the final diagnosis, the accuracy of two‐step ChatGPT‐4o (78.9%) was higher than single‐step ChatGPT‐4o (68.0%, p < 0.001) and single‐step Gemini (73.2%, p = 0.004), similar to real‐world radiology reports (80.0%, p = 0.34) and junior radiologists (78.9%–82.0%; p ‐values, 0.21 to > 0.99), but lower than middle‐level radiologists (84.6%–85.5%; p ‐values, 0.001 to 0.02). No incremental diagnostic value of ChatGPT‐4o was observed for any radiologist ( p ‐values, 0.63 to > 0.99). Conclusion Two‐step ChatGPT‐4o showed matching accuracies to real‐world radiology reports and junior radiologists for diagnosing FLLs but was less accurate than middle‐level radiologists and demonstrated little incremental diagnostic value.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
haojiaolv完成签到,获得积分10
12秒前
snah完成签到 ,获得积分10
14秒前
不去明知山完成签到 ,获得积分10
15秒前
18秒前
末世完成签到,获得积分10
26秒前
rpe完成签到,获得积分10
33秒前
37秒前
lvsehx发布了新的文献求助10
46秒前
55秒前
夹心吉吉完成签到 ,获得积分10
1分钟前
所所应助RASH采纳,获得10
1分钟前
1分钟前
MchemG应助更深的蓝采纳,获得10
1分钟前
012发布了新的文献求助10
1分钟前
1分钟前
RASH完成签到,获得积分20
1分钟前
chenjzhuc应助科研通管家采纳,获得10
1分钟前
shimhjy应助科研通管家采纳,获得20
1分钟前
Hello应助科研通管家采纳,获得10
1分钟前
RASH发布了新的文献求助10
1分钟前
1分钟前
012完成签到 ,获得积分20
1分钟前
小兔叽发布了新的文献求助10
1分钟前
1分钟前
lcm完成签到,获得积分10
1分钟前
江小白完成签到,获得积分0
1分钟前
onion完成签到,获得积分10
1分钟前
RRRZZ完成签到 ,获得积分10
2分钟前
Moislad完成签到,获得积分20
2分钟前
隐形曼青应助李宁采纳,获得10
2分钟前
2分钟前
平淡如天完成签到,获得积分10
2分钟前
2分钟前
2分钟前
李宁发布了新的文献求助10
2分钟前
2分钟前
2分钟前
李宁完成签到,获得积分10
2分钟前
自由灰狼发布了新的文献求助10
2分钟前
大胆楷瑞发布了新的文献求助10
2分钟前
高分求助中
Encyclopedia of Mathematical Physics 2nd edition 888
Chinesen in Europa – Europäer in China: Journalisten, Spione, Studenten 500
Arthur Ewert: A Life for the Comintern 500
China's Relations With Japan 1945-83: The Role of Liao Chengzhi // Kurt Werner Radtke 500
Two Years in Peking 1965-1966: Book 1: Living and Teaching in Mao's China // Reginald Hunt 500
材料概论 周达飞 ppt 500
Nonrandom distribution of the endogenous retroviral regulatory elements HERV-K LTR on human chromosome 22 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3807998
求助须知:如何正确求助?哪些是违规求助? 3352680
关于积分的说明 10359930
捐赠科研通 3068677
什么是DOI,文献DOI怎么找? 1685232
邀请新用户注册赠送积分活动 810332
科研通“疑难数据库(出版商)”最低求助积分说明 766022