Large-Scale Assessment of ChatGPT's Performance in Benign and Malignant Bone Tumors Imaging Report Diagnosis and Its Potential for Clinical Applications

医学 金标准(测试) 放射科 诊断准确性 医学影像学 医学物理学
作者
Fan Yang,Dong Yan,Zhixiang Wang
出处
期刊:Journal of bone oncology [Elsevier BV]
卷期号:44: 100525-100525 被引量:3
标识
DOI:10.1016/j.jbo.2024.100525
摘要

This study was designed to delve into the complexities involved in diagnosing of benign and malignant bone tumors and to assess the potential of AI technologies like ChatGPT in improving diagnostic accuracy and efficiency. The study also explores the few-shot learning as a method to optimize ChatGPT's performance in specialized medical domains such as benign and malignant bone tumors diagnosis. A total of 1366 benign and malignant bone tumors-related imaging reports were collected and diagnosed by 25 experienced physicians. The gold standard of diagnosis was established by combining clinical, imaging and pathological principles.These reports were then input into the ChatGPT model which underwent a few-shot learning method to generate diagnostic results. The diagnostic results of the physicians and the AI model were compared to evaluate the performance of ChatGPT. An experiment was conducted to assess the influence of different radiologist's reporting styles on the model's diagnostic performance. Furthermore, in-depth analysis of misdiagnosed cases was carried out, categorizing diagnostic errors and exploring possible causes. The diagnostic results generated by ChatGPT showed an accuracy of 0.73, sensitivity of 0.95, and specificity of 0.58. After few-shot learning, ChatGPT demonstrated significant improvement, achieving an accuracy of 0.87, sensitivity of 0.99, and specificity of 0.73, bringing it much closer to the level of physician diagnostics. In an experiment analyzing the influence of the radiologist's reporting style, the model demonstrated higher sensitivity when interpreting reports written by high-level radiologists. In 56 benign cases, ChatGPT misdiagnosed them as malignant. Among these, 35 benign lesions- fibrous dysplasia and osteofibrous dysplasia- were incorrectly identified as metastatic tumors or osteosarcomas; 8 cases of myositis ossificans were wrongly diagnosed as extraosseous osteosarcoma. 7 cases of giant cell tumor of bone at the end of long bone were misdiagnosed as osteosarcoma by intermediate doctors. Chondroblastoma was misdiagnosed as malignant tumor in 6 cases -2 osteosarcoma and 4 chondrosarcoma-In this study, 23 osteosarcoma cases were misdiagnosed by ChatGPT as osteomyelitis; Chondrosarcoma was misdiagnosed as fibrous dysplasia or aneurysmal bone cyst in 8 cases. Four cases of spinal chordoma were misdiagnosed as spinal tuberculosis. Our findings highlight the potential of ChatGPT in the diagnosis of benign and malignant bone tumors, offering advantages like enhanced efficiency and a reduction in missed diagnoses. However, the necessity of collaborative interactions between physicians and ChatGPT in practical settings was underscored. With an examination into AI's capacity in benign and malignant bone tumors diagnosis, this study lays the groundwork for future AI advancements in medicine. Additionally, the benefits of few-shot learning in fine-tuning ChatGPT applications in specialized fields were also demonstrated.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
2秒前
4秒前
虚幻青曼完成签到,获得积分20
5秒前
orixero应助我不是阿呆采纳,获得10
6秒前
keyanxiaobai发布了新的文献求助10
7秒前
7秒前
研友_VZG7GZ应助江楠酒采纳,获得10
7秒前
8秒前
8秒前
9秒前
归尘应助铠甲勇士采纳,获得10
10秒前
威康宇宙发布了新的文献求助10
11秒前
1234发布了新的文献求助10
12秒前
bqss发布了新的文献求助10
13秒前
苏silence发布了新的文献求助10
14秒前
16秒前
Jasper应助呵呵呵呵呵呵123采纳,获得10
16秒前
顾矜应助乐观振家采纳,获得10
17秒前
无花果应助keyanxiaobai采纳,获得10
18秒前
18秒前
顾矜应助Clarenceed采纳,获得10
19秒前
19秒前
大大怪完成签到,获得积分20
19秒前
bqss完成签到,获得积分10
20秒前
归尘应助铠甲勇士采纳,获得10
21秒前
22秒前
23秒前
23秒前
23秒前
大大怪发布了新的文献求助10
23秒前
心脏沾鲜血完成签到,获得积分20
24秒前
PGL完成签到,获得积分10
24秒前
虚拟的鞋垫完成签到,获得积分10
24秒前
24秒前
青蛙公主完成签到 ,获得积分10
25秒前
月下独酌42完成签到,获得积分20
26秒前
cjyyy发布了新的文献求助10
27秒前
江楠酒发布了新的文献求助10
28秒前
Owen应助Jeff采纳,获得10
28秒前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
ISCN 2024 – An International System for Human Cytogenomic Nomenclature (2024) 3000
Continuum Thermodynamics and Material Modelling 2000
Encyclopedia of Geology (2nd Edition) 2000
105th Edition CRC Handbook of Chemistry and Physics 1600
Maneuvering of a Damaged Navy Combatant 650
基于CZT探测器的128通道能量时间前端读出ASIC设计 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3777347
求助须知:如何正确求助?哪些是违规求助? 3322741
关于积分的说明 10211312
捐赠科研通 3038069
什么是DOI,文献DOI怎么找? 1667051
邀请新用户注册赠送积分活动 797952
科研通“疑难数据库(出版商)”最低求助积分说明 758098