Accuracy and Reproducibility of ChatGPT Responses to Breast Cancer Tumor Board Patients

一致性 乳腺癌 医学 再现性 癌症 肿瘤科 内科学 医学物理学 统计 数学
作者
Ning Liao,Cheukfai Li,William J. Gradishar,V. Suzanne Klimberg,Joshua Roshal,Tai-Ze Yuan,Sanjiv S. Agarwala,V Valero,Sandra M. Swain,Julie A. Margenthaler,Isabel T. Rubio,Sara A. Hurvitz,Charles E. Geyer,Nancy U. Lin,Hope S. Rugo,Guochun Zhang,N. Liu,Charles M. Balch
出处
期刊:JCO clinical cancer informatics [Lippincott Williams & Wilkins]
卷期号: (9)
标识
DOI:10.1200/cci-25-00001
摘要

PURPOSE We assessed the accuracy and reproducibility of Chat Generative Pre-Trained Transformer's (ChatGPT) recommendations in response to breast cancer patients by comparing generated outputs with consensus expert opinions. METHODS 362 consecutive breast cancer patients sourced from a weekly international breast cancer webinar series were submitted to a tumor board of renowned experts. The same 362 clinical patients were also prompted to ChatGPT-4.0 three separate times to examine reproducibility. RESULTS Only 46% of ChatGPT-generated content was entirely concordant with the recommendations of breast cancer experts, and only 39% of ChatGPT's responses demonstrated inter-response similarity. ChatGPT's responses demonstrated higher concordance with CEN experts in earlier stages of breast cancer (0, I, II, III) compared to advanced (IV) patients ( P = .019). There were less accurate responses from ChatGPT when responding to patients involving molecular markers and genetic testing ( P = .025), and in patients involving antibody drug conjugates ( P = .006). ChatGPT's responses were not necessarily incorrect but often omitted specific details about clinical management. When the same prompt was independently sent to CEN into the model on three occasions, each time by difference users, ChatGPT's responses exhibited variable content and formatting in 68% (246 out of 362) of patients and were entirely consistent with one another in only 32% of responses. CONCLUSION Since this promising clinical decision-making support tool is widely used currently by physicians worldwide, it is important for the user to understand its limitations as currently constructed when responding to multidisciplinary breast cancer patients, and for researchers in the field to continue improving its ability with contemporary, accurate and complete breast cancer information. As currently constructed, ChatGPT is not engineered to generate identical outputs to the same input and was less likely to correctly interpret and recommend treatments for complex breast cancer patients.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
忐忑的迎蓉完成签到,获得积分10
刚刚
天天快乐应助紧张的惜梦采纳,获得50
刚刚
北杨发布了新的文献求助10
刚刚
小羊完成签到,获得积分10
刚刚
过时的哑铃应助怡然鹭洋采纳,获得10
刚刚
谢生婷发布了新的文献求助10
刚刚
林早上完成签到,获得积分20
1秒前
桃李发布了新的文献求助10
1秒前
在水一方应助wddfz采纳,获得10
2秒前
3秒前
cici完成签到,获得积分10
4秒前
左丘绝山发布了新的文献求助10
4秒前
研小玥发布了新的文献求助10
4秒前
罗红豆发布了新的文献求助10
4秒前
脚踏实滴完成签到 ,获得积分10
5秒前
随性完成签到,获得积分10
5秒前
6秒前
北杨完成签到,获得积分10
7秒前
落微完成签到,获得积分10
7秒前
8秒前
aaaaa小柴应助HHHH采纳,获得10
9秒前
思源应助imemorizedpi采纳,获得10
9秒前
慕青应助谢生婷采纳,获得10
10秒前
10秒前
汉堡包应助Tinweng采纳,获得10
11秒前
白小白完成签到,获得积分10
11秒前
鑫光熠熠发布了新的文献求助10
11秒前
11秒前
11秒前
满意语风发布了新的文献求助10
12秒前
飘逸绿海完成签到 ,获得积分10
13秒前
15秒前
zhangyidian完成签到,获得积分10
15秒前
15秒前
小白发布了新的文献求助10
16秒前
16秒前
ren完成签到 ,获得积分10
16秒前
17秒前
完美世界应助kkk采纳,获得10
17秒前
18秒前
高分求助中
【重要!!请各位用户详细阅读此贴】科研通的精品贴汇总(请勿应助) 10000
Genomic signature of non-random mating in human complex traits 2000
Semantics for Latin: An Introduction 1155
Plutonium Handbook 1000
Three plays : drama 1000
Robot-supported joining of reinforcement textiles with one-sided sewing heads 640
北师大毕业论文 基于可调谐半导体激光吸收光谱技术泄漏气体检测系统的研究 530
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 4108399
求助须知:如何正确求助?哪些是违规求助? 3646573
关于积分的说明 11550892
捐赠科研通 3352494
什么是DOI,文献DOI怎么找? 1842097
邀请新用户注册赠送积分活动 908390
科研通“疑难数据库(出版商)”最低求助积分说明 825506