Use of artificial intelligence chatbots in clinical management of immune-related adverse events

完备性(序理论) 利克特量表 不利影响 医学 人工智能 评定量表 自然语言处理 计算机科学 机器学习 统计 内科学 数学 数学分析
作者
Hannah Burnette,Aliyah Pabani,Mitchell S. von Itzstein,Benjamin Switzer,Run Fan,Fei Ye,Igor Puzanov,Jarushka Naidoo,Paolo A. Ascierto,David E. Gerber,Marc S. Ernstoff,Douglas B. Johnson
出处
期刊:Journal for ImmunoTherapy of Cancer [BMJ]
卷期号:12 (5): e008599-e008599 被引量:10
标识
DOI:10.1136/jitc-2023-008599
摘要

Background Artificial intelligence (AI) chatbots have become a major source of general and medical information, though their accuracy and completeness are still being assessed. Their utility to answer questions surrounding immune-related adverse events (irAEs), common and potentially dangerous toxicities from cancer immunotherapy, are not well defined. Methods We developed 50 distinct questions with answers in available guidelines surrounding 10 irAE categories and queried two AI chatbots (ChatGPT and Bard), along with an additional 20 patient-specific scenarios. Experts in irAE management scored answers for accuracy and completion using a Likert scale ranging from 1 (least accurate/complete) to 4 (most accurate/complete). Answers across categories and across engines were compared. Results Overall, both engines scored highly for accuracy (mean scores for ChatGPT and Bard were 3.87 vs 3.5, p<0.01) and completeness (3.83 vs 3.46, p<0.01). Scores of 1–2 (completely or mostly inaccurate or incomplete) were particularly rare for ChatGPT (6/800 answer-ratings, 0.75%). Of the 50 questions, all eight physician raters gave ChatGPT a rating of 4 (fully accurate or complete) for 22 questions (for accuracy) and 16 questions (for completeness). In the 20 patient scenarios, the average accuracy score was 3.725 (median 4) and the average completeness was 3.61 (median 4). Conclusions AI chatbots provided largely accurate and complete information regarding irAEs, and wildly inaccurate information (“hallucinations”) was uncommon. However, until accuracy and completeness increases further, appropriate guidelines remain the gold standard to follow
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
可耐的香芦完成签到,获得积分10
1秒前
1秒前
2秒前
2秒前
superLmy完成签到 ,获得积分10
2秒前
哭泣乌发布了新的文献求助10
2秒前
田様应助樂楽采纳,获得10
2秒前
桐桐应助默默洋葱采纳,获得10
3秒前
来日方甜发布了新的文献求助10
4秒前
XZZH完成签到,获得积分10
4秒前
鲤鱼青槐完成签到,获得积分10
4秒前
小马甲应助快乐保温杯采纳,获得10
4秒前
林林林林发布了新的文献求助10
4秒前
WKY完成签到 ,获得积分10
5秒前
chengs发布了新的文献求助20
6秒前
CipherSage应助林也采纳,获得10
6秒前
6秒前
WangY1263发布了新的文献求助30
6秒前
Starry完成签到,获得积分10
7秒前
CAS_lyw发布了新的文献求助10
8秒前
专注纸鹤发布了新的文献求助10
8秒前
桑葚完成签到,获得积分20
8秒前
AA完成签到,获得积分10
8秒前
9秒前
深情的芝麻完成签到,获得积分10
9秒前
9秒前
9秒前
伤心的黄焖鸡完成签到,获得积分10
9秒前
Kuhaku完成签到,获得积分10
10秒前
天玄一刀完成签到,获得积分10
10秒前
田国兵完成签到,获得积分10
11秒前
躺平girl发布了新的文献求助10
12秒前
少年完成签到,获得积分10
12秒前
lql完成签到,获得积分20
13秒前
落寞萤完成签到,获得积分10
13秒前
15秒前
16秒前
W_G完成签到,获得积分10
16秒前
韩较瘦完成签到,获得积分0
16秒前
林林林林完成签到,获得积分10
16秒前
高分求助中
Les Mantodea de Guyane Insecta, Polyneoptera 2500
Mobilization, center-periphery structures and nation-building 600
Technologies supporting mass customization of apparel: A pilot project 600
Introduction to Strong Mixing Conditions Volumes 1-3 500
China—Art—Modernity: A Critical Introduction to Chinese Visual Expression from the Beginning of the Twentieth Century to the Present Day 430
Multichannel rotary joints-How they work 400
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3796238
求助须知:如何正确求助?哪些是违规求助? 3341180
关于积分的说明 10304661
捐赠科研通 3057743
什么是DOI,文献DOI怎么找? 1677834
邀请新用户注册赠送积分活动 805683
科研通“疑难数据库(出版商)”最低求助积分说明 762740