发布文献求助

清晨好，您是今天最早来到科研通的研友！由于当前在线用户较少，发布求助请尽量完整地填写文献信息，科研通机器人24小时在线，伴您科研之路漫漫前行！

Comparison of Large Language Models in Answering Immuno-Oncology Questions: A Cross-Sectional Study

可读性医学肿瘤科心理学医学教育内科学家庭医学计算机科学程序设计语言

作者

Giovanni Maria Iannantuono,Dara Bracken-Clarke,Fatima Karzai,Hyoyoung Choo‐Wosoba,James L. Gulley,Charalampos S. Floudas

出处

期刊：Oncologist [AlphaMed Press]
日期：2024-02-03 卷期号：29 (5): 407-414 被引量：14

链接

oup.com oup.com nih.gov nih.gov nih.govdoi.org

标识

DOI：10.1093/oncolo/oyae009

摘要

Abstract Background The capability of large language models (LLMs) to understand and generate human-readable text has prompted the investigation of their potential as educational and management tools for patients with cancer and healthcare providers. Materials and Methods We conducted a cross-sectional study aimed at evaluating the ability of ChatGPT-4, ChatGPT-3.5, and Google Bard to answer questions related to 4 domains of immuno-oncology (Mechanisms, Indications, Toxicities, and Prognosis). We generated 60 open-ended questions (15 for each section). Questions were manually submitted to LLMs, and responses were collected on June 30, 2023. Two reviewers evaluated the answers independently. Results ChatGPT-4 and ChatGPT-3.5 answered all questions, whereas Google Bard answered only 53.3% (P < .0001). The number of questions with reproducible answers was higher for ChatGPT-4 (95%) and ChatGPT3.5 (88.3%) than for Google Bard (50%) (P < .0001). In terms of accuracy, the number of answers deemed fully correct were 75.4%, 58.5%, and 43.8% for ChatGPT-4, ChatGPT-3.5, and Google Bard, respectively (P = .03). Furthermore, the number of responses deemed highly relevant was 71.9%, 77.4%, and 43.8% for ChatGPT-4, ChatGPT-3.5, and Google Bard, respectively (P = .04). Regarding readability, the number of highly readable was higher for ChatGPT-4 and ChatGPT-3.5 (98.1%) and (100%) compared to Google Bard (87.5%) (P = .02). Conclusion ChatGPT-4 and ChatGPT-3.5 are potentially powerful tools in immuno-oncology, whereas Google Bard demonstrated relatively poorer performance. However, the risk of inaccuracy or incompleteness in the responses was evident in all 3 LLMs, highlighting the importance of expert-driven verification of the outputs returned by these technologies.

求助该文献

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

⚡ 2026年影响因子、分区 已更新！ (2026-6-17)

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 心想柿橙完成签到，获得积分10

13秒前; 庄海棠完成签到，获得积分10

14秒前; 科研通AI2S上传了应助文件

29秒前; 简奥斯汀完成签到，获得积分10

32秒前; 卡卡完成签到，获得积分10

34秒前; sevenhill完成签到，获得积分0

37秒前; kkdg完成签到，获得积分10

39秒前; 千帆完成签到，获得积分10

43秒前; KKDG完成签到，获得积分10

48秒前; kaka完成签到，获得积分10

52秒前; 笨笨完成签到，获得积分10

1分钟前; 科研通AI2S上传了应助文件

1分钟前; 晴空万里完成签到，获得积分10

1分钟前; Lucas上传了应助文件

1分钟前; tyh发布了新的文献求助10

1分钟前; boymin2015完成签到，获得积分10

1分钟前; 汉堡包的应助被tyh采纳，获得10

2分钟前; tyh完成签到，获得积分20

2分钟前; 思源上传了应助文件

2分钟前; Copyright上传了应助文件

2分钟前; 香蕉不言发布了新的文献求助10

2分钟前; Hanoi347完成签到，获得积分0

2分钟前; 香蕉不言完成签到，获得积分10

2分钟前; Bo完成签到，获得积分10

2分钟前; 迷茫的一代完成签到，获得积分10

2分钟前; 吃的饱饱呀完成签到，获得积分10

2分钟前; Bo发布了新的文献求助10

3分钟前; obaica完成签到，获得积分10

3分钟前; feifei完成签到，获得积分10

3分钟前; 和谐的夏岚完成签到，获得积分10

4分钟前; 梅川库子完成签到，获得积分10

4分钟前; 小白完成签到，获得积分0

4分钟前; 爆米花上传了应助文件

4分钟前; 曾经的朝雪完成签到，获得积分10

4分钟前; 大方绿蕊发布了新的文献求助10

4分钟前; Language完成签到，获得积分10

4分钟前; 姚芭蕉完成签到，获得积分0

4分钟前; 欧耶完成签到，获得积分10

4分钟前; 四氧化三铁完成签到，获得积分10

5分钟前; 可爱的函函上传了应助文件

5分钟前

高分求助中: Principles of Economics, 11th Edition 10000; University Physics with Modern Physics, 16th edition 10000; (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Matrix Methods in Data Mining and Pattern Recognition 510; Social Skills Improvement System-Rating Scales--Chinese Version 500; Dynamische Polarisation von H-1 und B-11 in (CH-3)-3NBH-3 500; CLSI M07 2024 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 7247783; 求助须知：如何正确求助？哪些是违规求助？ 8870711; 关于积分的说明 18712314; 捐赠科研通 6926252; 什么是DOI，文献DOI怎么找？ 3197998; 关于科研通互助平台的介绍 2373776; 邀请新用户注册赠送积分活动 2172899

今日热心研友

潇洒的惋清

紫色水晶之恋

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通