ChatGPT in medical school: how successful is AI in progress testing?

考试(生物学) 多项选择 相关性 计算机科学 统计 医学 数学 显著性差异 几何学 生物 古生物学
作者
Hendrik Friederichs,Wolf Jonas Friederichs,Maren März
出处
期刊:Medical Education Online [Informa]
卷期号:28 (1) 被引量:17
标识
DOI:10.1080/10872981.2023.2220920
摘要

Background As generative artificial intelligence (AI), ChatGPT provides easy access to a wide range of information, including factual knowledge in the field of medicine. Given that knowledge acquisition is a basic determinant of physicians’ performance, teaching and testing different levels of medical knowledge is a central task of medical schools. To measure the factual knowledge level of the ChatGPT responses, we compared the performance of ChatGPT with that of medical students in a progress test.Methods A total of 400 multiple-choice questions (MCQs) from the progress test in German-speaking countries were entered into ChatGPT’s user interface to obtain the percentage of correctly answered questions. We calculated the correlations of the correctness of ChatGPT responses with behavior in terms of response time, word count, and difficulty of a progress test question.Results Of the 395 responses evaluated, 65.5% of the progress test questions answered by ChatGPT were correct. On average, ChatGPT required 22.8 s (SD 17.5) for a complete response, containing 36.2 (SD 28.1) words. There was no correlation between the time used and word count with the accuracy of the ChatGPT response (correlation coefficient for time rho = −0.08, 95% CI [−0.18, 0.02], t(393) = −1.55, p = 0.121; for word count rho = −0.03, 95% CI [−0.13, 0.07], t(393) = −0.54, p = 0.592). There was a significant correlation between the difficulty index of the MCQs and the accuracy of the ChatGPT response (correlation coefficient for difficulty: rho = 0.16, 95% CI [0.06, 0.25], t(393) = 3.19, p = 0.002).Conclusion ChatGPT was able to correctly answer two-thirds of all MCQs at the German state licensing exam level in Progress Test Medicine and outperformed almost all medical students in years 1–3. The ChatGPT answers can be compared with the performance of medical students in the second half of their studies.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
Gy完成签到,获得积分10
1秒前
1秒前
城南饭饭发布了新的文献求助30
1秒前
Gy发布了新的文献求助30
4秒前
爱鱼人士应助熙熙采纳,获得10
5秒前
Akim应助温水云采纳,获得10
6秒前
6秒前
221发布了新的文献求助10
10秒前
10秒前
拉格朗日发布了新的文献求助10
13秒前
14秒前
Hao应助小福星采纳,获得10
14秒前
14秒前
舒适思烟发布了新的文献求助10
15秒前
16秒前
尹冰之完成签到,获得积分10
16秒前
18秒前
18秒前
温水云发布了新的文献求助10
19秒前
李健应助无助采纳,获得10
20秒前
yoyo发布了新的文献求助10
21秒前
香蕉觅云应助隐形芹采纳,获得10
21秒前
小布发布了新的文献求助10
21秒前
爱鱼人士应助JcZuk采纳,获得10
22秒前
22秒前
susu完成签到,获得积分10
23秒前
jinghai发布了新的文献求助10
23秒前
城南饭饭完成签到,获得积分10
23秒前
无奈梦岚发布了新的文献求助10
23秒前
Owen应助柔之采纳,获得10
27秒前
28秒前
情怀应助舒适思烟采纳,获得10
29秒前
隐形曼青应助无奈梦岚采纳,获得50
30秒前
31秒前
31秒前
33秒前
无助完成签到,获得积分20
35秒前
Mipaa发布了新的文献求助10
36秒前
36秒前
高分求助中
【本贴是提醒信息,请勿应助】请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Challenges, Strategies, and Resiliency in Disaster and Risk Management 500
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2482857
求助须知:如何正确求助?哪些是违规求助? 2145091
关于积分的说明 5472237
捐赠科研通 1867418
什么是DOI,文献DOI怎么找? 928239
版权声明 563073
科研通“疑难数据库(出版商)”最低求助积分说明 496633