How does ChatGPT-4 preform on non-English national medical licensing examination? An evaluation in Chinese language

一致性 一致性(知识库) 背景(考古学) 计算机科学 自然语言处理 可靠性(半导体) 医学教育 心理学 人工智能 语言学 医学 量子力学 生物 物理 内科学 哲学 古生物学 功率(物理)
作者
Changchang Fang,Yuting Wu,Wanying Fu,Jitao Ling,Yue Wang,Xiaolin Liu,Yuan Jiang,Yifan Wu,Yixuan Chen,Jing Zhou,Zhiwei Zhu,Zhiwei Yan,Peng Yu,Xiao Liu
出处
期刊:PLOS digital health [Public Library of Science]
卷期号:2 (12): e0000397-e0000397 被引量:4
标识
DOI:10.1371/journal.pdig.0000397
摘要

ChatGPT, an artificial intelligence (AI) system powered by large-scale language models, has garnered significant interest in healthcare. Its performance dependent on the quality and quantity of training data available for a specific language, with the majority of it being in English. Therefore, its effectiveness in processing the Chinese language, which has fewer data available, warrants further investigation. This study aims to assess the of ChatGPT's ability in medical education and clinical decision-making within the Chinese context. We utilized a dataset from the Chinese National Medical Licensing Examination (NMLE) to assess ChatGPT-4's proficiency in medical knowledge in Chinese. Performance indicators, including score, accuracy, and concordance (confirmation of answers through explanation), were employed to evaluate ChatGPT's effectiveness in both original and encoded medical questions. Additionally, we translated the original Chinese questions into English to explore potential avenues for improvement. ChatGPT scored 442/600 for original questions in Chinese, surpassing the passing threshold of 360/600. However, ChatGPT demonstrated reduced accuracy in addressing open-ended questions, with an overall accuracy rate of 47.7%. Despite this, ChatGPT displayed commendable consistency, achieving a 75% concordance rate across all case analysis questions. Moreover, translating Chinese case analysis questions into English yielded only marginal improvements in ChatGPT's performance (p = 0.728). ChatGPT exhibits remarkable precision and reliability when handling the NMLE in Chinese. Translation of NMLE questions from Chinese to English does not yield an improvement in ChatGPT's performance.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
刚刚
1秒前
英姑应助大家好采纳,获得10
1秒前
研友_ZlxK6Z完成签到,获得积分10
1秒前
2秒前
甜蜜的笑白完成签到,获得积分10
2秒前
南栀完成签到 ,获得积分10
4秒前
登山人发布了新的文献求助10
4秒前
qqa发布了新的文献求助10
4秒前
你香完成签到,获得积分10
4秒前
5秒前
英勇冥王星完成签到,获得积分10
5秒前
西北孤傲的狼完成签到,获得积分10
5秒前
未晚发布了新的文献求助10
6秒前
犹豫绾绾发布了新的文献求助10
6秒前
6秒前
dongdongguai发布了新的文献求助10
7秒前
7秒前
大梦想家完成签到,获得积分10
7秒前
7秒前
小离完成签到,获得积分10
8秒前
甜妹i怎么会不甜完成签到,获得积分20
9秒前
JrPaleo101完成签到,获得积分10
9秒前
神光完成签到,获得积分10
9秒前
冰魂应助re采纳,获得150
10秒前
wangxinyi关注了科研通微信公众号
10秒前
10秒前
zhong完成签到,获得积分10
10秒前
栎木枝完成签到,获得积分10
11秒前
kd发布了新的文献求助20
11秒前
Jasper应助Liangyu采纳,获得10
11秒前
龙痕完成签到,获得积分10
11秒前
小离发布了新的文献求助10
11秒前
Thien应助谢陈采纳,获得10
11秒前
11秒前
缥缈的绿兰完成签到,获得积分10
12秒前
水上书发布了新的文献求助10
12秒前
12秒前
cs完成签到 ,获得积分10
13秒前
高分求助中
Les Mantodea de Guyane Insecta, Polyneoptera 2500
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 (PDF!) 1000
Technologies supporting mass customization of apparel: A pilot project 450
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
A Field Guide to the Amphibians and Reptiles of Madagascar - Frank Glaw and Miguel Vences - 3rd Edition 400
China Gadabouts: New Frontiers of Humanitarian Nursing, 1941–51 400
The Healthy Socialist Life in Maoist China, 1949–1980 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3789101
求助须知:如何正确求助?哪些是违规求助? 3334213
关于积分的说明 10267996
捐赠科研通 3050485
什么是DOI,文献DOI怎么找? 1674041
邀请新用户注册赠送积分活动 802435
科研通“疑难数据库(出版商)”最低求助积分说明 760607