Performance of artificial intelligence on Turkish dental specialization exam: can ChatGPT-4.0 and gemini advanced achieve comparable results to humans?

土耳其 医学 考试(生物学) 医学物理学 牙科教育 医学教育 人工智能 计算机科学 哲学 语言学 古生物学 生物
作者
Soner Şişmanoğlu,Belen Şirinoğlu Çapan
出处
期刊:BMC Medical Education [BioMed Central]
卷期号:25 (1) 被引量:1
标识
DOI:10.1186/s12909-024-06389-9
摘要

AI-powered chatbots have spread to various fields including dental education and clinical assistance to treatment planning. The aim of this study is to assess and compare leading AI-powered chatbot performances in dental specialization exam (DUS) administered in Turkey and compare it with the best performer of that year. DUS questions for 2020 and 2021 were directed to ChatGPT-4.0 and Gemini Advanced individually. DUS questions were manually entered into AI-powered chatbot in their original form, in Turkish. The results obtained were compared with each other and the year's best performers. Candidates who score at least 45 points on this centralized exam are deemed to have passed and are eligible to select their preferred department and institution. The data was statistically analyzed using Pearson's chi-squared test (p < 0.05). ChatGPT-4.0 received 83.3% correct response rate on the 2020 exam, while Gemini Advanced received 65% correct response rate. On the 2021 exam, ChatGPT-4.0 received 80.5% correct response rate, whereas Gemini Advanced received 60.2% correct response rate. ChatGPT-4.0 outperformed Gemini Advanced in both exams (p < 0.05). AI-powered chatbots performed worse in overall score (for 2020: ChatGPT-4.0, 65,5 and Gemini Advanced, 50.1; for 2021: ChatGPT-4.0, 65,6 and Gemini Advanced, 48.6) when compared to overall scores of the best performer of that year (68.5 points for year 2020 and 72.3 points for year 2021). This poor performance also includes the basic sciences and clinical sciences sections (p < 0.001). Additionally, periodontology was the clinical specialty in which both AI-powered chatbots achieved the best results, the lowest performance was determined in the endodontics and orthodontics. AI-powered chatbots, namely ChatGPT-4.0 and Gemini Advanced, passed the DUS by exceeding the threshold score of 45. However, they still lagged behind the top performers of that year, particularly in basic sciences, clinical sciences, and overall score. Additionally, they exhibited lower performance in some clinical specialties such as endodontics and orthodontics.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研通AI5应助科研通管家采纳,获得30
刚刚
852应助科研通管家采纳,获得10
刚刚
刚刚
SYLH应助科研通管家采纳,获得15
刚刚
科研通AI5应助科研通管家采纳,获得10
刚刚
科目三应助科研通管家采纳,获得30
刚刚
华仔应助科研通管家采纳,获得10
1秒前
脑洞疼应助科研通管家采纳,获得10
1秒前
科研通AI5应助科研通管家采纳,获得10
1秒前
Hello应助科研通管家采纳,获得10
1秒前
SYLH应助科研通管家采纳,获得10
1秒前
1秒前
1秒前
3秒前
itachi完成签到,获得积分10
3秒前
西瓜霜发布了新的文献求助10
3秒前
慕青应助MR_Z采纳,获得10
3秒前
yvonne发布了新的文献求助20
3秒前
李健应助合适的落落采纳,获得10
4秒前
黄建林发布了新的文献求助10
4秒前
Helium完成签到,获得积分10
5秒前
5秒前
科研通AI5应助害羞聋五采纳,获得10
6秒前
土豆子发布了新的文献求助10
6秒前
打打应助Raizel采纳,获得10
6秒前
sss发布了新的文献求助10
6秒前
chahun发布了新的文献求助10
6秒前
甜酒汤圆发布了新的文献求助10
8秒前
QxQ完成签到,获得积分20
9秒前
小马哥完成签到,获得积分10
9秒前
安年发布了新的文献求助10
10秒前
10秒前
12秒前
12秒前
研友_p完成签到,获得积分10
13秒前
13秒前
慕青应助Rishel_Li采纳,获得10
13秒前
传奇3应助一夜暴富采纳,获得10
14秒前
14秒前
honda完成签到,获得积分10
15秒前
高分求助中
Algorithmic Mathematics in Machine Learning 500
Advances in Underwater Acoustics, Structural Acoustics, and Computational Methodologies 400
Getting Published in SSCI Journals: 200+ Questions and Answers for Absolute Beginners 300
Fatigue of Materials and Structures 260
The Monocyte-to-HDL ratio (MHR) as a prognostic and diagnostic biomarker in Acute Ischemic Stroke: A systematic review with meta-analysis (P9-14.010) 240
The Burge and Minnechaduza Clarendonian mammalian faunas of north-central Nebraska 206
An Integrated Solution for Application of Next-Generation Sequencing in Newborn Screening 200
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3831948
求助须知:如何正确求助?哪些是违规求助? 3374282
关于积分的说明 10484141
捐赠科研通 3094156
什么是DOI,文献DOI怎么找? 1703342
邀请新用户注册赠送积分活动 819390
科研通“疑难数据库(出版商)”最低求助积分说明 771472