Generative pretrained transformer models can function as highly reliable second screeners of titles and abstracts in systematic reviews: A proof of concept and common guidelines.

生成语法 变压器 计算机科学 概念证明 机器学习 人工智能 自然语言处理 工程类 电压 电气工程 操作系统
作者
Mikkel Helding Vembye,Julian Christensen,Anja Bondebjerg Mølgaard,Frederikke Lykke Witthöft Schytt
出处
期刊:Psychological Methods [American Psychological Association]
被引量:1
标识
DOI:10.1037/met0000769
摘要

Independent human double screening of titles and abstracts is a critical step to ensure the quality of systematic reviews and meta-analyses herein. However, double screening is a resource-demanding procedure that slows the review process. To alleviate this issue, we evaluated the use of OpenAI's generative pretrained transformer (GPT) application programming interface (API) models as an alternative to human second screeners of titles and abstracts. We did so by developing a new benchmark scheme for interpreting the performances of automated screening tools against common human screening performances in high-quality systematic reviews and by conducting three large-scale experiments on three psychological systematic reviews with different levels of complexity. Across all experiments, we show that the GPT API models can perform on par with and in some cases even better than typical human screening performance in terms of detecting relevant studies while showing high exclusion performance, as well. Hereto, we introduce the use of multiprompt screening, which is making one concise prompt per inclusion/exclusion criteria in a review, and show that it can be a valuable tool to use and support screenings in highly complex review settings. To consolidate future implementation, we develop a reproducible workflow and a set of tentative guidelines for when and when not to use GPT API models as independent second screeners of titles and abstracts. Moreover, we present the R package AIscreenR to standardize the suggested application. Our aim is ultimately to make GPT API models acceptable as independent second screeners within high-quality systematic reviews, such as the ones published in Psychological Bulletin. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
无极微光应助金色晨光采纳,获得20
刚刚
打打应助时兆娟采纳,获得10
3秒前
MM完成签到,获得积分10
3秒前
自信半梦发布了新的文献求助10
4秒前
Robert完成签到,获得积分10
4秒前
乐乐应助jianan采纳,获得10
4秒前
关我屁事完成签到 ,获得积分10
4秒前
11111111111完成签到,获得积分10
6秒前
周先森完成签到,获得积分10
8秒前
求求科研完成签到 ,获得积分10
9秒前
FashionBoy应助星空采纳,获得10
9秒前
Damon完成签到,获得积分10
10秒前
李健应助纯真的朋友采纳,获得10
11秒前
ww完成签到 ,获得积分10
12秒前
12秒前
三明治完成签到,获得积分10
13秒前
知意完成签到,获得积分10
14秒前
研友_n0Dmwn完成签到,获得积分10
15秒前
西伯利亚大尾巴狼完成签到,获得积分10
15秒前
tjzbw发布了新的文献求助10
16秒前
xlz110完成签到,获得积分10
18秒前
19秒前
Stove完成签到,获得积分0
20秒前
星辰大海应助jianan采纳,获得10
21秒前
科研通AI6.1应助xlz110采纳,获得10
22秒前
CodeCraft应助lzy采纳,获得10
23秒前
梁晓雯完成签到 ,获得积分10
23秒前
星空发布了新的文献求助10
25秒前
郑欢欢完成签到 ,获得积分10
25秒前
26秒前
26秒前
生命科学发布了新的文献求助10
27秒前
28秒前
28秒前
乐乐应助Ly采纳,获得10
28秒前
唯愿等风起完成签到,获得积分10
29秒前
29秒前
fish完成签到,获得积分20
30秒前
30秒前
碧蓝丹烟完成签到,获得积分10
30秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Development Across Adulthood 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6445904
求助须知:如何正确求助?哪些是违规求助? 8259390
关于积分的说明 17594994
捐赠科研通 5506309
什么是DOI,文献DOI怎么找? 2901788
邀请新用户注册赠送积分活动 1878808
关于科研通互助平台的介绍 1718850