Artificial Intelligence–Assisted Data Extraction With a Large Language Model: A Study Within Reviews

医学 自然语言处理 数据提取 人工智能 质量(理念) 医疗保健 数据收集 代理(哲学) 数据质量 梅德林 健康数据 数据科学 医疗保健质量 卫生服务研究 数据挖掘 情报检索 质量管理 系统回顾
作者
Gerald Gartlehner,Shannon Kugley,Karen Crotty,Meera Viswanathan,Andreea Dobrescu,Barbara Nußbaumer-Streit,Graham Booth,Jonathan Treadwell,JN Han,Jesse Wagner,Eric Apaydin,Erin L. Coppola,Margaret Maglione,Rainer Hilscher,Robert Chew,Meagan Pilar,Bryan Swanton,Leila C. Kahwati
出处
期刊:Annals of Internal Medicine [American College of Physicians]
卷期号:178 (12): 1763-1771 被引量:11
标识
DOI:10.7326/annals-25-00739
摘要

BACKGROUND: Data extraction is a critical but error-prone and labor-intensive task in evidence synthesis. Unlike other artificial intelligence (AI) technologies, large language models (LLMs) do not require labeled training data for data extraction. OBJECTIVE: To compare an AI-assisted versus a traditional, human-only data extraction process. DESIGN: Study within reviews (SWAR) using a prospective, parallel-group comparison with blinded data adjudicators. SETTING: Workflow validation within 6 ongoing systematic reviews of interventions under real-world conditions. INTERVENTION: Initial data extraction using an LLM (Claude, versions 2.1, 3.0 Opus, and 3.5 Sonnet) verified by a human reviewer. MEASUREMENTS: Concordance, time on task, accuracy, sensitivity, positive predictive value, and error analysis. RESULTS: The 6 systematic reviews in the SWAR yielded 9341 data elements from 63 studies. Concordance between the 2 methods was 77.2% (95% CI, 76.3% to 78.0%). Compared with the reference standard, the AI-assisted approach had an accuracy of 91.0% (CI, 90.4% to 91.6%) and the human-only approach an accuracy of 89.0% (CI, 88.3% to 89.6%). Sensitivities were 89.4% (CI, 88.6% to 90.1%) and 86.5% (CI, 85.7% to 87.3%), respectively, with positive predictive values of 99.2% (CI, 99.0% to 99.4%) and 98.9% (CI, 98.6% to 99.1%). Incorrect data were extracted in 9.0% (CI, 8.4% to 9.6%) of AI-assisted cases and 11.0% (CI, 10.4% to 11.7%) of human-only cases, with corresponding proportions of major errors of 2.5% (CI, 2.2% to 2.8%) versus 2.7% (CI, 2.4% to 3.1%). Missed data items were the most frequent error type in both approaches. The AI-assisted method reduced data extraction time by a median of 41 minutes per study. LIMITATIONS: Assessing concordance and classifying errors required subjective judgment. Consistently tracking time on task was challenging. CONCLUSION: Data extraction assisted by AI may offer a viable, more efficient alternative to human-only methods. PRIMARY FUNDING SOURCE: Agency for Healthcare Research and Quality and RTI International.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
12121发布了新的文献求助10
刚刚
小白发布了新的文献求助10
刚刚
clyhg完成签到,获得积分10
1秒前
只道寻常完成签到,获得积分10
1秒前
tga发布了新的文献求助10
2秒前
2秒前
tga发布了新的文献求助10
2秒前
ji发布了新的文献求助10
3秒前
4秒前
4秒前
tga发布了新的文献求助10
5秒前
sql发布了新的文献求助10
5秒前
tga发布了新的文献求助10
5秒前
tga发布了新的文献求助10
5秒前
tga发布了新的文献求助10
5秒前
酷波er应助机灵的成协采纳,获得10
5秒前
6秒前
maguodrgon发布了新的文献求助30
6秒前
帮我求你完成签到,获得积分10
7秒前
科研通AI6.4应助好晒采纳,获得10
8秒前
小菀儿发布了新的文献求助10
9秒前
zhaozihao完成签到,获得积分10
9秒前
斯文败类应助tom采纳,获得10
9秒前
科研通AI6.2应助小白采纳,获得10
10秒前
10秒前
molihuakai应助个性的饼干采纳,获得10
11秒前
Kao应助ji采纳,获得10
11秒前
Kao应助ji采纳,获得10
11秒前
cdercder应助ji采纳,获得10
12秒前
12秒前
无辜的翠安完成签到,获得积分10
14秒前
搜集达人应助lizhiqing采纳,获得10
14秒前
15秒前
16秒前
万能图书馆应助maguodrgon采纳,获得30
16秒前
宴究生完成签到,获得积分10
18秒前
橙汁发布了新的文献求助10
18秒前
王文艺完成签到,获得积分10
20秒前
科研通AI6.2应助迷路荧采纳,获得10
20秒前
追寻鞋垫发布了新的文献求助10
21秒前
高分求助中
Principles of Economics, 11th Edition 10000
Prescott's Microbiology: 2026 Release ISE 10000
University Physics with Modern Physics, 16th edition 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Environmental Leverage in Times of Climate Crisis: Product Standards, Carbon Border Measures and Preferential Trade Agreements 1000
Interactions of Vowel Quality and Prosody in East Slavic 1000
Erwählung und Berufung bei Paulus: Bedeutung, Entwicklung und Funktion einer Vorstellung in ihrem frühjüdischen und griechisch-römischen Kontext 850
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7193855
求助须知:如何正确求助?哪些是违规求助? 8829784
关于积分的说明 18642555
捐赠科研通 6830283
什么是DOI,文献DOI怎么找? 3176146
关于科研通互助平台的介绍 2328568
邀请新用户注册赠送积分活动 2150622