Syndromic Analysis of Sepsis Cohorts Using Large Language Models

作者
Theodore R. Pak,Sanjat Kanjilal,Caroline McKenna,Alexander Hoffner-Heinike,Chanu Rhee,Michael Klompas
出处
期刊:JAMA network open [American Medical Association]
卷期号:8 (10): e2539267-e2539267
标识
DOI:10.1001/jamanetworkopen.2025.39267
摘要

Importance Presenting signs and symptoms affect the care of patients with possible sepsis. However, signs and symptoms are not incorporated into most large observational studies because they are difficult to extract from clinical notes at scale. Objective To assess the use of large language models (LLMs) to extract presenting signs and symptoms from admission notes and characterize their associations with infectious diagnoses, multidrug-resistant infections, and mortality. Design, Setting, and Participants This retrospective cohort study obtained data from 5 Massachusetts hospitals within 1 health care system between June 1, 2015, and August 1, 2022. Participants were hospitalized adult patients with possible infection (determined by blood culture drawn and intravenous antibiotics administered within 24 hours of arrival). An LLM (LLaMA 3 8B; Meta) was used to extract up to 10 presenting signs and symptoms from each patient’s history-and-physical admission notes. LLM-generated labels were validated by blinded review of 303 random admission notes. Data analyses were performed from July 2023 to August 2025. Exposures Thirty most common signs and symptoms were retained as exposures, and unsupervised clustering was used to create syndromes, which were compared with infection sources derived from the International Statistical Classification of Diseases, Tenth Revision, Clinical Modification discharge codes. Main Outcomes and Measures Outcomes included positive cultures for methicillin-resistant Staphylococcus aureus (MRSA), positive cultures for multidrug-resistant gram-negative (MDRGN) organisms, and in-hospital mortality. Multivariable logistic regression was used to adjust for demographics, comorbidities, physiologic markers of severity of illness, and time to antibiotics. Results Among the 104 248 patients (median [IQR] age, 66 [52-78] years; 54 137 males [51.9%]) included, 23 619 (22.7%) had sepsis without shock, 25 990 (24.9%) had septic shock, and 94 913 (91.0%) had 1 or more admission note within 24 hours. The LLM labeled the notes of 93 674 of 94 913 patients (98.7%). On manual validation, LLM labels had an accuracy of 99.3% (95% CI, 99.2%-99.3%), balanced accuracy of 84.6% (95% CI, 83.5%-85.8%), positive predictive value of 68.4% (95% CI, 66.0%-70.7%), sensitivity of 69.7% (95% CI, 67.3%-72.0%), and specificity of 99.6% (95% CI, 99.6%-99.6%) compared with the physician medical record reviewer. The 30 most common signs and symptoms were clustered into syndromes that correlated with infection sources. Presence of skin and soft tissue symptoms (adjusted odds ratio [AOR], 1.73; 95% CI, 1.49-2.00) and absence of gastrointestinal (AOR, 0.63; 95% CI, 0.54-0.73) or urinary tract symptoms (AOR, 0.34; 95% CI, 0.22-0.50) were associated with MRSA culture positivity; inverse associations were seen for MDRGN organisms. Cardiopulmonary symptoms were associated with increased mortality (AOR, 1.30; 95% CI, 1.17-1.45). Conclusions and Relevance This cohort study found that an LLM accurately extracted presenting signs and symptoms from admission notes that clustered into syndromes differentially correlated with infection sources, multidrug-resistant infections, and mortality. Further research is warranted to evaluate the value of large-scale sign-and-symptom data in models of antibiotic choice, effectiveness, and outcomes in patients with possible sepsis.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
紫麒麟发布了新的文献求助10
1秒前
饺子完成签到,获得积分10
1秒前
bkagyin应助yu采纳,获得10
1秒前
Lynn发布了新的文献求助10
2秒前
风起人散发布了新的文献求助10
2秒前
Pwrry发布了新的文献求助10
2秒前
Orange应助笑点低的发夹采纳,获得10
2秒前
3秒前
An发布了新的文献求助10
3秒前
3秒前
万能图书馆应助孑孑采纳,获得30
3秒前
4秒前
梦见鲸鱼岛完成签到,获得积分10
4秒前
Orange应助Yolen LI采纳,获得10
4秒前
完美世界应助123456采纳,获得10
5秒前
5秒前
6秒前
6秒前
小吴发布了新的文献求助10
6秒前
7秒前
klony发布了新的文献求助10
7秒前
科目三应助卷卷采纳,获得10
7秒前
史文韬发布了新的文献求助30
7秒前
善学以致用应助诸葛一笑采纳,获得10
7秒前
小菜狗发布了新的文献求助10
8秒前
干鞅完成签到,获得积分10
9秒前
9秒前
kingwhitewing发布了新的文献求助10
9秒前
罗莹发布了新的文献求助10
10秒前
量子星尘发布了新的文献求助10
10秒前
10秒前
Crazy_Runner发布了新的文献求助10
10秒前
10秒前
大模型应助纯真的盼柳采纳,获得10
11秒前
12秒前
斯文败类应助王俊采纳,获得10
12秒前
123456完成签到,获得积分10
13秒前
小蘑菇应助任任任采纳,获得10
13秒前
13秒前
传奇3应助任任任采纳,获得10
13秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Predation in the Hymenoptera: An Evolutionary Perspective 1800
List of 1,091 Public Pension Profiles by Region 1561
Binary Alloy Phase Diagrams, 2nd Edition 1400
Specialist Periodical Reports - Organometallic Chemistry Organometallic Chemistry: Volume 46 1000
Holistic Discourse Analysis 600
Beyond the sentence: discourse and sentential form / edited by Jessica R. Wirth 600
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 遗传学 催化作用 冶金 量子力学 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 5513050
求助须知:如何正确求助?哪些是违规求助? 4607382
关于积分的说明 14504952
捐赠科研通 4542911
什么是DOI,文献DOI怎么找? 2489237
邀请新用户注册赠送积分活动 1471256
关于科研通互助平台的介绍 1443307