亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Systematic Review

计算机科学 自然语言处理 语言识别 自然语言 语言模型 人工智能 语言学 通用网络语言 论语言 心理学 健康的社会决定因素 计算语言学 社会化媒体 上下文模型 语言理解 领域(数学分析) 自然(考古学) 数据科学 自然语言理解 主题模型 社交网络(社会语言学) 语用学 语言技术 建模语言 互联网 认知科学 多样性(控制论)
作者
Swati Rajwal,Avinash Kumar Pandey,Ziyuan Zhang,Yankai Chen,Michael X. Liu,Sudeshna Das,Hannah Rogers,Abeed Sarker,Yunyu Xiao
出处
期刊:Journal of Medical Internet Research [JMIR Publications]
卷期号:28: e83793-e83793
标识
DOI:10.2196/83793
摘要

Background: Social determinants of health (SDOH) are the social, economic, and environmental conditions that influence health outcomes. SDOH information is often embedded in unstructured text, such as notes in electronic health records and social media posts. Advances in natural language processing (NLP), including emergent large language models (LLMs), offer opportunities to extract, analyze, and interpret SDOH expressions from free text for inclusion in downstream analyses. Existing literature on NLP applications for SDOH is dispersed across disciplines and characterized by methodological heterogeneity and variability in study quality and scope, complicating synthesis and cross-study comparison. Objective: This study aimed to examine the use of NLP, including LLMs, in SDOH research, and highlight gaps and future research directions. Methods: We conducted a systematic review following PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, searching 7 major databases for publications between 2014 and November 2025. We included journal and conference proceedings papers that applied NLP methods to identify, classify, extract, or predict SDOH from text. Three reviewers independently screened studies and extracted data; conflicts were resolved by two senior reviewers. We abstracted study metadata, dataset characteristics, NLP approaches, SDOH domains addressed, and NLP performance metrics. We also conducted risk-of-bias analyses and identified influential studies based on relative citation counts. Results: 142 studies met the inclusion criteria. Nearly two-thirds (89/142, 62.7%) were published between 2023 and 2025, reflecting rapid recent growth. Most studies relied on electronic health records (93/142, 65.5%) and private datasets (81/142, 57.0%), while only 20.4% (29/142) used publicly available data. Commonly studied SDOH domains were housing instability (72/142, 50.7%), employment (65/142, 45.8%), and financial conditions (63/142, 44.4%); structural factors, such as immigration status (5/142, 3.5%), were rarely examined. Of studies that reported evaluation metrics, most focused on classification (26/83, 31.32%) or extraction (38/83, 45.7%), and used cross-sectional designs. Reported model performances were typically strong, with median F1-scores ranging roughly from 0.75 to 0.85 across model categories. Only 49 studies shared code, and fewer than half clearly described model interpretability or reproducibility practices. LLMs (including encoder-decoder models) appeared in 19.7% (28/142) of studies, highlighting emerging interest but also raising new concerns around transparency and governance. Conclusions: This review provides a timely synthesis of NLP and LLM applications across the SDOH research spectrum, addressing an important gap in a topic receiving increasing research attention. By comparing task formulations, data sources, and performance patterns, the review clarifies the research readiness of current approaches and reveals critical gaps. Our findings advance the field by highlighting the absence of a unified SDOH framework, uneven availability of public benchmarks, and limited evaluation of real-world deployment. Addressing these gaps through transparent, inclusive dataset development and implementation-focused evaluation is essential for translating NLP advances into equitable, real-world health impact.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
坦率如之完成签到,获得积分10
13秒前
14秒前
lkx发布了新的文献求助10
20秒前
Copyright应助冷静幻嫣采纳,获得10
29秒前
郑林发布了新的文献求助30
55秒前
耕牛热完成签到,获得积分10
59秒前
愉快惜儿完成签到 ,获得积分10
59秒前
大医仁心完成签到 ,获得积分10
1分钟前
lkx完成签到,获得积分10
1分钟前
1分钟前
小蘑菇应助科研通管家采纳,获得10
1分钟前
CodeCraft应助科研通管家采纳,获得10
1分钟前
大模型应助科研通管家采纳,获得10
1分钟前
Owen应助科研通管家采纳,获得10
1分钟前
桐桐应助科研通管家采纳,获得10
1分钟前
友好灵阳完成签到 ,获得积分10
1分钟前
香蕉觅云应助科研通管家采纳,获得10
1分钟前
1分钟前
彭于晏应助科研通管家采纳,获得10
1分钟前
完美世界应助科研通管家采纳,获得10
1分钟前
领导范儿应助科研通管家采纳,获得10
1分钟前
打打应助科研通管家采纳,获得10
1分钟前
JamesPei应助科研通管家采纳,获得10
1分钟前
斯文败类应助科研通管家采纳,获得10
1分钟前
脑洞疼应助科研通管家采纳,获得10
1分钟前
1分钟前
汉堡包应助科研通管家采纳,获得10
1分钟前
yy完成签到 ,获得积分10
1分钟前
郑林完成签到,获得积分10
1分钟前
舒心思山完成签到,获得积分10
1分钟前
2分钟前
朴实的新柔完成签到,获得积分10
2分钟前
Aletheia完成签到 ,获得积分10
2分钟前
2分钟前
feiyafei完成签到 ,获得积分10
3分钟前
慕青应助科研通管家采纳,获得20
3分钟前
3分钟前
纯真天荷完成签到,获得积分10
3分钟前
FeelingUnreal完成签到,获得积分10
4分钟前
GHOSTagw完成签到,获得积分10
4分钟前
高分求助中
Principles of Economics, 11th Edition 10000
Prescott's Microbiology: 2026 Release ISE 10000
University Physics with Modern Physics, 16th edition 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Environmental Leverage in Times of Climate Crisis: Product Standards, Carbon Border Measures and Preferential Trade Agreements 1000
Interactions of Vowel Quality and Prosody in East Slavic 1000
Erwählung und Berufung bei Paulus: Bedeutung, Entwicklung und Funktion einer Vorstellung in ihrem frühjüdischen und griechisch-römischen Kontext 850
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7183616
求助须知:如何正确求助?哪些是违规求助? 8822287
关于积分的说明 18631237
捐赠科研通 6810450
什么是DOI,文献DOI怎么找? 3172423
关于科研通互助平台的介绍 2320129
邀请新用户注册赠送积分活动 2146977