Development and validation of parsimonious algorithms to classify acute respiratory distress syndrome phenotypes: a secondary analysis of randomised controlled trials

医学 急性呼吸窘迫综合征 逻辑回归 特征选择 随机对照试验 随机森林 机器学习 人工智能 急性呼吸窘迫 算法 内科学 计算机科学
作者
Pratik Sinha,Kevin Delucchi,Daniel F. McAuley,Cecilia O’Kane,Michael A. Matthay,Carolyn S. Calfee
出处
期刊:The Lancet Respiratory Medicine [Elsevier BV]
卷期号:8 (3): 247-257 被引量:290
标识
DOI:10.1016/s2213-2600(19)30369-8
摘要

Using latent class analysis (LCA) in five randomised controlled trial (RCT) cohorts, two distinct phenotypes of acute respiratory distress syndrome (ARDS) have been identified: hypoinflammatory and hyperinflammatory. The phenotypes are associated with differential outcomes and treatment response. The objective of this study was to develop parsimonious models for phenotype identification that could be accurate and feasible to use in the clinical setting.In this retrospective study, three RCT cohorts from the National Lung, Heart, and Blood Institute ARDS Network (ARMA, ALVEOLI, and FACTT) were used as the derivation dataset (n=2022), from which the machine learning and logistic regression classifer models were derived, and a fourth (SAILS; n=715) from the same network was used as the validation test set. LCA-derived phenotypes in all of these cohorts served as the reference standard. Machine-learning algorithms (random forest, bootstrapped aggregating, and least absolute shrinkage and selection operator) were used to select a maximum of six important classifier variables, which were then used to develop nested logistic regression models. Only cases with complete biomarker data in the derivation dataset were used for variable selection. The best logistic regression models based on parsimony and predictive accuracy were then evaluated in the validation test set. Finally, the models' prognostic validity was tested in two external ARDS clinical trial datasets (START and HARP-2) by assessing mortality at days 28, 60, and 90 and ventilator-free days to day 28.The six most important classifier variables were interleukin (IL)-8, IL-6, protein C, soluble tumour necrosis factor receptor 1, bicarbonate, and vasopressor use. From the nested models, three-variable (IL-8, bicarbonate, and protein C) and four-variable (3-variable plus vasopressor use) models were adjudicated to be the best performing. In the validation test set, both models showed good accuracy (AUC 0·94 [95% CI 0·92-0·95] for the three-variable model and 0·95 [95% CI 0·93-0·96] for the four-variable model) against LCA classifications. As with LCA-derived phenotypes, the hyperinflammatory phenotype as identified by the classifier model was associated with higher mortality at day 90 (87 [39%] of 223 patients vs 112 [23%] of 492 patients; p<0·0001) and fewer ventilator-free days (median 14 days [IQR 0-22] vs 22 days [0-25]; p<0·0001). In the external validation datasets, three-variable models developed in the derivation dataset identified two phenotypes with distinct clinical features and outcomes consistent with previous findings, including differential survival with simvastatin versus placebo in HARP-2 (p=0·023 for survival at 28 days).ARDS phenotypes can be accurately identified with parsimonious classifier models using three or four variables. Pending the development of real-time testing for key biomarkers and prospective validation, these models could facilitate identification of ARDS phenotypes to enable their application in clinical trials and practice.National Institutes of Health.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
温软完成签到 ,获得积分10
1秒前
LXN发布了新的文献求助10
1秒前
打打应助AAA采纳,获得10
1秒前
NEO发布了新的文献求助10
1秒前
4秒前
无极微光应助奋斗的桐采纳,获得20
5秒前
我是老大应助俏皮代丝采纳,获得10
5秒前
6秒前
CodeCraft应助你泽采纳,获得10
6秒前
张小圆完成签到,获得积分10
7秒前
laurina完成签到 ,获得积分10
8秒前
8秒前
zhz发布了新的文献求助10
11秒前
gorgeous完成签到,获得积分10
12秒前
科目三应助19079405053采纳,获得10
12秒前
蔡一完成签到,获得积分10
12秒前
充电宝应助troyqiujing采纳,获得10
13秒前
顾矜应助靳元逵采纳,获得10
13秒前
刘娇发布了新的文献求助10
13秒前
LXN发布了新的文献求助10
14秒前
慢慢完成签到,获得积分10
14秒前
cyf完成签到,获得积分10
15秒前
CodeCraft应助落泪静殇采纳,获得10
17秒前
17秒前
SciGPT应助路远采纳,获得10
17秒前
18秒前
19秒前
rasmus完成签到,获得积分10
19秒前
倪妮完成签到,获得积分10
21秒前
21秒前
23秒前
23秒前
23秒前
23秒前
LXN发布了新的文献求助10
23秒前
23秒前
24秒前
积极的皮卡丘完成签到 ,获得积分10
24秒前
25秒前
陈乔发布了新的文献求助10
26秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
Research Methods for Applied Linguistics 500
Picture Books with Same-sex Parented Families Unintentional Censorship 444
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6415074
求助须知:如何正确求助?哪些是违规求助? 8233974
关于积分的说明 17484690
捐赠科研通 5467925
什么是DOI,文献DOI怎么找? 2888960
邀请新用户注册赠送积分活动 1865828
关于科研通互助平台的介绍 1703506