A simulation study of the number of events per variable in logistic regression analysis

逻辑回归 统计 统计的 数学 回归分析 变量 样本量测定 样品(材料) 差异(会计) 置信区间 瓦尔德试验 蒙特卡罗方法 计量经济学 变量(数学) 统计假设检验 业务 数学分析 会计 化学 色谱法
作者
Peter Peduzzi,John Concato,Elizabeth Kemper,Theodore R. Holford,Alvan R. Feinstein
出处
期刊:Journal of Clinical Epidemiology [Elsevier BV]
卷期号:49 (12): 1373-1379 被引量:8974
标识
DOI:10.1016/s0895-4356(96)00236-3
摘要

We performed a Monte Carlo study to evaluate the effect of the number of events per variable (EPV) analyzed in logistic regression analysis. The simulations were based on data from a cardiac trial of 673 patients in which 252 deaths occurred and seven variables were cogent predictors of mortality; the number of events per predictive variable was (252/7 =) 36 for the full sample. For the simulations, at values of EPV = 2, 5, 10, 15, 20, and 25, we randomly generated 500 samples of the 673 patients, chosen with replacement, according to a logistic model derived from the full sample. Simulation results for the regression coefficients for each variable in each group of 500 samples were compared for bias, precision, and significance testing against the results of the model fitted to the original sample. For EPV values of 10 or greater, no major problems occurred. For EPV values less than 10, however, the regression coefficients were biased in both positive and negative directions; the large sample variance estimates from the logistic model both overestimated and underestimated the sample variance of the regression coefficients; the 90% confidence limits about the estimated values did not have proper coverage; the Wald statistic was conservative under the null hypothesis; and paradoxical associations (significance in the wrong direction) were increased. Although other factors (such as the total number of events, or sample size) may influence the validity of the logistic model, our findings indicate that low EPV can lead to major problems.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
2秒前
hhh发布了新的文献求助10
2秒前
燕晓啸完成签到 ,获得积分10
3秒前
JYZ发布了新的文献求助10
6秒前
啦啦啦啦发布了新的文献求助10
7秒前
星辰大海应助123456采纳,获得10
7秒前
10秒前
11秒前
领导范儿应助奋斗的牛马采纳,获得10
13秒前
Miamia发布了新的文献求助10
14秒前
16秒前
执念发布了新的文献求助10
17秒前
17秒前
华仔应助明理的幻梦采纳,获得10
20秒前
洛莫完成签到,获得积分10
20秒前
舒心砖头发布了新的文献求助10
21秒前
m(_._)m完成签到 ,获得积分0
21秒前
21秒前
aluan发布了新的文献求助30
21秒前
123456发布了新的文献求助10
22秒前
22秒前
23秒前
大模型应助猪八戒采纳,获得10
23秒前
程程完成签到 ,获得积分10
24秒前
吴彦祖完成签到,获得积分10
24秒前
小春卷完成签到,获得积分10
24秒前
25秒前
25秒前
Yuuuuuuun完成签到 ,获得积分10
25秒前
25秒前
陈可男发布了新的文献求助10
25秒前
26秒前
27秒前
大道希言完成签到,获得积分10
27秒前
27秒前
28秒前
义气幼珊发布了新的文献求助10
28秒前
专注白昼发布了新的文献求助10
29秒前
M1有光完成签到,获得积分10
30秒前
高分求助中
液晶指向矢仿真分析数据集 8888
Invited Discussant 63O and 64O 1000
Ideology and Meaning-Making under the Putin Regime 750
Advanced Memory Technology 500
Petrology and Plate Tectonics 500
Writing Systems 500
A Handbook of User Experience Research & Design in Libraries 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6866417
求助须知:如何正确求助?哪些是违规求助? 8569000
关于积分的说明 18219196
捐赠科研通 6237050
什么是DOI,文献DOI怎么找? 3049651
关于科研通互助平台的介绍 2052239
邀请新用户注册赠送积分活动 2027452