缺少数据
协变量
范畴变量
插补(统计学)
统计
逻辑回归
计量经济学
回归分析
数学
人口
回归
医学
环境卫生
作者
Chao-Ying Joanne Peng,Jin Zhu
标识
DOI:10.1177/0013164407305582
摘要
For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the expectation-maximization (EM) method of weights and multiple imputation (MI). Sample data are drawn randomly from a population with known characteristics. Missing data on covariates are simulated under two conditions: missing completely at random and missing at random with different missing rates. A logistic regression model was fit to each sample using either the EM or MI approach. The performance of these two approaches is compared on four criteria: bias, efficiency, coverage, and rejection rate. Results generally favored MI over EM. Practical issues such as implementation, inclusion of continuous covariates, and interactions between covariates are discussed.
科研通智能强力驱动
Strongly Powered by AbleSci AI