协变量
范畴变量
标准化
选择(遗传算法)
统计
计算机科学
特征选择
计量经济学
逻辑回归
数据挖掘
数学
人工智能
操作系统
作者
Xiang Li,Yong Ma,Qing Pan
标识
DOI:10.1177/09622802221129042
摘要
In sparse penalized regressions, candidate covariates of different units need to be standardized beforehand so that the coefficient sizes are directly comparable and reflect their relative impacts, which leads to fairer variable selection. However, when covariates of mixed data types (e.g. continuous, binary or categorical) exist in the same dataset, the commonly used standardization methods may lead to different selection probabilities even when the covariates have the same impact on or level of association with the outcome. In this paper, we propose a novel standardization method that targets at generating comparable selection probabilities in sparse penalized regressions for continuous, binary or categorical covariates with the same impact. We illustrate the advantages of the proposed method in simulation studies, and apply it to the National Ambulatory Medical Care Survey data to select factors related to the opioid prescription in the US.
科研通智能强力驱动
Strongly Powered by AbleSci AI