特征选择
Lasso(编程语言)
人工智能
逻辑回归
特征(语言学)
计算机科学
回归
逻辑模型树
模式识别(心理学)
相关性
机器学习
弹性网正则化
数据挖掘
回归分析
噪音(视频)
数学
统计
几何学
万维网
哲学
图像(数学)
语言学
作者
Yadi Wang,Wenbo Zhang,Minghu Fan,Qiang Ge,Baojun Qiao,Xianyu Zuo,Bingbing Jiang
标识
DOI:10.1016/j.apm.2021.12.016
摘要
Feature selection for high-dimensional data is an important issue in machine learning, pattern recognition and bioinformatics fields. Feature selection algorithms are proposed to select the relevant feature subset from the original features. To adaptively identify the important highly correlated features from high-dimensional data which often beneficial to improve classification accuracy is a challenge. In this paper, we propose a regularized logistic regression with adaptive Lasso and correlation based penalty model to select informative highly correlated features adaptively. To incorporate significance of features into regression model, we first measure significance of each feature based on mutual information, and propose an adaptive weight construction strategy. Based on the adaptive weight construction strategy, the proposed adaptive logistic regression can impose a large amount of penalty on irrelevant features, and thus noise features are easily removed from the model and remain the informative features. The experimental results on the simulation and real-world datasets demonstrate the effectiveness and the superiority the proposed model by comparing it to existing competing regularized logistic regression models.
科研通智能强力驱动
Strongly Powered by AbleSci AI