特征选择
离群值
模式识别(心理学)
子空间拓扑
计算机科学
人工智能
预处理器
正规化(语言学)
异常检测
特征(语言学)
维数之咒
约束(计算机辅助设计)
降维
稀疏矩阵
特征向量
数据挖掘
稳健性(进化)
转化(遗传学)
缩小
数学
数据预处理
算法
选择(遗传算法)
稀疏PCA
期限(时间)
特征提取
协方差矩阵
趋同(经济学)
机器学习
矩阵分解
稀疏逼近
作者
Sisi Wang,Feiping Nie,Zheng Wang,Rong Wang,Zhensheng Sun,Xuelong Li
标识
DOI:10.1109/tcyb.2026.3656518
摘要
Feature selection is one of the hot issues in machine learning. It reduces storage pressure by effectively screening features and has become a very practical data preprocessing method. At present, most feature selection algorithms apply $\ell _{2,1}$ -norm on the transformation matrix to calculate the scores for all features and then select appropriate features according to these scores. But their sparsity is limited, and meaningless regularization parameters increase the cost, making it prone to falling into local optimum. To solve the above difficulties, this article proposes a novel max-min robust unsupervised feature selection via sparse subspace (MMRUFS), which considers both the reconstruction term and variance term of data, so that the model can not only fully retain the original information of data, but also make the data more dispersed. Second, $\ell _{2,0}$ -norm constraint is used on the transformation matrix to directly select the optimal feature subset, avoiding the fine-tuning of regularization parameters. To enhance the robustness, MMRUFS carefully designs mark weight vector to make the model treat normal samples and outliers differently and achieves the effect of anomaly detection. Finally, MMRUFS is solved by designing the surrogate matrix, and its convergence is strictly guaranteed, experimental results reveal that MMRUFS outperforms other feature selection algorithms on multiple real-world datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI