特征选择
计算机科学
特征(语言学)
加权
最小冗余特征选择
数据挖掘
启发式
人工智能
模式识别(心理学)
多标签分类
降维
选择(遗传算法)
排名(信息检索)
过程(计算)
相似性(几何)
机器学习
公制(单位)
维数之咒
工程类
操作系统
图像(数学)
放射科
哲学
医学
语言学
运营管理
作者
Jintao Huang,Wenbin Qian,Chi‐Man Vong,Weiping Ding,Wenhao Shu,Qin Huang
标识
DOI:10.1109/tetci.2022.3231655
摘要
Multi-label feature selection can effectively resolve the challenges of high or even ultra-high dimensionality in multi-label data. However, most existing multi-label feature selection algorithms can only handle a single data type, assume all labels are equally significant and utilize heuristic search strategies, which results in inefficient and relatively unsatisfactory classification accuracy. In view of the above shortcomings, this paper proposes a new multi-label feature selection algorithm that effectively resolves existing algorithms' issues through three innovative procedures. First, a new similarity relation metric is proposed to deal with hybrid data types effectively. Second, a label enhancement algorithm is designed to enhance and transform the logical labels into a label distribution by fully considering the analytic hierarchy process (AHP) embedded with label correlation, which can automatically identify the significance of different labels. Third, a feature weighting evaluation is redesigned in the feature selection process to obtain the optimal feature subset through feature ranking directly. Under these proposed procedures, multi-label feature selection can effectively utilize the abundant semantic information of the label significance and can significantly improve the operating accuracy and efficiency simultaneously. Comparative experiments are conducted on 20 real multi-label datasets with seven state-of-the-art multi-label feature selection algorithms. Experimental results show that the proposed multi-label feature selection algorithm in this paper is about 5–10% better than the algorithms in 80% of the compared datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI