特征选择
计算机科学
模式识别(心理学)
选择(遗传算法)
粒度
特征(语言学)
人工智能
集合(抽象数据类型)
粗集
多标签分类
预处理器
数据挖掘
机器学习
哲学
操作系统
程序设计语言
语言学
作者
Jinghua Liu,Yaojin Lin,Weiping Ding,Hongbo Zhang,Cheng Wang,Ji‐Xiang Du
出处
期刊:Neurocomputing
[Elsevier BV]
日期:2022-12-15
卷期号:524: 142-157
被引量:34
标识
DOI:10.1016/j.neucom.2022.11.096
摘要
Multi-label feature selection is an indispensable technology in multi-semantic high-dimensional data preprocessing, which has been brought into focus in recent years. However, most existing methods explicitly assume that the significance of all relevant labels is the same for every instance, while ignoring the real scenarios that the significance of available labels to each instance is usually different. In this paper, we propose a novel multi-label feature selection based on label distribution and neighborhood rough set, known as LDRS. To be specific, we first construct a label enhancement method based on instance information distribution to convert the logical labels of multi-label data into label distribution, thereby capturing label significance to provide additional information for learning tasks. Then, we extend the neighborhood rough set model for label distribution learning, and discuss the related properties in detail. This extended model can effectively avoid the selection of neighborhood granularity and seamlessly apply to handle label distribution data. After that, two feature significance measures are established to realize the quality evaluation of features and the fusion of label-specific features. Finally, a novel feature selection framework is designed, which takes into account feature significance, label significance, and label-specific features, simultaneously. Experiments on both public and real-world datasets exhibit the advantages of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI