计算机科学
特征选择
判别式
多标签分类
人工智能
模式识别(心理学)
特征(语言学)
机器学习
特征向量
背景(考古学)
数据挖掘
古生物学
哲学
语言学
生物
作者
Mohammad Jamil Faraji,Seyed Amjad Seyedi,Fardin Akhlaghian Tab,Reza Mahmoodi
标识
DOI:10.1016/j.eswa.2024.123198
摘要
In various application domains, high-dimensional multi-label data has become more prevalent, presenting two significant challenges: instances with high-dimensional features and a large number of labels. In the context of multi-label feature selection, the objective is to choose a subset of features from a given set that is highly pertinent for predicting multiple labels or categories associated with each instance. However, certain characteristics of multi-label classification, such as label dependencies and imbalanced label distribution, have often been overlooked although they hold valuable insights for designing effective multi-label feature selection algorithms. In this paper, we propose a feature selection model which exploits explicit global and local label correlations to select discriminative features across multiple labels. In addition, by representing the feature matrix and label matrix in a shared latent space, the model aims to capture the underlying correlations between features and labels. The shared representation can reveal common patterns or relationships that exist across multiple labels and features. An objective function involving L2,1-norm regularization is formulated, and an alternating optimization-based iterative algorithm is designed to obtain the sparse coefficients for multi-label feature selection. The proposed method was evaluated on 14 real-world multi-label datasets using six evaluation metrics, through comprehensive experiments. The results indicate its effectiveness, surpassing that of several representative methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI