特征选择
计算机科学
特征(语言学)
滤波器(信号处理)
数据挖掘
人工智能
进化算法
维数之咒
领域(数学)
机器学习
选择(遗传算法)
分类
降维
进化计算
模式识别(心理学)
数学
哲学
语言学
纯数学
计算机视觉
作者
Emrah Hançer,Bing Xue,Mengjie Zhang
标识
DOI:10.1016/j.knosys.2023.111008
摘要
The high-dimensional datasets in various domains, such as text categorization, information retrieval and bioinformatics, have highlighted the importance of feature selection in data mining. Despite the numerous existing approaches to feature selection, there is still a need for further research in this field. In this paper, we propose an evolutionary filter feature selection approach that can be used for both single- and multi-objective scenarios by introducing an objective function inspired by Neighborhood Component Analysis (NCA)-based method and then integrating it into the differential evolution framework. The proposed approach applicable to two scenarios aims to identify an optimal feature subset through an evolutionary search process that maximizes class separation while minimizing the dimensionality. Through comprehensive experimental studies conducted on diverse datasets, the results show that the proposed approach outperforms recently proposed evolutionary information-theoretic, rough set-based and state-of-the-art feature selection approaches in both scenarios. Notably, this study is the first to integrate an NCA-based strategy into an evolutionary feature selection approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI