计算机科学
特征选择
排名(信息检索)
数据挖掘
聚类分析
模式识别(心理学)
滤波器(信号处理)
人工智能
特征(语言学)
支持向量机
随机森林
k-最近邻算法
特征向量
机器学习
高维数据聚类
树(集合论)
数学
哲学
语言学
计算机视觉
数学分析
作者
Mohammad Ahmadi Ganjei,Reza Boostani
标识
DOI:10.1016/j.engappai.2022.104894
摘要
There is a growing interest in developing feature subset selection schemes for high-dimensional datasets by filter, wrapper, embedded, and hybrid manners. In this paper, we propose a new hybrid (filter-wrapper) feature selection approach. At first, in the filter step, we rank input features according to their relevance with the class label. Afterwards, we apply different clustering methods for the classification of the selected features. We perform an inner and outer cluster ranking based on the primary feature ranking in the next step. Then, different search strategies are performed on the best cluster of features in the wrapper phase. Moreover, we add some of them to the feature set based on the classifiers (nearest neighbor, decision tree, support vector machine, and random forests) feedback. Then, the algorithm goes to the next cluster, and this process is continued till all clusters are met. Finally, we compare the results of the proposed method to the state-of-the-art schemes. Comparison results imply the superiority of the proposed method to the counterparts on eight high-dimensional datasets in terms of accuracy and computational complexity.
科研通智能强力驱动
Strongly Powered by AbleSci AI