A hybrid feature selection approach based on information theory and dynamic butterfly optimization algorithm for data classification

特征选择 计算机科学 元启发式 数据挖掘 相互信息 最大化 分类器(UML) 特征(语言学) 人工智能 冗余(工程) 机器学习 算法 模式识别(心理学) 数学优化 数学 语言学 哲学 操作系统
作者
Anurag Tiwari,Amrita Chaturvedi
出处
期刊:Expert Systems With Applications [Elsevier BV]
卷期号:196: 116621-116621 被引量:34
标识
DOI:10.1016/j.eswa.2022.116621
摘要

The ubiquitous usage of feature selection in search space optimization, information retrieval, data mining, signal processing, software fault prediction, and bioinformatics is paramount to expert and intelligent systems. Most of the conventional feature selection methods implemented are based on filter and wrapper approaches that suffer from poor classification accuracy, high computational cost, and selection of irrelevant and redundant features. This is due to the limitations of the employed objective functions leading to overestimation of the feature significance. On the contrary, hybrid feature selection methods formulated from information theory and nature-inspired metaheuristic algorithms are preferred because of their high computational efficiency, scalability in avoiding redundant and less informative features, and independence from the classifier. However, these methods have three common drawbacks: (1) poor trade-off between exploration and exploitation phase, (2) getting stuck into an optimal local solution, and (3) avoiding irrelevancy and redundancy of selected features. The first and the second drawback is related to metaheuristic algorithm implementation, while the third is concerned with applied information-theoretic paradigms. To address the aforementioned problems, we developed a new hybrid feature selection method, namely, the Iterative Feature Selection using Dynamic Butterfly Optimization Algorithm based Interaction Maximization (IFS-DBOIM) that combines Dynamic Butterfly Optimization Algorithm (DBOA) with a mutual information-based Feature Interaction Maximization (FIM) scheme for selecting the optimal feature subset. There is evidence that DBOA performs better in exploration, exploitation, and avoidance of local optima entrapment, and FIM comparatively scores the maximum relevancy with minimum redundancy of the new features with previously selected ones. The performance of the proposed method is compared using twenty publicly available datasets with ten baseline feature selection approaches. The results revealed that IFS-DBOIM outperforms other approaches on most datasets, maximizing the percent classification accuracy with the least number of features. The nonparametric Wilcoxon rank test confirms the statistical significance of these outcomes. Moreover, this method realizes the best trade-off between accuracy and stability.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
lys发布了新的文献求助10
刚刚
duo完成签到,获得积分10
2秒前
2秒前
2秒前
SC30发布了新的文献求助10
4秒前
共享精神应助猫尔儿采纳,获得10
4秒前
orixero应助大白兔味薯片采纳,获得10
4秒前
4秒前
4秒前
5秒前
动人的乾发布了新的文献求助10
5秒前
安静店员发布了新的文献求助10
5秒前
白嫖论文完成签到 ,获得积分10
5秒前
热情的戾发布了新的文献求助10
5秒前
安静的兔子完成签到,获得积分10
6秒前
共享精神应助shi采纳,获得10
7秒前
科研通AI2S应助shi采纳,获得10
7秒前
幼稚园扛把子完成签到,获得积分10
7秒前
123发布了新的文献求助10
7秒前
8秒前
Owen应助七个丸子采纳,获得10
9秒前
Chemistry发布了新的文献求助10
9秒前
舒心迎曼发布了新的文献求助10
10秒前
10秒前
汤圆发布了新的文献求助10
10秒前
在水一方应助Fine采纳,获得10
10秒前
10秒前
10秒前
隐形曼青应助安静的兔子采纳,获得10
10秒前
muzian完成签到 ,获得积分10
11秒前
12秒前
13秒前
纯真自行车关注了科研通微信公众号
14秒前
飞虎应助文龙之子采纳,获得10
15秒前
cherrywxc发布了新的文献求助10
15秒前
15秒前
北鸢完成签到,获得积分10
15秒前
正直沧海发布了新的文献求助10
15秒前
15秒前
Owen应助zhouye采纳,获得10
16秒前
高分求助中
Principles of Economics, 11th Edition 10000
Prescott's Microbiology: 2026 Release ISE 10000
University Physics with Modern Physics, 16th edition 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Environmental Leverage in Times of Climate Crisis: Product Standards, Carbon Border Measures and Preferential Trade Agreements 1000
Interactions of Vowel Quality and Prosody in East Slavic 1000
Erwählung und Berufung bei Paulus: Bedeutung, Entwicklung und Funktion einer Vorstellung in ihrem frühjüdischen und griechisch-römischen Kontext 850
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7192549
求助须知:如何正确求助?哪些是违规求助? 8829007
关于积分的说明 18640550
捐赠科研通 6828053
什么是DOI,文献DOI怎么找? 3175774
关于科研通互助平台的介绍 2327685
邀请新用户注册赠送积分活动 2150240