粗集
特征选择
计算机科学
数据挖掘
相互信息
k-最近邻算法
班级(哲学)
熵(时间箭头)
特征(语言学)
集合(抽象数据类型)
人工智能
模式识别(心理学)
语言学
哲学
物理
量子力学
程序设计语言
作者
Weihua Xu,Ziting Yuan,Zheng Liu
出处
期刊:IEEE transactions on artificial intelligence
[Institute of Electrical and Electronics Engineers]
日期:2023-01-16
卷期号:5 (1): 229-243
被引量:24
标识
DOI:10.1109/tai.2023.3237203
摘要
Neighborhood rough sets are now widely used to process numerical data. Nevertheless, most of the existing neighborhood rough sets are not able to distinguish class mixture samples well when dealing with classification problems. That is, it cannot effectively classify categories when dealing with data with an unbalanced distribution. Because of this, in this article, we propose a new feature selection method that takes into consideration both heterogeneous data and feature interaction. The proposed model well integrates the ascendancy of ${\delta }$ -neighborhood and ${k}$ -nearest neighbor. Such heterogeneous data can be handled better than existing neighborhood models. We utilize information entropy theories such as mutual information and conditional mutual information and employ an iterative strategy to define the importance of each feature in decision making. Furthermore, we design a feature extraction algorithm based on the above idea. Experimental results display that the raised algorithm has superior effect than some existing algorithms, particularly the ${\delta }$ -neighborhood rough set model and the ${k}$ -nearest neighborhood rough set model.
科研通智能强力驱动
Strongly Powered by AbleSci AI