计算机科学
特征选择
数据挖掘
粗集
大数据
特征(语言学)
选择(遗传算法)
树(集合论)
人工智能
数学
语言学
数学分析
哲学
作者
Wanli Huang,Yanhong She,Xin He,Weiping Ding
出处
期刊:IEEE Transactions on Fuzzy Systems
[Institute of Electrical and Electronics Engineers]
日期:2023-10-01
卷期号:31 (10): 3721-3733
被引量:1
标识
DOI:10.1109/tfuzz.2023.3300913
摘要
In the era of big data, both the size and the number of features, samples, and classes continue to increase, resulting in high-dimensional classification tasks. One characteristic, among others, of big data is there exist complex structures between different classes. Hierarchical structure may be treated as the most representative one, which is mathematically depicted as a tree-like structure or directed acyclic graph. In this article, considering data in the real world may arrive dynamically, we propose an incremental feature selection approach in hierarchical classification by employing fuzzy rough set technique. First, we use the sibling strategy to reduce the scope of negative samples. Second, we present a theoretical analysis of the incremental updating of the lower approximation, positive region and dependency degree at the arrival of new samples, respectively. Third, we perform the algorithmic design of the incremental approaches. To do that, we first present two improved versions (NIDC and NIFS for short) of the existing nonincremental methods, based on NIDC, NIFS, and the aforementioned theoretical analysis, two incremental algorithms (IDU and IFS for short) are then designed to perform incremental feature selection. Finally, a numerical experiment is conducted on some commonly used datasets for hierarchical classification tasks, whose true classes are distributed to both leaf nodes and internal nodes. A comparative study is further performed to show that our approach is effective and feasible.
科研通智能强力驱动
Strongly Powered by AbleSci AI