特征选择
粒度
计算机科学
相互信息
人工智能
选择(遗传算法)
数据挖掘
程序设计语言
作者
Hengrong Ju,Tao Yin,Jiashuang Huang,Weiping Ding,Xibei Yang
标识
DOI:10.1109/tetci.2023.3314548
摘要
K -nearest neighborhood information granularity-based feature selection is derived from the well-known k -nearest neighbor ( k NN) classification technique, which is widely employed in data mining. However, the current k -nearest neighborhood-based information granules cannot effectively handle data with different density distributions. To address this problem, a sparse mutual granularity-based feature selection approach is developed. First, a personalized information granule is constructed based on the optimal k values for each sample. The optimal k value is obtained through the number of correlated samples, where the correlations between the samples are learned by the sparse constraint function. The achieved optimal k can define the size of the granularity-based model and increase the classification accuracy. Second, a mutual-information strategy is introduced in the granularity process. Irrelevant samples in the granularity-based model are removed, which improves the classification performance. Third, an improved heuristic feature selection algorithm is developed to address the nonmonotonic problem. Compared with the classical heuristic method, the proposed feature selection method can improve the performance of the obtained subset and avoid degradation caused by non-monotonicity. The experimental results on the UCI datasets show that the sparse mutual granularity-based feature selection approach is effective for managing data with different density distributions. Finally, the proposed feature selection approach is applied to select significant brain regions in several schizophrenia datasets. It contributes to the prediction of schizophrenia and also provides a new direction for the improvement of medical-image analysis.
科研通智能强力驱动
Strongly Powered by AbleSci AI