特征选择
聚类分析
计算机科学
冗余(工程)
人工智能
模式识别(心理学)
Boosting(机器学习)
特征(语言学)
特征提取
数据挖掘
机器学习
哲学
语言学
操作系统
作者
Liwei Du,Zhihong Xu,Hongda Chen,Duan-Yu Chen
标识
DOI:10.1109/tim.2023.3322486
摘要
The performance of the machine learning-based arc fault diagnosis method is significantly influenced by the effectiveness of features. This paper proposes the use of the feature selection (FS) method to optimize the feature quality and classification result for arc fault diagnosis. An experimental system is constructed to generate arc fault current signals under various conditions. The limitations of the predetermined threshold-based methods in time, frequency, and time-frequency domains are outlined, and the dependence/redundancy issues present in the feature dataset are analyzed. To mitigate the impact of these issues, a Light Gradient Boosting Machine (LightGBM)-based FS method, which establishes the feature importance evaluation criterion for the selected optimal feature subset based on permutation importance, is proposed in this article. The fitness proportionate sharing-based feature clustering (FPS-FC) method searches for potential feature clusters and selects a feature subset with low redundancy to accommodate unlabeled data. Finally, both proposed FS methods are compared with 6 popular supervised/unsupervised methods using 2 classifiers across 7 datasets. The results validate the effectiveness of the LightGBM-based and FPS-FC-based FS methods in enhancing the performance of arc fault diagnosis. In addition, a designed case on Raspberry Pi verifies the feasibility of the proposed methods in real-world applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI