过度拟合
极限学习机
降维
模式识别(心理学)
维数之咒
计算机科学
人工智能
分类器(UML)
特征(语言学)
特征提取
集成学习
数据挖掘
机器学习
人工神经网络
语言学
哲学
作者
Leilei Zhao,Fengchun Tian,Junhui Qian,Hantao Li,Zhiyuan Wu
标识
DOI:10.1109/tim.2023.3251416
摘要
Gas sensor array (GSA) data usually has high-dimensional features and a small sample size. When a classifier is directly used for GSA data classification, it is prone to overfitting and has a high time cost. The traditional solution is to perform feature dimensionality reduction before classification. However, selecting a suitable dimensionality reduction method is time-consuming and laborious, and some features useful for classification may be lost after dimensionality reduction, especially for the weak sensor response data to low-concentration gases. In this article, we proposed a feature ensemble-based extreme learning machine framework (FE-ELM) for GSA data classification. In FE-ELM, downsampling is first performed on the time series of each sensor, and then the downsampled features of different sensors are combined to obtain fused feature subsets. Next, a base ELM is trained independently on each fused feature subset with all training samples by solving the least-squares problem. The final FE-ELM predictions for input samples are obtained by voting the prediction results of all base ELMs. Compared with traditional methods, the proposed method solves the overfitting problem and can be directly used for GSA data classification without prior feature dimension reduction. Furthermore, the ensemble of all base classifiers with little loss of original features enables the proposed FE-ELM to have a more efficient and robust classification performance. Experimental results on data from both homemade GSA under low-concentration gases (ppb) and publicly available confirm that the proposed FE-ELM exceeds traditional methods and extends the detection limit of the sensor array.
科研通智能强力驱动
Strongly Powered by AbleSci AI