自编码
聚类分析
计算机科学
异常检测
模式识别(心理学)
人工智能
数据挖掘
机器学习
人工神经网络
作者
Van Quân Nguyen,Viet Hung Nguyen,Nhien‐An Le‐Khac,Van Loi Cao
标识
DOI:10.1109/rivf51545.2021.9642120
摘要
In a previous work, a clustering-based method had been incorporated with the latent feature space of an autoencoder to discover sub-classes of normal data for anomaly detection. However, the work has the limitation in manually setting up the numbers of clusters in the normal training data. Finding a proper number of clusters in datasets is often ambiguous and highly depends on the characteristics of datasets. This paper proposes a novel data-driven empirical approach for automatically identifying the number of normal sub-classes (clusters) without human intervention. This clustering-based method, afterward, is co-trained with an autoencoder to automatically discover the appreciated number of clusters of normal training data in the middle hidden layer of the autoencoder. The resulting clustering centers are then used to identify anomalies in querying data. Our approach is tested on four scenarios from the CTU13 datasets, and the experimental results show that the proposed model often perform better than those of the model in the previous work on almost scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI