An Iterative Method for Unsupervised Robust Anomaly Detection Under Data Contamination

异常检测正态性计算机科学稳健性（进化）异常（物理）水准点（测量）人工智能离群值数据挖掘机器学习超参数模式识别（心理学）数学统计凝聚态物理基因物理化学大地测量学地理生物化学

作者

Minkyung Kim,Jongmin Yu,Junsik Kim,Tae-Hyun Oh,Jun Kyun Choi

出处

期刊：IEEE transactions on neural networks and learning systems [Institute of Electrical and Electronics Engineers]
日期：2023-05-10 卷期号：35 (10): 13327-13339 被引量：17

链接

nih.gov arxiv.org arxiv.org ac.krdoi.org

标识

DOI：10.1109/tnnls.2023.3267028

摘要

Most deep anomaly detection models are based on learning normality from datasets due to the difficulty of defining abnormality by its diverse and inconsistent nature. Therefore, it has been a common practice to learn normality under the assumption that anomalous data are absent in a training dataset, which we call normality assumption. However, in practice, the normality assumption is often violated due to the nature of real data distributions that includes anomalous tails, i.e., a contaminated dataset. Thereby, the gap between the assumption and actual training data affects detrimentally in learning of an anomaly detection model. In this work, we propose a learning framework to reduce this gap and achieve better normality representation. Our key idea is to identify sample-wise normality and utilize it as an importance weight, which is updated iteratively during the training. Our framework is designed to be model-agnostic and hyperparameter insensitive so that it applies to a wide range of existing methods without careful parameter tuning. We apply our framework to three different representative approaches of deep anomaly detection that are classified into one-class classification-, probabilistic model-, and reconstruction-based approaches. In addition, we address the importance of a termination condition for iterative methods and propose a termination criterion inspired by the anomaly detection objective. We validate that our framework improves the robustness of the anomaly detection models under different levels of contamination ratios on five anomaly detection benchmark datasets and two image datasets. On various contaminated datasets, our framework improves the performance of three representative anomaly detection methods, measured by area under the ROC curve.

求助该文献

最长约 10秒，即可获得该文献文件

An Iterative Method for Unsupervised Robust Anomaly Detection Under Data Contamination

今日热心研友