计算机科学
数据流挖掘
维数之咒
离群值
数据科学
背景(考古学)
异常检测
数据挖掘
任务(项目管理)
航程(航空)
集合(抽象数据类型)
概念漂移
领域(数学)
空格(标点符号)
机器学习
人工智能
系统工程
古生物学
材料科学
生物
工程类
复合材料
程序设计语言
数学
纯数学
操作系统
作者
Imen Souiden,Mohamed Nazih Omri,Zaki Brahmi
标识
DOI:10.1016/j.cosrev.2022.100463
摘要
The rapid evolution of technology has led to the generation of high dimensional data streams in a wide range of fields, such as genomics, signal processing, and finance. The combination of the streaming scenario and high dimensionality is particularly challenging especially for the outlier detection task. This is due to the special characteristics of the data stream such as the concept drift, the limited time and space requirements, in addition to the impact of the well-known curse of dimensionality in high dimensional space. To the best of our knowledge, few studies have addressed these challenges simultaneously, and therefore detecting anomalies in this context requires a great deal of attention. The main objective of this work is to study the main approaches existing in the literature, to identify a set of comparison criteria, such as the computational cost and the interpretation of outliers, which will help us to reveal the different challenges and additional research directions associated with this problem. At the end of this study, we will draw up a summary report which summarizes the main limits identified and we will detail the different directions of research related to this issue in order to promote research for this community.
科研通智能强力驱动
Strongly Powered by AbleSci AI