维数之咒
异常检测
相似性(几何)
欧几里德距离
差异(会计)
离群值
计算机科学
代表(政治)
秩(图论)
欧几里得空间
空格(标点符号)
入侵检测系统
模式识别(心理学)
数据点
维数(图论)
人工智能
数据挖掘
数学
图像(数学)
业务
会计
组合数学
操作系统
法学
纯数学
政治
政治学
作者
Xiaojie Li,Jiancheng Lv,Yi Zhang
标识
DOI:10.1109/tcyb.2018.2876615
摘要
Outlier detection has drawn significant interest from both academia and industry, such as network intrusion detection. Most existing methods implicitly or explicitly rely on distances in Euclidean space. However, the Euclidean distance may be incapable of measuring the similarity among high-dimensional data due to the curse of dimensionality, thus leading to inferior performance in practice. This paper presents an innovative approach for outlier detection from the view of meaningful structure scores. If two points have similar features, the difference between their structural scores is small and vice versa. The scores are calculated by measuring the variance of angles weighted by data representation, which takes the global data structure into the measurement. Thus, it could consistently rank more similar points. Compared with existing methods, our structural scores could be better to reflect the characteristics of data in a high-dimensional space. The proposed method consistently ranks more similar points. Experiments on synthetic and several real-world datasets have demonstrated the effectiveness and efficiency of our proposed methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI