跟踪(心理语言学)
计算机科学
异常检测
追踪
人工智能
图形
数据挖掘
异常(物理)
集合(抽象数据类型)
节点(物理)
机器学习
模式识别(心理学)
理论计算机科学
工程类
程序设计语言
哲学
语言学
物理
结构工程
凝聚态物理
作者
Ke Zhang,Chenxi Zhang,Xin Peng,Chaofeng Sha
标识
DOI:10.1109/issre55969.2022.00032
摘要
Distributed tracing has been an important part of microservice infrastructure and learning-based trace analysis has been used to detect anomalies in microservice systems. Existing learning-based trace anomaly detection approaches ei-ther assume that trace patterns can be learned from normal execution or rely on fault injection to produce labeled traces (i.e., normal/anomalous ones). However, in practice it is often difficult to ensure that the normal execution does not involve anomalous traces or obtain a large variety of normal and anomalous traces through fault injection. In this paper, we propose PUTraceAD, a trace anomaly detection approach that can alleviate the above problems. PUTraceAD represents a trace as a span causal graph with node features such as operation name, response code, duration time. Based on the graph representation, PUTraceAD trains a GNN- and PU learning-based trace anomaly detection model. During the process, PU (Positive and Unlabeled) learning optimizes model parameters through estimating the data distribution. Therefore, PUTraceAD can train the model based on a small set of labeled anomalous traces and a large set of unlabeled traces. Our evaluation shows that PUTraceAD outperforms existing unsupervised trace anomaly detection approaches and only slightly underperforms a supervised learning-based approach that takes full advantage of labeled traces.
科研通智能强力驱动
Strongly Powered by AbleSci AI