计算机科学
异常检测
故障排除
人工智能
自编码
精确性和召回率
决策树
异常(物理)
模式识别(心理学)
深度学习
支持向量机
机器学习
数据挖掘
人工神经网络
卷积神经网络
作者
Marta Catillo,Antonio Pecchia,Umberto Villano
标识
DOI:10.1016/j.eswa.2021.116263
摘要
The use of system logs for detecting and troubleshooting anomalies of production systems has been known since the early days of computers. In spite of the advances in the area, the analysis of log files emitted by real-life systems poses many peculiar challenges. Up-to-date tools, such as log management and Security Information and Event Management (SIEM) products, capitalize on standard data formats, logging protocols and dictionaries of threat signatures, which hardly fit to logs of industrial and proprietary systems. This paper addresses the analysis of logs emitted by computer systems with a focus on anomaly detection. The proposed approach, named AutoLog, consists in sampling the logs at regular intervals and to compute numeric scores. Scores collected under normative operations are used to train a semi-supervised deep autoencoder, which serves as a baseline to classify future scores. The approach is not constrained by the structure of underlying logs and does not need for anomalies at training time. The results obtained in detecting anomalies of two industrial systems and the public BG/L and Hadoop datasets widely used as benchmarks, indicate that the recall of AutoLog ranges between 0.96 and 0.99, while the precision is within 0.93 and 0.98. A comparative study with isolation forest, one-class SVM, decision tree, vanilla autoencoder and variational autoencoder is conducted to demonstrate the validity of the proposal. • A semi-supervised learning technique for anomaly detection. • Automatic knowledge extraction from system logs. • Deep autoencoding and non-linear data transformations. • Measurements with real-life computer systems and logs.
科研通智能强力驱动
Strongly Powered by AbleSci AI