异常检测
部分可观测马尔可夫决策过程
计算机科学
强化学习
异常(物理)
人工智能
马尔可夫决策过程
贝叶斯概率
机器学习
马尔可夫过程
数据挖掘
马尔可夫链
马尔可夫模型
数学
统计
物理
凝聚态物理
作者
Jeremy Lee Watts,Franco van Wyk,Shahrbanoo Rezaei,Yiyang Wang,Neda Masoud,Anahita Khojandi
出处
期刊:IEEE Transactions on Intelligent Transportation Systems
[Institute of Electrical and Electronics Engineers]
日期:2022-12-01
卷期号:23 (12): 22884-22894
被引量:1
标识
DOI:10.1109/tits.2022.3200906
摘要
To assure the successful operation of connected and automated vehicles, it is critical to detect and isolate anomalous and/or faulty information in a timely manner. To do so, anomaly detection techniques should be implemented in real-time where if the probability of anomalous information exceeds a certain threshold, the information is dealt with accordingly. Traditionally, the threshold for judging whether the data is anomalous is fixed and determined a priori. However, not only does this approach fail to account for the feedback obtained during a trip on the performance of the algorithms, but it also fails to respond to potential changes in rates of anomalies. Hence, it is important to develop an approach that can dynamically alter this threshold in response to exogenous factors to assure reliable and robust system operation. We develop a mathematical framework which utilizes a dynamic threshold for an anomaly classification algorithm in order to maximize the safety of a trip. Specifically, we develop and pair an anomaly classification algorithm based on convolutional neural networks (CNN), with a partially observable Markov decision process (POMDP) model. We solve the resulting POMDP model using the asynchronous advantage actor critic (A3C) deep reinforcement learning algorithm. The prescribed policy determines the anomaly classification threshold in real-time that maximizes the performance. Our numerical experiments show that the POMDP model outperforms state-of-the-art benchmarks, especially under more difficult to detect anomaly profiles.
科研通智能强力驱动
Strongly Powered by AbleSci AI