主观性
云计算
强化学习
计算机科学
云存储
计算机安全
人工智能
操作系统
认识论
哲学
作者
Zahra Aref,Narayan B. Mandayam
标识
DOI:10.1109/infocomwkshps54753.2022.9798283
摘要
Cloud storage is a target of advanced persistent threats (APTs), where a sophisticated adversary attempts to steal sensitive data in a continuous manner. Human monitoring and intervention is an integral part of the reinforcement learning (RL) approaches to defend against APTs. In this paper, prospect theory (PT) is used to model the subjective behavior of the cloud storage defender in assigning computing resources (processing units) to scan and monitor the cloud storage system against an APT attacker bot, which attempts to steal information from the cloud. Under a constraint on the total number of processing units and a lack of knowledge of the opponent's resource allocation strategy, we study the defense performance of a federated maximum-likelihood deep Q-network (FMLDQ) RL algorithm against a sophisticated branching dueling deep Q-network (BDQ) RL attack algorithm. Specifically, the RL strategy for the defender is affected by subjective decisions in estimating the processing units of the attacker. Simulation results show that when the defender has more resources than the attacker, an EUT-based defense strategy (without human intervention) yields better data protection. On the other hand, when the defender has fewer resources, a PT-based defense strategy (with human intervention) is better.
科研通智能强力驱动
Strongly Powered by AbleSci AI