强化学习
计算机科学
利用
部分可观测马尔可夫决策过程
马尔可夫决策过程
解算器
人工智能
机器学习
过程(计算)
马尔可夫过程
马尔可夫链
马尔可夫模型
计算机安全
统计
操作系统
程序设计语言
数学
作者
Mohamed Chahine Ghanem,Tom Chen
标识
DOI:10.1109/worlds4.2018.8611595
摘要
Penetration testing (PT) is an active method for assessing and evaluating the security of digital assets by planning, generating and executing all possible attacks that can exploit existing vulnerabilities. Current PT practice is becoming repetitive, complex and resource consuming despite the use of automated tools. The goal of this paper is to design an intelligent PT approach using reinforcement learning (RL) that will allow regular and systematic testing, saving human resources. The system is modelled as a partially observed Markov decision process (POMDP), and tested using an external POMDP-solver with different algorithms. Although this paper is limited to only the planning phase and not the entire PT process, the results support the hypothesis that reinforcement learning can enhance PT beyond the capabilities of any human expert in terms of accurate and reliable outputs.
科研通智能强力驱动
Strongly Powered by AbleSci AI