强化学习
计算机科学
人工智能
钢筋
工程类
结构工程
作者
Jiebang Xing,Xianlin Zeng
出处
期刊:Chinese Control Conference
日期:2021-07-26
卷期号:: 8366-8371
标识
DOI:10.23919/ccc52363.2021.9550113
摘要
David Gale's lion and man problem, which is a fundamental pursuit-evasion game, attracts the interest of many scholars. We propose a deep reinforcement learning method based on Deep Deterministic Policy Gradient (DDPG) to solve this problem. We first improve the exploration strategy by adding guided exploration and dynamic spaces exploration strategies to the greedy algorithm. Then we introduce a learning reset mechanism to help the agents escape the traps in the learning process. With these improvements, our method achieves better performance than the classic DDPG algorithm in the lion and man problem. The simulation result shows that deep reinforcement learning method may be promising to solve this problem.
科研通智能强力驱动
Strongly Powered by AbleSci AI