群体行为
强化学习
计算机科学
过程(计算)
群机器人
人工智能
纳什均衡
数学优化
数学
操作系统
作者
Zhi Wang,Fan Liu,Guo Jing,Hong Chen,Ming Chen,Ershen Wang,Yun‐Bo Zhao
标识
DOI:10.23919/ccc55666.2022.9902663
摘要
Multi-agent deep reinforcement learning (MADRL) has attracted a tremendous amount of interest in recent years. In this paper, we introduce MADRL into the confrontation scene of Unmanned Aerial Vehicle (UAV) swarm. To analysis the dynamic game process of UAV swarm confrontation, we build two non-cooperative game models based on MADRL paradigm. By using the multi-agent deep deterministic policy gradient (MADDPG) and the centralized training with decentralized execution method, we achieve the Nash equilibrium under 5 vs. 5 UAV confrontation scenes. We also introduce multi-agent soft actor critic (MASAC) method into the UAV swarm confrontation, simulation results indicate that the MASAC-based model outperforms the MADDPG-based model on exploring the UAV swarm combat environment, and more effectively converges to the Nash equilibrium. Our work will provide new insights into the confrontation of UAV swarm.
科研通智能强力驱动
Strongly Powered by AbleSci AI