强化学习
群体行为
计算机科学
人工智能
群机器人
作者
Shuzhe Xuan,Liangjun Ke
出处
期刊:Lecture notes in electrical engineering
日期:2021-10-30
卷期号:: 5599-5608
被引量:5
标识
DOI:10.1007/978-981-15-8155-7_464
摘要
This paper studies the problem of UAV swarm attack-defense confrontation, which can be viewed as an extension of defending territory game. In this problem, a swarm of intruder UAVs attempt to invade into a territory, which is guarded by a swarm of defender UAVs. This problem is a great challenge to traditional methods. To deal with it, a multi-agent deep reinforcement learning approach is proposed, which is based on the Multi-Agent Deep Deterministic Policy Gradient algorithm (MADDPG). A simulation platform is developed which takes account of UAV flight constraints and simulates a real flight environment. To study the performance of the proposed algorithm, we compare it with DDPG. Experimental results show that the UAVs using the MADDPG algorithm can learn better strategies and achieve better performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI