无人地面车辆
计算机科学
遥感
环境科学
人工智能
地质学
作者
Wei Li,Xin Chen,Wei Yu,Mingyang Xie
标识
DOI:10.1177/09596518241291185
摘要
With the development of swarm intelligence, multi-unmanned aerial vehicle (UAV) cooperation in uncertain dynamic environments has been able to safely and efficiently realize all-round strikes on ground targets. However, the limited life span of UAV onboard batteries challenges the cooperative strikes against the ground targets. To address this challenge, this paper proposes an improved multi-agent deep deterministic policy gradient (i-MADDPG) algorithm for energy-saving cooperative strikes of ground targets. Introducing a sensor fusion layer and self-attention mechanism to the actor network helps the UAVs collect more comprehensive information and filter the collected data, resulting in more accurate environmental perception. In addition, importing an egocentric state representation mechanism into the critic network contributes to the computation of different Q-values for different UAV observation states, which ensures each UAV can make appropriate decisions based on its own state. Simulation experiments are conducted to validate the performance of the improved MADDPG algorithm. The results show that the success rate of ground target strikes of the proposed algorithm is improved by 29.5%, and the collision rate is reduced compared with the traditional MADDPG algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI