计算机科学
水准点(测量)
强化学习
可扩展性
无人机
光学(聚焦)
人工智能
人工神经网络
实时计算
分布式计算
物理
大地测量学
数据库
生物
光学
遗传学
地理
作者
Jiehong Wu,Danyang Li,Yuanzhe Yu,Lijun Gao,Jinsong Wu,Guangjie Han
标识
DOI:10.1109/tits.2024.3379508
摘要
With the further development of Unmanned Aerial Vehicle (UAV) technologies, research on multi-UAV formations have also received more attention. Unmanned Aerial Vehicles (UAVs) cooperate with each other to form a formation group, which can give full play to the advantages that a single UAV does not have, and more capable of working in multi-task scenarios. Based on the Sierpinski fractal structure, this paper proposes a hybrid formation control architecture. To address the aggregation problem of UAV swarms, a multi-agent deep reinforcement learning (MADRL) method is used for aggregation control. To improve the efficiency, accuracy, effectiveness and scalability of MADRL in environments with a large number of UAVs, a multi-intelligent deep deterministic policy gradient method based on attention mechanism and adaptive accuracy (3A-MADDPG) is proposed. The method enables each agent to selectively focus on the information of other agents, with adaptive learning rate to dynamically learn its own critic network. The algorithm has a large learning rate in the early stage and converges quickly. The later stage of the algorithm has small learning rate and high accuracy, and combined with the fixed variable hybrid reward function designed in the paper, so the UAV can fly to the center of the expected region more accurately, making the whole cluster more accurate. Experiments show that the 3A algorithm proposed in the paper converges faster and achieves higher accuracy than the benchmark algorithm, whether it is the cluster aggregation formation or the separation and aggregation of sub-formations.
科研通智能强力驱动
Strongly Powered by AbleSci AI