强化学习
群体行为
计算机科学
图形
比例(比率)
分布式计算
控制(管理)
多智能体系统
人工智能
理论计算机科学
物理
量子力学
作者
Bocheng Zhao,Mingying Huo,Zheng Li,Ze Yu,Naiming Qi
标识
DOI:10.1016/j.ast.2024.109166
摘要
In this study, a novel graph-embedding technique based on a graph neural network (GNN) is proposed to identify the topology in the motion of a unmanned aerial vehicles (UAV) swarm and quickly obtain local information around each agent. We also propose a model reference reinforcement learning method to learn the potential field function and determine an appropriate strategy for each agent that can satisfy the requirements of collaborative motion and obstacle avoidance for large-scale UAV swarms. First, a new swarm structure is proposed to provide reserved maneuvering space for UAVs during flight. In addition, a method was proposed to encode the obstacle avoidance behavior of multiple UAVs in a continuous space into spatial maps. A graph attention mechanism (GAT) structure based on local information was proposed to obtain dynamic graph information, and each individual output action was obtained according to the current state information. To improve the training effect, this method can restrain the UAV group while maintaining the formation and preventing collisions among the UAV. Second, a new distributed control algorithm based on multi-agent reinforcement learning (MARL) is proposed by learning the potential field function using local information obtained by a GNN. Each individual can repel and cooperate with the target within a short range and attract objects over a long distance. Finally, simulation results demonstrate the effectiveness and superiority of the proposed method, which has great potential for application in online autonomous collaboration.
科研通智能强力驱动
Strongly Powered by AbleSci AI