Graph-based multi-agent reinforcement learning for large-scale UAVs swarm system control

强化学习 群体行为 计算机科学 图形 比例(比率) 分布式计算 控制(管理) 多智能体系统 人工智能 理论计算机科学 物理 量子力学
作者
Bocheng Zhao,Mingying Huo,Zheng Li,Ze Yu,Naiming Qi
出处
期刊:Aerospace Science and Technology [Elsevier BV]
卷期号:150: 109166-109166 被引量:41
标识
DOI:10.1016/j.ast.2024.109166
摘要

In this study, a novel graph-embedding technique based on a graph neural network (GNN) is proposed to identify the topology in the motion of a unmanned aerial vehicles (UAV) swarm and quickly obtain local information around each agent. We also propose a model reference reinforcement learning method to learn the potential field function and determine an appropriate strategy for each agent that can satisfy the requirements of collaborative motion and obstacle avoidance for large-scale UAV swarms. First, a new swarm structure is proposed to provide reserved maneuvering space for UAVs during flight. In addition, a method was proposed to encode the obstacle avoidance behavior of multiple UAVs in a continuous space into spatial maps. A graph attention mechanism (GAT) structure based on local information was proposed to obtain dynamic graph information, and each individual output action was obtained according to the current state information. To improve the training effect, this method can restrain the UAV group while maintaining the formation and preventing collisions among the UAV. Second, a new distributed control algorithm based on multi-agent reinforcement learning (MARL) is proposed by learning the potential field function using local information obtained by a GNN. Each individual can repel and cooperate with the target within a short range and attract objects over a long distance. Finally, simulation results demonstrate the effectiveness and superiority of the proposed method, which has great potential for application in online autonomous collaboration.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
xW12123完成签到,获得积分10
刚刚
刚刚
刚刚
1秒前
YYY完成签到,获得积分20
2秒前
打打应助听话的亦云采纳,获得10
2秒前
义气豌豆完成签到 ,获得积分10
2秒前
二愣子完成签到,获得积分10
3秒前
我喝白开水完成签到,获得积分10
3秒前
jessicaw完成签到,获得积分10
3秒前
来了完成签到,获得积分10
3秒前
Orange应助puzhongjiMiQ采纳,获得10
3秒前
伶俐茗茗应助puzhongjiMiQ采纳,获得10
3秒前
情怀应助puzhongjiMiQ采纳,获得10
3秒前
dxftx应助puzhongjiMiQ采纳,获得10
3秒前
星辰大海应助puzhongjiMiQ采纳,获得10
4秒前
4秒前
星星完成签到,获得积分10
4秒前
4秒前
研友_VZG7GZ应助快乐的夜云采纳,获得10
4秒前
细心的岩完成签到,获得积分10
4秒前
小橙子完成签到,获得积分10
5秒前
5秒前
无水乙醚完成签到,获得积分10
6秒前
YYY发布了新的文献求助20
6秒前
6秒前
心落失完成签到,获得积分10
7秒前
顾矜应助旺旺采纳,获得10
7秒前
啵啵奶冻完成签到 ,获得积分10
7秒前
哎哟可爱发布了新的文献求助10
7秒前
啦啦啦发布了新的文献求助10
7秒前
阔达惮完成签到,获得积分10
7秒前
眼睛大的百褶裙完成签到,获得积分10
7秒前
安赛虫完成签到,获得积分10
8秒前
有魅力强炫完成签到,获得积分10
8秒前
巫霸完成签到,获得积分10
9秒前
Orange应助东单的单车采纳,获得10
9秒前
张钰完成签到,获得积分10
9秒前
9秒前
cdercder应助oleskarabach采纳,获得10
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Prompt Engineering for Clinicians: Harnessing AI in Everyday Medical Practice 600
University Physics for the Life Sciences 500
REAL-WORLD EFFICACY AND GENOMIC LANDSCAPE OF POLATUZUMA VEDOTIN-BASED FIRST-LINE THERAPY IN DIFFUSE LARGE B-CELL LYMPHOMA: A FOCUS ON TP53 MUTATIONS AND TREATMENT RESPONSE 500
Handbook of Luminescence Dating 500
Safety Pharmacology 500
《KNN基无铅压电陶瓷电学性能优化与物理机理研究》 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6952022
求助须知:如何正确求助?哪些是违规求助? 8636246
关于积分的说明 18312339
捐赠科研通 6394755
什么是DOI,文献DOI怎么找? 3082285
关于科研通互助平台的介绍 2127728
邀请新用户注册赠送积分活动 2059159