Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

强化学习 计算机科学 可扩展性 人工智能 维数之咒 马尔可夫决策过程 交叉口(航空) 群体行为 机器学习 马尔可夫过程 数学 工程类 统计 数据库 航空航天工程
作者
Wenhong Zhou,Zhihong Liu,Jie Li,Xin Xu,Lincheng Shen
出处
期刊:Neurocomputing [Elsevier BV]
卷期号:466: 285-297 被引量:74
标识
DOI:10.1016/j.neucom.2021.09.044
摘要

In recent years, deep reinforcement learning (DRL) has proved its great potential in multi-agent cooperation. However, how to apply DRL to multi-target tracking (MTT) problem for unmanned aerial vehicle (UAV) swarms is challenging: 1) the scale of UAVs may be large, but the existing multi-agent reinforcement learning (MARL) methods that rely on global or joint information of all agents suffer from the dimensionality curse; 2) the dimension of each UAV’s received information is variable, which is incompatible with the neural networks with fixed input dimensions; 3) the UAVs are homogeneous and interchangeable that each UAV’s policy should be irrelevant to the permutation of its received information. To this end, we propose a DRL method for UAV swarms to solve the MTT problem. Firstly, a decentralized swarm-oriented Markov Decision Process (MDP) model is presented for UAV swarms, which involves each UAV’s local communication and partial observation. Secondly, to achieve better scalability, a cartogram feature representation (FR) is proposed to integrate the variable-dimensional information set into a fixed-shape input variable, and the cartogram FR can also maintain the permutation irrelevance to the information. Then, the double deep Q-learning network with dueling architecture is adapted to the MTT problem, and the experience-sharing training mechanism is adopted to learn the shared cooperative policy for UAV swarms. Extensive experiments are provided and the results show that our method can successfully learn a cooperative tracking policy for UAV swarms and outperforms the baseline method in the tracking ratio and scalability.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
星河发布了新的文献求助30
1秒前
研友_ndvWy8完成签到,获得积分10
3秒前
4秒前
6秒前
7秒前
寒酥完成签到,获得积分10
7秒前
DJM完成签到 ,获得积分20
7秒前
ii童歌发布了新的文献求助10
7秒前
英俊的小蝴蝶完成签到,获得积分10
8秒前
wanwan完成签到,获得积分20
8秒前
he完成签到,获得积分10
8秒前
8秒前
9秒前
9秒前
10秒前
华夫饼发布了新的文献求助10
10秒前
科研通AI2S应助紧张的大有采纳,获得10
10秒前
Hugo发布了新的文献求助10
10秒前
10秒前
10秒前
JamesPei应助高高的石头采纳,获得10
10秒前
DJM关注了科研通微信公众号
10秒前
12秒前
丁丁完成签到,获得积分10
12秒前
13秒前
缥缈天思发布了新的文献求助10
14秒前
Freya完成签到,获得积分10
14秒前
宝宝发布了新的文献求助10
14秒前
qiu发布了新的文献求助10
15秒前
舒心砖头发布了新的文献求助10
15秒前
愚畑完成签到,获得积分10
16秒前
xxsnn发布了新的文献求助10
16秒前
124完成签到,获得积分10
17秒前
18秒前
orixero应助爱吃大虾的猫采纳,获得10
18秒前
18秒前
sun完成签到,获得积分20
19秒前
科研通AI6.1应助缥缈天思采纳,获得10
19秒前
脑洞疼应助刻苦的巧荷采纳,获得10
20秒前
20秒前
高分求助中
The Graphene Handbook (2019 Edition) 800
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6600705
求助须知:如何正确求助?哪些是违规求助? 8369494
关于积分的说明 17913620
捐赠科研通 5756168
什么是DOI,文献DOI怎么找? 2954497
邀请新用户注册赠送积分活动 1929668
关于科研通互助平台的介绍 1825432