强化学习
计算机科学
可扩展性
任务(项目管理)
趋同(经济学)
资源配置
分布式计算
资源管理(计算)
任务分析
人工智能
计算机网络
工程类
经济增长
数据库
经济
系统工程
作者
Da Liu,Liqian Dou,Ruilong Zhang,Xiuyun Zhang,Qun Zong
出处
期刊:IEEE Transactions on Vehicular Technology
[Institute of Electrical and Electronics Engineers]
日期:2022-12-12
卷期号:72 (4): 4372-4383
被引量:54
标识
DOI:10.1109/tvt.2022.3228198
摘要
The coordinated dynamic task allocation (CDTA) problem for heterogeneous unmanned aerial vehicles (UAVs) in the presence of environment uncertainty is studied in this paper. Dynamic task allocation mainly solves the problem of resource reallocation after new tasks appear, so that the multi-UAV systems can quickly respond to further information and objectives. In this paper, the CDTA strategy for heterogenous UAVs is proposed through proposer-responser mechanism and prioritized experience replay, in which the multi-agent reinforcement learning (MARL)-based coordinated network is constructed to propose request, and the Q-network is developed to approximate expected return to determine the responser whether to participate in the dynamic task. The CDTA algorithm considers the uncertainty of dynamic task and has a high scalability in different UAV groups, which can reduce the burden of online calculation and increase the speed of online operation effectively. The experiment proves that the priority experience replay speeds up the convergence of the algorithm, and the scalability of the algorithm is verified within 10-180 UAVs. Comparison simulations with the game theory-based and reinforcement learning-based methods are provided to show the effectiveness of the proposed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI