强化学习
培训(气象学)
计算机科学
人工智能
钢筋
工程类
结构工程
物理
气象学
作者
Gaoqing Shen,Lei Lei,Xinting Zhang,Zhilin Li,Shengsuo Cai,Lijuan Zhang
出处
期刊:IEEE Transactions on Vehicular Technology
[Institute of Electrical and Electronics Engineers]
日期:2023-02-23
卷期号:72 (7): 8354-8368
被引量:41
标识
DOI:10.1109/tvt.2023.3245120
摘要
This paper considers the cooperative search for stationary targets by multiple unmanned aerial vehicles (UAVs) with limited sensing range and communication ability in a dynamic threatening environment. The main purpose is to use multiple UAVs to find more unknown targets as soon as possible, increase the coverage rate of the mission area, and more importantly, guide UAVs away from threats. However, traditional search methods are mostly unscalable and perform poorly in dynamic environments. A new multi-agent deep reinforcement learning (MADRL) method, DNQMIX, is proposed in this study to solve the multi-UAV cooperative target search (MCTS) problem. The reward function is also newly designed for the MCTS problem to guide UAVs to explore and exploit the environment information more efficiently. Moreover, this paper proposes a digital twin (DT) driven training framework "centralized training, decentralized execution, and continuous evolution" (CTDECE). It can facilitate the continuous evolution of MADRL models and solve the tradeoff between training speed and environment fidelity when MADRL is applied to real-world multi-UAV systems. Simulation results show that DNQMIX outperforms state-of-art methods in terms of search rate and coverage rate.
科研通智能强力驱动
Strongly Powered by AbleSci AI