强化学习
人工智能
计算机科学
机器学习
工程类
运筹学
控制工程
作者
Guang Zhan,Kun Zhang,Ke Li,Haiyin Piao
出处
期刊:Chinese Journal of Systems Engineering and Electronics
[Institute of Electrical and Electronics Engineers]
日期:2024-04-23
卷期号:35 (3): 644-665
被引量:2
标识
DOI:10.23919/jsee.2024.000022
摘要
Autonomous umanned aerial vehicle (UAV) manipulation is necessary for the defense department to execute tactical missions given by commanders in the future unmanned battlefield. A large amount of research has been devoted to improving the autonomous decision-making ability of UAV in an interactive environment, where finding the optimal maneuvering decision-making policy became one of the key issues for enabling the intelligence of UAV. In this paper, we propose a maneuvering decision-making algorithm for autonomous air-delivery based on deep reinforcement learning under the guidance of expert experience. Specifically, we refine the guidance towards area and guidance towards specific point tasks for the air-delivery process based on the traditional air-to-surface fire control methods. Moreover, we construct the UAV maneuvering decision-making model based on Markov decision processes (MDPs). Specifically, we present a reward shaping method for the guidance towards area and guidance towards specific point tasks using potential-based function and expert-guided advice. The proposed algorithm could accelerate the convergence of the maneuvering decision-making policy and increase the stability of the policy in terms of the output during the later stage of training process. The effectiveness of the proposed maneuvering decision-making policy is illustrated by the curves of training parameters and extensive experimental results for testing the trained policy.
科研通智能强力驱动
Strongly Powered by AbleSci AI