终端(电信)
终端速度
强化学习
钢筋
计算机科学
人工智能
模拟
航空学
工程类
结构工程
物理
机械
电信
作者
Hao Zhang,Jianwen Zhu,LI Xiao-pin,Weimin Bao,Haifeng Sun
标识
DOI:10.1177/09544100241263123
摘要
An adaptive guidance strategy that integrates optimal guidance and deep reinforcement learning to address a highly dynamic terminal guidance problem that entails meeting terminal position, angle, and velocity constraints. The proposed strategy leverages optimal guidance commands to accomplish position and angle control while introducing a deep reinforcement learning-based bias for the velocity constraint. In the training process, a dual-velocity state space is constructed to enhance the adaptability of the strategy to different guidance tasks, while training is optimized using the prediction-correction and expert knowledge to improve the training efficiency and optimality of the strategy. Simulations demonstrate that the proposed guidance strategy can achieve simultaneous control of terminal position, angle and velocity, and adapt to different guidance tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI