Neural network-based reinforcement learning control for combined spacecraft attitude tracking maneuvers

强化学习 计算机科学 人工神经网络 控制理论(社会学) 航天器 贝尔曼方程 背景(考古学) 最优控制 理论(学习稳定性) 功能(生物学) 单调函数 控制(管理) 数学优化 人工智能 机器学习 数学 工程类 进化生物学 生物 数学分析 航空航天工程 古生物学
作者
Yuhan Liu,Guangfu Ma,Yueyong Lyu,Pengyu Wang
出处
期刊:Neurocomputing [Elsevier]
卷期号:484: 67-78 被引量:10
标识
DOI:10.1016/j.neucom.2021.07.099
摘要

This paper proposes a novel reinforcement learning-based attitude tracking control strategy for combined spacecraft takeover maneuvers with completely unknown dynamics. One major issue in the context of combined spacecraft attitude takeover control is that the accurate dynamic model is highly nonlinear, complex and costly to identify online, which makes it impractical for control design. To address this issue, we take the advantage of the Q-learning algorithm to acquire the control strategy directly from system input/output measurement data in a model-free manner, and thus the online inertia parameter identification procedure is avoided. More specifically, first, the attitude tracking is formulated as a regulation problem by introducing an argumented system, where the system dynamic model is still required in control design. Then, in order to achieve a model-free control strategy, an online policy-iteration (PI) Q-learning procedure is derived to solve the Bellman optimality equation by utilizing the generated measurement data. In theoretical analysis, it is proved that the iteration sequences of Q value function and control strategy can converge to the optimal ones. In addition, rigorous proof of the stability and monotonicity guarantees of the proposed control strategy are also provided. Furthermore, for the purpose of online implementation, off-policy learning scheme is employed to find the optimal Q value function approximator with neural network structure after data-collection phase. Numerical simulations are exhibited to validate the effectiveness of the proposed strategy.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
情怀应助山海不说话采纳,获得30
1秒前
3秒前
传奇3应助Q123ba叭采纳,获得10
3秒前
6秒前
7秒前
7秒前
Lin发布了新的文献求助10
11秒前
喜悦姿完成签到,获得积分10
12秒前
清新的老四完成签到,获得积分10
13秒前
14秒前
17秒前
18秒前
Q123ba叭发布了新的文献求助10
19秒前
小奕应助zmrright采纳,获得10
19秒前
zsj3787发布了新的文献求助10
21秒前
21秒前
愉快的老三完成签到,获得积分10
22秒前
orange发布了新的文献求助10
24秒前
25秒前
28秒前
29秒前
30秒前
31秒前
32秒前
FashionBoy应助ttttbxl采纳,获得10
33秒前
酷酷小啵发布了新的文献求助10
33秒前
鱼鸦完成签到,获得积分10
34秒前
34秒前
Hao应助zmrright采纳,获得10
35秒前
36秒前
山海不说话完成签到,获得积分10
37秒前
星河在眼里完成签到,获得积分10
37秒前
bkagyin应助orange采纳,获得10
37秒前
Lucas应助zsj3787采纳,获得10
38秒前
鱼鸦发布了新的文献求助10
39秒前
41秒前
SCUsjg完成签到,获得积分10
43秒前
ChenxiDai完成签到 ,获得积分10
45秒前
47秒前
48秒前
高分求助中
【本贴是提醒信息,请勿应助】请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Challenges, Strategies, and Resiliency in Disaster and Risk Management 500
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2481942
求助须知:如何正确求助?哪些是违规求助? 2144460
关于积分的说明 5470026
捐赠科研通 1866925
什么是DOI,文献DOI怎么找? 927985
版权声明 563071
科研通“疑难数据库(出版商)”最低求助积分说明 496438