初始化
控制理论(社会学)
强化学习
计算机科学
迭代学习控制
数学优化
系统动力学
控制(管理)
理论(学习稳定性)
数学
人工智能
机器学习
程序设计语言
作者
Ci Chen,Frank L. Lewis,Bo Li
出处
期刊:Automatica
[Elsevier BV]
日期:2022-01-28
卷期号:138: 110153-110153
被引量:39
标识
DOI:10.1016/j.automatica.2021.110153
摘要
Recent results have emerged that policy iteration is a powerful reinforcement learning tool in designing a stabilizing control policy for continuous-time systems with unknown system dynamics. Policy iteration involves a model-based initialization stage, i.e., seeking an initial stabilizing control policy, which is, however, dependent on the full system dynamics including the drift dynamics and system input matrix. To remove such model requirements, this paper utilizes a homotopy-based initialization strategy for policy iteration, wherein a stabilizing control policy for continuous-time systems is obtained by gradually moving a stable system to the original system. We propose two homotopic policy iteration-based stabilizing control schemes, namely, a model-based design and a model-free design using system data, which are proved to place unstable poles into a stable region. The effectiveness of the proposed designs is validated through an illustrative example.
科研通智能强力驱动
Strongly Powered by AbleSci AI