动态规划
不变(物理)
最优控制
离散时间和连续时间
数学优化
LTI系统理论
计算机科学
系统动力学
自适应控制
数学
控制理论(社会学)
控制(管理)
线性系统
人工智能
统计
数学物理
数学分析
作者
Yuanheng Zhu,Dongbin Zhao,Haibo He
标识
DOI:10.1109/tsmc.2019.2911900
摘要
For systems that can only be locally stabilized, control laws and their effective regions are both important. In this paper, invariant policy iteration is proposed to solve the optimal control of discrete-time systems. At each iteration, a given policy is evaluated in its invariantly admissible region, and a new policy and a new region are updated for the next iteration. Theoretical analysis shows the method is regionally convergent to the optimal value and the optimal policy. Combined with sum-of-squares polynomials, the method is able to achieve the near-optimal control of a class of discrete-time systems. An invariant adaptive dynamic programming algorithm is developed to extend the method to scenarios where system dynamics is not available. Online data are utilized to learn the near-optimal policy and the invariantly admissible region. Simulated experiments verify the effectiveness of our method.
科研通智能强力驱动
Strongly Powered by AbleSci AI