强化学习
最优控制
趋同(经济学)
乘法函数
国家(计算机科学)
数学优化
代数Riccati方程
计算机科学
线性二次高斯控制
随机控制
线性二次调节器
数学
二次方程
过程(计算)
控制(管理)
线性系统
控制理论(社会学)
Riccati方程
马尔可夫决策过程
代数数
随机过程
代数方程
缩小
随机逼近
控制器(灌溉)
状态向量
控制系统
鲁棒控制
随机优化
作者
Jianglin Yu,Bing‐Chang Wang,Deyuan Meng
摘要
ABSTRACT This paper aims at solving the infinite‐horizon stochastic linear quadratic (SLQ) optimal control problem online for continuous‐time systems with both additive and multiplicative noises. To eliminate the requirement for prior knowledge of system dynamics, a novel policy iteration approach is proposed, which leverages integral reinforcement learning (RL) techniques to iteratively solve the stochastic algebraic Riccati equation (SARE) using real‐time state and input data. The proposed approach is an off‐policy RL algorithm, where the learning process can be executed by using identical state and input data collected online over fixed time intervals, thereby enabling the optimal control law to be computed. The convergence of the proposed algorithm to the solution of the SARE is verified, and the effectiveness is validated through a numerical example.
科研通智能强力驱动
Strongly Powered by AbleSci AI