线性二次调节器
离散时间和连续时间
国家(计算机科学)
计算机科学
时间范围
二次方程
数学优化
最优控制
控制(管理)
功能(生物学)
控制器(灌溉)
控制理论(社会学)
数学
人工智能
算法
农学
几何学
统计
进化生物学
生物
作者
P. Muthukumar,Hamidreza Modares,Frank L. Lewis,Muhammad Aurangzeb
标识
DOI:10.1109/tcyb.2014.2322116
摘要
This paper presents a method of Q-learning to solve the discounted linear quadratic regulator (LQR) problem for continuous-time (CT) continuous-state systems. Most available methods in the existing literature for CT systems to solve the LQR problem generally need partial or complete knowledge of the system dynamics. Q-learning is effective for unknown dynamical systems, but has generally been well understood only for discrete-time systems. The contribution of this paper is to present a Q-learning methodology for CT systems which solves the LQR problem without having any knowledge of the system dynamics. A natural and rigorous justified parameterization of the Q-function is given in terms of the state, the control input, and its derivatives. This parameterization allows the implementation of an online Q-learning algorithm for CT systems. The simulation results supporting the theoretical development are also presented.
科研通智能强力驱动
Strongly Powered by AbleSci AI