强化学习
控制(管理)
计算机科学
钢筋
心理学
人工智能
社会心理学
作者
Bahare Kiumarsi,Kyriakos G. Vamvoudakis,Hamidreza Modares,Frank L. Lewis
标识
DOI:10.1109/tnnls.2017.2773458
摘要
This paper reviews the current state of the art on reinforcement learning (RL)-based feedback control solutions to optimal regulation and tracking of single and multiagent systems. Existing RL solutions to both optimal and control problems, as well as graphical games, will be reviewed. RL methods learn the solution to optimal control and game problems online and using measured data along the system trajectories. We discuss Q-learning and the integral RL algorithm as core algorithms for discrete-time (DT) and continuous-time (CT) systems, respectively. Moreover, we discuss a new direction of off-policy RL for both CT and DT systems. Finally, we review several applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI