强化学习
跳跃
增强学习
马尔可夫链
跟踪(教育)
计算机科学
离散时间和连续时间
算法
马尔可夫决策过程
控制(管理)
马尔可夫过程
控制理论(社会学)
人工智能
数学
机器学习
统计
心理学
教育学
物理
量子力学
作者
Jiahui Shi,Dakuo He,Qiang Zhang
标识
DOI:10.1080/00207721.2024.2395928
摘要
In this paper, the H∞ tracking control problem of linear discrete-time Markov jump systems is studied by using the data-based reinforcement learning method. Specifically, a new performance index function is established by using Markov chain and weighted sum technique, and thus the tracking game algebraic Riccati equation with weight vector and discount factor is obtained. A Q-learning algorithm is proposed to solve the tracking game algebra Riccati equation problem online without knowing the information of the system model. In addition, the convergence analysis of the algorithm is given, and it is proved that the added probing noise will not bias the algorithm. Finally, two simulation examples are given to verify the effectiveness of the proposed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI