强化学习
马尔可夫决策过程
增强学习
计算机科学
模糊控制系统
控制理论(社会学)
趋同(经济学)
数学优化
模糊逻辑
离散时间和连续时间
非线性系统
控制器(灌溉)
数学
马尔可夫过程
控制(管理)
人工智能
量子力学
生物
统计
经济增长
物理
经济
农学
作者
Jing Wang,Jiacheng Wu,Hao Shen,Jinde Cao,Leszek Rutkowski
标识
DOI:10.1109/tcyb.2022.3220537
摘要
In this article, a novel hybrid reinforcement Q -learning control method is proposed to solve the adaptive fuzzy H∞ control problem of discrete-time nonlinear Markov jump systems based on the Takagi-Sugeno fuzzy model. First, the core problem of adaptive fuzzy H∞ control is converted to solving fuzzy game coupled algebraic Riccati equation, which can hardly be solved by mathematical methods directly. To solve this problem, an offline parallel hybrid learning algorithm is first designed, where system dynamics should be known as a prior. Furthermore, an online parallel Q -learning hybrid learning algorithm is developed. The main characteristics of the proposed online hybrid learning algorithms are threefold: 1) system dynamics are avoided during the learning process; 2) compared with the policy iteration method, the restriction of the initial stable control policy is removed; and 3) compared with the value iteration method, a faster convergence rate can be obtained. Finally, we provide a tunnel diode circuit system model to validate the effectiveness of the present learning algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI