Learning-based control: A tutorial and some recent results

控制(管理)
作者
Zhong-Ping Jiang,Tao Bian,Weinan Gao
出处
期刊:Foundations and trends in systems and control [Now Publishers]
卷期号:8 (3): 176-284 被引量:15
标识
DOI:10.1561/2600000023
摘要

The recent success of Reinforcement Learning and related methods can be attributed to several key factors. First, it is driven by reward signals obtained through the interaction with the environment. Second, it is closely related to the human learning behavior. Third, it has a solid mathematical foundation. Nonetheless, conventional Reinforcement Learning theory exhibits some shortcomings particularly in a continuous environment or in considering the stability and robustness of the controlled process. In this monograph, the authors build on Reinforcement Learning to present a learning-based approach for controlling dynamical systems from real-time data and review some major developments in this relatively young field. In doing so the authors develop a framework for learning-based control theory that shows how to learn directly suboptimal controllers from input-output data. There are three main challenges on the development of learning-based control. First, there is a need to generalize existing recursive methods. Second, as a fundamental difference between learning-based control and Reinforcement Learning, stability and robustness are important issues that must be addressed for the safety-critical engineering systems such as self-driving cars. Third, data efficiency of Reinforcement Learning algorithms need be addressed for safety-critical engineering systems. This monograph provides the reader with an accessible primer on a new direction in control theory still in its infancy, namely Learning-Based Control Theory, that is closely tied to the literature of safe Reinforcement Learning and Adaptive Dynamic Programming.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
建议保存本图,每天支付宝扫一扫(相册选取)领红包
实时播报
root完成签到 ,获得积分10
1秒前
杨幂完成签到,获得积分10
3秒前
ming完成签到,获得积分10
6秒前
Tree_完成签到 ,获得积分10
9秒前
LJJ完成签到 ,获得积分10
11秒前
尚岩完成签到 ,获得积分10
12秒前
Chem34完成签到,获得积分10
12秒前
Harlotte完成签到 ,获得积分10
16秒前
小小雪完成签到 ,获得积分10
16秒前
cyril完成签到 ,获得积分10
17秒前
kryptonite完成签到 ,获得积分10
19秒前
Kitty完成签到,获得积分10
21秒前
巴山夜雨完成签到 ,获得积分10
21秒前
机智幻香完成签到 ,获得积分10
22秒前
ponytail完成签到 ,获得积分10
23秒前
小趴菜完成签到 ,获得积分10
24秒前
老弍完成签到 ,获得积分10
31秒前
恋恋青葡萄完成签到,获得积分10
37秒前
小二郎应助wybdsj采纳,获得10
41秒前
42秒前
江幻天完成签到,获得积分10
46秒前
48秒前
hajy完成签到 ,获得积分10
48秒前
fenggen发布了新的文献求助10
49秒前
风之谷完成签到,获得积分10
53秒前
HG20220101完成签到 ,获得积分10
56秒前
58秒前
xinjiasuki完成签到 ,获得积分10
59秒前
木又完成签到 ,获得积分10
59秒前
晶莹黎完成签到,获得积分10
1分钟前
神经蛙完成签到 ,获得积分10
1分钟前
李恩慧完成签到 ,获得积分10
1分钟前
plant完成签到,获得积分10
1分钟前
Minjalee完成签到,获得积分10
1分钟前
孤独映天完成签到,获得积分20
1分钟前
东方欲晓完成签到 ,获得积分0
1分钟前
科研螺丝完成签到 ,获得积分10
1分钟前
研通通完成签到,获得积分0
1分钟前
王小乐完成签到 ,获得积分10
1分钟前
xiaoxiaoxingqiu完成签到 ,获得积分10
1分钟前
高分求助中
Teaching Social and Emotional Learning in Physical Education 1000
Guide to Using WVASE Spectroscopic Ellipsometry Data Acquisition and Analysis Software 600
Multifunctionality Agriculture: A New Paradigm for European Agriculture and Rural Development 500
grouting procedures for ground source heat pump 500
ANDA Litigation: Strategies and Tactics for Pharmaceutical Patent Litigators Second 版本 500
中国志愿服务发展报告(2022~2023) 300
The Commercialization of Pharmaceutical Patents in China (Asian Commercial, Financial and Economic Law and Policy series) 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2335948
求助须知:如何正确求助?哪些是违规求助? 2024323
关于积分的说明 5065637
捐赠科研通 1773345
什么是DOI,文献DOI怎么找? 887491
版权声明 555761
科研通“疑难数据库(出版商)”最低求助积分说明 473023