发布文献求助

Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning

控制理论（社会学）强化学习自适应控制李雅普诺夫函数控制器（灌溉）非线性系统 Lyapunov稳定性计算机科学理论（学习稳定性）迭代学习控制弹道控制系统工程类控制（管理）人工智能物理电气工程量子力学机器学习天文农学生物

作者

Anuradha M. Annaswamy,Anubhav Guha,Yingnan Cui,Sunbochen Tang,Peter Fisher,Joseph E. Gaudio

出处

期刊：IEEE Transactions on Automatic Control [Institute of Electrical and Electronics Engineers]
日期：2023-06-27 卷期号：68 (12): 7740-7755 被引量：17

标识

DOI：10.1109/tac.2023.3290037

摘要

This article considers the problem of real-time control and learning in dynamic systems subjected to parameteric uncertainties. We propose a combination of a reinforcement learning (RL)-based policy in the outer loop suitably chosen to ensure stability and optimality for the nominal dynamics, together with adaptive control (AC) in the inner loop so that in real-time AC contracts the closed-loop dynamics toward a stable trajectory traced out by RL. In total, two classes of nonlinear dynamic systems are considered, both of which are control affine. The first class of dynamic systems utilizes equilibrium points and a Lyapunov approach, whereas second class of nonlinear systems uses contraction theory. AC-RL controllers are proposed for both classes of systems and shown to lead to online policies that guarantee stability using a high-order tuner and accommodate parameteric uncertainties and magnitude limits on the input. In addition to establishing a stability guarantee with real-time control, the AC-RL controller is also shown to lead to parameter learning with persistent excitation for the first class of systems. Numerical validations of all algorithms are carried out using a quadrotor landing task on a moving platform.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 大模型上传了应助文件

刚刚; 许甜甜鸭的应助被聪明钢铁侠采纳，获得10

刚刚; 猫与咖啡完成签到，获得积分10

刚刚; 溜溜很优秀完成签到，获得积分10

刚刚; 酷炫茉莉发布了新的文献求助10

1秒前; 汉堡包上传了应助文件

1秒前; 5114完成签到，获得积分10

1秒前; 漂亮水池完成签到，获得积分10

2秒前; cdercder的应助被忧郁芒果采纳，获得10

2秒前; 领导范儿上传了应助文件

2秒前; CodeCraft的应助被yy采纳，获得10

2秒前; 是是是发布了新的文献求助10

3秒前; 平淡问寒上传了应助文件

3秒前; JAY发布了新的文献求助10

3秒前; 薯愿完成签到，获得积分10

3秒前; ppp发布了新的文献求助10

4秒前; 樊小雾发布了新的文献求助10

4秒前; 哈哈是你发布了新的文献求助10

4秒前; FOODHUA发布了新的文献求助10

4秒前; 自由的沛山完成签到，获得积分10

5秒前; SciGPT上传了应助文件

5秒前; Bellis完成签到，获得积分10

5秒前; cknckn11发布了新的文献求助10

5秒前; 花Cheung完成签到，获得积分10

5秒前; 852上传了应助文件

5秒前; 慕青上传了应助文件

6秒前; 潇潇暮雨完成签到，获得积分10

6秒前; haha完成签到，获得积分10

6秒前; 思源上传了应助文件

6秒前; ALDXL完成签到，获得积分10

6秒前; cheese完成签到，获得积分10

6秒前; 许甜甜鸭的应助被阔达的非笑采纳，获得10

7秒前; 健忘的灵槐完成签到，获得积分10

7秒前; 大个上传了应助文件

7秒前; Annie发布了新的文献求助10

8秒前; 芝麻球ii完成签到，获得积分10

8秒前; Charming发布了新的文献求助10

8秒前; 心有意完成签到，获得积分10

9秒前; 曾经荔枝发布了新的文献求助10

9秒前; 发nature完成签到，获得积分10

9秒前

高分求助中: Handbook of Diagnosis and Treatment of DSM-5-TR Personality Disorders 800; Algorithmic Mathematics in Machine Learning 500; Разработка метода ускоренного контроля качества электрохромных устройств 500; Advances in Underwater Acoustics, Structural Acoustics, and Computational Methodologies 400; 建筑材料检测与应用 370; Getting Published in SSCI Journals: 200+ Questions and Answers for Absolute Beginners 300; The Monocyte-to-HDL ratio (MHR) as a prognostic and diagnostic biomarker in Acute Ischemic Stroke: A systematic review with meta-analysis (P9-14.010) 240

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3830751; 求助须知：如何正确求助？哪些是违规求助？ 3373073; 关于积分的说明 10477730; 捐赠科研通 3093242; 什么是DOI，文献DOI怎么找？ 1702418; 邀请新用户注册赠送积分活动 819024; 科研通“疑难数据库（出版商）”最低求助积分说明 771203

今日热心研友

可千万不要躺平呀

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通