发布文献求助

亲爱的研友该休息了！由于当前在线用户较少，发布求助请尽量完整的填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！身体可是革命的本钱，早点休息，好梦！

Safe reinforcement learning: A control barrier function optimization approach

强化学习控制器（灌溉）计算机科学最优控制理论（学习稳定性）控制（管理）集合（抽象数据类型）功能（生物学）数学优化控制理论（社会学）人工智能数学生物进化生物学机器学习程序设计语言农学

作者

Zahra Marvi,Bahare Kiumarsi

出处

期刊：International Journal of Robust and Nonlinear Control [Wiley]
日期：2020-08-11 卷期号：31 (6): 1923-1940 被引量：73

标识

DOI：10.1002/rnc.5132

摘要

Summary This article presents a learning‐based barrier certified method to learn safe optimal controllers that guarantee operation of safety‐critical systems within their safe regions while providing an optimal performance. The cost function that encodes the designer's objectives is augmented with a control barrier function (CBF) to ensure safety and optimality. A damping coefficient is incorporated into the CBF which specifies the trade‐off between safety and optimality. The proposed formulation provides a look‐ahead and proactive safety planning and results in a smooth transition of states within the feasible set. That is, instead of applying an optimal controller and intervening with it only if the safety constraints are violated, the safety is planned and optimized along with the performance to minimize the intervention with the optimal controller. It is shown that addition of the CBF into the cost function does not affect the stability and optimality of the designed controller within the safe region. This formulation enables us to find the optimal safe solution iteratively. An off‐policy reinforcement learning (RL) algorithm is then employed to find a safe optimal policy without requiring the complete knowledge about the system dynamics, while satisfies the safety constraints. The efficacy of the proposed safe RL control design approach is demonstrated on the lane keeping as an automotive control problem.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 论文查重

更新

大幅提高文件上传限制，最高150M (2024-4-1)

更新

新增期刊收藏功能 (2024-03-23)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 坚强的广山的应助被科研通管家采纳，获得10

11秒前; 爆米花的应助被笑点低千雁采纳，获得10

33秒前; 天天快乐的应助被蛙蛙的呱呱采纳，获得10

56秒前; 爆米花上传了应助文件

1分钟前; 笑点低千雁发布了新的文献求助10

1分钟前; 深情安青上传了应助文件

2分钟前; 认真的成风完成签到，获得积分20

2分钟前; 风一样的我发布了新的文献求助10

2分钟前; 情怀的应助被科研通管家采纳，获得10

2分钟前; 彩色莞完成签到，获得积分10

2分钟前; 蛙蛙的呱呱完成签到，获得积分10

2分钟前; 天天快乐上传了应助文件

2分钟前; 蛙蛙的呱呱发布了新的文献求助10

2分钟前; FashionBoy的应助被蛙蛙的呱呱采纳，获得10

2分钟前; OCDer上传了应助文件

2分钟前; 笑点低千雁发布了新的文献求助10

2分钟前; 华仔的应助被蛙蛙的呱呱采纳，获得10

3分钟前; FashionBoy上传了应助文件

3分钟前; 科研通AI2.0上传了应助文件

3分钟前; 蛙蛙的呱呱发布了新的文献求助10

3分钟前; 彭佳丽发布了新的文献求助10

4分钟前; 华仔上传了应助文件

4分钟前; 坚强的广山的应助被科研通管家采纳，获得10

4分钟前; orixero的应助被蛙蛙的呱呱采纳，获得10

4分钟前; 晨光完成签到，获得积分10

5分钟前; 蛙蛙的呱呱发布了新的文献求助10

5分钟前; orixero上传了应助文件

5分钟前; zokor完成签到，获得积分10

5分钟前; 蛙蛙的呱呱发布了新的文献求助10

5分钟前; 小蘑菇的应助被晨光采纳，获得10

5分钟前; 科研通AI2.0上传了应助文件

7分钟前; chloe发布了新的文献求助10

7分钟前; 大个的应助被chloe采纳，获得10

7分钟前; 大个上传了应助文件

8分钟前; 曾经寄文完成签到，获得积分10

8分钟前; chloe发布了新的文献求助10

8分钟前; 曾经寄文发布了新的文献求助20

8分钟前; 有水无木杨关闭了有水无木杨的文献求助

8分钟前; 有水无木杨发布了新的文献求助10

9分钟前; 爆米花的应助被刺猬hedgehog采纳，获得10

9分钟前

高分求助中: One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000; 巫和雄 -《毛泽东选集》英译研究 (2013) 800; Yuwu Song, Biographical Dictionary of the People's Republic of China 700; [Lambert-Eaton syndrome without calcium channel autoantibodies] 520; The three stars each: the Astrolabes and related texts 500; Revolutions 400; Diffusion in Solids: Key Topics in Materials Science and Engineering 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 2450841; 求助须知：如何正确求助？哪些是违规求助？ 2124449; 关于积分的说明 5405774; 捐赠科研通 1853223; 什么是DOI，文献DOI怎么找？ 921688; 版权声明 562263; 科研通“疑难数据库（出版商）”最低求助积分说明 493029

今日热心研友

互助遵法尚德

坚强的广山

紫金大萝卜

清脆的问枫

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：826996720【点击一键加群】如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通