Precise and dexterous robotic manipulation via human-in-the-loop reinforcement learning

强化学习 人在回路中 稳健性(进化) 机器人学 计算机科学 软件部署 人机交互 机器人 机械臂 人工智能 控制工程 工程类 软件工程 生物化学 化学 基因
作者
Jianlan Luo,Charles Xu,Jeffrey Wu,Sergey Levine
出处
期刊:Science robotics [American Association for the Advancement of Science]
卷期号:10 (105): eads5033-eads5033 被引量:30
标识
DOI:10.1126/scirobotics.ads5033
摘要

Robotic manipulation remains one of the most difficult challenges in robotics, with approaches ranging from classical model-based control to modern imitation learning. Although these methods have enabled substantial progress, they often require extensive manual design, struggle with performance, and demand large-scale data collection. These limitations hinder their real-world deployment at scale, where reliability, speed, and robustness are essential. Reinforcement learning (RL) offers a powerful alternative by enabling robots to autonomously acquire complex manipulation skills through interaction. However, realizing the full potential of RL in the real world remains challenging because of issues of sample efficiency and safety. We present a human-in-the-loop, vision-based RL system that achieved strong performance on a wide range of dexterous manipulation tasks, including precise assembly, dynamic manipulation, and dual-arm coordination. These tasks reflect realistic industrial tolerances, with small but critical variations in initial object placements that demand sophisticated reactive control. Our method integrates demonstrations, human corrections, sample-efficient RL algorithms, and system-level design to directly learn RL policies in the real world. Within 1 to 2.5 hours of real-world training, our approach outperformed other baselines by improving task success by 2×, achieving near-perfect success rates, and executing 1.8× faster on average. Through extensive experiments and analysis, our results suggest that RL can learn a wide range of complex vision-based manipulation policies directly in the real world within practical training times. We hope that this work will inspire a new generation of learned robotic manipulation techniques, benefiting both industrial applications and research advancements.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
corazon完成签到 ,获得积分10
刚刚
芳芳完成签到,获得积分10
刚刚
应如音完成签到,获得积分10
刚刚
刚刚
小谢完成签到,获得积分10
刚刚
宁阿霜完成签到,获得积分10
刚刚
刚刚
刚刚
weiww完成签到,获得积分10
1秒前
1秒前
敏感的夜阑完成签到,获得积分20
1秒前
郝123完成签到,获得积分10
1秒前
那小子真帅完成签到,获得积分10
1秒前
迷人的安寒完成签到,获得积分10
1秒前
小张在努力完成签到 ,获得积分10
1秒前
共享精神应助冷静的朝雪采纳,获得10
1秒前
Ellie完成签到 ,获得积分10
3秒前
科研通AI6.3应助ZJK采纳,获得10
3秒前
4秒前
4秒前
淮海路小佩奇完成签到,获得积分10
4秒前
22完成签到,获得积分10
4秒前
4秒前
万幸鹿发布了新的文献求助10
4秒前
蜂鸟5156完成签到,获得积分10
5秒前
5秒前
5秒前
5秒前
姜忆莲完成签到,获得积分10
5秒前
yyc完成签到,获得积分10
6秒前
黄健丰完成签到,获得积分10
6秒前
雪白的觅松完成签到,获得积分10
7秒前
就叫烨烨发布了新的文献求助10
7秒前
7秒前
甜蜜阑悦完成签到,获得积分10
7秒前
8秒前
烟花应助666采纳,获得10
8秒前
8秒前
嗡嗡嗡完成签到,获得积分10
9秒前
Mingda完成签到,获得积分10
9秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6441083
求助须知:如何正确求助?哪些是违规求助? 8255037
关于积分的说明 17574304
捐赠科研通 5499660
什么是DOI,文献DOI怎么找? 2900128
邀请新用户注册赠送积分活动 1876853
关于科研通互助平台的介绍 1716955