一般化
计算机科学
人工智能
模仿
任务(项目管理)
机器人
对象(语法)
弹道
分解
人机交互
演示式编程
学习迁移
克隆(编程)
机器学习
任务分析
机器人学
人机交互
代表(政治)
仿人机器人
内隐学习
理论(学习稳定性)
编码(内存)
视觉对象识别的认知神经科学
机器人学习
先验概率
作者
Kamil Dreczkowski,Pietro Vitiello,Vitalis Vosylius,Edward Johns
出处
期刊:Science robotics
[American Association for the Advancement of Science]
日期:2025-11-12
卷期号:10 (108)
标识
DOI:10.1126/scirobotics.adv7594
摘要
Humans are remarkably efficient at learning tasks from demonstrations, but today’s imitation learning methods for robot manipulation often require hundreds or thousands of demonstrations per task. We investigated two fundamental priors for improving learning efficiency: decomposing manipulation trajectories into sequential alignment and interaction phases and retrieval-based generalization. Through 3450 real-world rollouts, we systematically studied this decomposition. We compared different design choices for the alignment and interaction phases and examined generalization and scaling trends relative to today’s dominant paradigm of behavioral cloning with a single-phase monolithic policy. In the few-demonstrations-per-task regime (<10 demonstrations), decomposition achieved an order of magnitude of improvement in data efficiency over single-phase learning, with retrieval consistently outperforming behavioral cloning for both alignment and interaction. Building on these insights, we developed Multi-Task Trajectory Transfer (MT3), an imitation learning method based on decomposition and retrieval. MT3 learns everyday manipulation tasks from as little as a single demonstration each while also generalizing to previously unseen object instances. This efficiency enabled us to teach a robot 1000 distinct everyday tasks in under 24 hours of human demonstrator time. Through 2200 additional real-world rollouts, we reveal MT3’s capabilities and limitations across different task families.
科研通智能强力驱动
Strongly Powered by AbleSci AI