人工智能
计算机科学
机器人
领域(数学)
机器人学
特征(语言学)
钥匙(锁)
生成模型
工程类
机器学习
生成语法
动作(物理)
计算机视觉
理论(学习稳定性)
作者
Hao Qi,Haocheng Yin,Aris Zhu,Yilun Du,H. J. Yang
出处
期刊:IEEE robotics and automation letters
日期:2026-03-13
卷期号:11 (5): 5534-5541
标识
DOI:10.1109/lra.2026.3673995
摘要
We present generative predictive control (GPC), a framework for inference-time enhancement of pretrained behavior-cloning policies. Rather than retraining or fine-tuning, GPC augments a frozen diffusion policy at deployment by coupling it with a predictive world model. Concretely, we train an action-conditioned world model on expert demonstrations and random exploration rollouts to forecast the consequences of action proposals produced by the diffusion policy, then perform lightweight online planning that ranks and refines these proposals via model-based look-ahead. This combination of a generative prior with predictive foresight enables test-time adaptation. Across diverse robotic manipulation tasks—state- and vision-based, in simulation and on real hardware—GPC consistently outperforms standard behavior cloning and compares favorably to other inference-time adaptation baselines.
科研通智能强力驱动
Strongly Powered by AbleSci AI