水准点(测量)
强化学习
计算机科学
集合(抽象数据类型)
空格(标点符号)
机器学习
人工智能
人机交互
程序设计语言
大地测量学
操作系统
地理
作者
Abdus Salam Azad,Edward Kim,Kimin Lee,Qiancheng Wu,Ion Stoica,Pieter Abbeel,Sanjit A. Seshia
出处
期刊:Cornell University - arXiv
日期:2021-06-07
摘要
The capability of reinforcement learning (RL) agent directly depends on the diversity of learning scenarios the environment generates and how closely it captures real-world situations. However, existing environments/simulators lack the support to systematically model distributions over initial states and transition dynamics. Furthermore, in complex domains such as soccer, the space of possible scenarios is infinite, which makes it impossible for one research group to provide a comprehensive set of scenarios to train, test, and benchmark RL algorithms. To address this issue, for the first time, we adopt an existing formal scenario specification language, SCENIC, to intuitively model and generate interactive scenarios. We interfaced SCENIC to Google Research Soccer environment to create a platform called SCENIC4RL. Using this platform, we provide a dataset consisting of 36 scenario programs encoded in SCENIC and demonstration data generated from a subset of them. We share our experimental results to show the effectiveness of our dataset and the platform to train, test, and benchmark RL algorithms. More importantly, we open-source our platform to enable RL community to collectively contribute to constructing a comprehensive set of scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI