计算机科学
四轴飞行器
强化学习
对抗制
模仿
生成语法
人工智能
无人机
人机交互
控制(管理)
工程类
生物
遗传学
心理学
社会心理学
航空航天工程
作者
Suraj R. Bandela,Yongcan Cao
摘要
Learning from human demonstrations is fundamental to harnessing human intelligence in many tasks. A critical approach to learning from human demonstrations is inverse reinforcement learning, which aims to learn rewards from limited human demonstrations and then train control policies based on the learned reward. The existing inverse reinforcement learning methods perform well in less complex environments but often fail in complex high-dimensional environments. To overcome these difficulties and limitations, this paper studies the implementation of a generative adversarial imitation learning (GAIL) method that controls a quadcopter Unmanned Aerial Vehicle (UAV) to navigate between two defined positions in a virtual environment created in Unreal Engine, whose simulation environments can reflect real-world physics. We present procedures to build a customized virtual environment using the Epic game's {Unreal engine}, collect expert demonstrations, and optimize the control policy using GAIL. Finally, the simulation results discuss and explain the performance of GAIL in 3-dimensional UAV navigation.
科研通智能强力驱动
Strongly Powered by AbleSci AI