概化理论
计算机科学
强化学习
现存分类群
人工智能
机器学习
不完美的
推论
感知
数据科学
心理学
生物
进化生物学
发展心理学
哲学
语言学
神经科学
作者
Saurabh Arora,Prashant Doshi
标识
DOI:10.1016/j.artint.2021.103500
摘要
Abstract Inverse reinforcement learning ( IRL ) is the problem of inferring the reward function of an agent, given its policy or observed behavior. Analogous to RL , IRL is perceived both as a problem and as a class of methods. By categorically surveying the extant literature in IRL , this article serves as a comprehensive reference for researchers and practitioners of machine learning as well as those new to it to understand the challenges of IRL and select the approaches best suited for the problem on hand. The survey formally introduces the IRL problem along with its central challenges such as the difficulty in performing accurate inference and its generalizability, its sensitivity to prior knowledge, and the disproportionate growth in solution complexity with problem size. The article surveys a vast collection of foundational methods grouped together by the commonality of their objectives, and elaborates how these methods mitigate the challenges. We further discuss extensions to the traditional IRL methods for handling imperfect perception, an incomplete model, learning multiple reward functions and nonlinear reward functions. The article concludes the survey with a discussion of some broad advances in the research area and currently open research questions.
科研通智能强力驱动
Strongly Powered by AbleSci AI