强化学习
时差学习
计算机科学
进化算法
人工智能
优势和劣势
机器学习
空格(标点符号)
钢筋
功能(生物学)
贝尔曼方程
增强学习
数学
数学优化
进化生物学
生物
社会心理学
认识论
操作系统
哲学
心理学
作者
David E. Moriarty,Alan C. Schultz,John J. Grefenstette
摘要
There are two distinct approaches to solving reinforcement learning problems, namely, searching in value function space and searching in policy space. Temporal difference methods and evolutionary algorithms are well-known examples of these approaches. Kaelbling, Littman and Moore recently provided an informative survey of temporal difference methods. This article focuses on the application of evolutionary algorithms to the reinforcement learning problem, emphasizing alternative policy representations, credit assignment methods, and problem-specific genetic operators. Strengths and weaknesses of the evolutionary approach to reinforcement learning are presented, along with a survey of representative applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI