强化学习
地震(自然现象)
计算机科学
锦标赛
人口
人工智能
钢筋
人机交互
心理学
数学
社会心理学
组合数学
地质学
社会学
人口学
地震学
作者
Max Jaderberg,Wojciech Marian Czarnecki,Iain Dunning,Luke Marris,Guy Lever,Antonio García Castañeda,Charles Beattie,Neil C. Rabinowitz,Ari S. Morcos,Avraham Ruderman,Nicolas Sonnerat,Tim Green,Louise Deason,Joel Z. Leibo,David Silver,Demis Hassabis,Koray Kavukcuoglu,Thore Graepel
出处
期刊:Science
[American Association for the Advancement of Science]
日期:2019-05-30
卷期号:364 (6443): 859-865
被引量:672
标识
DOI:10.1126/science.aau6249
摘要
Reinforcement learning (RL) has shown great success in increasingly complex single-agent environments and two-player turn-based games. However, the real world contains multiple agents, each learning and acting independently to cooperate and compete with other agents. We used a tournament-style evaluation to demonstrate that an agent can achieve human-level performance in a three-dimensional multiplayer first-person video game, Quake III Arena in Capture the Flag mode, using only pixels and game points scored as input. We used a two-tier optimization process in which a population of independent RL agents are trained concurrently from thousands of parallel matches on randomly generated environments. Each agent learns its own internal reward signal and rich representation of the world. These results indicate the great potential of multiagent reinforcement learning for artificial intelligence research.
科研通智能强力驱动
Strongly Powered by AbleSci AI