可解释性
计算机科学
人工神经网络
人工智能
限制
认知科学
航程(航空)
机器学习
心理学
机械工程
材料科学
工程类
复合材料
作者
Thomas McGrath,Andrei Kapishnikov,Nenad Tomašev,Adam Pearce,Martin Wattenberg,Demis Hassabis,Been Kim,Ulrich Paquet,Vladimir Kramnik
标识
DOI:10.1073/pnas.2206625119
摘要
We analyze the knowledge acquired by AlphaZero, a neural network engine that learns chess solely by playing against itself yet becomes capable of outperforming human chess players. Although the system trains without access to human games or guidance, it appears to learn concepts analogous to those used by human chess players. We provide two lines of evidence. Linear probes applied to AlphaZero's internal state enable us to quantify when and where such concepts are represented in the network. We also describe a behavioral analysis of opening play, including qualitative commentary by a former world chess champion.
科研通智能强力驱动
Strongly Powered by AbleSci AI