多巴胺能
动作(物理)
价值(数学)
信号(编程语言)
神经科学
计算机科学
多巴胺
生物
机器学习
物理
量子力学
程序设计语言
作者
Francesca Greenstreet,Hernando Martínez Vergara,Yvonne Johansson,Sthitapranjya Pati,Laura Maria Loss Schwarz,Stephen C. Lenzi,J. P. Geerts,Matthew Wisdom,Alina Gubanova,Lars Rollik,Jasvin Kaur,Theodore Moskovitz,Joseph Cohen,Emmett Thompson,Troy W. Margrie,Claudia Clopath,Marcus Stephenson‐Jones
出处
期刊:Nature
[Nature Portfolio]
日期:2025-05-14
标识
DOI:10.1038/s41586-025-09008-9
摘要
Abstract Choice behaviour of animals is characterized by two main tendencies: taking actions that led to rewards and repeating past actions 1,2 . Theory suggests that these strategies may be reinforced by different types of dopaminergic teaching signals: reward prediction error to reinforce value-based associations and movement-based action prediction errors to reinforce value-free repetitive associations 3–6 . Here we use an auditory discrimination task in mice to show that movement-related dopamine activity in the tail of the striatum encodes the hypothesized action prediction error signal. Causal manipulations reveal that this prediction error serves as a value-free teaching signal that supports learning by reinforcing repeated associations. Computational modelling and experiments demonstrate that action prediction errors alone cannot support reward-guided learning, but when paired with the reward prediction error circuitry they serve to consolidate stable sound–action associations in a value-free manner. Together we show that there are two types of dopaminergic prediction errors that work in tandem to support learning, each reinforcing different types of association in different striatal areas.
科研通智能强力驱动
Strongly Powered by AbleSci AI