眶额皮质
神经科学
心理学
局部场电位
强化学习
前额叶皮质
奖励制度
钢筋
纹状体
光遗传学
腹侧纹状体
认知心理学
多巴胺
认知
计算机科学
人工智能
社会心理学
作者
Miranda Francoeur Koloski,Morteza Salimi,Sidharth Hulyalkar,Tianzhi Tang,Samuel A. Barnes,Jyoti Mishra,Dhakshin Ramanathan
标识
DOI:10.1523/jneurosci.0858-25.2025
摘要
Behavioral outcomes are rarely certain, requiring subjects to discriminate between available choices by using feedback to guide future decisions. Probabilistic reversal learning (PRL) tasks test subjects’ ability to learn and flexibly adapt to changes in reward contingencies. Cortico-striatal circuitry has been broadly implicated in flexible decision-making – though what role these circuits play remains complicated. In this study we leveraged the fast temporal dynamics of local field potentials to precisely identify the role that cortico-striatal networks play during PRL reward-feedback. We measured widespread (32-CH) local field potential activity of male Long-Evans rats during a PRL task wherein a target response delivered reward on 80% of trials while a non-target response delivered reward on 20% of trials. When subjects learned those reward probabilities, contingencies were reversed. We found that reward-evoked oscillations at beta (15-30Hz) and high gamma (>70Hz) frequencies, marked positive reward valence and reflected probability of reward. Activity and connectivity at beta-frequencies between orbitofrontal cortex, anterior insula, medial prefrontal cortex, and ventral striatum during expected rewards was correlated with behavioral performance and specific aspects of value/exploitative behavior as defined by a reinforcement learning computational model. Finally, we found that modulating beta activity in orbitofrontal cortex with optogenetic (20Hz) stimulation promoted maladaptive behavior when stimulation was provided during non-target responses, consistent with our data and computational model predictions. Reward-evoked beta oscillations may reflect a crucial component underlying reward learning and erroneous elevations in this physiological signal may contribute to maladaptive task performance and behavioral disruptions. Significance Statement We examined how oscillatory dynamics throughout cortico-striatal regions may represent reward value when reward delivery is uncertain. Beta and high gamma oscillations mark positive reward valence, reflect reward likelihood and show greater connectivity between areas of the cortico-striatal network during successful task performance. Optogenetic stimulation of orbitofrontal cortex at beta frequencies (20 Hz) modulates task performance based on which trial (high or low probability of reward) it is applied. Together, our findings suggest that beta oscillations in the cortico-striatal network represent learned reward value that may be particularly important under conditions of uncertain or changing reward contingencies.
科研通智能强力驱动
Strongly Powered by AbleSci AI