Distributed representations of temporally accumulated reward prediction errors in the mouse cortex

Published 2025 in Science Advances

ABSTRACT

Reward prediction errors (RPEs) quantify the difference between expected and actual rewards, serving to refine future actions. Although reinforcement learning (RL) provides ample theoretical evidence suggesting that the long-term accumulation of these error signals improves learning efficiency, it remains unclear whether the brain uses similar mechanisms. To explore this, we constructed RL-based theoretical models and used multiregional two-photon calcium imaging in the mouse dorsal cortex. We identified a population of neurons whose activity was modulated by varying degrees of RPE accumulation. Consequently, RPE-encoding neurons were sequentially activated within each trial, forming a distributed assembly. RPE representations in mice aligned with theoretical predictions of RL, emerging during learning and being subject to manipulations of the reward function. Interareal comparisons revealed a region-specific code, with higher-order cortical regions exhibiting long-term encoding of RPE accumulation. These results present an additional layer of complexity in cortical RPE computation, potentially augmenting learning efficiency in animals.

PUBLICATION RECORD

Publication year
2025
Venue
Science Advances
Publication date
2025-01-22
Fields of study
Biology, Medicine
Identifiers
DOI 10.1126/sciadv.adi4782 PMID 39841828 PMCID 11753378
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Neurophysiological mechanisms of error monitoring in human and non-human primates
2023cited by this paper
Arithmetic value representation for hierarchical behavior composition
2022cited by this paper
Representation learning in the artificial and biological neural networks underlying sensorimotor integration
2022cited by this paper
A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning
2022cited by this paper
Distributional reinforcement learning in prefrontal cortex
2021cited by this paper
A distributional code for value in dopamine-based reinforcement learning
2020cited by this paper
A quantitative reward prediction error signal in the ventral pallidum
2020cited by this paper
Specialized coding of sensory, motor, and cognitive variables in VTA dopamine neurons
2019cited by this paper
Area-Specificity and Plasticity of History-Dependent Value Coding During Learning.
2019cited by this paper
Modern Machine Learning as a Benchmark for Fitting Neural Responses
2018cited by this paper
Dissociable Structural and Functional Hippocampal Outputs via Distinct Subiculum Cell Classes.
2018cited by this paper
Dissociable Structural and Functional Hippocampal Outputs via Distinct Subiculum Cell Classes.
2018cited by this paper
Dynamic Reorganization of Neuronal Activity Patterns in Parietal Cortex.
2017cited by this paper
Neural Circuitry of Reward Prediction Error.
2017cited by this paper
A Multiplexed, Heterogeneous, and Adaptive Code for Navigation in Medial Entorhinal Cortex.
2017cited by this paper
Somatosensory Cortex Plays an Essential Role in Forelimb Motor Adaptation in Mice.
2017cited by this paper
A Corticocortical Circuit Directly Links Retrosplenial Cortex to M2 in the Mouse
2016cited by this paper
Temporal Specificity of Reward Prediction Errors Signaled by Putative Dopamine Neurons in Rat VTA Depends on Ventral Striatum.
2016cited by this paper
A large field of view two-photon mesoscope with subcellular resolution for in vivo imaging
2016cited by this paper
Arithmetic and local circuitry underlying dopamine prediction errors
2015cited by this paper
The dissociable effects of punishment and reward on motor learning
2015cited by this paper
Encoding and decoding in parietal cortex during sensorimotor decision-making
2014cited by this paper
A Causal Link Between Prediction Errors, Dopamine Neurons and Learning
2013cited by this paper
Neural basis of reinforcement learning and decision making.
2012cited by this paper
Two cortical systems for memory-guided behaviour
2012cited by this paper
Learning from Sensory and Reward Prediction Errors during Motor Adaptation
2011cited by this paper
Rethinking motor learning and savings in adaptation paradigms: model-free memory for successful actions combines with internal models.
2011cited by this paper
Double dissociation of value computations in orbitofrontal and anterior cingulate neurons
2011cited by this paper
Dopamine neurons learn to encode the long-term value of multiple future rewards
2011cited by this paper
Encoding of Both Positive and Negative Reward Prediction Errors by Neurons of the Primate Lateral Prefrontal Cortex and Caudate Nucleus
2011cited by this paper
Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis
2011cited by this paper
Neuron-type specific signals for reward and punishment in the ventral tegmental area
2011cited by this paper
Error correction, sensory prediction, and adaptation in motor control.
2010cited by this paper
What does the retrosplenial cortex do?
2009cited by this paper
A computational neuroanatomy for motor control
2008cited by this paper
Reinforcement learning: the good, the bad and the ugly.
2008cited by this paper
Silencing the Critics: Understanding the Effects of Cocaine Sensitization on Dorsolateral and Ventral Striatum in the Context of an Actor/Critic Model
2008cited by this paper
Medial prefrontal cell activity signaling prediction errors of action values
2007cited by this paper
Reinforcement learning: Computational theory and biological mechanisms
2007cited by this paper
Lateral habenula as a source of negative reward signals in dopamine neurons
2007cited by this paper
Anterior cingulate error‐related activity is modulated by predicted reward
2005cited by this paper
Electrophysiological correlates of reward prediction error recorded in the human prefrontal cortex.
2005cited by this paper
A Neural Substrate of Prediction and Reward
1997cited by this paper
A framework for mesencephalic dopamine systems based on predictive Hebbian learning
1996cited by this paper

CITED BY

No citing papers are available for this paper.