Functional Requirements for Reward-Modulated Spike-Timing-Dependent Plasticity

Nicolas Frémaux,Henning Sprekeler,W. Gerstner

Published 2010 in Journal of Neuroscience

ABSTRACT

Recent experiments have shown that spike-timing-dependent plasticity is influenced by neuromodulation. We derive theoretical conditions for successful learning of reward-related behavior for a large class of learning rules where Hebbian synaptic plasticity is conditioned on a global modulatory factor signaling reward. We show that all learning rules in this class can be separated into a term that captures the covariance of neuronal firing and reward and a second term that presents the influence of unsupervised learning. The unsupervised term, which is, in general, detrimental for reward-based learning, can be suppressed if the neuromodulatory signal encodes the difference between the reward and the expected reward—but only if the expected reward is calculated for each task and stimulus separately. If several tasks are to be learned simultaneously, the nervous system needs an internal critic that is able to predict the expected reward for arbitrary stimuli. We show that, with a critic, reward-modulated spike-timing-dependent plasticity is capable of learning motor trajectories with a temporal resolution of tens of milliseconds. The relation to temporal difference learning, the relevance of block-based learning paradigms, and the limitations of learning with a critic are discussed.

PUBLICATION RECORD

Publication year
2010
Venue
Journal of Neuroscience
Publication date
2010-10-06
Fields of study
Biology, Medicine, Psychology
Identifiers
DOI 10.1523/JNEUROSCI.6249-09.2010 PMID 20926659 PMCID PMC6634722
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

With an internal critic, reward-modulated spike-timing-dependent plasticity can learn motor trajectories with temporal resolution on the order of tens of milliseconds.
Confidence 0.95

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
When several tasks are learned simultaneously, an internal critic that predicts expected reward for arbitrary stimuli is required.
Confidence 0.92

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
The unsupervised learning term can be suppressed when the neuromodulatory signal encodes reward relative to expected reward, but only when expected reward is computed separately for each task and stimulus.
Confidence 0.94

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
Learning rules in this class decompose into a covariance term capturing neuronal firing-reward covariance and a second term reflecting unsupervised learning influences.
Confidence 0.97

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review

CONCEPTS

covariance of neuronal firing and reward
model term

The component of the learning-rule decomposition that reflects how neuronal activity covaries with reward.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
expected reward
quantity

The predicted reward level used as a reference for modulatory signaling.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
internal critic
predictor

An internal predictor that estimates expected reward for arbitrary stimuli.

Aliases: critic

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
motor trajectories
behavior

Time-varying movement patterns used as the target behavior in the learning example.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
neuromodulatory signal
signal

The global reward-related modulatory factor that gates synaptic plasticity.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
reward-modulated spike-timing-dependent plasticity
learning rule

A spike-timing-dependent plasticity rule whose synaptic updates are gated by a global reward-related modulatory factor.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
simultaneous task learning
learning setting

A setting in which several tasks are learned at the same time.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
task- and stimulus-specific expected reward
quantity

The expected reward estimated separately for each task and stimulus condition.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
temporal difference learning
learning framework

A reinforcement-learning framework based on predicting reward differences over time.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review
unsupervised learning term
model term

The component of the learning-rule decomposition associated with unsupervised plasticity effects independent of reward.

박진우 (dztg5apj7m) extractionB (s683577b42) reviewq (76h6bfydm6) review

REFERENCES

Dopamine signals for reward value and risk: basic and recent data
2010cited by this paper
Molecular mechanisms of HIV-1 persistence in the monocyte-macrophage lineage
2010influential reference
Gain in sensitivity and loss in temporal contrast of STDP by dopaminergic modulation at hippocampal synapses
2009influential reference
A Spiking Neural Network Model of an Actor-Critic Learning Agent
2009cited by this paper
Spike-Based Reinforcement Learning in Continuous State and Action Space: When Policy Gradient Methods Fail
2009cited by this paper
Synaptic plasticity in the basal ganglia.
2009cited by this paper
Dopamine Receptor Activation Is Required for Corticostriatal Spike-Timing-Dependent Plasticity
2008influential reference
A Learning Theory for Reward-Modulated Spike-Timing-Dependent Plasticity with Application to Biofeedback
2008influential reference
Dendritic excitability and synaptic plasticity.
2008cited by this paper
Reinforcement learning with modulated spike timing dependent synaptic plasticity.
2007influential reference
Space, time and dopamine.
2007cited by this paper
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
2007influential reference
Behavioral dopamine signals.
2007influential reference
Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity
2007influential reference
Neuromodulators control the polarity of spike-timing-dependent synaptic plasticity.
2007influential reference
Solving the distal reward problem through linkage of STDP and dopamine signaling
2007influential reference
A synaptic memory trace for cortical receptive field plasticity
2007cited by this paper
Optimal Spike-Timing-Dependent Plasticity for Precise Action Potential Firing in Supervised Learning
2005influential reference
Neurons Tune to the Earliest Spikes Through STDP
2005cited by this paper
Learning in neural networks by reinforcement of irregular spiking.
2004influential reference
Cortical neural prosthetics.
2004cited by this paper
Dopamine: a potential substrate for synaptic plasticity and memory mechanisms.
2003influential reference
Learning Input Correlations through Nonlinear Temporally Asymmetric Hebbian Plasticity
2003cited by this paper
Learning in spiking neural networks by reinforcement of stochastic synaptic transmission.
2003cited by this paper
The nucleus basalis and memory codes: auditory cortical plasticity and the induction of specific, associative behavioral memory.
2003cited by this paper
Spiking Neuron Models
2002influential reference
Dopamine-dependent plasticity of corticostriatal synapses
2002cited by this paper
Rate, timing, and cooperativity jointly determine cortical synaptic plasticity.
2001cited by this paper
A cellular mechanism of reward-related learning
2001cited by this paper
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
2001cited by this paper
Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems
2001cited by this paper
Competitive Hebbian learning through spike-timing-dependent synaptic plasticity
2000cited by this paper
Stable Hebbian Learning from Spike Timing-Dependent Plasticity
2000cited by this paper
Metric-space analysis of spike trains: theory, algorithms and application
1998cited by this paper
Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type
1998cited by this paper
Reinforcement learning
1998influential reference
Learning of sequential movements by neural network model with dopamine-like reinforcement signal
1998cited by this paper
Neural correlates of motor memory consolidation.
1997cited by this paper
Regulation of Synaptic Efficacy by Coincidence of Postsynaptic APs and EPSPs
1997cited by this paper
A Neural Substrate of Prediction and Reward
1997cited by this paper
Consolidation in human motor memory
1996cited by this paper
A neuronal learning rule for sub-millisecond temporal coding
1996cited by this paper
Different voltage-dependent thresholds for inducing long-term depression and long-term potentiation in slices of rat visual cortex
1990influential reference
Primate motor cortex and free arm movements to visual targets in three- dimensional space. II. Coding of the direction of movement by a neuronal population
1988cited by this paper
Neuronal population coding of movement direction.
1986cited by this paper
Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex
1982cited by this paper
Simplified neuron model as a principal component analyzer
1982cited by this paper
A Theory of Attention: Variations in the Associability of Stimuli with Reinforcement
1975cited by this paper
Long‐lasting potentiation of synaptic transmission in the dentate area of the unanaesthetized rabbit following stimulation of the perforant path
1973cited by this paper
Long‐lasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path
1973cited by this paper
A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement
1972cited by this paper
The organization of behavior: A neuropsychological theory
1951cited by this paper

CITED BY

Reward-Modulated Local Learning in Spiking Encoders: Controlled Benchmarks with STDP and Hybrid Rate Readouts
2026cites this paper
Three-factor learning in spiking neural networks: An overview of methods and trends from a machine learning perspective
2025influential citation
Closed-loop vagus nerve stimulation aids recovery from spinal cord injury
2025cites this paper
Challenging Backpropagation: Evidence for Target Learning in the Neocortex
2025cites this paper
A spiking neural network for active efficient coding
2025cites this paper
Synaptic bundle theory for spike-driven sensor-motor system: More than eight independent synaptic bundles collapse reward-STDP learning
2025cites this paper
A burst-dependent algorithm for neuromorphic on-chip learning of spiking neural networks
2024cites this paper
From "What" to "When" - a Spiking Neural Network Predicting Rare Events and Time to their Occurrence
2024cites this paper
Inferring plasticity rules from single-neuron spike trains using deep learning methods
2024cites this paper
A Brain-Inspired Spiking Neural Network Model for Decision-Making in Reinforcement Learning: Intelligent Self-Balancing Control
2024cites this paper
Learning what matters: Synaptic plasticity with invariance to second-order input correlations
2024cites this paper
Brain-inspired learning rules for spiking neural network-based control: a tutorial
2024cites this paper
Assistive sensory-motor perturbations influence learned neural representations
2024cites this paper
Stimulus-to-stimulus learning in RNNs with cortical inductive biases
2024cites this paper
Brain mechanism of foraging: Reward-dependent synaptic plasticity versus neural integration of values
2024cites this paper
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
2024cites this paper
A Spiking Binary Neuron - Detector of Causal Links
2023cites this paper
Hebbian learning inspired estimation of the linear regression parameters from queries
2023cites this paper
Improving Spiking Neural Network Performance with Auxiliary Learning
2023cites this paper
Brain-inspired learning in artificial neural networks: a review
2023cites this paper
Interpreting learning in biological neural networks as zero-order optimization method
2023cites this paper
Brain mechanism of foraging: reward-dependent synaptic plasticity or neural integration of values?
2023cites this paper
A Cerebellum-Inspired Prediction and Correction Model for Motion Control of a Musculoskeletal Robot
2023cites this paper
From "What" to "When" - a Spiking Neural Network Predicting Rare Events and Time to their Occurrence
2023cites this paper
Emergent computations in trained artificial neural networks and real brains
2022cites this paper
Brain mechanism of foraging: reward-dependent synaptic plasticity or neural integration of values?
2022cites this paper
Thunderstruck: The ACDC model of flexible sequences and rhythms in recurrent neural circuits
2022cites this paper
Introducing principles of synaptic integration in the optimization of deep neural networks
2022cites this paper
Stabilization of a pendulum on an elastic foundation using a multilayer perceptron
2022cites this paper
Adaptive control of synaptic plasticity integrates micro- and macroscopic network function
2022cites this paper
Enhancing efficiency of object recognition in different categorization levels by reinforcement learning in modular spiking neural networks
2021cites this paper
Computational Roles of Intrinsic Synaptic Dynamics
2021cites this paper
Norepinephrine potentiates and serotonin depresses visual cortical responses by transforming eligibility traces
2021cites this paper
Learning in Deep Neural Networks Using a Biologically Inspired Optimizer
2021cites this paper
Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning
2021cites this paper
Beyond Gradients: Noise Correlations Control Hebbian Plasticity to Shape Credit Assignment
2021cites this paper
Temporal stimulus segmentation by reinforcement learning in populations of spiking neurons
2020cites this paper
A differential Hebbian framework for biologically-plausible motor control
2020cites this paper
Emergent Inference of Hidden Markov Models in Spiking Neural Networks Through Winner-Take-All
2020cites this paper
Supervised learning in spiking neural networks: A review of algorithms and evaluations
2020cites this paper
Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuits
2020cites this paper
Closed-loop experiments on the BrainScaleS-2 architecture
2020cites this paper
A neural circuit mechanism of categorical perception: top-down signaling in the primate cortex
2020cites this paper
Supervised Learning in Temporally-Coded Spiking Neural Networks with Approximate Backpropagation
2020influential citation
Stochastic Models of Neural Synaptic Plasticity
2020cites this paper
Dual exploration strategies using artificial spiking neural networks in a robotic learning task
2020cites this paper
Task difficulty and expertise mediate the effects of roving on perceptual performance
2019influential citation
Revisiting the XOR problem: a neurorobotic implementation
2019cites this paper
Pattern Classification by Spiking Neural Networks Combining Self-Organized and Reward-Related Spike-Timing-Dependent Plasticity
2019cites this paper
Spatial Concept Learning: A Spiking Neural Network Implementation in Virtual and Physical Robots
2019cites this paper
Learning with naturalistic odor representations in a dynamic model of the Drosophila olfactory system
2019cites this paper
An online supervised learning algorithm based on triple spikes for spiking neural networks
2019cites this paper
Versatile Emulation of Spiking Neural Networks on an Accelerated Neuromorphic Substrate
2019cites this paper
A Closed-Loop Toolchain for Neural Network Simulations of Learning Autonomous Agents
2019cites this paper
Winner-Take-All as Basic Probabilistic Inference Unit of Neuronal Circuits
2018cites this paper
A model of operant learning based on chaotically varying synaptic strength
2018cites this paper
Eligibility Traces and Plasticity on Behavioral Time Scales: Experimental Support of NeoHebbian Three-Factor Learning Rules
2018cites this paper
Brain-Inspired Motion Learning in Recurrent Neural Network With Emotion Modulation
2018cites this paper
Failure to learn during roving, analysing the unsupervised bias hypothesis
2018influential citation
Multi-context blind source separation by error-gated Hebbian rule
2018cites this paper
Demonstrating Advantages of Neuromorphic Computation: A Pilot Study
2018influential citation
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity
2018cites this paper
Spiking Neurons Integrating Visual Stimuli Orientation and Direction Selectivity in a Robotic Context
2018cites this paper
Reshaping Movement Distributions With Limit-Push Robotic Training
2018cites this paper
SuperSpike: Supervised Learning in Multilayer Spiking Neural Networks
2017influential citation
United States Patent ( 10 ) Patent No . : US 8 , 063 , 008 B 2
2017cites this paper
Sameness/difference spiking neural circuit as a relational concept precursor model: A bio-inspired robotic implementation
2017cites this paper
Learning spatio-temporal spike train encodings with ReSuMe, DelReSuMe, and Reward-modulated Spike-timing Dependent Plasticity in Spiking Neural Networks
2017influential citation
Reshaping Movements Through Avoidance of Negative Events: A General Adaptive Controller
2017cites this paper
Sequential neuromodulation of Hebbian plasticity offers mechanism for effective reward-based navigation
2017cites this paper
Improving learning efficiency of recurrent neural network through adjusting weights of all layers in a biologically-inspired framework
2017cites this paper
Acetylcholine-modulated plasticity in reward-driven navigation: a computational study
2017cites this paper
A Dynamic Connectome Supports the Emergence of Stable Computational Function of Neural Circuits through Reward-Based Learning
2017cites this paper
First-Spike-Based Visual Categorization Using Reward-Modulated STDP
2017cites this paper
Reward-based stochastic self-configuration of neural circuits
2017cites this paper
Closing the loop between neural network simulators and the OpenAI Gym
2017cites this paper
Striatal action-value neurons reconsidered
2017cites this paper
Inhibitory Plasticity: Balance, Control, and Codependence.
2017cites this paper
A heterosynaptic spiking neural system for the development of autonomous agents
2017cites this paper
Learning with Surprise - Theory and Applications
2016cites this paper
Biologically plausible learning in recurrent neural networks reproduces neural dynamics observed during cognitive tasks
2016influential citation
Mapping spatio-temporally encoded patterns by reward-modulated STDP in Spiking neurons
2016cites this paper
Goal-Directed Decision Making with Spiking Neurons
2016cites this paper
In Search for the Neural Mechanisms of Individual Development: Behavior-Driven Differential Hebbian Learning
2016cites this paper
An Efficient Approach to Boosting Performance of Deep Spiking Network Training
2016cites this paper
Somato-dendritic Synaptic Plasticity and Error-backpropagation in Active Dendrites
2016cites this paper
Review of advances in neural networks: Neural design technology stack
2016cites this paper
Optimal Supervised Learning in Spiking Neural Networks for Precise Temporal Encoding
2016cites this paper
A Local Learning Rule for Independent Component Analysis
2016cites this paper
Supervised Learning in Spiking Neural Networks for Precise Temporal Encoding
2016cites this paper
Flexible decision-making in recurrent neural networks trained with a biologically plausible rule
2016cites this paper
Nonlinear Hebbian Learning as a Unifying Principle in Receptive Field Formation
2016cites this paper
The neural marketplace
2016cites this paper
Parametrization of Neuromodulation in Reinforcement Learning
2016cites this paper
Neuromodulated Spike-Timing-Dependent Plasticity, and Theory of Three-Factor Learning Rules
2016influential citation
Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity
2016cites this paper
Reward-based training of recurrent neural networks for cognitive and value-based tasks
2016influential citation
Does computational neuroscience need new synaptic learning paradigms
2016cites this paper
A Reinforcement-learning Framework for Interpreting Trial-by-trial Motor Adaptation to Novel Haptic Environments
2015cites this paper
Central Cholinergic Neurons Are Rapidly Recruited by Reinforcement Feedback.
2015cites this paper