A Theoretical Framework for Inference Learning

N. Alonso,Beren Millidge,J. Krichmar,Emre Neftci

Published 2022 in Neural Information Processing Systems

ABSTRACT

Backpropagation (BP) is the most successful and widely used algorithm in deep learning. However, the computations required by BP are challenging to reconcile with known neurobiology. This difficulty has stimulated interest in more biologically plausible alternatives to BP. One such algorithm is the inference learning algorithm (IL). IL has close connections to neurobiological models of cortical function and has achieved equal performance to BP on supervised learning and auto-associative tasks. In contrast to BP, however, the mathematical foundations of IL are not well-understood. Here, we develop a novel theoretical framework for IL. Our main result is that IL closely approximates an optimization method known as implicit stochastic gradient descent (implicit SGD), which is distinct from the explicit SGD implemented by BP. Our results further show how the standard implementation of IL can be altered to better approximate implicit SGD. Our novel implementation considerably improves the stability of IL across learning rates, which is consistent with our theory, as a key property of implicit SGD is its stability. We provide extensive simulation results that further support our theoretical interpretations and also demonstrate IL achieves quicker convergence when trained with small mini-batches while matching the performance of BP for large mini-batches.

PUBLICATION RECORD

Publication year
2022
Venue
Neural Information Processing Systems
Publication date
2022-06-01
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.48550/arXiv.2206.00164 arXiv 2206.00164
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Inferring Neural Activity Before Plasticity: A Foundation for Learning Beyond Backpropagation
2022cited by this paper
Learning on Arbitrary Graph Topologies via Predictive Coding
2022cited by this paper
Towards Scaling Difference Target Propagation by Learning Backprop Targets
2022cited by this paper
Predictive Coding: a Theoretical and Experimental Review
2021cited by this paper
Associative Memories via Predictive Coding
2021cited by this paper
Tightening the Biological Constraints on Gradient-Based Predictive Coding
2021influential reference
Inertial Proximal Deep Learning Alternating Minimization for Efficient Neutral Network Training
2021cited by this paper
Deriving Differential Target Propagation from Iterating Approximate Inverses
2020cited by this paper
Predictive Coding Approximates Backprop Along Arbitrary Computation Graphs
2020cited by this paper
Backpropagation and the brain
2020cited by this paper
Scaling Equilibrium Propagation to Deep ConvNets by Drastically Reducing Its Gradient Estimator Bias
2020cited by this paper
Hopfield Networks is All You Need
2020cited by this paper
A Theoretical Framework for Target Propagation
2020cited by this paper
Can the Brain Do Backpropagation? - Exact Implementation of Backpropagation in Predictive Coding Networks
2020cited by this paper
Theories of Error Back-Propagation in the Brain
2019cited by this paper
Predictive Coding, Variational Autoencoders, and Biological Connections
2019cited by this paper
Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures
2018cited by this paper
Beyond Backprop: Online Alternating Minimization with Auxiliary Variables
2018cited by this paper
Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing
2018cited by this paper
A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training
2018cited by this paper
Biologically-plausible learning algorithms can scale to large datasets
2018cited by this paper
Biologically Motivated Algorithms for Propagating Local Target Representations
2018cited by this paper
Predictive Processing: A Canonical Cortical Computation.
2018cited by this paper
An Approximation of the Error Backpropagation Algorithm in a Predictive Coding Network with Local Hebbian Synaptic Plasticity
2017influential reference
A tutorial on the free-energy framework for modelling perception and learning
2017cited by this paper
Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems
2017cited by this paper
Equilibrium Propagation: Bridging the Gap between Energy-Based Models and Backpropagation
2016cited by this paper
Dense Associative Memory for Pattern Recognition
2016cited by this paper
Random synaptic feedback weights support error backpropagation for deep learning
2016cited by this paper
Towards Stability and Optimality in Stochastic Gradient Descent
2015cited by this paper
Difference Target Propagation
2014cited by this paper
How Auto-Encoders Could Provide Credit Assignment in Deep Networks via Target Propagation
2014cited by this paper
Asymptotic and finite-sample properties of estimators based on stochastic gradients
2014cited by this paper
Statistical analysis of stochastic gradient methods for generalized linear models
2014cited by this paper
Implicit stochastic gradient descent for principled estimation with large datasets
2014cited by this paper
Proximal Algorithms
2013influential reference
Anatomy of hierarchy: Feedforward and feedback pathways in macaque visual cortex
2013cited by this paper
Predictions not commands: active inference in the motor system
2012cited by this paper
Canonical microcircuits for predictive coding.
2012cited by this paper
The free-energy principle: a unified brain theory?
2010cited by this paper
Predictive coding under the free-energy principle
2009cited by this paper
Fast communication: Derivation of a new normalized least mean squares algorithm with modified minimization criterion
2009cited by this paper
Predictive coding as a model of biased competition in visual attention.
2008cited by this paper
The Expectation-Maximization Algorithm
2007cited by this paper
A theory of cortical responses
2005cited by this paper
Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects.
1999influential reference
Backpropagation: the basic theory
1995cited by this paper
The recent excitement about neural networks
1989cited by this paper
A Theoretical Framework for Back-Propagation
1988cited by this paper
Learning representations by back-propagating errors
1986cited by this paper
A learning method for system identification
1967cited by this paper
Noname manuscript No. (will be inserted by the editor) Incremental Proximal Methods for Large Scale Convex Optimization
year unknowncited by this paper

CITED BY

On the Infinite Width and Depth Limits of Predictive Coding Networks
2026cites this paper
Efficient Learning in Predictive Coding Networks Using Global Error Signals
2025cites this paper
Generalising E-prop to Deep Networks
2025cites this paper
A survey on neuro-mimetic deep learning via predictive coding
2025cites this paper
Introduction to Predictive Coding Networks for Machine Learning
2025cites this paper
Bridging Predictive Coding and MDL: A Two-Part Code Framework for Deep Learning
2025cites this paper
Predictive Coding Networks and Inference Learning: Tutorial and Survey
2024influential citation
PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis
2024cites this paper
Benchmarking Predictive Coding Networks - Made Simple
2024cites this paper
Tight Stability, Convergence, and Robustness Bounds for Predictive Coding Networks
2024cites this paper
Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation
2024influential citation
Feedback control guides credit assignment in recurrent neural networks
2024cites this paper
Understanding and Improving Optimization in Predictive Coding Networks
2023influential citation
Predictive Coding as a Neuromorphic Alternative to Backpropagation: A Critical Evaluation
2023cites this paper
Gibbs sampling the posterior of neural networks
2023cites this paper
Learning in Deep Factor Graphs with Gaussian Belief Propagation
2023influential citation
Coincidence detection and integration behavior in spiking neural networks
2023cites this paper
Understanding Predictive Coding as an Adaptive Trust-Region Method
2023cites this paper
Brain-Inspired Computational Intelligence via Predictive Coding
2023cites this paper
A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks
2022cites this paper
Recurrent predictive coding models for associative memory employing covariance learning
2022cites this paper
Understanding Predictive Coding as a Second-Order Trust-Region Method
year unknowninfluential citation
2025 Introduction to Predictive Coding Networks for Machine Learning
year unknowncites this paper