Deep learning is a broad set of techniques that uses multiple layers of representation to automatically learn relevant features directly from structured data. Recently, such techniques have yielded record-breaking results on a diverse set of difficult machine learning tasks in computer vision, speech recognition, and natural language processing. Despite the enormous success of deep learning, relatively little is understood theoretically about why these techniques are so successful at feature learning and compression. Here, we show that deep learning is intimately related to one of the most important and successful techniques in theoretical physics, the renormalization group (RG). RG is an iterative coarse-graining scheme that allows for the extraction of relevant features (i.e. operators) as a physical system is examined at different length scales. We construct an exact mapping from the variational renormalization group, first introduced by Kadanoff, and deep learning architectures based on Restricted Boltzmann Machines (RBMs). We illustrate these ideas using the nearest-neighbor Ising Model in one and two-dimensions. Our results suggests that deep learning algorithms may be employing a generalized RG-like scheme to learn relevant features from data.
An exact mapping between the Variational Renormalization Group and Deep Learning
Published 2014 in arXiv.org
ABSTRACT
PUBLICATION RECORD
- Publication year
2014
- Venue
arXiv.org
- Publication date
2014-10-14
- Fields of study
Mathematics, Physics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
CONCEPTS
- deep learning
A set of machine learning techniques that use multiple layers of representation to learn relevant features directly from structured data.
Anonymous (b2adb6bfad) extraction - feature learning
The process of automatically learning relevant features from data, which deep learning is described as performing in this paper.
Anonymous (b2adb6bfad) extraction - nearest-neighbor ising model
An Ising model with nearest-neighbor interactions that is used here in one and two dimensions as an example system.
Aliases: Ising Model
Anonymous (b2adb6bfad) extraction - renormalization group
An iterative coarse-graining scheme for extracting relevant features or operators as a physical system is viewed at different length scales.
Aliases: RG
Anonymous (b2adb6bfad) extraction - restricted boltzmann machines
A class of deep learning architectures used in this paper to establish the mapping to variational renormalization group.
Aliases: RBMs, RBM
Anonymous (b2adb6bfad) extraction - variational renormalization group
A variational form of the renormalization group introduced by Kadanoff and used here as one side of the exact mapping.
Anonymous (b2adb6bfad) extraction
REFERENCES
Showing 1-15 of 15 references · Page 1 of 1