We consider the problem of designing models to leverage a recently introduced approximate model averaging technique called dropout. We define a simple new model called maxout (so named because its output is the max of a set of inputs, and because it is a natural companion to dropout) designed to both facilitate optimization by dropout and improve the accuracy of dropout's fast approximate model averaging technique. We empirically verify that the model successfully accomplishes both of these tasks. We use maxout and dropout to demonstrate state of the art classification performance on four benchmark datasets: MNIST, CIFAR-10, CIFAR-100, and SVHN.
Maxout Networks
I. Goodfellow,David Warde-Farley,Mehdi Mirza,Aaron C. Courville,Yoshua Bengio
Published 2013 in International Conference on Machine Learning
ABSTRACT
PUBLICATION RECORD
- Publication year
2013
- Venue
International Conference on Machine Learning
- Publication date
2013-02-18
- Fields of study
Mathematics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
CONCEPTS
- approximate model averaging
A fast technique for approximating the prediction of an ensemble of many dropout-thinned models.
Aliases: fast approximate model averaging
박진우 (dztg5apj7m) extractionAnonymous (12632b8b5f) review - classification performance
The quality of a model's class prediction accuracy on evaluation data.
Aliases: classification accuracy
박진우 (dztg5apj7m) extractionAnonymous (12632b8b5f) review - dropout
A training and approximate model-averaging technique that randomly omits units during learning.
Aliases: dropout regularization
박진우 (dztg5apj7m) extractionAnonymous (12632b8b5f) review - maxout
A neural network unit that outputs the maximum value from a set of learned inputs.
Aliases: maxout networks, maxout unit
박진우 (dztg5apj7m) extractionAnonymous (12632b8b5f) review - mnist, cifar-10, cifar-100, and svhn
Four standard image classification benchmarks used to evaluate the reported models.
Aliases: MNIST, CIFAR-10, CIFAR-100, SVHN
박진우 (dztg5apj7m) extractionAnonymous (12632b8b5f) review - optimization
The process of fitting the model parameters during training.
Aliases: training optimization
박진우 (dztg5apj7m) extractionAnonymous (12632b8b5f) review
REFERENCES
Showing 1-24 of 24 references · Page 1 of 1