Maxout Networks

I. Goodfellow,David Warde-Farley,Mehdi Mirza,Aaron C. Courville,Yoshua Bengio

Published 2013 in International Conference on Machine Learning

ABSTRACT

We consider the problem of designing models to leverage a recently introduced approximate model averaging technique called dropout. We define a simple new model called maxout (so named because its output is the max of a set of inputs, and because it is a natural companion to dropout) designed to both facilitate optimization by dropout and improve the accuracy of dropout's fast approximate model averaging technique. We empirically verify that the model successfully accomplishes both of these tasks. We use maxout and dropout to demonstrate state of the art classification performance on four benchmark datasets: MNIST, CIFAR-10, CIFAR-100, and SVHN.

PUBLICATION RECORD

  • Publication year

    2013

  • Venue

    International Conference on Machine Learning

  • Publication date

    2013-02-18

  • Fields of study

    Mathematics, Computer Science

  • Identifiers
  • External record

    Open on Semantic Scholar

  • Source metadata

    Semantic Scholar

CITATION MAP

EXTRACTION MAP

CONCEPTS

REFERENCES

Showing 1-24 of 24 references · Page 1 of 1

CITED BY

Showing 1-100 of 2238 citing papers · Page 1 of 23