2013

Maxout Networks

Yoshua Bengio

citations

Cite Score

64

AI summary

The paper introduces maxout networks, a novel model that leverages dropout for optimization and model averaging. Using maxout with dropout, the authors achieved state-of-the-art results on MNIST, CIFAR-10, CIFAR-100, and SVHN datasets, demonstrating the model's effectiveness in classification tasks.

Main Contributions

  • Introduced the maxout unit, a new type of activation function suitable for dropout.
  • Proved that maxout networks are universal approximators.
  • Demonstrated that dropout attains good approximation to model averaging in deep models, especially with maxout units.
  • Showed that maxout improves the bagging-style training phase of dropout.
  • Achieved state-of-the-art results on MNIST, CIFAR-10, CIFAR-100 and SVHN datasets using maxout and dropout.

Abstract

We consider the problem of designing models to leverage a recently introduced approximate model averaging technique called dropout. We define a simple new model called maxout (so named because its output is the max of a set of inputs, and because it is a natural companion to dropout) designed to both facilitate optimization by dropout and improve the accuracy of dropout's fast approximate model averaging technique. We empirically verify that the model successfully accomplishes both of these tasks. We use maxout and dropout to demonstrate state of the art classification performance on four benchmark datasets: MNIST, CIFAR-10, CIFAR-100, and SVHN.

Citation Graph

Loading graph...

References [24]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

Alex Krizhevsky - 2009

27 papers in library cite

Xavier Glorot, Antoine Bordes, Yoshua Bengio - 2011

17 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

Y. Netzer, Tianle Wang, A. Coates, Alessandro Bissacco, Bo Wu, Andrew Y. Ng - 2011

8 papers in library cite

K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009

20 papers in library cite

F. Bastien, P. Lamblin, Razvan Pascanu, James Bergstra, I. Goodfellow, A. Bergeron, A. Bouchard, N. Nicolas, Yoshua Bengio - 2012

13 papers in library cite

P. Sermanet, S. Chintala, Yann Lecun - 2012

6 papers in library cite

Dan C. Ciresan, Ueli Meier, Luca M. Gambardella, Jürgen Schmidhuber - 2010

10 papers in library cite

James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010

22 papers in library cite

L. Breiman - 1994

4 papers in library cite

Ruslan Salakhutdinov, Geoffrey E. Hinton - 2009

9 papers in library cite

J. Snoek, Hugo Larochelle, R. P. Adams - 2012

9 papers in library cite

N. Srivastava - 2013

6 papers in library cite

Matthew D. Zeiler, Rob Fergus - 2013

5 papers in library cite

N. Srebro, A. Shraibman - 2005

3 papers in library cite

E. Salinas, L. F. Abbott - 1996

2 papers in library cite

L. Deng, D. Yu - 2011

2 papers in library cite

M. Malinowski, M. Fritz - 2013

2 papers in library cite

R. H. R. Hahnloser - 1998

2 papers in library cite

S. Rifai, Yann Dauphin, Pascal Vincent, Yoshua Bengio, X. Muller - 2011

2 papers in library cite

Shijie Wang - 2004

1 paper in library cites

Ian J. Goodfellow, Aaron Courville, Yoshua Bengio - 2013

1 paper in library cites

Cited by

17

papers in your library

Cites

13

papers in your library

Read

on April 29, 2025

Your review

Tags

Paper Aliases

No aliases