2010

Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Dan C. Ciresan, Ueli Meier, Luca M. Gambardella, Jürgen Schmidhuber

citations

Cite Score

1

AI summary

This paper introduces a deep multi-layer perceptron (MLP) model trained with back-propagation on MNIST dataset using GPUs, achieving a 0.35% error rate through extensive training image deformations and graphics card acceleration.

Main Contributions

  • Demonstrated that deep, plain MLPs can achieve state-of-the-art results on MNIST with sufficient training and computational resources.
  • Achieved a low 0.35% error rate on the MNIST handwritten digits benchmark using online back-propagation.
  • Utilized graphics cards (GPUs) to significantly speed up the training process.
  • Employed extensive training image deformations to improve the network's generalization.
  • Showed that hardware progress is important in deep learning.

Abstract

Good old on-line back-propagation for plain multi-layer perceptrons yields a very low 0.35% error rate on the famous MNIST handwritten digits benchmark. All we need to achieve this best result so far are many hidden layers, many neurons per layer, numerous deformed training images, and graphics cards to greatly speed up learning.

Citation Graph

Loading graph...

References [21]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

46 papers in library cite

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

John C. Platt - 2003

12 papers in library cite

Sepp Hochreiter, Yoshua Bengio, Paolo Frasconi, Jürgen Schmidhuber - 2001

16 papers in library cite

Marc'aurelio Ranzato, C. Poultney, S. Chopra, Yann Lecun - 2006

20 papers in library cite

Marc'aurelio Ranzato, F. Huang, Y. Boureau, Yann Lecun - 2007

8 papers in library cite

K. Chellapilla, S. Puri, Patrice Y. Simard - 2006

3 papers in library cite

Yann Lecun - 1985

5 papers in library cite

Geoffrey Hinton - 2006

5 papers in library cite

D. Steinkraus, I. Buck, Patrice Simard - 2005

3 papers in library cite

P. Werbos - 1974

14 papers in library cite

Sepp Hochreiter - 1991

18 papers in library cite

D. Decoste, B. Scholkopf - 2002

6 papers in library cite

Ruslan Salakhutdinov, Geoffrey Hinton - 2007

5 papers in library cite

S. Russell, P. Norvig - 1995

4 papers in library cite

F. Lauer, C. Suen, G. Bloch - 2007

1 paper in library cites

D. Scherer, Sven Behnke - 2009

1 paper in library cites

Nvidia - 2009

1 paper in library cites

G. Ruetsch, P. Micikevicius - 2009

1 paper in library cites

Cited by

10

papers in your library

Cites

13

papers in your library

Read

on October 15, 2025

Your review

Tags

Paper Aliases

No aliases