1998

Gradient-Based Learning Applied to Document Recognition

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner

citations

Cite Score

99

AI summary

This paper reviews gradient-based learning methods for handwritten character recognition, highlighting convolutional neural networks for 2D shape variability. It introduces graph transformer networks (GTNs) for training multi-module systems globally and describes systems for online handwriting and bank check recognition, achieving record accuracy.

Main Contributions

  • Review of gradient-based learning techniques for handwritten character recognition.
  • Demonstrates the effectiveness of convolutional neural networks in handling the variability of 2D shapes.
  • Introduction of graph transformer networks (GTNs) as a learning paradigm for training multi-module systems globally.
  • Description of two systems for online handwriting recognition, showcasing the advantages of global training and the flexibility of GTNs.
  • Application of GTNs to reading bank checks, achieving record accuracy through the combination of convolutional neural networks and global training techniques.

Abstract

Multilayer neural networks trained with the back-propagation algorithm constitute the best example of a successful gradient-based learning technique. Given an appropriate network architecture, gradient-based learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional neural networks, which are specifically designed to deal with the variability of two dimensional (2-D) shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation, recognition, and language modeling. A new learning paradigm, called graph transformer networks (GTN's), allows such multimodule systems to be trained globally using gradient-based methods so as to minimize an overall performance measure. Two systems for online handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of graph transformer networks. A graph transformer network for reading a bank check is also described. It uses convolutional neural network character recognizers combined with global training techniques to provide record accuracy on business and personal checks. It is deployed commercially and reads several million checks per day.

Citation Graph

Loading graph...

References [65]

Sort:
Filter:

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

46 papers in library cite

Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackal - 1989

24 papers in library cite

Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel - 1990

10 papers in library cite

A. H. Waibel, T. Hanazawa, Geoffrey Hinton, K. Shikano, K. Lang - 1989

13 papers in library cite

Yann Lecun - 1989

5 papers in library cite

Kunihiko Fukushima - 1975

4 papers in library cite

V. Vapnik - 1995

2 papers in library cite

S. Becker, Yann Lecun - 1988

9 papers in library cite

O. Matan, C. J. C. Burges, Yann Lecun, John S. Denker - 1992

3 papers in library cite

M. Minsky, Oliver G. Selfridge - 1961

1 paper in library cites

Yann Lecun - 1985

5 papers in library cite

Yann Lecun - 1986

3 papers in library cite

Geoffrey E. Hinton, T. J. Sejnowski - 1986

9 papers in library cite

V. N. Vapnik - 1998

10 papers in library cite

R. O. Duda, P. E. Hart - 1973

9 papers in library cite

V. Vapnik - 1995

9 papers in library cite

D. B. Parker - 1985

8 papers in library cite

D. H. Hubel, T. N. Wiesel - 1962

8 papers in library cite

Kunihiko Fukushima, S. Miyake - 1982

7 papers in library cite

D. H. Ackley, Geoffrey E. Hinton, T. J. Sejnowski - 1985

6 papers in library cite

J. A. E. Bryson, Y. C. Ho - 1969

4 papers in library cite

L. Bahl, P. Brown, P. D. Souza, R. Mercer - 1986

4 papers in library cite

Patrice Simard, Yann Lecun, John Denker - 1993

3 papers in library cite

Leon Bottou, P. Gallinari - 1991

2 papers in library cite

Yann Lecun - 1988

2 papers in library cite

S. I. Amari - 1967

2 papers in library cite

Kevin J. Lang, Geoffrey E. Hinton - 1988

2 papers in library cite

Y. Tsypkin - 1971

2 papers in library cite

L. T. Niles, H. F. Silverman - 1990

2 papers in library cite

A. H. Kramer, A. S. Vincentelli - 1988

2 papers in library cite

Yann Lecun, I. Kanter, Sara Solla - 1991

2 papers in library cite

C. J. C. Burges, B. Schoelkopf - 1997

2 papers in library cite

J. Keeler, D. Rumelhart, W. K. Leow - 1991

2 papers in library cite

C. Cortes, Lawrence Jackel, Sara Solla, V. N. Vapnik, John Denker - 1993

2 papers in library cite

Yann Lecun - 1987

2 papers in library cite

W. H. Press, B. P. Flannery, S. A. Teukolsky, W. T. Vetterling - 1986

2 papers in library cite

J. Bromley, J. W. Bentz, Leon Bottou, I. Guyon, Yann Lecun, C. Moore, E. Sackinger, R. Shah - 1993

2 papers in library cite

T. G. Dietterich, G. Bakiri - 1995

2 papers in library cite

S. Manke, U. Bodenhausen - 1994

1 paper in library cites

Samy Bengio, Yoshua Bengio - 1996

1 paper in library cites

Lucas Lam, C. Y. Suen, D. Guillevic, N. W. Strathy, M. Cheriet, K. Liu, J. N. Said - 1995

1 paper in library cites

C. Y. Suen, C. Nadal, R. Legault, T. A. Mai, Lucas Lam - 1992

1 paper in library cites

D. Guillevic, C. Y. Suen - 1995

1 paper in library cites

I. Guyon, P. Albrecht, Yann Lecun, John S. Denker, W. Hubbard - 1991

1 paper in library cites

B. H. Juang, S. Katagiri - 1992

1 paper in library cites

M. Moller - 1993

1 paper in library cites

U. Muller, A. Gunzinger, W. Guggenbuhl - 1995

1 paper in library cites

Y. Tsypkin - 1973

1 paper in library cites

Yann Lecun, L. D. Jackel, B. Boser, John S. Denker, H. P. Graf, I. Guyon, D. Henderson, R. E. Howard, W. Hubbard - 1989

1 paper in library cites

S. N. Srihari - 1992

1 paper in library cites

V. N. Vapnik, E. Levin, Yann Lecun - 1994

1 paper in library cites

J. Wang, J. Jean - 1993

1 paper in library cites

Patrick Haffner, A. H. Waibel - 1992

1 paper in library cites

Yoshua Bengio - 1996

1 paper in library cites

C. J. C. Burges, J. I. Ben, John S. Denker, Yann Lecun, C. R. Nohl - 1993

1 paper in library cites

Yann Lecun, Yoshua Bengio, D. Henderson, A. Weisbuch, H. Weissman, Lawrence Jackel - 1993

1 paper in library cites

M. Gilloux, M. Leroux - 1993

1 paper in library cites

Leon Bottou, F. Fogelman, P. Blanchet, J. S. Lienard - 1990

1 paper in library cites

L. R. Bahl, P. F. Brown, P. V. D. Souza, R. L. Mercer - 1987

1 paper in library cites

S. Seung, H. Sompolinsky, N. Tishby - 1992

1 paper in library cites

M. C. Mozer - 1991

1 paper in library cites

C. Tappert, C. Suen, T. Wakahara - 1990

1 paper in library cites

Patrick Haffner, A. H. Waibel - 1991

1 paper in library cites

Cited by

62

papers in your library

Cites

14

papers in your library

Read

on June 24, 2025

Your review

Tags

Paper Aliases

The MNIST Database of Handwritten Digits