2001

Recognizing Handwritten Digits Using Hierarchical Products of Experts

G. Mayraz, Geoffrey E. Hinton

citations

Cite Score

6

AI summary

This paper introduces a hierarchical product of experts model for recognizing handwritten digits on the MNIST dataset, achieving comparable performance to state-of-the-art discriminative methods. It uses a three-level hierarchy of models to extract features and a logistic classification network for classification.

Main Contributions

  • Introduces a hierarchical product of experts (PoE) model for handwritten digit recognition.
  • Demonstrates that the hierarchical PoE model can achieve comparable performance to state-of-the-art discriminative methods on the MNIST dataset.
  • Shows that generative models can be effective for discriminative tasks.
  • Presents a method for training hierarchical PoE models sequentially.
  • Explores the use of goodness scores from PoE models as inputs to a logistic classifier.

Abstract

The product of experts learning procedure [1] can discover a set of stochastic binary features that constitute a non-linear generative model of handwritten images of digits. The quality of generative models learned in this way can be assessed by learning a separate model for each class of digit and then comparing the unnormalized probabilities of test images under the 10 different class-specific models. To improve discriminative performance, it is helpful to learn a hierarchy of separate models for each digit class. Each model in the hierarchy has one layer of hidden units and the nth level model is trained on data that consists of the activities of the hidden units in the already trained (n - 1)th level model. After training, each level produces a separate, unnormalized log probabilty score. With a three-level hierarchy for each of the 10 digit classes, a test image produces 30 scores which can be used as inputs to a supervised, logistic classification network that is trained on separate data. On the MNIST database, our system is comparable with current state-of-the-art discriminative methods, demonstrating that the product of experts learning procedure can produce effective generative models of high-dimensional data.

Citation Graph

Loading graph...

References [6]

Sort:
Filter:

Geoffrey Hinton - 2002

23 papers in library cite

V. Vapnik - 1995

2 papers in library cite

P. Smolensky - 1986

11 papers in library cite

Geoffrey E. Hinton, T. J. Sejnowski - 1986

9 papers in library cite

Y. Freund, D. Haussler - 1992

8 papers in library cite

C. J. C. Burges, B. Schoelkopf - 1997

2 papers in library cite

Cited by

1

papers in your library

Cites

4

papers in your library

Read

on June 21, 2025

Your review

Tags

Paper Aliases

No aliases