2009

Learning Multiple Layers of Features From Tiny Images

Alex Krizhevsky

citations

Cite Score

95

AI summary

This paper introduces a multi-layer generative model using Restricted Boltzmann Machines (RBMs) and Deep Belief Networks (DBNs) for extracting meaningful features from tiny images, achieving improved object recognition with the CIFAR-10 and CIFAR-100 datasets through pre-training on unlabeled data, as well as introducing a parallelization algorithm.

Main Contributions

  • Demonstrates the ability to train a multi-layer generative model to extract meaningful features from tiny images.
  • Introduces a novel parallelization algorithm for training the model on a network of machines.
  • Creates and releases two labeled datasets, CIFAR-10 and CIFAR-100, for object recognition experiments.
  • Shows that object recognition can be significantly improved by pre-training a layer of features on a large set of unlabeled tiny images.
  • Demonstrates the use of RBMs and DBNs for feature extraction and pre-training to improve classification performance.

Abstract

Groups at MIT and NYU have collected a dataset of millions of tiny colour images from the web. It is, in principle, an excellent dataset for unsupervised training of deep generative models, but previous researchers who have tried this have found it difficult to learn a good set of filters from the images. We show how to train a multi-layer generative model that learns to extract meaningful features which resemble those found in the human visual cortex. Using a novel parallelization algorithm to distribute the work among multiple machines connected on a network, we show how training such a model can be done in reasonable time. A second problematic aspect of the tiny images dataset is that there are no reliable class labels which makes it hard to use for object recognition experiments. We created two sets of reliable labels. The CIFAR-10 set has 6000 examples of each of 10 classes and the CIFAR-100 set has 600 examples of each of 100 non-overlapping classes. Using these labels, we show that object recognition is significantly improved by pre-training a layer of features on a large set of unlabeled tiny images.

Citation Graph

Loading graph...

References [12]

Sort:
Filter:

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

G. Miller - 1995

5 papers in library cite

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

Geoffrey Hinton - 2002

23 papers in library cite

Antonio Torralba, Rob Fergus, W. Freeman - 2008

8 papers in library cite

P. Smolensky - 1986

11 papers in library cite

Y. Freund, D. Haussler - 1992

8 papers in library cite

Ruslan Salakhutdinov, I. Murray - 2008

4 papers in library cite

Y. Teh, Geoffrey Hinton - 2001

4 papers in library cite

T. Serre, Lior Wolf, S. Bileschi, M. Riesenhuber, T. Poggio - 2007

4 papers in library cite

A. Bell, T. Sejnowski - 1997

4 papers in library cite

T. Tieleman - 2008

4 papers in library cite

Cited by

27

papers in your library

Cites

6

papers in your library

Read

on November 26, 2025

Your review

Tags

Paper Aliases

No aliases