Papperoni

2009

Learning Multiple Layers of Features From Tiny Images

Alex Krizhevsky

citations

Cite Score

AI summary

This paper introduces a multi-layer generative model using Restricted Boltzmann Machines (RBMs) and Deep Belief Networks (DBNs) for extracting meaningful features from tiny images, achieving improved object recognition with the CIFAR-10 and CIFAR-100 datasets through pre-training on unlabeled data, as well as introducing a parallelization algorithm.

Main Contributions

Demonstrates the ability to train a multi-layer generative model to extract meaningful features from tiny images.
Introduces a novel parallelization algorithm for training the model on a network of machines.
Creates and releases two labeled datasets, CIFAR-10 and CIFAR-100, for object recognition experiments.
Shows that object recognition can be significantly improved by pre-training a layer of features on a large set of unlabeled tiny images.
Demonstrates the use of RBMs and DBNs for feature extraction and pre-training to improve classification performance.

Abstract

Groups at MIT and NYU have collected a dataset of millions of tiny colour images from the web. It is, in principle, an excellent dataset for unsupervised training of deep generative models, but previous researchers who have tried this have found it difficult to learn a good set of filters from the images. We show how to train a multi-layer generative model that learns to extract meaningful features which resemble those found in the human visual cortex. Using a novel parallelization algorithm to distribute the work among multiple machines connected on a network, we show how training such a model can be done in reasonable time. A second problematic aspect of the tiny images dataset is that there are no reliable class labels which makes it hard to use for object recognition experiments. We created two sets of reliable labels. The CIFAR-10 set has 6000 examples of each of 10 classes and the CIFAR-100 set has 600 examples of each of 100 non-overlapping classes. Using these labels, we show that object recognition is significantly improved by pre-training a layer of features on a large set of unlabeled tiny images.

Citation Graph

Loading graph...

References [12]

Sort:

Filter:

[1]Reducing the Dimensionality of Data With Neural Networks

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Google Scholar

I didn't like the way this is written, very hard to understand without a ton of background knowledge. But hey, it's the first deep learning model!

[2]WordNet: A Lexical Database for English

G. Miller - 1995

5 papers in library cite

Google Scholar

Meh. It seems like it was publish so that people could cite the dataset somehow. Nothing interesting, but quick read and very used.

[3]Greedy Layer-Wise Training of Deep Networks

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

Google Scholar

Bengio is perfect. This is everything that Hinton's paper hoped to be. Very well explained, and also tying back to real use cases (not just "hey, the math works and it reduced the score")

[4]Training Products of Experts by Minimizing Contrastive Divergence

Geoffrey Hinton - 2002

23 papers in library cite

Google Scholar

Good read, but I think I need to revisit it after I understand RBMs better.

[5]80 Million Tiny Images: A Large Dataset for Non-Parametric Object and Scene Recognition

Antonio Torralba, Rob Fergus, W. Freeman - 2008

8 papers in library cite

Google Scholar

The initial part about data collection and dataset description was nice, but the part of classifying was a bit overkill

[6]Information Processing in Dynamical Systems: Foundations of Harmony Theory

P. Smolensky - 1986

11 papers in library cite

Google Scholar

88 pages; Introduced RBMs

[7]Unsupervised Learning of Distributions on Binary Vectors Using Two Layer Networks

Y. Freund, D. Haussler - 1992

8 papers in library cite

Google Scholar

[8]On the Quantitative Analysis of Deep Belief Networks

Ruslan Salakhutdinov, I. Murray - 2008

4 papers in library cite

Google Scholar

[9]Rate-Coded Restricted Boltzmann Machines for Face Recognition

Y. Teh, Geoffrey Hinton - 2001

4 papers in library cite

Google Scholar

[10]Robust Object Recognition With Cortex-Like Mechanisms

T. Serre, Lior Wolf, S. Bileschi, M. Riesenhuber, T. Poggio - 2007

4 papers in library cite

Google Scholar

[11]The "Independent Components" of Natural Scenes Are Edge Filters

A. Bell, T. Sejnowski - 1997

4 papers in library cite

Google Scholar

[12]Training Restricted Boltzmann Machines Using Approximations to the Likelihood Gradient

T. Tieleman - 2008

4 papers in library cite

Google Scholar

Cited by

papers in your library

Cites

papers in your library

Read

on November 26, 2025

It's alright. It mainly focuses on RBMs and their features and the actual part that describes the dataset is like 1 page. However, it's maybe the best intuitive description of an RBM I have seen. Other than that, it reads very much like an undergraduate thesis.