2012

Building High-Level Features Using Large Scale Unsupervised Learning

Quoc V. Le, M. A. Ranzato, R. Monga, M. Devin, K. Chen, G. S. Corrado, Jeffrey Dean, Andrew Y. Ng

citations

Cite Score

64

AI summary

This paper introduces a 9-layer locally connected sparse autoencoder to learn high-level features from 10 million unlabeled images, achieving 15.8% accuracy on ImageNet with 22,000 categories, a 70% relative improvement over the state-of-the-art, demonstrating the possibility of training class-specific feature detectors without labeled data.

Main Contributions

  • Demonstrates that it is possible to train a face detector without having to label images as containing a face or not.
  • Introduces a 9-layered locally connected sparse autoencoder with pooling and local contrast normalization.
  • Uses a large dataset of 10 million 200x200 pixel images downloaded from the Internet.
  • Achieved 15.8% accuracy in recognizing 22,000 object categories from ImageNet, a leap of 70% relative improvement over the previous state-of-the-art.
  • Shows that the learned feature detector is robust not only to translation but also to scaling and out-of-plane rotation.

Abstract

We consider the problem of building high-level, class-specific feature detectors from only unlabeled data. For example, is it possible to learn a face detector using only unlabeled images? To answer this, we train a 9-layered locally connected sparse autoencoder with pooling and local contrast normalization on a large dataset of images (the model has 1 billion connections, the dataset has 10 million 200x200 pixel images downloaded from the Internet). We train this network using model parallelism and asynchronous SGD on a cluster with 1,000 machines (16,000 cores) for three days. Contrary to what appears to be a widely-held intuition, our experimental results reveal that it is possible to train a face detector without having to label images as containing a face or not. Control experiments show that this feature detector is robust not only to translation but also to scaling and out-of-plane rotation. We also find that the same network is sensitive to other high-level concepts such as cat faces and human bodies. Starting with these learned features, we trained our network to obtain 15.8% accuracy in recognizing 22,000 object categories from ImageNet, a leap of 70% relative improvement over the previous state-of-the-art.

Citation Graph

Loading graph...

References [40]

Sort:
Filter:

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

Alex Krizhevsky - 2009

27 papers in library cite

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006

43 papers in library cite

B. Olshausen, D. Field - 1996

5 papers in library cite

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009

20 papers in library cite

Rajat Raina, Alexis Battle, Honglak Lee, Benjamin Packer, A. Ng - 2007

7 papers in library cite

Yoshua Bengio, Yann Lecun - 2007

15 papers in library cite

Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pascal Vincent - 2009

4 papers in library cite

Marc'aurelio Ranzato, F. Huang, Y. Boureau, Yann Lecun - 2007

8 papers in library cite

Dan C. Ciresan, Ueli Meier, Luca M. Gambardella, Jürgen Schmidhuber - 2010

10 papers in library cite

Honglak Lee, R. Grosse, R. Ranganath, Andrew Y. Ng - 2009

12 papers in library cite

G. B. Huang, M. Ramesh, T. Berg, E. L. Miller - 2007

5 papers in library cite

Honglak Lee, C. Ekanadham, A. Ng - 2008

10 papers in library cite

M. Riesenhuber, T. Poggio - 1999

8 papers in library cite

A. Coates, A. Ng, Honglak Lee - 2011

7 papers in library cite

Kunihiko Fukushima, S. Miyake - 1982

7 papers in library cite

Honglak Lee, Alexis Battle, Rajat Raina, A. Ng - 2007

6 papers in library cite

D. H. Hubel, T. N. Wiesel - 1959

6 papers in library cite

N. Pinto, D. D. Cox, J. J. Dicarlo - 2008

5 papers in library cite

J. Sanchez, F. Perronnin - 2011

4 papers in library cite

Quoc Le, A. Karpenko, J. Ngiam, A. Ng - 2011

4 papers in library cite

Rajat Raina, A. Madhavan, Andrew Y. Ng - 2009

4 papers in library cite

Quoc V. Le, J. Ngiam, A. Coates, A. Lahiri, B. Prochnow, Andrew Y. Ng - 2011

4 papers in library cite

Quoc V. Le, J. Ngiam, Ziru Chen, D. Chia, P. W. Koh, Andrew Y. Ng - 2010

4 papers in library cite

P. Sermanet, Yann Lecun - 2011

4 papers in library cite

K. Gregor, Yann Lecun - 2010

3 papers in library cite

S. Lyu, E. Simoncelli - 2008

3 papers in library cite

P. Berkes, L. Wiskott - 2005

3 papers in library cite

Jason Weston, Samy Bengio, Nicolas Usunier - 2011

3 papers in library cite

J. J. Dicarlo, D. Zoccolan, N. C. Rust - 2012

2 papers in library cite

R. Q. Quiroga, L. Reddy, G. Kreiman, C. Koch, I. Fried - 2005

2 papers in library cite

C. Keller, M. Enzweiler, D. M. Gavrila - 2009

1 paper in library cites

B. Pakkenberg, P. D. Marner, L. Bundgaard, M. J. Gundersen, H. J. G. Nyengaard, J. R. Regeur - 2003

1 paper in library cites

Wenxuan Zhang, Jian Sun, X. Tang - 2008

1 paper in library cites

A. Hyvarinen, J. Hurri, P. O. Hoyer - 2009

1 paper in library cites

R. Desimone, T. Albright, C. Gross, C. Bruce - 1984

1 paper in library cites

J. Deng, A. Berg, K. Li, Li Fei Fei - 2010

1 paper in library cites

Cited by

10

papers in your library

Cites

14

papers in your library

Read

on October 19, 2025

Your review

Tags

Paper Aliases

No aliases