2011

Reading Digits in Natural Images With Unsupervised Feature Learning

Y. Netzer, Tianle Wang, A. Coates, Alessandro Bissacco, Bo Wu, Andrew Y. Ng

citations

Cite Score

80

AI summary

This paper introduces a method for reading digits in natural images using unsupervised feature learning, achieving superior results on a new Street View House Numbers (SVHN) dataset of over 600,000 labeled digits compared to hand-designed features.

Main Contributions

  • Introduces a method for reading digits in natural images using unsupervised feature learning.
  • Introduces the Street View House Numbers (SVHN) dataset, containing over 600,000 labeled digits cropped from Street View images.
  • Demonstrates that unsupervised feature learning methods outperform hand-designed features on the SVHN dataset.
  • Shows that the K-means-based system performs slightly better due to the spatial-pooling stage.
  • Achieves higher accuracy in the final end-to-end application using unsupervised feature learning methods on the SVHN dataset.

Abstract

Detecting and reading text from natural images is a hard computer vision task that is central to a variety of emerging applications. Related problems like document character recognition have been widely studied by computer vision and machine learning researchers and are virtually solved for practical applications like reading handwritten digits. Reliably recognizing characters in more complex scenes like photographs, however, is far more difficult: the best existing methods lag well behind human performance on the same tasks. In this paper we attack the problem of recognizing digits in a real application using unsupervised feature learning methods: reading house numbers from street level photos. To this end, we introduce a new benchmark dataset for research use containing over 600,000 labeled digits cropped from Street View images. We then demonstrate the difficulty of recognizing these digits when the problem is approached with hand-designed features. Finally, we employ variants of two recently proposed unsupervised feature learning methods and find that they are convincingly superior on our benchmarks.

Citation Graph

Loading graph...

References [26]

Sort:
Filter:

P. H. Vincent, Larochelle, Isabelle Lajoie, Yoshua Bengio, Pierre Antoine Manzagol - 2010

6 papers in library cite

G. Dahl, D. Yu, L. Deng, Alex Acero - 2012

19 papers in library cite

Antonio Torralba, Rob Fergus, W. Freeman - 2008

8 papers in library cite

Dragomir Anguelov, Carole Dulong, Daniel Filip, Christian Frueh, Stephane Lafon, Richard Lyon, Abhijit Ogale, Luc Vincent, Josh Weaver - 2010

1 paper in library cites

I. Goodfellow, Quoc Le, A. Saxe, A. Ng - 2009

7 papers in library cite

Andrea Frome, German Cheung, Ahmad Abdulkader, Marco Zennaro, Bo Wu, Alessandro Bissacco, Hartwig Adam, Hartmut Neven, Luc Vincent - 2009

2 papers in library cite

Dan C. Ciresan, Ueli Meier, Luca M. Gambardella, Jürgen Schmidhuber - 2010

10 papers in library cite

N. Dalal, B. Triggs - 2005

12 papers in library cite

Honglak Lee, C. Ekanadham, A. Ng - 2008

10 papers in library cite

Jihan Yang, K. Yu, Y. Gong, T. Huang - 2009

8 papers in library cite

A. Coates, A. Ng, Honglak Lee - 2011

7 papers in library cite

S. Russell, P. Norvig - 1995

4 papers in library cite

Quoc Le, W. Zou, S. Y. Yeung, A. Ng - 2011

4 papers in library cite

N. Pinto, D. Doukhan, J. J. Dicarlo, D. D. Cox - 2009

3 papers in library cite

R. Plamondon, S. N. Srihari - 2000

2 papers in library cite

Honglak Lee, P. Pham, Y. Largman, A. Ng - 2009

2 papers in library cite

Y. Pan, X. Hou, C. L. Liu - 2008

1 paper in library cites

A. Mishra, K. Alahari, C. Jawahar - 2011

1 paper in library cites

G. Nagy - 1992

1 paper in library cites

F. Shafait, D. Keysers, T. M. Breuel - 2008

1 paper in library cites

K. Wang, B. Babenko, S. Belongie - 2011

1 paper in library cites

S. Mori, C. Y. Suen, K. Yamamoto - 1992

1 paper in library cites

F. Kimura, T. Wakabayashi, S. Tsuruoka, Y. Miyake - 1997

1 paper in library cites

A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh, Tianle Wang, D. Wu, A. Ng - 2011

1 paper in library cites

T. M. Breuel - 2008

1 paper in library cites

J. J. Weinman - 2008

1 paper in library cites

Cited by

8

papers in your library

Cites

7

papers in your library

Read

on October 15, 2025

Your review

Tags

Paper Aliases

No aliases