2011
Cite Score
80
AI summary
This paper introduces a method for reading digits in natural images using unsupervised feature learning, achieving superior results on a new Street View House Numbers (SVHN) dataset of over 600,000 labeled digits compared to hand-designed features.
Main Contributions
Abstract
Detecting and reading text from natural images is a hard computer vision task that is central to a variety of emerging applications. Related problems like document character recognition have been widely studied by computer vision and machine learning researchers and are virtually solved for practical applications like reading handwritten digits. Reliably recognizing characters in more complex scenes like photographs, however, is far more difficult: the best existing methods lag well behind human performance on the same tasks. In this paper we attack the problem of recognizing digits in a real application using unsupervised feature learning methods: reading house numbers from street level photos. To this end, we introduce a new benchmark dataset for research use containing over 600,000 labeled digits cropped from Street View images. We then demonstrate the difficulty of recognizing these digits when the problem is approached with hand-designed features. Finally, we employ variants of two recently proposed unsupervised feature learning methods and find that they are convincingly superior on our benchmarks.
Citation Graph
References [26]
P. H. Vincent, Larochelle, Isabelle Lajoie, Yoshua Bengio, Pierre Antoine Manzagol - 2010
6 papers in library cite
G. Dahl, D. Yu, L. Deng, Alex Acero - 2012
19 papers in library cite
Antonio Torralba, Rob Fergus, W. Freeman - 2008
8 papers in library cite
Dragomir Anguelov, Carole Dulong, Daniel Filip, Christian Frueh, Stephane Lafon, Richard Lyon, Abhijit Ogale, Luc Vincent, Josh Weaver - 2010
1 paper in library cites
I. Goodfellow, Quoc Le, A. Saxe, A. Ng - 2009
7 papers in library cite
Andrea Frome, German Cheung, Ahmad Abdulkader, Marco Zennaro, Bo Wu, Alessandro Bissacco, Hartwig Adam, Hartmut Neven, Luc Vincent - 2009
2 papers in library cite
Dan C. Ciresan, Ueli Meier, Luca M. Gambardella, Jürgen Schmidhuber - 2010
10 papers in library cite
N. Dalal, B. Triggs - 2005
12 papers in library cite
Honglak Lee, C. Ekanadham, A. Ng - 2008
10 papers in library cite
Jihan Yang, K. Yu, Y. Gong, T. Huang - 2009
8 papers in library cite
A. Coates, A. Ng, Honglak Lee - 2011
7 papers in library cite
S. Russell, P. Norvig - 1995
4 papers in library cite
Quoc Le, W. Zou, S. Y. Yeung, A. Ng - 2011
4 papers in library cite
N. Pinto, D. Doukhan, J. J. Dicarlo, D. D. Cox - 2009
3 papers in library cite
R. Plamondon, S. N. Srihari - 2000
2 papers in library cite
Honglak Lee, P. Pham, Y. Largman, A. Ng - 2009
2 papers in library cite
Y. Pan, X. Hou, C. L. Liu - 2008
1 paper in library cites
A. Mishra, K. Alahari, C. Jawahar - 2011
1 paper in library cites
G. Nagy - 1992
1 paper in library cites
F. Shafait, D. Keysers, T. M. Breuel - 2008
1 paper in library cites
K. Wang, B. Babenko, S. Belongie - 2011
1 paper in library cites
S. Mori, C. Y. Suen, K. Yamamoto - 1992
1 paper in library cites
F. Kimura, T. Wakabayashi, S. Tsuruoka, Y. Miyake - 1997
1 paper in library cites
A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh, Tianle Wang, D. Wu, A. Ng - 2011
1 paper in library cites
T. M. Breuel - 2008
1 paper in library cites
J. J. Weinman - 2008
1 paper in library cites
Cited by
8
papers in your library
Cites
7
papers in your library
Read
on October 15, 2025
Your review
Tags
Paper Aliases
No aliases