2009

Large-Scale Object Recognition With CUDA-accelerated Hierarchical Neural Networks

R. Uetz, Sven Behnke

citations

Cite Score

6

AI summary

This paper introduces a hierarchical, locally-connected neural network model (LCNP) accelerated by NVIDIA CUDA, suited for large-scale object recognition. A new realistic dataset was created from the LabelMe dataset, and the model achieved a testing error rate of 0.76% and 2.87% on MNIST and NORB datasets, respectively.

Main Contributions

  • Introduced a hierarchical, locally-connected neural network model (LCNP) optimized for large-scale, high-performance object recognition.
  • Implemented the model using the NVIDIA CUDA framework, allowing for massively parallel execution on a state-of-the-art graphics card.
  • Created a new realistic dataset by extracting a large number of objects from the LabelMe dataset of natural images.
  • Achieved a testing error rate of 0.76% on the MNIST dataset and 2.87% on the NORB dataset.
  • Demonstrated a speedup factor of up to 82 times compared to a single-core CPU version of the system.

Abstract

Robust recognition of arbitrary object classes in natural visual scenes is an aspiring goal with numerous practical applications, for instance, in the area of autonomous robotics and autonomous vehicles. One obstacle on the way towards human-like recognition performance is the limitation of computational power, restricting the size of the training and testing dataset as well as the complexity of the object recognition system. In this work, we present a hierarchical, locally-connected neural network model that is well-suited for large-scale, high-performance object recognition. By using the NVIDIA CUDA framework, we create a massively parallel implementation of the model which is executed on a state-of-the-art graphics card. This implementation is up to 82 times faster than a single-core CPU version of the system. This significant gain in computational performance allows us to evaluate the model on a very large, realistic, and challenging set of natural images which we extracted from the LabelMe dataset. To compare our model to other approaches, we also evaluate the recognition performance using the well-known MNIST and NORB datasets, achieving a testing error rate of 0.76% and 2.87%, respectively.

Citation Graph

Loading graph...

References [13]

Sort:
Filter:

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

F. Rosenblatt - 1958

3 papers in library cite

Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998

20 papers in library cite

Li Fei Fei, Rob Fergus, Pietro Perona - 2004

15 papers in library cite

Bryan C. Russell, Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2008

10 papers in library cite

Yann Lecun, Fu Jie Huang, Leon Bottou - 2004

18 papers in library cite

Yann Lecun - 1998

8 papers in library cite

F. Huang, Yann Lecun - 2006

5 papers in library cite

N. Pinto, D. D. Cox, J. J. Dicarlo - 2008

5 papers in library cite

Sven Behnke - 2003

3 papers in library cite

Nvidia - 2009

2 papers in library cite

E. R. Kandel, J. H. Schwartz, T. M. Jessel - 2000

1 paper in library cites

Missing year

R. Uetz

1 paper in library cites

Cited by

2

papers in your library

Cites

7

papers in your library

Read

on August 2, 2025

Your review

Tags

Paper Aliases

No aliases