2010
Cite Score
17
AI summary
This paper introduces a CUDA-based high-performance library for training and classification of CNNs on the GPU, demonstrating performance and scalability improvements compared to CPU implementations on the MNIST dataset. The GPU version achieves speedups of 2 to 24 times compared to the CPU, with better scaling for larger networks.
Main Contributions
Abstract
In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.
Citation Graph
References [29]
Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998
62 papers in library cite
Ronan Collobert, Jason Weston - 2008
32 papers in library cite
Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998
20 papers in library cite
John C. Platt - 2003
12 papers in library cite
K. Chellapilla, S. Puri, Patrice Y. Simard - 2006
3 papers in library cite
K. S. Oh, Keechul Jung - 2004
2 papers in library cite
D. Steinkraus, I. Buck, Patrice Simard - 2005
3 papers in library cite
Alex Graves, Santiago Fernandez, Jürgen Schmidhuber - 2007
2 papers in library cite
D. Luebke, Greg Humphreys - 2007
1 paper in library cites
S. Lahabar, P. Agrawal, P. J. Narayanan - 2008
1 paper in library cites
C. M. Bishop - 1995
12 papers in library cite
C. Garcia, M. Delakis - 2004
3 papers in library cite
Nvidia - 2009
3 papers in library cite
Intel - 2009
2 papers in library cite
Nvidia - 2009
2 papers in library cite
Zhuoye Zhao, Shusheng Yang, X. Ma - 2008
1 paper in library cites
D. Strigl, K. Kofler - 2009
1 paper in library cites
J. C. L. Lam, M. Eizenman - 2008
1 paper in library cites
S. Duffner - 2007
1 paper in library cites
Bryan Catanzaro, N. Sundaram, Kurt Keutzer - 2008
1 paper in library cites
Intel - 2009
1 paper in library cites
I. Durdanovic, E. Cosatto, H. P. Graf - 2007
1 paper in library cites
S. L. Phung, A. Bouzerdoum - 2009
1 paper in library cites
Yann Lecun, C. Cortes - 2009
1 paper in library cites
J. V. Bouvrie - 2006
1 paper in library cites
Nvidia - 2009
1 paper in library cites
T. K. Group - 2009
1 paper in library cites
G. Poli, J. H. Saito, J. F. Mari, M. R. Zorzan - 2008
1 paper in library cites
Cited by
1
papers in your library
Cites
11
papers in your library
Read
on August 2, 2025
Your review
Tags
Paper Aliases
No aliases