2010

Performance and Scalability of gpu-based Convolutional Neural Networks

D. Strigl, K. Kofler, Stefan Podlipnig

citations

Cite Score

17

AI summary

This paper introduces a CUDA-based high-performance library for training and classification of CNNs on the GPU, demonstrating performance and scalability improvements compared to CPU implementations on the MNIST dataset. The GPU version achieves speedups of 2 to 24 times compared to the CPU, with better scaling for larger networks.

Main Contributions

  • Developed a high-performance library in CUDA for fast training and classification of CNNs on the GPU.
  • Demonstrated significant performance and scalability improvements by shifting computation-intensive CNN tasks to the GPU.
  • Achieved 2 to 24 times faster training and classification on the GPU compared to the CPU, depending on network topology.
  • Showed that the GPU implementation scales much better than the CPU implementation with respect to network size.
  • The library is publicly available to encourage further development in this research area.

Abstract

In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.

Citation Graph

Loading graph...

References [29]

Sort:
Filter:

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998

20 papers in library cite

John C. Platt - 2003

12 papers in library cite

K. Chellapilla, S. Puri, Patrice Y. Simard - 2006

3 papers in library cite

K. S. Oh, Keechul Jung - 2004

2 papers in library cite

D. Steinkraus, I. Buck, Patrice Simard - 2005

3 papers in library cite

Alex Graves, Santiago Fernandez, Jürgen Schmidhuber - 2007

2 papers in library cite

D. Luebke, Greg Humphreys - 2007

1 paper in library cites

S. Lahabar, P. Agrawal, P. J. Narayanan - 2008

1 paper in library cites

C. M. Bishop - 1995

12 papers in library cite

C. Garcia, M. Delakis - 2004

3 papers in library cite

Nvidia - 2009

3 papers in library cite

Intel - 2009

2 papers in library cite

Nvidia - 2009

2 papers in library cite

Zhuoye Zhao, Shusheng Yang, X. Ma - 2008

1 paper in library cites

D. Strigl, K. Kofler - 2009

1 paper in library cites

J. C. L. Lam, M. Eizenman - 2008

1 paper in library cites

Missing year

C. Boyd

1 paper in library cites

S. Duffner - 2007

1 paper in library cites

Bryan Catanzaro, N. Sundaram, Kurt Keutzer - 2008

1 paper in library cites

Intel - 2009

1 paper in library cites

I. Durdanovic, E. Cosatto, H. P. Graf - 2007

1 paper in library cites

S. L. Phung, A. Bouzerdoum - 2009

1 paper in library cites

Yann Lecun, C. Cortes - 2009

1 paper in library cites

J. V. Bouvrie - 2006

1 paper in library cites

Nvidia - 2009

1 paper in library cites

T. K. Group - 2009

1 paper in library cites

G. Poli, J. H. Saito, J. F. Mari, M. R. Zorzan - 2008

1 paper in library cites

Cited by

1

papers in your library

Cites

11

papers in your library

Read

on August 2, 2025

Your review

Tags

Paper Aliases

No aliases