2005

Using GPUs for Machine Learning Algorithms

D. Steinkraus, I. Buck, Patrice Simard

citations

Cite Score

21

AI summary

This paper introduces a generic 2-layer fully connected neural network GPU implementation, achieving over 3X speedup for both training and testing with respect to a 3GHz, P4 CPU, demonstrating the potential of GPUs for machine learning tasks.

Main Contributions

  • Proposes a generic 2-layer fully connected neural network implementation on GPU.
  • Achieves over 3X speedup for both training and testing compared to a 3GHz, P4 CPU.
  • Demonstrates the feasibility and efficiency of using GPUs for machine learning algorithms.
  • Discusses the implementation of machine learning primitives using pixel shaders in DirectX.
  • Provides a versatile end-to-end learning algorithm that leverages the parallel processing capabilities of GPUs.

Abstract

Using dedicated hardware to do machine learning typically ends up in disaster because of cost, obsolescence, and poor software. The popularization of Graphic Processing Units (GPUs), which are now available on every PC, provides an attractive alternative. We propose a generic 2-layer fully connected neural network GPU implementation which yields over 3X speedup for both training and testing with respect to a 3GHz, P4 CPU.

Citation Graph

Loading graph...

References [5]

Sort:
Filter:

John C. Platt - 2003

12 papers in library cite

J. Kruger, R. Westermann - 2003

2 papers in library cite

T. J. Purcell, I. Buck, W. Mark, P. Hanrahan - 2002

1 paper in library cites

J. Bolz, I. Farmer, E. Grinspun, P. Schroder - 2003

1 paper in library cites

M. Macedonia - 2003

1 paper in library cites

Cited by

3

papers in your library

Cites

1

papers in your library

Read

on August 3, 2025

Your review

Tags

Paper Aliases

No aliases