2005
Cite Score
21
AI summary
This paper introduces a generic 2-layer fully connected neural network GPU implementation, achieving over 3X speedup for both training and testing with respect to a 3GHz, P4 CPU, demonstrating the potential of GPUs for machine learning tasks.
Main Contributions
Abstract
Using dedicated hardware to do machine learning typically ends up in disaster because of cost, obsolescence, and poor software. The popularization of Graphic Processing Units (GPUs), which are now available on every PC, provides an attractive alternative. We propose a generic 2-layer fully connected neural network GPU implementation which yields over 3X speedup for both training and testing with respect to a 3GHz, P4 CPU.
Citation Graph
References [5]
John C. Platt - 2003
12 papers in library cite
J. Kruger, R. Westermann - 2003
2 papers in library cite
T. J. Purcell, I. Buck, W. Mark, P. Hanrahan - 2002
1 paper in library cites
J. Bolz, I. Farmer, E. Grinspun, P. Schroder - 2003
1 paper in library cites
M. Macedonia - 2003
1 paper in library cites
Cited by
3
papers in your library
Cites
1
papers in your library
Read
on August 3, 2025
Your review
Tags
Paper Aliases
No aliases