2004

GPU Implementation of Neural Networks

K. S. Oh, Keechul Jung

citations

Cite Score

28

AI summary

This paper introduces a GPU-based implementation of neural networks for enhanced performance in text detection, achieving a 20-fold speedup on an ATI RADEON 9700 PRO by converting inner-product operations into matrix operations, which efficiently utilizes the parallelism of GPUs.

Main Contributions

  • Demonstrates the use of GPUs to accelerate neural network computations.
  • Achieves a 20-fold performance improvement using an ATI RADEON 9700 PRO board.
  • Presents a method to convert inner-product operations into matrix operations for efficient GPU utilization.
  • Applies the GPU-accelerated neural network to a text detection system.
  • Suggests further research areas including benchmarking with various hardware and GPU-aware learning algorithms.

Abstract

Graphics processing unit (GPU) is used for a faster artificial neural network. It is used to implement the matrix multiplication of a neural network to enhance the time performance of a text detection system. Preliminary results produced a 20-fold performance enhancement using an ATI RADEON 9700 PRO board. The parallelism of a GPU is fully utilized by accumulating a lot of input feature vectors and weight vectors, then converting the many inner-product operations into one matrix operation. Further research areas include benchmarking the performance with various hardware and GPU-aware learning algorithms.

Citation Graph

Loading graph...

References [4]

Sort:
Filter:

Jiacheng Zhu, P. Sutton - 2003

1 paper in library cites

Missing year

D. Manocha

1 paper in library cites

A. Moravanszky - 2003

1 paper in library cites

Keechul Jung - 2001

1 paper in library cites

Cited by

2

papers in your library

Cites

0

papers in your library

Read

on August 1, 2025

Your review

Tags

Paper Aliases

No aliases