2013

Rectifier Nonlinearities Improve Neural Network Acoustic Models

A. L. Maas, A. Y. Hannun, Andrew Y. Ng

citations

Cite Score

85

AI summary

This paper introduces deep rectifier networks as acoustic models for the Switchboard conversational speech recognition task, achieving a 2% absolute reduction in word error rates over sigmoidal counterparts. It analyzes hidden layer representations to quantify differences in how ReL units encode inputs compared to sigmoidal units.

Main Contributions

  • Evaluates rectifier DNNs as acoustic models for a 300-hour Switchboard conversational LVCSR task.
  • Compares rectifier variants, including standard ReL and leaky ReL.
  • Presents a quantitative analysis of how different DNNs encode information, providing insights into why rectifier DNNs perform well.
  • Achieves WER reductions of up to 2% absolute on the full Eval2000 dataset compared to sigmoidal DNNs.
  • Demonstrates that rectifier DNNs benefit more from depth than sigmoidal DNNs due to the lack of vanishing gradients.

Abstract

Deep neural network acoustic models produce substantial gains in large vocabulary continuous speech recognition systems. Emerging work with rectified linear (ReL) hidden units demonstrates additional gains in final system performance relative to more commonly used sigmoidal nonlinearities. In this work, we explore the use of deep rectifier networks as acoustic models for the 300 hour Switchboard conversational speech recognition task. Using simple training procedures without pretraining, networks with rectifier nonlinearities produce 2% absolute reductions in word error rates over their sigmoidal counterparts. We analyze hidden layer representations to quantify differences in how ReL units encode inputs as compared to sigmoidal units. Finally, we evaluate a variant of the ReL unit with a gradient more amenable to optimization in an attempt to further improve deep rectifier networks.

Citation Graph

Loading graph...

References [14]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Geoffrey Hinton - 2012

21 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Xavier Glorot, Antoine Bordes, Yoshua Bengio - 2011

17 papers in library cite

G. Dahl, D. Yu, L. Deng, Alex Acero - 2012

19 papers in library cite

Matthew D. Zeiler, M. A. Ranzato, R. Monga, M. Mao, K. Yang, Quoc Le, P. Nguyen, A. Senior, Vincent Vanhoucke, Jeffrey Dean - 2013

3 papers in library cite

A. Coates, Andrew Y. Ng - 2011

5 papers in library cite

D. Povey, A. Ghoshal - 2011

4 papers in library cite

Brian Kingsbury, T. N. Sainath, H. Soltau - 2012

3 papers in library cite

D. Yu, M. L. Seltzer, Jeffrey Li, J. T. Huang, F. Seide - 2013

2 papers in library cite

George E. Dahl, T. N. Sainath, Geoffrey E. Hinton - 2013

2 papers in library cite

B. Willmore, P. A. Watters, D. J. Tolhurst - 2000

1 paper in library cites

B. Willmore, D. J. Tolhurst - 2001

1 paper in library cites

K. Vesely, A. Ghoshal, Lukas Burget, D. Povey - 2013

1 paper in library cites

Cited by

3

papers in your library

Cites

6

papers in your library

Read

on August 1, 2025

Your review

Tags

Paper Aliases

No aliases