2011

Extensions of Recurrent Neural Network Language Model

Tomas Mikolov, S. Kombrink, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur

citations

Cite Score

49

AI summary

This paper introduces several modifications to the recurrent neural network language model (RNN LM) to reduce computational complexity. The modifications lead to a 15x speedup in training and testing. The paper also shows the importance of using a backpropagation through time algorithm.

Main Contributions

  • Introduces several modifications to the recurrent neural network language model (RNN LM) to reduce computational complexity.
  • Shows approaches that lead to more than 15 times speedup for both training and testing phases.
  • Shows the importance of using a backpropagation through time algorithm.
  • Discusses possibilities of reducing the amount of parameters in the model.
  • The resulting RNN model can thus be smaller, faster both during training and testing, and more accurate than the basic one.

Abstract

We present several modifications of the original recurrent neural network language model (RNN LM). While this model has been shown to significantly outperform many competitive language modeling techniques in terms of accuracy, the remaining problem is the computational complexity. In this work, we show approaches that lead to more than 15 times speedup for both training and testing phases. Next, we show importance of using a backpropagation through time algorithm. An empirical comparison with feedforward networks is also provided. In the end, we discuss possibilities how to reduce the amount of parameters in the model. The resulting RNN model can thus be smaller, faster both during training and testing, and more accurate than the basic one.

Citation Graph

Loading graph...

References [18]

Sort:
Filter:

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

34 papers in library cite

Jeffrey L. Elman - 1990

23 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010

36 papers in library cite

Yoshua Bengio, Yann Lecun - 2007

15 papers in library cite

F. Morin, Yoshua Bengio - 2005

19 papers in library cite

Yoshua Bengio, Jean Sebastien Senecal - 2008

6 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2005

7 papers in library cite

J. Goodman - 2001

15 papers in library cite

J. T. Goodman - 2001

7 papers in library cite

D. Filimonov, M. Harper - 2009

4 papers in library cite

A. Emami, Frederick Jelinek - 2004

4 papers in library cite

Tomas Mikolov, J. Kopecky, Lukas Burget, O. Glembek, Jan Cernocky - 2009

4 papers in library cite

P. Xu - 2005

4 papers in library cite

M. Boden - 2002

2 papers in library cite

A. Emami - 2006

2 papers in library cite

A. Alexandrescu, K. Kirchhoff - 2006

2 papers in library cite

Cited by

16

papers in your library

Cites

9

papers in your library

Read

on March 21, 2025

Your review

Tags

Paper Aliases

No aliases