2012

Context Dependent Recurrent Neural Network Language Model

Tomas Mikolov, Geoffrey Zweig

citations

Cite Score

32

AI summary

This paper introduces a context-dependent recurrent neural network language model (RNNLM) that incorporates contextual information using Latent Dirichlet Allocation (LDA), achieving state-of-the-art perplexity on the Penn Treebank and improvements in word-error-rate on the Wall Street Journal task.

Main Contributions

  • Introduced the use of context vectors to improve the performance of a RNNLM.
  • Demonstrated perplexity improvements over the previous state-of-the-art for the Penn Treebank.
  • Developed an efficient method for computing context vectors when using a sliding window of context.
  • Evaluated the models by rescoring N-best lists from a speech recognizer and observe improvements there as well.
  • Achieved WER improvements for the Wall Street Journal task.

Abstract

Recurrent neural network language models (RNNLMs) have recently demonstrated state-of-the-art performance across a variety of tasks. In this paper, we improve their performance by providing a contextual real-valued input vector in association with each word. This vector is used to convey contextual information about the sentence being modeled. By performing Latent Dirichlet Allocation using a block of preceding text, we achieve a topic-conditioned RNNLM. This approach has the key advantage of avoiding the data fragmentation associated with building multiple topic models on different data subsets. We report perplexity results on the Penn Treebank data, where we achieve a new state-of-the-art. We further apply the model to the Wall Street Journal speech recognition task, where we observe improvements in word-error-rate.

Citation Graph

Loading graph...

References [37]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993

22 papers in library cite

Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010

36 papers in library cite

Tomas Mikolov, S. Kombrink, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2011

16 papers in library cite

A. Mnih, Geoffrey Hinton - 2007

12 papers in library cite

James Martens, Ilya Sutskever - 2011

13 papers in library cite

Tomas Mikolov, A. Deoras, D. Povey, Lukas Burget, Jan Cernocky - 2011

9 papers in library cite

Holger Schwenk - 2007

12 papers in library cite

Tomas Mikolov, A. Deoras, S. Kombrink, Lukas Burget, Jan Cernocky - 2011

13 papers in library cite

Reference title contains 'et al'

D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003

10 papers in library cite

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, R. Harshman - 1990

12 papers in library cite

H. S. Le, I. Oparin, A. Allauzen, Jean Luc Gauvain, F. Yvon - 2011

7 papers in library cite

Tomas Mikolov - 2012

17 papers in library cite

R. Kuhn, R. D. Mori - 1990

6 papers in library cite

D. Filimonov, M. Harper - 2009

4 papers in library cite

A. Emami, Frederick Jelinek - 2004

4 papers in library cite

P. Xu - 2005

4 papers in library cite

D. Povey, A. Ghoshal - 2011

4 papers in library cite

R. Kneser, V. Steinbiss - 1993

3 papers in library cite

P. Xu, D. Karakos, Sanjeev Khudanpur - 2009

3 papers in library cite

S. F. Chen - 2009

3 papers in library cite

J. Bellegarda - 2000

2 papers in library cite

Sanjeev Khudanpur, Jeffrey Wu - 2000

2 papers in library cite

R. M. Iyer, M. Ostendorf - 1999

2 papers in library cite

N. Coccaro, Dan Jurafsky - 1998

2 papers in library cite

R. Lau, R. Rosenfeld, S. Roukos - 1993

2 papers in library cite

R. Rosenfeld - 1997

1 paper in library cites

F. Z. Martinez, S. E. Boquera, M. J. C. Bleda, R. D. Mori - 2012

1 paper in library cites

E. Arisoy, M. Saraclar, B. Roark, I. Shafran - 2012

1 paper in library cites

S. Chu, L. Mangu - 2012

1 paper in library cites

L. H. Son, A. Allauzen, F. Yvon - 2012

1 paper in library cites

S. Chen - 2009

1 paper in library cites

Geoffrey Zweig, S. Chang - 2011

1 paper in library cites

W. Reichl, W. Chou - 2000

1 paper in library cites

D. Povey, Lukas Burget - 2011

1 paper in library cites

Cited by

12

papers in your library

Cites

13

papers in your library

Read

on November 23, 2025

Your review

Tags

Paper Aliases

No aliases