2012
Cite Score
32
AI summary
This paper introduces a context-dependent recurrent neural network language model (RNNLM) that incorporates contextual information using Latent Dirichlet Allocation (LDA), achieving state-of-the-art perplexity on the Penn Treebank and improvements in word-error-rate on the Wall Street Journal task.
Main Contributions
Abstract
Recurrent neural network language models (RNNLMs) have recently demonstrated state-of-the-art performance across a variety of tasks. In this paper, we improve their performance by providing a contextual real-valued input vector in association with each word. This vector is used to convey contextual information about the sentence being modeled. By performing Latent Dirichlet Allocation using a block of preceding text, we achieve a topic-conditioned RNNLM. This approach has the key advantage of avoiding the data fragmentation associated with building multiple topic models on different data subsets. We report perplexity results on the Penn Treebank data, where we achieve a new state-of-the-art. We further apply the model to the Wall Street Journal speech recognition task, where we observe improvements in word-error-rate.
Citation Graph
References [37]
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994
31 papers in library cite
Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001
62 papers in library cite
M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993
22 papers in library cite
Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010
36 papers in library cite
Tomas Mikolov, S. Kombrink, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2011
16 papers in library cite
A. Mnih, Geoffrey Hinton - 2007
12 papers in library cite
James Martens, Ilya Sutskever - 2011
13 papers in library cite
Tomas Mikolov, A. Deoras, D. Povey, Lukas Burget, Jan Cernocky - 2011
9 papers in library cite
Holger Schwenk - 2007
12 papers in library cite
Tomas Mikolov, A. Deoras, S. Kombrink, Lukas Burget, Jan Cernocky - 2011
13 papers in library cite
D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003
10 papers in library cite
S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, R. Harshman - 1990
12 papers in library cite
H. S. Le, I. Oparin, A. Allauzen, Jean Luc Gauvain, F. Yvon - 2011
7 papers in library cite
Tomas Mikolov - 2012
17 papers in library cite
R. Kuhn, R. D. Mori - 1990
6 papers in library cite
D. Filimonov, M. Harper - 2009
4 papers in library cite
A. Emami, Frederick Jelinek - 2004
4 papers in library cite
P. Xu - 2005
4 papers in library cite
D. Povey, A. Ghoshal - 2011
4 papers in library cite
R. Kneser, V. Steinbiss - 1993
3 papers in library cite
P. Xu, D. Karakos, Sanjeev Khudanpur - 2009
3 papers in library cite
S. F. Chen - 2009
3 papers in library cite
J. Bellegarda - 2000
2 papers in library cite
Sanjeev Khudanpur, Jeffrey Wu - 2000
2 papers in library cite
R. M. Iyer, M. Ostendorf - 1999
2 papers in library cite
N. Coccaro, Dan Jurafsky - 1998
2 papers in library cite
R. Lau, R. Rosenfeld, S. Roukos - 1993
2 papers in library cite
R. Rosenfeld - 1997
1 paper in library cites
F. Z. Martinez, S. E. Boquera, M. J. C. Bleda, R. D. Mori - 2012
1 paper in library cites
E. Arisoy, M. Saraclar, B. Roark, I. Shafran - 2012
1 paper in library cites
S. Chu, L. Mangu - 2012
1 paper in library cites
L. H. Son, A. Allauzen, F. Yvon - 2012
1 paper in library cites
S. Chen - 2009
1 paper in library cites
Geoffrey Zweig, S. Chang - 2011
1 paper in library cites
W. Reichl, W. Chou - 2000
1 paper in library cites
D. Povey, Lukas Burget - 2011
1 paper in library cites
Cited by
12
papers in your library
Cites
13
papers in your library
Read
on November 23, 2025
Your review
Tags
Paper Aliases
No aliases