2014
Cite Score
70
AI summary
This paper introduces a method to correctly apply dropout to LSTMs, reducing overfitting in language modeling, speech recognition, image caption generation, and machine translation, achieving improved results. It shows that standard dropout perturbs recurrent connections, which makes it difficult for the LSTM to learn.
Main Contributions
Abstract
We present a simple regularization technique for Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. Dropout, the most successful technique for regularizing neural networks, does not work well with RNNS and LSTMs. In this paper, we show how to correctly apply dropout to LSTMs, and show that it substantially reduces overfitting on a variety of tasks. These tasks include language modeling, speech recognition, image caption generation, and machine translation.
Citation Graph
References [34]
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015
20 papers in library cite
T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014
14 papers in library cite
Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014
38 papers in library cite
Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014
58 papers in library cite
Geoffrey Hinton - 2013
13 papers in library cite
M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993
22 papers in library cite
Dumitru Erhan - 2015
11 papers in library cite
Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010
36 papers in library cite
Alex Graves - 2013
27 papers in library cite
L. Wan, M. Zeiler, S. Zhang, Rob Fergus - 2013
8 papers in library cite
M. Sundermeyer, R. Schluter, Hermann Ney - 2010
7 papers in library cite
N. Kalchbrenner, Phil Blunsom - 2013
27 papers in library cite
Tomas Mikolov, Quoc V. Le, Ilya Sutskever - 2013
6 papers in library cite
Razvan Pascanu, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2013
7 papers in library cite
Tomas Mikolov, Geoffrey Zweig - 2012
12 papers in library cite
Tomas Mikolov, A. Deoras, D. Povey, Lukas Burget, Jan Cernocky - 2011
9 papers in library cite
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, John Makhoul - 2014
9 papers in library cite
V. Pham, T. Bluche, C. Kermorvant, J. Louradour - 2014
5 papers in library cite
J. Koutnik, K. Greff, Faustino Gomez, Jürgen Schmidhuber - 2014
4 papers in library cite
Tomas Mikolov - 2012
17 papers in library cite
H. Bourlard, N. Morgan - 1993
8 papers in library cite
N. Srivastava - 2013
6 papers in library cite
Alex Graves, M. Liwicki, Santiago Fernandez, R. Bertolami, H. Bunke, Jürgen Schmidhuber - 2009
5 papers in library cite
Shijie Wang, C. Manning - 2013
4 papers in library cite
J. Bayer - 2013
3 papers in library cite
Herbert Jaeger, M. Lukosevicius, D. Popovici, U. Siewert - 2007
3 papers in library cite
Y. Chow, M. Dunham, O. Kimball, M. Krasner, G. Kubala, John Makhoul, P. Price, S. Roucos, Richard Schwartz - 1987
2 papers in library cite
W. C. Cheng, S. Kok, H. V. Pham, H. L. Chieu, K. M. A. Chai - 2014
2 papers in library cite
M. Pachitariu, M. Sahani - 2013
2 papers in library cite
Tony Robinson, M. Hochberg, S. Renals - 1996
2 papers in library cite
Holger Schwenk - 2014
2 papers in library cite
Holger Schwenk, P. Lambert, L. Barrault, C. Servan, H. Afli, S. A. Rauf, K. Shah - 2011
1 paper in library cites
H. Sak, Oriol Vinyals, Georg Heigold, A. Senior, E. Mcdermott, R. Monga, M. Mao - 2014
1 paper in library cites
Cited by
22
papers in your library
Cites
20
papers in your library
Read
on October 13, 2025
Your review
Tags
Paper Aliases
No aliases