2014

Recurrent Neural Network Regularization

Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals

citations

Cite Score

70

AI summary

This paper introduces a method to correctly apply dropout to LSTMs, reducing overfitting in language modeling, speech recognition, image caption generation, and machine translation, achieving improved results. It shows that standard dropout perturbs recurrent connections, which makes it difficult for the LSTM to learn.

Main Contributions

  • Introduces a method to correctly apply dropout to LSTMs
  • Dropout is only applied to the non-recurrent connections
  • Demonstrates that the method reduces overfitting on a variety of tasks
  • Achieves improved results on language modeling, speech recognition, image caption generation, and machine translation

Abstract

We present a simple regularization technique for Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. Dropout, the most successful technique for regularizing neural networks, does not work well with RNNS and LSTMs. In this paper, we show how to correctly apply dropout to LSTMs, and show that it substantially reduces overfitting on a variety of tasks. These tasks include language modeling, speech recognition, image caption generation, and machine translation.

Citation Graph

Loading graph...

References [34]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014

38 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Geoffrey Hinton - 2013

13 papers in library cite

M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993

22 papers in library cite

Dumitru Erhan - 2015

11 papers in library cite

Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010

36 papers in library cite

Alex Graves - 2013

27 papers in library cite

L. Wan, M. Zeiler, S. Zhang, Rob Fergus - 2013

8 papers in library cite

M. Sundermeyer, R. Schluter, Hermann Ney - 2010

7 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

27 papers in library cite

Tomas Mikolov, Quoc V. Le, Ilya Sutskever - 2013

6 papers in library cite

Razvan Pascanu, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2013

7 papers in library cite

Tomas Mikolov, Geoffrey Zweig - 2012

12 papers in library cite

Tomas Mikolov, A. Deoras, D. Povey, Lukas Burget, Jan Cernocky - 2011

9 papers in library cite

Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, John Makhoul - 2014

9 papers in library cite

V. Pham, T. Bluche, C. Kermorvant, J. Louradour - 2014

5 papers in library cite

J. Koutnik, K. Greff, Faustino Gomez, Jürgen Schmidhuber - 2014

4 papers in library cite

Tomas Mikolov - 2012

17 papers in library cite

H. Bourlard, N. Morgan - 1993

8 papers in library cite

N. Srivastava - 2013

6 papers in library cite

Alex Graves, M. Liwicki, Santiago Fernandez, R. Bertolami, H. Bunke, Jürgen Schmidhuber - 2009

5 papers in library cite

Shijie Wang, C. Manning - 2013

4 papers in library cite

J. Bayer - 2013

3 papers in library cite

Herbert Jaeger, M. Lukosevicius, D. Popovici, U. Siewert - 2007

3 papers in library cite

Y. Chow, M. Dunham, O. Kimball, M. Krasner, G. Kubala, John Makhoul, P. Price, S. Roucos, Richard Schwartz - 1987

2 papers in library cite

W. C. Cheng, S. Kok, H. V. Pham, H. L. Chieu, K. M. A. Chai - 2014

2 papers in library cite

M. Pachitariu, M. Sahani - 2013

2 papers in library cite

Tony Robinson, M. Hochberg, S. Renals - 1996

2 papers in library cite

Holger Schwenk - 2014

2 papers in library cite

Holger Schwenk, P. Lambert, L. Barrault, C. Servan, H. Afli, S. A. Rauf, K. Shah - 2011

1 paper in library cites

H. Sak, Oriol Vinyals, Georg Heigold, A. Senior, E. Mcdermott, R. Monga, M. Mao - 2014

1 paper in library cites

Cited by

22

papers in your library

Cites

20

papers in your library

Read

on October 13, 2025

Your review

Tags

Paper Aliases

No aliases