2014

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio

citations

Cite Score

91

AI summary

This paper compares LSTM, GRU, and tanh units on polyphonic music datasets and speech signal modeling tasks; the GRU and LSTM units outperform the traditional tanh unit, with GRU being better than LSTM on some datasets, suggesting that the choice of the type of gated recurrent unit may depend heavily on the dataset and corresponding task.

Main Contributions

  • Empirically evaluated RNNs with three widely used recurrent units: traditional tanh unit, LSTM unit and GRU.
  • Compared the LSTM unit, GRU and tanh unit in the task of sequence modeling.
  • Showed that LSTM and GRU units outperform the traditional tanh unit.
  • Showed that GRU is better than LSTM on some datasets, but not all.
  • Suggested that the choice of the type of gated recurrent unit may depend heavily on the dataset and corresponding task.

Abstract

In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently proposed gated recurrent unit (GRU). We evaluate these recurrent units on the tasks of polyphonic music modeling and speech signal modeling. Our experiments revealed that these advanced recurrent units are indeed better than more traditional recurrent units such as tanh units. Also, we found GRU to be comparable to LSTM.

Citation Graph

Loading graph...

References [20]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

James Bergstra, Yoshua Bengio - 2012

7 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Geoffrey Hinton - 2013

13 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, D. Bahdanau, Yoshua Bengio - 2014

9 papers in library cite

Razvan Pascanu, Tomas Mikolov, Yoshua Bengio - 2013

21 papers in library cite

Alex Graves - 2013

27 papers in library cite

F. Bastien, P. Lamblin, Razvan Pascanu, James Bergstra, I. Goodfellow, A. Bergeron, A. Bouchard, N. Nicolas, Yoshua Bengio - 2012

13 papers in library cite

James Martens, Ilya Sutskever - 2011

13 papers in library cite

Yoshua Bengio, N. B. Lewandowski, Razvan Pascanu - 2013

4 papers in library cite

Yoshua Bengio - 2013

5 papers in library cite

James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010

22 papers in library cite

Sepp Hochreiter - 1991

18 papers in library cite

Alex Graves - 2011

8 papers in library cite

Alex Graves - 2012

6 papers in library cite

C. G. Gulcehre, Kyunghyun Cho, Razvan Pascanu, Yoshua Bengio - 2013

2 papers in library cite

Geoffrey Hinton - 2012

2 papers in library cite

Cited by

11

papers in your library

Cites

15

papers in your library

Read

on August 7, 2025

Your review

Tags

Paper Aliases

No aliases