2014

Learning Phrase Representations Using RNN Encoder-Decoder for Statistical Machine Translation

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio

citations

Cite Score

95

AI summary

This paper introduces a new RNN Encoder-Decoder model for statistical machine translation, using two recurrent neural networks to encode and decode sequences. The model improves translation performance by learning semantically meaningful phrase representations, achieving better results on the English/French translation task of the WMT'14 workshop.

Main Contributions

  • Introduces a novel neural network architecture called RNN Encoder-Decoder for statistical machine translation.
  • Proposes a new type of hidden unit with reset and update gates for adaptive control of memory.
  • Demonstrates improved translation performance on the English/French translation task of the WMT'14 workshop by scoring phrase pairs with the RNN Encoder-Decoder.
  • Shows that the RNN Encoder-Decoder captures linguistic regularities in phrase pairs and proposes well-formed target phrases.
  • Visualizes word and phrase representations learned by the model, revealing semantic and syntactic clustering.

Abstract

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder–Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

Citation Graph

Loading graph...

References [32]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

32 papers in library cite

Xavier Glorot, Antoine Bordes, Yoshua Bengio - 2011

17 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Matthew D. Zeiler - 2012

13 papers in library cite

G. Dahl, D. Yu, L. Deng, Alex Acero - 2012

19 papers in library cite

Yoshua Bengio - 2013

17 papers in library cite

Surya Ganguli - 2014

9 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

27 papers in library cite

F. Bastien, P. Lamblin, Razvan Pascanu, James Bergstra, I. Goodfellow, A. Bergeron, A. Bouchard, N. Nicolas, Yoshua Bengio - 2012

13 papers in library cite

Razvan Pascanu, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2013

7 papers in library cite

Richard Socher, Eric H. Huang, J. Pennin, C. Manning, A. Ng - 2011

10 papers in library cite

Yoshua Bengio, N. B. Lewandowski, Razvan Pascanu - 2013

4 papers in library cite

Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, John Makhoul - 2014

9 papers in library cite

Holger Schwenk - 2007

12 papers in library cite

Holger Schwenk - 2012

5 papers in library cite

James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010

22 papers in library cite

W. Zou, Richard Socher, D. Cer, C. Manning - 2013

4 papers in library cite

P. Koehn, F. J. Och, D. Marcu - 2003

8 papers in library cite

Alex Graves - 2012

6 papers in library cite

Ashish Vaswani, Y. Zhao, V. Fossum, D. Chiang - 2013

5 papers in library cite

A. Axelrod, X. Fe, Jianfeng Gao - 2011

5 papers in library cite

L. H. Son, A. Allauzen, F. Yvon - 2012

4 papers in library cite

Michael Auli, M. Galley, C. Quirk, Geoffrey Zweig - 2013

3 papers in library cite

P. Koehn - 2005

2 papers in library cite

D. Marcu, W. Wong - 2002

1 paper in library cites

S. Chandar, S. Lauly, Hugo Larochelle, M. Khapra, B. Ravindran, V. Raykar, A. Saha - 2014

1 paper in library cites

Laurens Van Der Maaten - 2013

1 paper in library cites

Holger Schwenk, M. R. C. Jussa, J. A. R. Fonollosa - 2006

1 paper in library cites

R. C. Moore, W. Lewis - 2010

1 paper in library cites

Jianfeng Gao, X. He, W. Yih, L. Deng - 2013

1 paper in library cites

Cited by

38

papers in your library

Cites

19

papers in your library

Read

on June 7, 2025

Your review

Tags

Paper Aliases

No aliases