2014
Cite Score
95
AI summary
This paper introduces a new RNN Encoder-Decoder model for statistical machine translation, using two recurrent neural networks to encode and decode sequences. The model improves translation performance by learning semantically meaningful phrase representations, achieving better results on the English/French translation task of the WMT'14 workshop.
Main Contributions
Abstract
In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder–Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.
Citation Graph
References [32]
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012
71 papers in library cite
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013
32 papers in library cite
Xavier Glorot, Antoine Bordes, Yoshua Bengio - 2011
17 papers in library cite
Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001
62 papers in library cite
Matthew D. Zeiler - 2012
13 papers in library cite
G. Dahl, D. Yu, L. Deng, Alex Acero - 2012
19 papers in library cite
Yoshua Bengio - 2013
17 papers in library cite
Surya Ganguli - 2014
9 papers in library cite
N. Kalchbrenner, Phil Blunsom - 2013
27 papers in library cite
F. Bastien, P. Lamblin, Razvan Pascanu, James Bergstra, I. Goodfellow, A. Bergeron, A. Bouchard, N. Nicolas, Yoshua Bengio - 2012
13 papers in library cite
Razvan Pascanu, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2013
7 papers in library cite
Richard Socher, Eric H. Huang, J. Pennin, C. Manning, A. Ng - 2011
10 papers in library cite
Yoshua Bengio, N. B. Lewandowski, Razvan Pascanu - 2013
4 papers in library cite
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, John Makhoul - 2014
9 papers in library cite
Holger Schwenk - 2007
12 papers in library cite
Holger Schwenk - 2012
5 papers in library cite
James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010
22 papers in library cite
W. Zou, Richard Socher, D. Cer, C. Manning - 2013
4 papers in library cite
P. Koehn, F. J. Och, D. Marcu - 2003
8 papers in library cite
Alex Graves - 2012
6 papers in library cite
Ashish Vaswani, Y. Zhao, V. Fossum, D. Chiang - 2013
5 papers in library cite
A. Axelrod, X. Fe, Jianfeng Gao - 2011
5 papers in library cite
L. H. Son, A. Allauzen, F. Yvon - 2012
4 papers in library cite
Michael Auli, M. Galley, C. Quirk, Geoffrey Zweig - 2013
3 papers in library cite
P. Koehn - 2005
2 papers in library cite
D. Marcu, W. Wong - 2002
1 paper in library cites
S. Chandar, S. Lauly, Hugo Larochelle, M. Khapra, B. Ravindran, V. Raykar, A. Saha - 2014
1 paper in library cites
Laurens Van Der Maaten - 2013
1 paper in library cites
Holger Schwenk, M. R. C. Jussa, J. A. R. Fonollosa - 2006
1 paper in library cites
R. C. Moore, W. Lewis - 2010
1 paper in library cites
Jianfeng Gao, X. He, W. Yih, L. Deng - 2013
1 paper in library cites
Cited by
38
papers in your library
Cites
19
papers in your library
Read
on June 7, 2025
Your review
Tags
Paper Aliases
No aliases