2014

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

Kyunghyun Cho, B. V. Merrienboer, D. Bahdanau, Yoshua Bengio

citations

Cite Score

84

AI summary

This paper analyzes neural machine translation using RNN Encoder-Decoder and gated recursive convolutional neural networks. The models are evaluated on the task of translation from French to English. Results show good performance on short sentences but degradation as the sentence length increases. The gated recursive convolutional network can learn grammatical structures.

Main Contributions

  • Analysis of Neural Machine Translation models: RNN Encoder-Decoder and gated recursive convolutional neural network.
  • Showed that neural machine translation performs well on short sentences but degrades rapidly as the length of the sentence increases.
  • Demonstrated that the gated recursive convolutional network learns a grammatical structure of a sentence automatically.
  • Experiments on English-to-French translation task.
  • Evaluation of translation performance using BLEU scores.

Abstract

Neural machine translation is a relatively new approach to statistical machine translation based purely on neural networks. The neural machine translation models often consist of an encoder and a decoder. The encoder extracts a fixed-length representation from a variable-length input sentence, and the decoder generates a correct translation from this representation. In this paper, we focus on analyzing the properties of the neural machine translation using two models; RNN Encoder–Decoder and a newly proposed gated recursive convolutional neural network. We show that the neural machine translation performs relatively well on short sentences without unknown words, but its performance degrades rapidly as the length of the sentence and the number of unknown words increase. Furthermore, we find that the proposed gated recursive convolutional network learns a grammatical structure of a sentence automatically.

Citation Graph

Loading graph...

References [12]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014

38 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Matthew D. Zeiler - 2012

13 papers in library cite

Alex Graves - 2013

27 papers in library cite

Alex Graves - 2012

7 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

27 papers in library cite

Pascal Vincent - 2013

2 papers in library cite

P. Koehn, F. J. Och, D. Marcu - 2003

8 papers in library cite

A. Axelrod, X. Fe, Jianfeng Gao - 2011

5 papers in library cite

C. L. Liu, D. Dahlmeier, H. T. Ng - 2011

1 paper in library cites

X. Song, T. Cohn, L. Specia - 2013

1 paper in library cites

Cited by

9

papers in your library

Cites

8

papers in your library

Read

on June 8, 2025

Your review

Tags

Paper Aliases

No aliases