2013

Recurrent Continuous Translation Models

N. Kalchbrenner, Phil Blunsom

citations

Cite Score

54

AI summary

This paper introduces Recurrent Continuous Translation Models (RCTM) using continuous representations for words, phrases, and sentences, conditioned by a Convolutional Sentence Model, achieving over 43% lower perplexity than state-of-the-art alignment-based translation models and demonstrating sensitivity to word order and syntax.

Main Contributions

  • Introduces Recurrent Continuous Translation Models (RCTM) for sentence-level translation.
  • Uses continuous representations for words, phrases, and sentences without relying on alignments.
  • Models translation generation with a target Recurrent Language Model.
  • Conditions translation on the source sentence using a Convolutional Sentence Model.
  • Achieves a perplexity > 43% lower than state-of-the-art alignment-based translation models.

Abstract

We introduce a class of probabilistic con tinuous translation models called Recurrent Continuous Translation Models that are purely based on continuous representations for words, phrases and sentences and do not rely on alignments or phrasal translation units. The models have a generation and a conditioning aspect. The generation of the transla tion is modelled with a target Recurrent Language Model, whereas the conditioning on the source sentence is modelled with a Convolu tional Sentence Model. Through various ex periments, we show first that our models ob tain a perplexity with respect to gold transla tions that is > 43% lower than that of state of-the-art alignment-based translation models. Secondly, we show that they are remarkably sensitive to the word order, syntax, and mean ing of the source sentence despite lacking alignments. Finally we show that they match a state-of-the-art system when rescoring n-best lists of translations.

Citation Graph

Loading graph...

References [18]

Sort:
Filter:

John Duchi, Elad Hazan, Yoram Singer - 2011

19 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010

36 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Ilya Sutskever, James Martens, Geoffrey E. Hinton - 2011

13 papers in library cite

Tomas Mikolov, S. Kombrink, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2011

16 papers in library cite

Richard Socher, Eric H. Huang, J. Pennin, C. Manning, A. Ng - 2011

10 papers in library cite

Tomas Mikolov, Geoffrey Zweig - 2012

12 papers in library cite

Holger Schwenk - 2012

5 papers in library cite

Holger Schwenk, D. Dchelotte, Jean Luc Gauvain - 2006

5 papers in library cite

Richard Socher, B. Huval, Christopher D. Manning, Andrew Y. Ng - 2012

7 papers in library cite

P. F. Brown, S. D. Pietra, Vincent J. Della Pietra, R. L. Mercer - 1993

7 papers in library cite

C. Dyer, V. Chahuneau, Noah A. Smith - 2013

4 papers in library cite

L. H. Son, A. Allauzen, F. Yvon - 2012

4 papers in library cite

K. Hermann, Phil Blunsom - 2013

3 papers in library cite

C. Dyer, J. Weese, H. Setiawan, A. Lopez, F. Ture, V. Eidelman, J. Ganitkevitch, Phil Blunsom, P. Resnik - 2010

2 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

2 papers in library cite

Edward Grefenstette, M. Sadrzadeh, S. Clark, B. Coecke, S. Pulman - 2011

1 paper in library cites

Cited by

27

papers in your library

Cites

10

papers in your library

Read

on April 29, 2025

Your review

Tags

Paper Aliases

No aliases