2006

Continuous Space Language Models for Statistical Machine Translation

Holger Schwenk, D. Dchelotte, Jean Luc Gauvain

citations

Cite Score

8

AI summary

This paper introduces a continuous space language model using neural networks for statistical machine translation, achieving BLEU score improvements on European Parliament Speeches data by effectively modeling word representations and probability estimation, particularly when combined with lattice rescoring techniques.

Main Contributions

  • Proposed a continuous space language model based on neural networks for statistical machine translation.
  • Demonstrated improved generalization to unknown n-grams through smooth probability functions of word representations.
  • Achieved consistent improvements in BLEU scores on the translation of European Parliament Speeches using the proposed method.
  • Presented algorithms to improve language model probability estimation by splitting long sentences into shorter chunks.
  • Showed that the continuous space language model can be trained on large corpora and used to rescore translation lattices.

Abstract

Statistical machine translation systems are based on one or more translation models and a language model of the target language. While many different translation models and phrase extraction algorithms have been proposed, a standard word n-gram back-off language model is used in most systems. In this work, we propose to use a new statistical language model that is based on a continuous representation of the words in the vocabulary. A neural network is used to perform the projection and the probability estimation. We consider the translation of European Parliament Speeches. This task is part of an international evaluation organized by the TC-STAR project in 2006. The proposed method achieves consistent improvements in the BLEU score on the development and test data. We also present algorithms to improve the estimation of the language model probabilities when splitting long sentences into shorter chunks.

Citation Graph

Loading graph...

References [17]

Sort:
Filter:

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Andreas Stolcke - 2002

13 papers in library cite

A. L. Berger, S. A. D. Pietra, Vincent J. Della Pietra - 1996

10 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2005

7 papers in library cite

Holger Schwenk - 2004

6 papers in library cite

P. F. Brown, S. D. Pietra, Vincent J. Della Pietra, R. L. Mercer - 1993

7 papers in library cite

A. Emami, Frederick Jelinek - 2005

4 papers in library cite

P. Xu, Frederick Jelinek - 2004

2 papers in library cite

F. Och, D. Gildea, Sanjeev Khudanpur, A. Sarkar, K. Yamada, A. Fraser, S. Kumar, L. Shen, D. Smith, K. Eng, V. Jain, Z. Jin, D. R. Radev - 2004

1 paper in library cites

C. Gollan, M. Bisani, S. Kanthak, R. Schlueter, Hermann Ney - 2005

1 paper in library cites

F. Och, Hermann Ney - 2002

1 paper in library cites

K. Kirchhoff, Michael Yang - 2005

1 paper in library cites

S. Hasan, O. Bender, Hermann Ney - 2006

1 paper in library cites

E. Charniak, K. Knight, K. Yamada - 2003

1 paper in library cites

D. Dechelotte, Holger Schwenk, J. Gauvain - 2006

1 paper in library cites

Cited by

5

papers in your library

Cites

5

papers in your library

Read

on April 25, 2025

Your review

Tags

Paper Aliases

No aliases