2018

When and Why Are Pre-Trained Word Embeddings Useful for Neural Machine Translation?

Graham Neubig

citations

Cite Score

20

AI summary

This paper analyzes the use of pre-trained word embeddings in NMT tasks. It uses TED talks transcripts to create parallel corpus between English and three pairs of languages. It finds that pre-trained embeddings can provide gains of up to 20 BLEU points in favorable settings.

Main Contributions

  • It examines the effectiveness of pre-trained word embeddings across various languages in NMT.
  • It analyzes whether pre-training is more effective for similar translation pairs.
  • It investigates whether alignment of word embeddings improves performance.
  • It studies the impact of pre-training in multilingual translation systems.
  • The paper identifies a sweet spot where word embeddings are most effective in low-resource scenarios.

Abstract

The performance of Neural Machine Translation (NMT) systems often suffers in low-resource scenarios where sufficiently large-scale parallel corpora cannot be obtained. Pre-trained word embeddings have proven to be invaluable for improving performance in natural language analysis tasks, which often suffer from paucity of data. However, their utility for NMT has not been extensively explored. In this work, we perform five sets of experiments that analyze when we can expect pre-trained word embeddings to help in NMT tasks. We show that such embeddings can be surprisingly effective in some cases – providing gains of up to 20 BLEU points in the most favorable setting.

Citation Graph

Loading graph...

References [25]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

32 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

K. Papineni, S. Roukos, T. Ward, Wei Jing Zhu - 2002

19 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

Yoon Kim - 2014

8 papers in library cite

Tomas Mikolov - 2017

7 papers in library cite

Yonghui Wu, M. Schuster, Ziru Chen, Quoc V. Le, M. Norouzi, W. Macherey, M. Krikun, Yue Cao, Q. Gao, K. Macherey, J. Klingner, A. Shah, M. J. Johnson, Xiaodong Liu, Lukasz Kaiser, S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, Wenyi Wang, C. Young, J. Smith, J. Riesa, A. Rudnick, Oriol Vinyals, G. S. Corrado, M. Hughes, Jeffrey Dean - 2016

15 papers in library cite

Alexis Conneau, G. Lample, Marc'aurelio Ranzato, L. Denoyer, Hervé Jégou - 2018

3 papers in library cite

M. Artetxe, G. Labaka, E. Agirre, Kyunghyun Cho - 2017

4 papers in library cite

P. Ramachandran, P. J. Liu, Quoc V. Le - 2017

9 papers in library cite

M. J. Johnson, M. Schuster, Quoc V. Le, M. Krikun, Yonghui Wu, Ziru Chen, N. Thorat, F. B. Viegas, M. Wattenberg, G. S. Corrado, M. Hughes, Jeffrey Dean - 2017

7 papers in library cite

Tomas Mikolov, E. Grave, Piotr Bojanowski, C. Puhrsch, Armand Joulin - 2017

1 paper in library cites

D. He, Y. Xia, T. Qin, Lisa Wang, N. Yu, T. Liu, W. Y. Ma - 2016

2 papers in library cite

O. Firat, Kyunghyun Cho, Yoshua Bengio - 2016

2 papers in library cite

S. L. Smith, D. H. Turban, S. Hamblin, N. Y. Hammerla - 2017

4 papers in library cite

M. Denkowski, Graham Neubig - 2017

2 papers in library cite

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, C. Dyer - 2016

4 papers in library cite

X. Ma, Eduard Hovy - 2016

3 papers in library cite

Y. Cheng, Weixin Xu, Z. He, Weiran He, H. Wu, Maosong Sun, Yibo Liu - 2016

2 papers in library cite

M. Neishi, J. Sakuma, S. Tohda, S. Ishiwatari, N. Yoshinaga, M. Toyoda - 2017

1 paper in library cites

M. A. D. Gangi, M. Federico - 2017

1 paper in library cites

P. H. Matthews - 1997

1 paper in library cites

G. Corbett, B. Comrie - 2003

1 paper in library cites

Graham Neubig, M. Sperber, Xinpeng Wang, M. Felix, A. Matthews, S. Padmanabhan, Y. Qi, D. S. Sachan, P. Arthur, P. Godard, J. Hewitt, R. Riad, Lisa Wang - 2018

1 paper in library cites

Cited by

1

papers in your library

Cites

17

papers in your library

Read

on October 23, 2025

Your review

Tags

Paper Aliases

No aliases