2018
Cite Score
20
AI summary
This paper analyzes the use of pre-trained word embeddings in NMT tasks. It uses TED talks transcripts to create parallel corpus between English and three pairs of languages. It finds that pre-trained embeddings can provide gains of up to 20 BLEU points in favorable settings.
Main Contributions
Abstract
The performance of Neural Machine Translation (NMT) systems often suffers in low-resource scenarios where sufficiently large-scale parallel corpora cannot be obtained. Pre-trained word embeddings have proven to be invaluable for improving performance in natural language analysis tasks, which often suffer from paucity of data. However, their utility for NMT has not been extensively explored. In this work, we perform five sets of experiments that analyze when we can expect pre-trained word embeddings to help in NMT tasks. We show that such embeddings can be surprisingly effective in some cases – providing gains of up to 20 BLEU points in the most favorable setting.
Citation Graph
References [25]
D. P. Kingma, Jimmy Lei Ba - 2014
49 papers in library cite
Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013
32 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
K. Papineni, S. Roukos, T. Ward, Wei Jing Zhu - 2002
19 papers in library cite
Yoshua Bengio - 2010
20 papers in library cite
Yoon Kim - 2014
8 papers in library cite
Tomas Mikolov - 2017
7 papers in library cite
Yonghui Wu, M. Schuster, Ziru Chen, Quoc V. Le, M. Norouzi, W. Macherey, M. Krikun, Yue Cao, Q. Gao, K. Macherey, J. Klingner, A. Shah, M. J. Johnson, Xiaodong Liu, Lukasz Kaiser, S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, Wenyi Wang, C. Young, J. Smith, J. Riesa, A. Rudnick, Oriol Vinyals, G. S. Corrado, M. Hughes, Jeffrey Dean - 2016
15 papers in library cite
Alexis Conneau, G. Lample, Marc'aurelio Ranzato, L. Denoyer, Hervé Jégou - 2018
3 papers in library cite
M. Artetxe, G. Labaka, E. Agirre, Kyunghyun Cho - 2017
4 papers in library cite
P. Ramachandran, P. J. Liu, Quoc V. Le - 2017
9 papers in library cite
M. J. Johnson, M. Schuster, Quoc V. Le, M. Krikun, Yonghui Wu, Ziru Chen, N. Thorat, F. B. Viegas, M. Wattenberg, G. S. Corrado, M. Hughes, Jeffrey Dean - 2017
7 papers in library cite
Tomas Mikolov, E. Grave, Piotr Bojanowski, C. Puhrsch, Armand Joulin - 2017
1 paper in library cites
D. He, Y. Xia, T. Qin, Lisa Wang, N. Yu, T. Liu, W. Y. Ma - 2016
2 papers in library cite
O. Firat, Kyunghyun Cho, Yoshua Bengio - 2016
2 papers in library cite
S. L. Smith, D. H. Turban, S. Hamblin, N. Y. Hammerla - 2017
4 papers in library cite
M. Denkowski, Graham Neubig - 2017
2 papers in library cite
G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, C. Dyer - 2016
4 papers in library cite
X. Ma, Eduard Hovy - 2016
3 papers in library cite
Y. Cheng, Weixin Xu, Z. He, Weiran He, H. Wu, Maosong Sun, Yibo Liu - 2016
2 papers in library cite
M. Neishi, J. Sakuma, S. Tohda, S. Ishiwatari, N. Yoshinaga, M. Toyoda - 2017
1 paper in library cites
M. A. D. Gangi, M. Federico - 2017
1 paper in library cites
P. H. Matthews - 1997
1 paper in library cites
G. Corbett, B. Comrie - 2003
1 paper in library cites
Graham Neubig, M. Sperber, Xinpeng Wang, M. Felix, A. Matthews, S. Padmanabhan, Y. Qi, D. S. Sachan, P. Arthur, P. Godard, J. Hewitt, R. Riad, Lisa Wang - 2018
1 paper in library cites
Cited by
1
papers in your library
Cites
17
papers in your library
Read
on October 23, 2025
Your review
Tags
Paper Aliases
No aliases