2018

Word Translation Without Parallel Data

Alexis Conneau, G. Lample, Marc'aurelio Ranzato, L. Denoyer, Hervé Jégou

citations

Cite Score

45

AI summary

This paper introduces a new unsupervised approach that leverages adversarial training and cross-domain similarity local scaling to learn cross-lingual word embeddings, achieving state-of-the-art results on word translation, sentence translation retrieval, and cross-lingual word similarity tasks on several language pairs using fastText embeddings.

Main Contributions

  • Introduces an unsupervised approach for cross-lingual word embeddings that matches or exceeds supervised methods.
  • Introduces a cross-domain similarity adaptation to address the hubness problem.
  • Proposes an unsupervised criterion for model selection correlated with mapping quality.
  • Releases high-quality dictionaries for 12 oriented language pairs and corresponding word embeddings.
  • Demonstrates effectiveness on a low-resource language pair (English-Esperanto).

Abstract

State-of-the-art methods for learning cross-lingual word embeddings have relied on bilingual dictionaries or parallel corpora. Recent studies showed that the need for parallel data supervision can be alleviated with character-level information. While these methods showed encouraging results, they are not on par with their supervised counterparts and are limited to pairs of languages sharing a common alphabet. In this work, we show that we can build a bilingual dictionary between two languages without using any parallel corpora, by aligning monolingual word embedding spaces in an unsupervised way. Without using any character information, our model even outperforms existing supervised methods on cross-lingual tasks for some language pairs. Our experiments demonstrate that our method works very well also for distant language pairs, like English-Russian or English-Chinese. We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation. Our code, embeddings and dictionaries are publicly available.

Citation Graph

Loading graph...

References [52]

Sort:
Filter:

Ian J. Goodfellow, J. P. Abadie, M. Mirza, B. Xu, D. W. Farley, S. Ozair, Aaron Courville, Yoshua Bengio - 2014

2 papers in library cite

Tomas Mikolov, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

26 papers in library cite

Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

32 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

Tomas Mikolov - 2017

7 papers in library cite

Tomas Mikolov, Quoc V. Le, Ilya Sutskever - 2013

6 papers in library cite

Yaroslav Ganin, E. Ustinova, H. Ajakan, P. Germain, Hugo Larochelle, F. Laviolette, M. Marchand, Victor Lempitsky - 2016

1 paper in library cites

J. Johnson, M. Douze, Hervé Jégou - 2017

4 papers in library cite

I. Goodfellow - 2016

1 paper in library cites

Omer Levy, Y. Goldberg - 2014

4 papers in library cite

W. Zou, Richard Socher, D. Cer, C. Manning - 2013

4 papers in library cite

S. L. Smith, D. H. Turban, S. Hamblin, N. Y. Hammerla - 2017

4 papers in library cite

M. Artetxe, G. Labaka, E. Agirre - 2017

2 papers in library cite

M. Artetxe, G. Labaka, E. Agirre - 2016

2 papers in library cite

Mingchuan Zhang, Yibo Liu, H. Luan, Maosong Sun - 2017

2 papers in library cite

A. Lazaridou, G. Dinu, M. Baroni - 2015

2 papers in library cite

R. Parker, D. Graff, J. Kong, K. Chen, K. Maeda - 2011

5 papers in library cite

T. Luong, Richard Socher, Christopher D. Manning - 2013

4 papers in library cite

Z. Harris - 1954

3 papers in library cite

S. Gouws, Yoshua Bengio, G. Corrado - 2015

2 papers in library cite

S. Ravi, K. Knight - 2011

2 papers in library cite

Manaal Faruqui, C. Dyer - 2014

2 papers in library cite

P. Koehn, K. Knight - 2002

2 papers in library cite

A. Haghighi, Percy Liang, T. B. Kirkpatrick, Dan Klein - 2008

2 papers in library cite

W. Ammar, G. Mulcaire, Y. Tsvetkov, G. Lample, C. Dyer, Noah A. Smith - 2016

2 papers in library cite

C. Xing, D. Wang, C. L. Liu, Yutong Lin - 2015

2 papers in library cite

J. Tiedemann - 2012

2 papers in library cite

J. C. Collados, M. T. Pilehvar, N. Collier, R. Navigli - 2017

2 papers in library cite

Y. Rubner, C. Tomasi, L. J. Guibas - 2000

2 papers in library cite

Q. Dou, Ashish Vaswani, K. Knight, C. Dyer - 2015

2 papers in library cite

H. Cao, T. Z. Zhao, S. Zhang, Y. Meng - 2016

1 paper in library cites

P. H. Schonemann - 1966

1 paper in library cites

Hervé Jégou, Cordelia Schmid, H. Harzallah, J. Verbeek - 2010

1 paper in library cites

S. Umeyama - 1988

1 paper in library cites

Pascale Fung, L. Y. Yee - 1998

1 paper in library cites

G. Kondrak, B. Hauer, G. Nicolai - 2017

1 paper in library cites

Pascale Fung - 1995

1 paper in library cites

N. Pourdamghani, K. Knight - 2017

1 paper in library cites

Mingchuan Zhang, Yibo Liu, H. Luan, Maosong Sun - 2017

1 paper in library cites

M. Radovanovic, A. Nanopoulos, M. Ivanovic - 2010

1 paper in library cites

R. Rapp - 1995

1 paper in library cites

G. D'souza, N. Vasic, A. R. Soriano, Yann Dauphin, M. Florescu - 2015

1 paper in library cites

A. Klementiev, I. Titov, B. Bhattarai - 2012

1 paper in library cites

C. Schafer, D. Yarowsky - 2002

1 paper in library cites

L. Duong, H. Kanayama, T. Ma, S. Bird, T. Cohn - 2016

1 paper in library cites

J. C. Collados, M. T. Pilehvar, R. Navigli - 2016

1 paper in library cites

M. Cisse, Piotr Bojanowski, E. Grave, Yann Dauphin, Nicolas Usunier - 2017

1 paper in library cites

L. Z. Manor, Pietro Perona - 2005

1 paper in library cites

A. Irvine, Chris Callison Burch - 2013

1 paper in library cites

M. Baroni, S. Bernardini, A. Ferraresi, E. Zanchetta - 2009

1 paper in library cites

Cited by

3

papers in your library

Cites

16

papers in your library

Read

on November 3, 2025

Your review

Tags

Paper Aliases

No aliases