2017

Supervised Learning of Universal Sentence Representations From Natural Language Inference Data

Alexis Conneau, Douwe Kiela, Holger Schwenk, L. Barrault, Antoine Bordes

citations

Cite Score

62

AI summary

This paper introduces a new method for learning universal sentence representations by training a BiLSTM-based encoder on the SNLI dataset, achieving state-of-the-art results on a variety of transfer tasks compared to unsupervised methods like SkipThought.

Main Contributions

  • Demonstrates that supervised learning on the SNLI dataset can produce high-quality universal sentence representations.
  • Shows that a BiLSTM architecture with max pooling, trained on SNLI, outperforms unsupervised methods like SkipThought on various transfer tasks.
  • Introduces a sentence evaluation toolkit called SentEval for automating evaluation on transfer tasks.
  • Achieves state-of-the-art results on several transfer tasks using sentence embeddings trained on SNLI.
  • Shows that using a larger coverage dataset (MultiNLI) for training sentence encoders helps learn even better general representations.

Abstract

Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, we show how universal sentence representations trained using the supervised data of the Stanford Natural Language Inference datasets can consistently outperform unsupervised methods like SkipThought vectors on a wide range of transfer tasks. Much like how computer vision uses ImageNet to obtain features, which can then be transferred to other tasks, our work tends to indicate the suitability of natural language inference for transfer learning to other NLP tasks. Our encoder is publicly available.

Citation Graph

Loading graph...

References [41]

Sort:
Filter:

K. He, X. Zhang, S. Ren, Jian Sun - 2016

20 papers in library cite

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

32 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Jimmy Lei Ba, R. Kiros, Geoffrey E. Hinton - 2016

14 papers in library cite

Quoc Le, Tomas Mikolov - 2014

13 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Ronan Collobert, Jason Weston, Leon Bottou, M. Karlen, Koray Kavukcuoglu, P. P. Kuksa - 2011

23 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, D. Bahdanau, Yoshua Bengio - 2014

9 papers in library cite

Y. Taigman, Michael Yang, Marc'aurelio Ranzato, Lior Wolf - 2014

5 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

A. Williams, Nikita Nangia, S. Bowman - 2018

19 papers in library cite

Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

18 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

Zongyu Lin, M. Feng, C. D. Santos, M. Yu, Bing Xiang, B. Zhou, Yoshua Bengio - 2017

2 papers in library cite

Marco Marelli, S. Menini, M. Baroni, L. Bentivogli, R. Bernardi, R. Z. Elli - 2014

7 papers in library cite

F. Hill, Kyunghyun Cho, Anna Korhonen - 2016

12 papers in library cite

A. Karpathy, Li Fei Fei - 2014

6 papers in library cite

S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, D. Parikh - 2015

6 papers in library cite

A. Razavian, H. Azizpour, J. Sullivan, S. Carlsson - 2014

6 papers in library cite

K. S. Tai, Richard Socher, Christopher D. Manning - 2015

6 papers in library cite

S. Arora, Yiqing Liang, T. Ma - 2017

4 papers in library cite

M. Hodosh, P. Young, J. Hockenmaier - 2013

4 papers in library cite

Alexis Conneau, Douwe Kiela - 2018

5 papers in library cite

J. Wieting, Mohit Bansal, Kevin Gimpel, K. A. Livescu, K. Karen - 2015

4 papers in library cite

I. Vendrov, R. Kiros, Sanja Fidler, R. Urtasun - 2016

4 papers in library cite

L. Mou, Z. Meng, R. Yan, G. Li, Yiheng Xu, Li Zhang, Z. Jin - 2016

3 papers in library cite

A. Lai, J. Hockenmaier - 2014

5 papers in library cite

E. Agirre, C. Banea, C. Cardie, D. M. Cer, M. T. Diab, A. G. Agirre, W. Guo, R. Mihalcea, G. Rigau, J. Wiebe - 2014

4 papers in library cite

Yangfeng Ji, J. Eisenstein - 2013

3 papers in library cite

H. Zhao, Z. L. Lu, Z. A. Poupart, P. Pascal - 2015

3 papers in library cite

Yibo Liu, C. Sun, L. Lin, Xinpeng Wang - 2016

2 papers in library cite

Douwe Kiela, Alexis Conneau, A. Jabri, M. Nickel - 2017

2 papers in library cite

L. Ma, Z. L. Lu, L. Shang, H. Li - 2015

1 paper in library cites

J. Ganitkevitch, B. V. Durme, Chris Callison Burch - 2013

1 paper in library cites

E. Littwin, Lior Wolf - 2016

1 paper in library cites

Cited by

11

papers in your library

Cites

32

papers in your library

Read

on October 24, 2025

Your review

Tags

Paper Aliases

No aliases