2015

Skip-Thought Vectors

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler

citations

Cite Score

65

AI summary

This paper introduces skip-thoughts, a novel unsupervised sentence encoder, trained on the BookCorpus dataset. The encoder-decoder model reconstructs surrounding sentences, mapping sentences with similar properties to similar vector representations. Evaluated on 8 tasks, skip-thoughts demonstrate robust and generic sentence representations.

Main Contributions

  • Introduces skip-thoughts, an unsupervised sentence encoder that learns generic sentence representations by predicting surrounding sentences.
  • Presents a vocabulary expansion method to encode words not seen during training, expanding the vocabulary to a million words.
  • Evaluates skip-thought vectors on 8 tasks using linear models, demonstrating robust performance across semantic relatedness, paraphrase detection, and image-sentence ranking.
  • Achieves comparable results to LSTM models trained specifically for semantic relatedness tasks, highlighting the effectiveness of unsupervised pre-training.
  • Demonstrates that combining skip-thoughts with bag-of-words features results in a strong baseline for text classification tasks.

Abstract

We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage. Sentences that share semantic and syntactic properties are thus mapped to similar vector representations. We next introduce a simple vocabulary expansion method to encode words that were not seen as part of training, allowing us to expand our vocabulary to a million words. After training our model, we extract and evaluate our vectors with linear models on 8 tasks: semantic relatedness, paraphrase detection, image-sentence ranking, question-type classification and 4 benchmark sentiment and subjectivity datasets. The end result is an off-the-shelf encoder that can produce highly generic sentence representations that are robust and perform well in practice. We will make our encoder publicly available.

Citation Graph

Loading graph...

References [42]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

Tomas Mikolov, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

26 papers in library cite

Geoffrey Hinton - 2008

7 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014

38 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Yoon Kim - 2014

8 papers in library cite

J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2014

11 papers in library cite

Quoc Le, Tomas Mikolov - 2014

13 papers in library cite

Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013

24 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, D. Bahdanau, Yoshua Bengio - 2014

9 papers in library cite

Phil Blunsom, Edward Grefenstette, N. Kalchbrenner - 2014

7 papers in library cite

Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

18 papers in library cite

Surya Ganguli - 2014

9 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

27 papers in library cite

Tomas Mikolov, Quoc V. Le, Ilya Sutskever - 2013

6 papers in library cite

Richard Socher, Eric H. Huang, J. Pennin, C. Manning, A. Ng - 2011

10 papers in library cite

A. Karpathy, Li Fei Fei - 2014

6 papers in library cite

K. S. Tai, Richard Socher, Christopher D. Manning - 2015

6 papers in library cite

B. Dolan, C. Quirk, C. A. Brockett, C. Chris - 2004

5 papers in library cite

Bo Pang, L. A. Lee, L. Lillian - 2004

8 papers in library cite

J. Wiebe, T. Wilson, T. Theresa, C. A. Cardie, C. Claire - 2005

7 papers in library cite

Shijie Wang, Manning, C. Christopher - 2012

7 papers in library cite

Marco Marelli, L. Bentivogli, M. Baroni, R. Bernardi, S. Menini, R. Zamparelli - 2014

7 papers in library cite

M. Hu, B. A. Liu, B. Bing - 2004

6 papers in library cite

Richard Socher, Quoc Le, C. Manning, A. Ng - 2014

5 papers in library cite

A. Lai, J. Hockenmaier - 2014

5 papers in library cite

J. Mao, Weixin Xu, Yining Yang, J. Wang, A. Yuille - 2014

3 papers in library cite

Xiang Lisa Li, Dan Roth - 2002

3 papers in library cite

Dipanjan Das, Noah A. Smith - 2009

3 papers in library cite

H. Zhao, Z. L. Lu, Z. A. Poupart, P. Pascal - 2015

3 papers in library cite

J. Zhao, T. T. Zhu, M. Lan - 2014

2 papers in library cite

N. Madnani, J. Tetreault, J. A. Chodorow, M. M. K. Martin - 2012

2 papers in library cite

S. Jimenez, G. Duenas, J. Baquero, A. Gelbukh, A. J. D. Batiz, A. Mendiz'abal - 2014

2 papers in library cite

S. Wan, M. Dras, R. Dale, C. Paris - 2006

2 papers in library cite

B. Klein, G. Lev, G. Sadeh, Lior Wolf - 2015

1 paper in library cites

J. Bjerva, J. Bos, R. V. D. Goot, M. Nissim - 2014

1 paper in library cites

A. Finch, Y. Hwang, E. Sumita - 2005

1 paper in library cites

Cited by

23

papers in your library

Cites

22

papers in your library

Read

on August 4, 2025

Your review

Tags

Paper Aliases

No aliases