2017

A Structured Self-Attentive Sentence Embedding

Zongyu Lin, M. Feng, C. D. Santos, M. Yu, Bing Xiang, B. Zhou, Yoshua Bengio

citations

Cite Score

63

AI summary

This paper introduces a self-attention mechanism to create a fixed-size matrix sentence embedding, enabling interpretability and improved performance on author profiling, sentiment classification, and textual entailment tasks, achieving significant gains compared to other sentence embedding methods.

Main Contributions

  • Introduces a self-attention mechanism for fixed-size matrix sentence embeddings.
  • Presents a 2-D matrix embedding where each row attends to a different part of the sentence.
  • Proposes a special regularization term to encourage diversity in attention.
  • Provides a visualization method to understand what the embedding encodes.
  • Achieves significant performance gains on author profiling, sentiment classification, and textual entailment tasks.

Abstract

This paper proposes a new model for extracting an interpretable sentence embedding by introducing self-attention. Instead of using a vector, we use a 2-D matrix to represent the embedding, with each row of the matrix attending on a different part of the sentence. We also propose a self-attention mechanism and a special regularization term for the model. As a side effect, the embedding comes with an easy way of visualizing what specific parts of the sentence are encoded into the embedding. We evaluate our model on 3 different tasks: author profiling, sentiment classification and textual entailment. Results show that our model yields a significant performance gain compared to other sentence embedding methods in all of the 3 tasks.

Citation Graph

Loading graph...

References [36]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Tomas Mikolov, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

26 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

Yoon Kim - 2014

8 papers in library cite

J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2014

11 papers in library cite

Quoc Le, Tomas Mikolov - 2014

13 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013

24 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

Phil Blunsom, Edward Grefenstette, N. Kalchbrenner - 2014

7 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

A. P. Parikh, O. Tackstrom, Dipanjan Das, Jakob Uszkoreit - 2016

11 papers in library cite

Richard Socher, Jeffrey Pennington, Eric H. Huang, Andrew Y. Ng, Christopher D. Manning - 2011

10 papers in library cite

Mirella Lapata - 2016

8 papers in library cite

F. Hill, Kyunghyun Cho, Anna Korhonen - 2016

12 papers in library cite

K. S. Tai, Richard Socher, Christopher D. Manning - 2015

6 papers in library cite

I. Vendrov, R. Kiros, Sanja Fidler, R. Urtasun - 2016

4 papers in library cite

S. Bowman, J. Gauthier, Abhinav Rastogi, R. Gupta, C. Manning, Christopher Potts - 2016

5 papers in library cite

H. Yu, T. Munkhdalai - 2017

4 papers in library cite

H. Yu, T. Munkhdalai - 2017

4 papers in library cite

P. L. Li, Wentao Li, Z. He, Xinpeng Wang, Yue Cao, Jingren Zhou, Weixin Xu - 2016

3 papers in library cite

L. Mou, H. Peng, G. Li, Yiheng Xu, Li Zhang, Z. Jin - 2015

3 papers in library cite

L. Mou, M. Rui, G. Li, Yiheng Xu, Li Zhang, R. Yan, Z. Jin - 2016

3 papers in library cite

C. D. Santos, M. Tan, Bing Xiang, B. Zhou - 2016

2 papers in library cite

W. Yin, Hinrich Schutze - 2015

2 papers in library cite

C. N. D. Santos, M. Gatti - 2014

2 papers in library cite

Yibo Liu, C. Sun, L. Lin, Xinpeng Wang - 2016

2 papers in library cite

W. Ling, L. C. Cheng, Y. Tsvetkov, S. Amir - 2015

2 papers in library cite

T. T. D. Team, R. A. Rfou, G. Alain, Amjad Almahairi, C. Angermueller, D. Bahdanau, Nicolas Ballas, F. Bastien, J. Bayer, A. Belikov - 2016

2 papers in library cite

H. Margarit, R. Subramaniam - 2016

1 paper in library cites

M. Feng, Bing Xiang, M. R. Glass, Lisa Wang, B. Zhou - 2015

1 paper in library cites

H. Palangi, L. Deng, Y. Shen, Jianfeng Gao, X. He, Jixuan Chen, X. Song, R. Ward - 2016

1 paper in library cites

M. Ma, L. Huang, Bing Xiang, B. Zhou - 2015

1 paper in library cites

M. Tan, C. D. Santos, Bing Xiang, B. Zhou - 2016

1 paper in library cites

R. Memisevic - 2013

1 paper in library cites

J. Y. Lee, F. Dernoncourt - 2016

1 paper in library cites

Cited by

2

papers in your library

Cites

17

papers in your library

Read

on August 6, 2025

Your review

Tags

Paper Aliases

No aliases