2017

Learned in Translation: Contextualized Word Vectors

B. Mccann, J. Bradbury, Caiming Xiong, Richard Socher

citations

Cite Score

42

AI summary

This paper introduces a method to contextualize word vectors (CoVe) using a deep LSTM encoder from a machine translation model, demonstrating improved performance on sentiment analysis, question classification, entailment, and question answering tasks, achieving state-of-the-art results on fine-grained sentiment analysis and entailment.

Main Contributions

  • Introduces context vectors (CoVe) from a MT-LSTM to improve NLP task performance.
  • Demonstrates that CoVe improves performance over using unsupervised word and character vectors on various NLP tasks.
  • Achieves state-of-the-art performance on fine-grained sentiment analysis and entailment tasks using CoVe.
  • Shows that the quantity of MT training data positively correlates with downstream task performance.
  • Presents a general biattentive classification network (BCN) for testing CoVe transferability.

Abstract

Computer vision has benefited from initializing multiple deep layers with weights pretrained on large supervised training sets like ImageNet. Natural language processing (NLP) typically sees initialization of only the lowest layer of deep models with pretrained word vectors. In this paper, we use a deep LSTM encoder from an attentional sequence-to-sequence model trained for machine translation (MT) to contextualize word vectors. We show that adding these context vectors (CoVe) improves performance over using only unsupervised word and character vectors on a wide variety of common NLP tasks: sentiment analysis (SST, IMDb), question classification (TREC), entailment (SNLI), and question answering (SQuAD). For fine-grained sentiment analysis and entailment, CoVe improves performance of our baseline models to the state of the art.

Citation Graph

Loading graph...

References [71]

Sort:
Filter:

K. He, X. Zhang, S. Ren, Jian Sun - 2016

20 papers in library cite

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

Tomas Mikolov, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

26 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

Ross Girshick, J. Donahue, Trevor Darrell, Jitendra Malik - 2014

18 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

V. Nair, Geoffrey E. Hinton - 2010

18 papers in library cite

T. Luong, H. Pham, Christopher D. Manning - 2015

15 papers in library cite

Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013

24 papers in library cite

Ronan Collobert, Jason Weston, Leon Bottou, M. Karlen, Koray Kavukcuoglu, P. P. Kuksa - 2011

23 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

A. L. Maas, R. E. Daly, P. T. Pham, Dong Huang, Andrew Y. Ng, Christopher Potts - 2011

12 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

R. Nallapati, B. Zhou, C. N. D. Santos, C. G. Gulcehre, Bing Xiang - 2016

10 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

Yoshua Bengio - 2013

17 papers in library cite

Alexis Conneau, Douwe Kiela, Holger Schwenk, L. Barrault, Antoine Bordes - 2017

11 papers in library cite

M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi - 2017

13 papers in library cite

A. P. Parikh, O. Tackstrom, Dipanjan Das, Jakob Uszkoreit - 2016

11 papers in library cite

A. M. Dai, Quoc V. Le - 2015

27 papers in library cite

F. Hill, Kyunghyun Cho, Anna Korhonen - 2016

12 papers in library cite

Alec Radford, R. Jozefowicz, Ilya Sutskever - 2017

8 papers in library cite

P. Ramachandran, P. J. Liu, Quoc V. Le - 2017

9 papers in library cite

Alex Graves, Jürgen Schmidhuber - 2005

14 papers in library cite

G. Klein, Yoon Kim, Y. Deng, J. Senellart, A. Rush - 2017

4 papers in library cite

A. Fukui, D. H. Park, Diyi Yang, A. Rohrbach, Trevor Darrell, M. Rohrbach - 2016

2 papers in library cite

A. Kumar, O. Irsoy, P. Ondruska, M. Iyyer, J. Bradbury, I. Gulrajani, Victor Zhong, R. Paulus, Richard Socher - 2015

9 papers in library cite

Wenyi Wang, N. Yang, F. Wei, B. Chang, M. Zhou - 2017

5 papers in library cite

J. Wieting, Mohit Bansal, Kevin Gimpel, K. A. Livescu, K. Karen - 2015

4 papers in library cite

Shijie Wang, J. J. Jiang - 2017

6 papers in library cite

P. Koehn, H. Hoang, Alexandra Birch, Chris Callison Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, E. Herbst - 2007

8 papers in library cite

K. Hashimoto, Caiming Xiong, Y. Tsuruoka, Richard Socher - 2016

5 papers in library cite

Caiming Xiong, S. Merity, Richard Socher - 2016

5 papers in library cite

Richard Socher, Quoc Le, C. Manning, A. Ng - 2014

5 papers in library cite

E. M. Voorhees, D. M. Tice - 1999

5 papers in library cite

T. Miyato, A. M. Dai, I. Goodfellow - 2016

4 papers in library cite

H. Yu, T. Munkhdalai - 2017

4 papers in library cite

H. Yu, T. Munkhdalai - 2017

4 papers in library cite

S. Min, M. J. Seo, Hananneh Hajishirzi - 2017

4 papers in library cite

E. Agirre, C. Banea, C. Cardie, D. M. Cer, M. T. Diab, A. G. Agirre, W. Guo, R. Mihalcea, G. Rigau, J. Wiebe - 2014

4 papers in library cite

L. Mou, H. Peng, G. Li, Yiheng Xu, Li Zhang, Z. Jin - 2015

3 papers in library cite

Caiming Xiong, Victor Zhong, Richard Socher - 2017

3 papers in library cite

Y. Yu, Wenxuan Zhang, K. S. Hasan, M. Yu, Bing Xiang, B. Zhou - 2016

3 papers in library cite

J. P. C. G. D. Silva, L. Coheur, A. C. Mendes, A. Wichert - 2011

3 papers in library cite

P. Zhou, Z. Qi, S. Zheng, Jiacheng Xu, H. Bao, B. Xu - 2016

3 papers in library cite

A. Dieng, A. B., Caitlin Wang, Jianfeng Gao, J. A. Paisley, J. John - 2016

3 papers in library cite

K. Saenko, B. Kulis, M. Fritz, Trevor Darrell - 2010

2 papers in library cite

M. Looks, M. Herreshoff, M. Hutchins, D. A. Norvig, P. Peter - 2017

2 papers in library cite

Samuel R. Bowman, Christopher Potts, Christopher D. Manning - 2015

2 papers in library cite

R. Johnson, Tong Zhang - 2016

2 papers in library cite

M. Cettolo, J. Niehues, S. Stuker, L. Bentivogli, R. Cattoni, M. J. Federico - 2016

2 papers in library cite

B. Paria, K. M. Annervaz, A. Dukkipati, A. Chatterjee, S. Podder - 2016

1 paper in library cites

L. Specia, S. Frank, K. Sima'an, D. Elliott - 2016

1 paper in library cites

Robert Zhang, Honglak Lee, D. R. Radev - 2016

1 paper in library cites

M. Huang, Q. Qian, X. Zhu - 2017

1 paper in library cites

H. Guo, C. Cherry, Jianlin Su - 2017

1 paper in library cites

Qinlang Chen, X. D. Zhu, Z. H. Ling, S. Wei, H. Jiang - 2016

1 paper in library cites

Y. Qi, S. Zhang, Lianhui Qin, H. Yao, Q. Huang, J. Lim, M. H. Yang - 2016

1 paper in library cites

Yuxuan Zhu, Yanru Chen, Z. L. Lu, Sinno Jialin Pan, G. R. Xue, Y. Yu, Qiang Yang - 2011

1 paper in library cites

H. T. Madabushi, M. Lee - 2016

1 paper in library cites

N. V. Tu, L. A. Cuong - 2016

1 paper in library cites

J. Lu, Caiming Xiong, D. Parikh, Richard Socher - 2016

1 paper in library cites

L. Dong, Mirella Lapata - 2016

1 paper in library cites

Xiang Lisa Li, Dan Roth - 2006

1 paper in library cites

B. Loni, G. V. Tulder, P. Wiggers, D. M. J. Tax, M. Loog - 2011

1 paper in library cites

L. Sha, B. Chang, Zhifang Sui, Shanda Li - 2016

1 paper in library cites

F. Hill, Kyunghyun Cho, S. Jean, Yoshua Bengio - 2017

1 paper in library cites

Cited by

14

papers in your library

Cites

34

papers in your library

Read

on October 23, 2025

Your review

Tags

Paper Aliases

No aliases