2018

Deep Contextualized Word Representations

M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer

citations

Cite Score

90

AI summary

This paper introduces ELMo (Embeddings from Language Models), a novel deep contextualized word representation using bidirectional LSTMs, pre-trained on a large corpus, that significantly improves state-of-the-art results on six NLP tasks.

Main Contributions

  • Introduces ELMo, a new type of deep contextualized word representation.
  • ELMo word vectors are learned functions of the internal states of a deep bidirectional language model (biLM).
  • Demonstrates that ELMo representations can be easily added to existing models.
  • Achieves significant improvements on six challenging NLP problems: question answering, textual entailment, sentiment analysis, semantic role labeling, coreference resolution, and named entity extraction.
  • Shows that exposing the deep internals of the pre-trained network is crucial.

Abstract

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses vary across linguistic contexts (i.e., to model polysemy). Our word vectors are learned functions of the internal states of a deep bidirectional language model (biLM), which is pre-trained on a large text corpus. We show that these representations can be easily added to existing models and significantly improve the state of the art across six challenging NLP problems, including question answering, textual entailment and sentiment analysis. We also present an analysis showing that exposing the deep internals of the pre-trained network is crucial, allowing downstream models to mix different types of semi-supervision signals.

Citation Graph

Loading graph...

References [61]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014

20 papers in library cite

Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

32 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

Jimmy Lei Ba, R. Kiros, Geoffrey E. Hinton - 2016

14 papers in library cite

Tomas Mikolov - 2017

7 papers in library cite

M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993

22 papers in library cite

Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013

24 papers in library cite

Ronan Collobert, Jason Weston, Leon Bottou, M. Karlen, Koray Kavukcuoglu, P. P. Kuksa - 2011

23 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, D. Bahdanau, Yoshua Bengio - 2014

9 papers in library cite

Matthew D. Zeiler - 2012

13 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015

6 papers in library cite

J. Turian, L. Ratinov, Yoshua Bengio - 2010

17 papers in library cite

M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi - 2017

13 papers in library cite

Yarin Gal - 2015

9 papers in library cite

A. M. Dai, Quoc V. Le - 2015

27 papers in library cite

R. Jozefowicz, Oriol Vinyals, M. Schuster, Noam Shazeer, Yonghui Wu - 2016

20 papers in library cite

C. Chelba, Tomas Mikolov, M. Schuster, Q. Ge, T. Brants, P. Koehn, Tony Robinson - 2013

13 papers in library cite

B. Mccann, J. Bradbury, Caiming Xiong, Richard Socher - 2017

14 papers in library cite

M. E. Peters, W. Ammar, C. Bhagavatula, Russell Power - 2017

5 papers in library cite

O. Melamud, J. Goldberger, Ido Dagan - 2016

5 papers in library cite

C. Clark, Matt Gardner - 2017

7 papers in library cite

A. Kumar, O. Irsoy, P. Ondruska, M. Iyyer, J. Bradbury, I. Gulrajani, Victor Zhong, R. Paulus, Richard Socher - 2015

9 papers in library cite

Qinlang Chen, X. Zhu, Z. H. Ling, S. Wei, H. Jiang, D. Inkpen - 2017

5 papers in library cite

S. Merity, Nitish Shirish Keskar, Richard Socher - 2017

6 papers in library cite

Wenyi Wang, N. Yang, F. Wei, B. Chang, M. Zhou - 2017

5 papers in library cite

G. Melis, C. Dyer, Phil Blunsom - 2018

6 papers in library cite

Yonatan Belinkov, N. Durrani, F. Dalvi, H. Sajjad, J. R. Glass - 2017

2 papers in library cite

A. Raganato, J. C. Collados, R. Navigli - 2017

2 papers in library cite

P. Ramachandran, P. Liu, Quoc Le - 2017

1 paper in library cites

A. Raganato, C. D. Bovi, R. Navigli - 2017

1 paper in library cites

Yoon Kim, Yacine Jernite, D. Sontag, Alexander M. Rush - 2016

7 papers in library cite

J. Lafferty, Andrew Mccallum, F. C. Pereira - 2001

6 papers in library cite

K. Hashimoto, Caiming Xiong, Y. Tsuruoka, Richard Socher - 2016

5 papers in library cite

W. Ling, C. Dyer, A. W. Black, I. Trancoso, R. Fermandez, S. Amir, L. Marujo, T. Luis - 2015

5 papers in library cite

R. Jozefowicz, Wojciech Zaremba, Ilya Sutskever - 2015

4 papers in library cite

E. F. T. K. Sang, F. D. Meulder - 2003

4 papers in library cite

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, C. Dyer - 2016

4 papers in library cite

H. Yu, T. Munkhdalai - 2017

4 papers in library cite

A. Sogaard, Y. Goldberg - 2016

3 papers in library cite

Jingren Zhou, Weixin Xu - 2015

3 papers in library cite

K. Lee, Luheng He, Martha Lewis, L. S. Zettlemoyer - 2017

3 papers in library cite

X. Ma, Eduard Hovy - 2016

3 papers in library cite

P. Zhou, Z. Qi, S. Zheng, Jiacheng Xu, H. Bao, B. Xu - 2016

3 papers in library cite

M. Palmer, P. Kingsbury, D. Gildea - 2005

3 papers in library cite

J. Wieting, Mohit Bansal, Kevin Gimpel, Karen Livescu - 2016

2 papers in library cite

S. Pradhan, A. Moschitti, N. Xue, O. Uryupina, Y. Z. Zhang - 2012

2 papers in library cite

Luheng He, K. Lee, Martha Lewis, L. S. Zettlemoyer - 2017

2 papers in library cite

J. Chiu, E. Nichols - 2016

2 papers in library cite

Y. Gong, H. Luo, J. Zhang - 2018

2 papers in library cite

Xiaodong Liu, Y. Shen, K. Duh, Jianfeng Gao - 2017

2 papers in library cite

K. Clark, Christopher D. Manning - 2016

1 paper in library cites

G. Durrett, Dan Klein - 2013

1 paper in library cites

Arvind Neelakantan, J. Shankar, A. Passos, Andrew Mccallum - 2014

1 paper in library cites

I. Iacobacci, M. T. Pilehvar, R. Navigli - 2016

1 paper in library cites

Sam Wiseman, Alexander M. Rush, Stuart M. Shieber - 2016

1 paper in library cites

S. Pradhan, A. Moschitti, N. Xue, H. Ng, A. Bjorkelund, O. Uryupina, Y. Z. Zhang, Z. Zhong - 2013

1 paper in library cites

George A. Miller, M. Chodorow, S. Landes, C. Leacock, R. G. Thomas - 1994

1 paper in library cites

Cited by

27

papers in your library

Cites

34

papers in your library

Read

on August 7, 2025

Your review

Tags

Paper Aliases

No aliases