2019

SpanBERT: Improving Pre-Training by Representing and Predicting Spans

M. Joshi, Deli Chen, Yibo Liu, D. Weld, Luke Zettlemoyer, Omer Levy

citations

Cite Score

60

AI summary

This paper introduces SpanBERT, a new pre-training method that is designed to represent and predict spans of text by masking contiguous random spans and training the span boundary representations to predict the entire content of the masked span, achieving state-of-the-art results on SQuAD1.1, SQuAD 2.0, and OntoNotes benchmarks.

Main Contributions

  • Introduces a new pre-training method called SpanBERT that is designed to better represent and predict spans of text.
  • Extends BERT by masking contiguous random spans, rather than random tokens.
  • Trains the span boundary representations to predict the entire content of the masked span, without relying on the individual token representations within it.
  • Achieves state-of-the-art results on the OntoNotes coreference resolution task (79.6% F1).
  • Achieves strong performance on the TACRED relation extraction benchmark, and even gains on GLUE.

Abstract

We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text. Our approach extends BERT by (1) masking contiguous random spans, rather than random tokens, and (2) training the span boundary representations to predict the entire content of the masked span, without relying on the individual token representations within it. SpanBERT consistently outperforms BERT and our better-tuned baselines, with substantial gains on span selection tasks such as question answering and coreference resolution. In particular, with the same training data and model size as BERTlarge, our single model obtains 94.6% and 88.7% F1 on SQUAD 1.1 and 2.0 respectively. We also achieve a new state of the art on the OntoNotes coreference resolution task (79.6% F1), strong performance on the TACRED relation extraction benchmark, and even gains on GLUE.

Citation Graph

Loading graph...

References [53]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018

39 papers in library cite

I. Loshchilov, Frank Hutter - 2017

7 papers in library cite

Yibo Liu, M. Ott, N. Goyal, J. Du, M. Joshi, Deli Chen, Omer Levy, Martha Lewis, Luke Zettlemoyer, Veselin Stoyanov - 2019

17 papers in library cite

M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018

27 papers in library cite

Jimmy Lei Ba, R. Kiros, Geoffrey E. Hinton - 2016

14 papers in library cite

Zhilin Yang, Z. Dai, Yining Yang, J. Carbonell, Ruslan Salakhutdinov, Quoc V. Le - 2019

11 papers in library cite

Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013

24 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

Dan Hendrycks, Kevin Gimpel - 2016

5 papers in library cite

A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018

26 papers in library cite

Thomas Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M. Funtowicz, J. Davison, Sam Shleifer, P. V. Platen, C. Ma, Yacine Jernite, J. Plu, Chenfeng Xu, T. L. Scao, S. Gugger, M. Drame, Q. Lhoest, Alexander M. Rush - 2019

7 papers in library cite

J. Howard, Sebastian Ruder - 2018

14 papers in library cite

A. Williams, Nikita Nangia, S. Bowman - 2018

19 papers in library cite

Z. Dai, Zhilin Yang, Yining Yang, W. Cohen, J. Carbonell, Quoc Le, Ruslan Salakhutdinov - 2019

9 papers in library cite

T. Kwiatkowski, J. Palomaki, O. Rhinehart, Michael Collins, A. P. Parikh, C. Alberti, D. Epstein, Illia Polosukhin, M. Kelcey, Jacob Devlin, K. Lee, K. N. Toutanova, Llion Jones, M. W. Chang, Andrew Dai, Jakob Uszkoreit, Quoc Le, Slav Petrov - 2019

9 papers in library cite

P. Rajpurkar, R. Jia, Percy Liang - 2018

14 papers in library cite

M. Joshi, E. Choi, D. Weld, Luke Zettlemoyer - 2017

18 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

Ido Dagan, O. Glickman, Bernardo Magnini - 2005

19 papers in library cite

W. Dolan, Chris Brockett - 2005

9 papers in library cite

Hector J. Levesque, E. Davis, Leora Morgenstern - 2011

13 papers in library cite

G. Lample, Alexis Conneau - 2019

5 papers in library cite

A. M. Dai, Quoc V. Le - 2015

27 papers in library cite

O. Press, Lior Wolf - 2017

7 papers in library cite

O. Melamud, J. Goldberger, Ido Dagan - 2016

5 papers in library cite

Zhilin Yang, P. Qi, S. Zhang, Yoshua Bengio, W. Cohen, Ruslan Salakhutdinov, Christopher D. Manning - 2018

4 papers in library cite

M. Ott, S. Edunov, A. Baevski, A. Fan, S. Gross, N. Ng, D. Grangier, Michael Auli - 2019

4 papers in library cite

L. Dong, N. Yang, Wenyi Wang, F. Wei, Xiaodong Liu, Yuzhi Wang, Jianfeng Gao, M. Zhou, H. W. Hon - 2019

4 papers in library cite

Alex Warstadt, A. Singh, S. Bowman - 2018

8 papers in library cite

Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao - 2019

6 papers in library cite

K. Song, X. Tan, T. Qin, J. Lu, T. Y. Liu - 2019

5 papers in library cite

M. Dunn, L. Sagun, M. Higgins, V. U. Guney, V. Cirik, Kyunghyun Cho - 2017

5 papers in library cite

D. Giampiccolo, Bernardo Magnini, Ido Dagan, B. Dolan - 2007

7 papers in library cite

D. Cer, M. Diab, E. Agirre, I. L. Gazpio, L. Specia - 2017

6 papers in library cite

R. B. Haim, Ido Dagan, B. Dolan, L. Ferro, D. Giampiccolo, Bernardo Magnini, I. Szpektor - 2006

6 papers in library cite

Alec Radford, K. Narasimhan, T. Salimans, Ilya Sutskever - 2018

4 papers in library cite

L. Logeswaran, Honglak Lee - 2018

3 papers in library cite

K. Lee, Luheng He, Martha Lewis, L. S. Zettlemoyer - 2017

3 papers in library cite

K. Lee, S. Salant, T. Kwiatkowski, A. P. Parikh, Dipanjan Das, Jonathan Berant - 2017

3 papers in library cite

Y. Z. Zhang, Victor Zhong, Deli Chen, G. Angeli, Christopher D. Manning - 2017

3 papers in library cite

S. Pradhan, A. Moschitti, N. Xue, O. Uryupina, Y. Z. Zhang - 2012

2 papers in library cite

Y. S. Sun, Shijie Wang, Yiwei Li, S. Feng, X. Chen, Haowei Zhang, X. Tian, D. Zhu, H. Tian, H. Wu - 2019

2 papers in library cite

W. Chan, N. Kitaev, K. Guu, M. Stern, Jakob Uszkoreit - 2019

2 papers in library cite

M. Joshi, Omer Levy, D. S. Weld, Luke Zettlemoyer, Omer Levy - 2019

1 paper in library cites

M. Stern, Noam Shazeer, Jakob Uszkoreit - 2018

1 paper in library cites

Zhengyou Zhang, X. Han, Ze Liu, Xu Jiang, Maosong Sun, Qian Liu - 2019

1 paper in library cites

K. Lee, Luheng He, Luke Zettlemoyer - 2018

1 paper in library cites

Luheng He, K. Lee, Omer Levy, Luke Zettlemoyer - 2018

1 paper in library cites

L. B. Soares, N. A. Fitzgerald, J. Ling, T. Kwiatkowski - 2019

1 paper in library cites

Adam Fisch, A. Talmor, R. Jia, M. Seo, E. Choi, Deli Chen - 2019

1 paper in library cites

M. Joshi, E. Choi, Omer Levy, D. Weld, Luke Zettlemoyer - 2019

1 paper in library cites

Cited by

5

papers in your library

Cites

34

papers in your library

Read

on December 29, 2025

Your review

Tags

Paper Aliases

No aliases