2018
Cite Score
82
AI summary
The paper introduces GLUE, a multi-task benchmark for NLU, comprising diverse tasks, limited data, and a diagnostic test suite. Baselines using ELMO achieve improved performance via multi-task training, yet reveal the necessity for enhanced general NLU systems.
Main Contributions
Abstract
For natural language understanding (NLU) technology to be maximally useful, it must be able to process language in a way that is not exclusive to a single task, genre, or dataset. In pursuit of this objective, we introduce the General Language Understanding Evaluation (GLUE) benchmark, a collection of tools for evaluating the performance of models across a diverse set of existing NLU tasks. By including tasks with limited training data, GLUE is designed to favor and encourage models that share general linguistic knowledge across tasks. GLUE also includes a hand-crafted diagnostic test suite that enables detailed linguistic analysis of models. We evaluate baselines based on current methods for transfer and representation learning and find that multi-task training on all tasks performs better than training a separate model per task. However, the low absolute performance of our best model indicates the need for improved general NLU systems.
Citation Graph
References [52]
D. P. Kingma, Jimmy Lei Ba - 2014
49 papers in library cite
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017
47 papers in library cite
Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014
31 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018
27 papers in library cite
Quoc Le, Tomas Mikolov - 2014
13 papers in library cite
Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013
24 papers in library cite
Ronan Collobert, Jason Weston, Leon Bottou, M. Karlen, Koray Kavukcuoglu, P. P. Kuksa - 2011
23 papers in library cite
P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016
37 papers in library cite
Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015
25 papers in library cite
A. Williams, Nikita Nangia, S. Bowman - 2018
19 papers in library cite
Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015
18 papers in library cite
R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015
23 papers in library cite
Ido Dagan, O. Glickman, Bernardo Magnini - 2005
19 papers in library cite
Alexis Conneau, Douwe Kiela, Holger Schwenk, L. Barrault, Antoine Bordes - 2017
11 papers in library cite
M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi - 2017
13 papers in library cite
W. Dolan, Chris Brockett - 2005
9 papers in library cite
Hector J. Levesque, E. Davis, Leora Morgenstern - 2011
13 papers in library cite
Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Richard Schwartz, S. Bowman, Noah A. Smith - 2018
6 papers in library cite
C. Chelba, Tomas Mikolov, M. Schuster, Q. Ge, T. Brants, P. Koehn, Tony Robinson - 2013
13 papers in library cite
B. Mccann, J. Bradbury, Caiming Xiong, Richard Socher - 2017
14 papers in library cite
Tim Rocktaschel, Edward Grefenstette, K. Hermann, T. Kocisky, Phil Blunsom - 2016
5 papers in library cite
Richard Socher - 2018
9 papers in library cite
F. Hill, Kyunghyun Cho, Anna Korhonen - 2016
12 papers in library cite
Alex Warstadt, A. Singh, S. Bowman - 2018
8 papers in library cite
Matt Gardner, J. Grus, M. Neumann, Oyvind Tafjord, P. Dasigi, N. Liu, M. Peters, M. Schmitz, Luke Zettlemoyer - 2018
5 papers in library cite
Alexis Conneau, Douwe Kiela - 2018
5 papers in library cite
A. Poliak, J. Naradowsky, A. Haldar, R. Rudinger, B. V. Durme - 2018
5 papers in library cite
S. Subramanian, A. Trischler, Yoshua Bengio, C. Pal - 2018
4 papers in library cite
Richard Schwartz, Maarten Sap, I. Konstas, L. Zilles, Yejin Choi, Noah A. Smith - 2017
3 papers in library cite
Bo Pang, L. Lee - 2005
13 papers in library cite
Bo Pang, L. A. Lee, L. Lillian - 2004
8 papers in library cite
J. Wiebe, T. Wilson, T. Theresa, C. A. Cardie, C. Claire - 2005
7 papers in library cite
L. Bentivogli, Peter Clark, Ido Dagan, D. Giampiccolo - 2009
7 papers in library cite
D. Giampiccolo, Bernardo Magnini, Ido Dagan, B. Dolan - 2007
7 papers in library cite
M. Hu, B. A. Liu, B. Bing - 2004
6 papers in library cite
D. Cer, M. Diab, E. Agirre, I. L. Gazpio, L. Specia - 2017
6 papers in library cite
R. B. Haim, Ido Dagan, B. Dolan, L. Ferro, D. Giampiccolo, Bernardo Magnini, I. Szpektor - 2006
6 papers in library cite
K. Hashimoto, Caiming Xiong, Y. Tsuruoka, Richard Socher - 2016
5 papers in library cite
E. M. Voorhees, D. M. Tice - 1999
5 papers in library cite
Armand Joulin, E. Grave, Piotr Bojanowski, Tomas Mikolov - 2017
4 papers in library cite
Allen Nie, E. Bennett, N. Goodman - 2017
4 papers in library cite
A. Sogaard, Y. Goldberg - 2016
3 papers in library cite
R. T. Mccoy, Tal Linzen - 2019
3 papers in library cite
R. Cooper, D. Crouch, J. Eijck, C. Fox, J. Genabith, J. Jaspars, H. Kamp, D. Milward, M. Pinkal, M. Poesio, S. Pulman, T. Briscoe, H. Maier, K. Konrad - 1996
3 papers in library cite
A. White, P. Rastogi, K. Duh, B. Durme - 2017
2 papers in library cite
M. Tsuchiya - 2018
2 papers in library cite
Sebastian Ruder, J. Bingel, I. Augenstein, A. Sogaard - 2017
2 papers in library cite
A. Ettinger, S. Rao, H. D. Iii, E. Bender - 2017
2 papers in library cite
J. Gorodkin - 2004
1 paper in library cites
B. Matthews - 1975
1 paper in library cites
D. Demszky, K. Guu, Percy Liang - 2018
1 paper in library cites
Cited by
26
papers in your library
Cites
30
papers in your library
Read
on August 8, 2025
Your review
Tags
Paper Aliases
No aliases