2019
Cite Score
5
AI summary
This paper analyzes state-of-the-art natural language understanding models and conducts an extensive empirical investigation to evaluate them against criteria through a series of experiments. The paper proposes a new evaluation metric based on an online encoding of the test data that quantifies how quickly an existing agent (model) learns a new task.
Main Contributions
Abstract
We define general linguistic intelligence as the ability to reuse previously acquired knowledge about a language's lexicon, syntax, semantics, and pragmatic conventions to adapt to new tasks quickly. Using this definition, we analyze state-of-the-art natural language understanding models and conduct an extensive empirical investigation to evaluate them against these criteria through a series of experiments that assess the task-independence of the knowledge being acquired by the learning process. In addition to task performance, we propose a new evaluation metric based on an online encoding of the test data that quantifies how quickly an existing agent (model) learns a new task. Our results show that while the field has made impressive progress in terms of model architectures that generalize to many tasks, these models still require a lot of in-domain training examples (e.g., for fine tuning, training task-specific modules), and are prone to catastrophic forgetting. Moreover, we find that far from solving general tasks (e.g., document question answering), our models are overfitting to the quirks of particular datasets (e.g., SQuAD). We discuss missing components and conjecture on how to make progress toward general linguistic intelligence.
Citation Graph
References [35]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017
47 papers in library cite
Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018
39 papers in library cite
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Chelsea Finn, P. Abbeel, Sergey Levine - 2017
4 papers in library cite
M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018
27 papers in library cite
Alec Radford, K. Narasimhan, T. Salimans, Ilya Sutskever - 2018
23 papers in library cite
J. Kirkpatrick, Razvan Pascanu, N. C. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. G. Barwinska, Demis Hassabis, C. Clopath, D. Kumaran, Raia Hadsell - 2017
5 papers in library cite
P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016
37 papers in library cite
A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018
26 papers in library cite
Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015
25 papers in library cite
A. Williams, Nikita Nangia, S. Bowman - 2018
19 papers in library cite
T. Kwiatkowski, J. Palomaki, O. Rhinehart, Michael Collins, A. P. Parikh, C. Alberti, D. Epstein, Illia Polosukhin, M. Kelcey, Jacob Devlin, K. Lee, K. N. Toutanova, Llion Jones, M. W. Chang, Andrew Dai, Jakob Uszkoreit, Quoc Le, Slav Petrov - 2019
9 papers in library cite
Robert M. French - 1999
2 papers in library cite
M. Joshi, E. Choi, D. Weld, Luke Zettlemoyer - 2017
18 papers in library cite
M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi - 2017
13 papers in library cite
R. Jia, Percy Liang - 2017
11 papers in library cite
A. M. Dai, Quoc V. Le - 2015
27 papers in library cite
Richard Socher - 2018
9 papers in library cite
E. Grave, Armand Joulin, Nicolas Usunier - 2016
7 papers in library cite
M. Mccloskey, N. J. Cohen - 1989
4 papers in library cite
J. Schwarz, Jelena Luketina, W. M. Czarnecki, A. G. Barwinska, Yee Whye Teh, Razvan Pascanu, Raia Hadsell - 2018
1 paper in library cites
E. Choi, He He, M. Iyyer, M. Yatskar, W. T. Yih, Yejin Choi, Percy Liang, Luke Zettlemoyer - 2018
8 papers in library cite
Omer Levy, M. Seo, E. Choi, L. S. Zettlemoyer - 2017
3 papers in library cite
B. Krause, E. Kahembwe, I. Murray, S. Renals - 2017
3 papers in library cite
Greg Brockman, V. Cheung, L. Pettersson, J. Schneider, John Schulman, Jie Tang, Wojciech Zaremba - 2016
3 papers in library cite
H. Hassan, A. Aue, C. C. Chen, V. Chowdhary, Jack Clark, C. Federmann, X. Huang, M. J. Dowmunt, W. Lewis, M. Li, Shuming Liu, T. Y. Liu, R. Luo, Arul Menezes, T. Qin, F. Seide, X. Tan, F. Tian, L. Wu, S. Wu, Y. Xia, Danyang Zhang, Zhengyou Zhang, M. Zhou - 2018
1 paper in library cites
T. V. Erven, P. Grunwald, S. D. Rooij - 2012
1 paper in library cites
C. Beattie, J. Z. Leibo, D. Teplyashin, T. Ward, M. Wainwright, H. Kuttler, A. Lefrancq, S. Green, V. Valdes, A. Sadik, J. Schrittwieser, K. Anderson, S. York, M. Cant, A. Cain, A. Bolton, S. Gaffney, H. King, Demis Hassabis, Shane Legg, S. Petersen - 2016
1 paper in library cites
J. G. Wolff - 1982
1 paper in library cites
N. Fitzgerald, J. Michael, Luheng He, Luke Zettlemoyer - 2018
1 paper in library cites
Alex Nichol, Josh Achiam, John Schulman - 2018
1 paper in library cites
G. J. Chaitin - 2007
1 paper in library cites
M. Jaderberg, V. Dalibard, S. Osindero, W. M. Czarnecki, J. Donahue, A. Razavi, Oriol Vinyals, T. Green, I. Dunning, K. Simonyan, C. Fernando, Koray Kavukcuoglu - 2017
1 paper in library cites
I. Guyon, G. Cawley, G. Dror, V. Lemaire - 2011
1 paper in library cites
L. Blier, Y. Ollivier - 2018
1 paper in library cites
Cited by
2
papers in your library
Cites
23
papers in your library
Read
on November 13, 2025
Your review
Tags
Paper Aliases
No aliases