2019

Learning and Evaluating General Linguistic Intelligence

D. Yogatama, C. D. M. D'autume, J. Connor, T. Kocisky, M. Chrzanowski, L. Kong, A. Lazaridou, W. Ling, Longhui Yu, C. Dyer

citations

Cite Score

5

AI summary

This paper analyzes state-of-the-art natural language understanding models and conducts an extensive empirical investigation to evaluate them against criteria through a series of experiments. The paper proposes a new evaluation metric based on an online encoding of the test data that quantifies how quickly an existing agent (model) learns a new task.

Main Contributions

  • The paper defines general linguistic intelligence as the ability to reuse previously acquired knowledge to adapt to new tasks quickly.
  • The paper analyzes state-of-the-art natural language understanding models and conducts an extensive empirical investigation to evaluate them.
  • The paper proposes a new evaluation metric based on an online encoding of the test data that quantifies how quickly an existing agent (model) learns a new task.
  • The paper finds that far from solving general tasks, models are overfitting to the quirks of particular datasets (e.g., SQuAD).
  • The paper discusses missing components and conjecture on how to make progress toward general linguistic intelligence.

Abstract

We define general linguistic intelligence as the ability to reuse previously acquired knowledge about a language's lexicon, syntax, semantics, and pragmatic conventions to adapt to new tasks quickly. Using this definition, we analyze state-of-the-art natural language understanding models and conduct an extensive empirical investigation to evaluate them against these criteria through a series of experiments that assess the task-independence of the knowledge being acquired by the learning process. In addition to task performance, we propose a new evaluation metric based on an online encoding of the test data that quantifies how quickly an existing agent (model) learns a new task. Our results show that while the field has made impressive progress in terms of model architectures that generalize to many tasks, these models still require a lot of in-domain training examples (e.g., for fine tuning, training task-specific modules), and are prone to catastrophic forgetting. Moreover, we find that far from solving general tasks (e.g., document question answering), our models are overfitting to the quirks of particular datasets (e.g., SQuAD). We discuss missing components and conjecture on how to make progress toward general linguistic intelligence.

Citation Graph

Loading graph...

References [35]

Sort:
Filter:

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018

39 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Chelsea Finn, P. Abbeel, Sergey Levine - 2017

4 papers in library cite

M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018

27 papers in library cite

Alec Radford, K. Narasimhan, T. Salimans, Ilya Sutskever - 2018

23 papers in library cite

J. Kirkpatrick, Razvan Pascanu, N. C. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. G. Barwinska, Demis Hassabis, C. Clopath, D. Kumaran, Raia Hadsell - 2017

5 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018

26 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

A. Williams, Nikita Nangia, S. Bowman - 2018

19 papers in library cite

T. Kwiatkowski, J. Palomaki, O. Rhinehart, Michael Collins, A. P. Parikh, C. Alberti, D. Epstein, Illia Polosukhin, M. Kelcey, Jacob Devlin, K. Lee, K. N. Toutanova, Llion Jones, M. W. Chang, Andrew Dai, Jakob Uszkoreit, Quoc Le, Slav Petrov - 2019

9 papers in library cite

Robert M. French - 1999

2 papers in library cite

M. Joshi, E. Choi, D. Weld, Luke Zettlemoyer - 2017

18 papers in library cite

M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi - 2017

13 papers in library cite

R. Jia, Percy Liang - 2017

11 papers in library cite

A. M. Dai, Quoc V. Le - 2015

27 papers in library cite

Richard Socher - 2018

9 papers in library cite

E. Grave, Armand Joulin, Nicolas Usunier - 2016

7 papers in library cite

M. Mccloskey, N. J. Cohen - 1989

4 papers in library cite

J. Schwarz, Jelena Luketina, W. M. Czarnecki, A. G. Barwinska, Yee Whye Teh, Razvan Pascanu, Raia Hadsell - 2018

1 paper in library cites

E. Choi, He He, M. Iyyer, M. Yatskar, W. T. Yih, Yejin Choi, Percy Liang, Luke Zettlemoyer - 2018

8 papers in library cite

Omer Levy, M. Seo, E. Choi, L. S. Zettlemoyer - 2017

3 papers in library cite

B. Krause, E. Kahembwe, I. Murray, S. Renals - 2017

3 papers in library cite

Greg Brockman, V. Cheung, L. Pettersson, J. Schneider, John Schulman, Jie Tang, Wojciech Zaremba - 2016

3 papers in library cite

H. Hassan, A. Aue, C. C. Chen, V. Chowdhary, Jack Clark, C. Federmann, X. Huang, M. J. Dowmunt, W. Lewis, M. Li, Shuming Liu, T. Y. Liu, R. Luo, Arul Menezes, T. Qin, F. Seide, X. Tan, F. Tian, L. Wu, S. Wu, Y. Xia, Danyang Zhang, Zhengyou Zhang, M. Zhou - 2018

1 paper in library cites

C. Beattie, J. Z. Leibo, D. Teplyashin, T. Ward, M. Wainwright, H. Kuttler, A. Lefrancq, S. Green, V. Valdes, A. Sadik, J. Schrittwieser, K. Anderson, S. York, M. Cant, A. Cain, A. Bolton, S. Gaffney, H. King, Demis Hassabis, Shane Legg, S. Petersen - 2016

1 paper in library cites

J. G. Wolff - 1982

1 paper in library cites

N. Fitzgerald, J. Michael, Luheng He, Luke Zettlemoyer - 2018

1 paper in library cites

Alex Nichol, Josh Achiam, John Schulman - 2018

1 paper in library cites

M. Jaderberg, V. Dalibard, S. Osindero, W. M. Czarnecki, J. Donahue, A. Razavi, Oriol Vinyals, T. Green, I. Dunning, K. Simonyan, C. Fernando, Koray Kavukcuoglu - 2017

1 paper in library cites

I. Guyon, G. Cawley, G. Dror, V. Lemaire - 2011

1 paper in library cites

L. Blier, Y. Ollivier - 2018

1 paper in library cites

Cited by

2

papers in your library

Cites

23

papers in your library

Read

on November 13, 2025

Your review

Tags

Paper Aliases

No aliases