2019
Cite Score
68
AI summary
This paper introduces LAMA (LAnguage Model Analysis) probe to test the factual and commonsense knowledge in language models, finding that BERT contains relational knowledge competitive with traditional NLP methods and achieves remarkable results for open-domain QA.
Main Contributions
Abstract
Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the training data, and may be able to answer queries structured as "fill-in-the-blank" cloze statements. Language models have many advantages over structured knowledge bases: they require no schema engineering, allow practitioners to query about an open class of relations, are easy to extend to more data, and require no human supervision to train. We present an in-depth analysis of the relational knowledge already present (without fine-tuning) in a wide range of state-of-the-art pretrained language models. We find that (i) without fine-tuning, BERT contains relational knowledge competitive with traditional NLP methods that have some access to oracle knowledge, (ii) BERT also does remarkably well on open-domain question answering against a supervised baseline, and (iii) certain types of factual knowledge are learned much more readily than others by standard language model pretraining approaches. The surprisingly strong ability of these models to recall factual knowledge without any fine-tuning demonstrates their potential as unsupervised open-domain QA systems. The code to reproduce our analysis is available at https://github.com/facebookresearch/LAMA.
Citation Graph
References [39]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017
47 papers in library cite
Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018
39 papers in library cite
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Alec Radford, Jeffrey Wu, Rewon Child, D. Luan, Dario Amodei, Ilya Sutskever - 2019
27 papers in library cite
M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018
27 papers in library cite
Alec Radford, K. Narasimhan, T. Salimans, Ilya Sutskever - 2018
23 papers in library cite
Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001
62 papers in library cite
P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016
37 papers in library cite
A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018
26 papers in library cite
Z. Dai, Zhilin Yang, Yining Yang, W. Cohen, J. Carbonell, Quoc Le, Ruslan Salakhutdinov - 2019
9 papers in library cite
T. Kwiatkowski, J. Palomaki, O. Rhinehart, Michael Collins, A. P. Parikh, C. Alberti, D. Epstein, Illia Polosukhin, M. Kelcey, Jacob Devlin, K. Lee, K. N. Toutanova, Llion Jones, M. W. Chang, Andrew Dai, Jakob Uszkoreit, Quoc Le, Slav Petrov - 2019
9 papers in library cite
Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals - 2014
22 papers in library cite
Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015
18 papers in library cite
S. Merity, Caiming Xiong, J. Bradbury, Richard Socher - 2017
12 papers in library cite
R. T. Mccoy, Ellie Pavlick, Tal Linzen - 2019
5 papers in library cite
Siva Reddy, Deli Chen, Christopher D. Manning - 2018
6 papers in library cite
Tomas Mikolov, Geoffrey Zweig - 2012
12 papers in library cite
Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktaschel - 2019
3 papers in library cite
Yann N. Dauphin, A. Fan, Michael Auli, D. Grangier - 2016
8 papers in library cite
Deli Chen, Adam Fisch, Jason Weston, Antoine Bordes - 2017
10 papers in library cite
G. Melis, C. Dyer, Phil Blunsom - 2018
6 papers in library cite
D. Bahdanau, F. Hill, Jan Leike, E. Hughes, P. Kohli, Edward Grefenstette - 2019
4 papers in library cite
I. Tenney, P. Xia, Berlin Chen, A. Wang, A. Poliak, R. T. Mccoy, N. Kim, B. V. Durme, S. Bowman, Dipanjan Das, Ellie Pavlick - 2019
4 papers in library cite
Y. Goldberg - 2019
4 papers in library cite
M. E. Peters, M. Neumann, Luke Zettlemoyer, W. T. Yih - 2018
4 papers in library cite
A. Talmor, J. Herzig, N. Lourie, Jonathan Berant - 2019
3 papers in library cite
M. Baroni, G. Dinu, German Kruszewski - 2014
3 papers in library cite
F. Hill, R. Reichart, Anna Korhonen - 2015
3 papers in library cite
R. Marvin, Tal Linzen - 2018
3 papers in library cite
Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi - 2018
2 papers in library cite
S. R. K. Branavan, D. Silver, R. Barzilay - 2011
2 papers in library cite
M. Nickel, K. Murphy, V. Tresp, E. Gabrilovich - 2016
1 paper in library cites
M. C. Boisvert, D. Bahdanau, S. Lahlou, L. Willems, C. Saharia, T. H. Nguyen, Yoshua Bengio - 2018
1 paper in library cites
D. Sorokin, I. Gurevych - 2017
1 paper in library cites
S. Welleck, K. Brantley, H. D. Iii, Kyunghyun Cho - 2019
1 paper in library cites
M. Surdeanu, H. Ji - 2014
1 paper in library cites
R. Speer, C. Havasi - 2012
1 paper in library cites
H. Elsahar, P. Vougiouklis, A. Remaci, C. Gravier, J. Hare, F. Laforest, E. Simperl - 2018
1 paper in library cites
Antoine Bordes, Nicolas Usunier, A. G. Duran, Jason Weston, O. Yakhnenko - 2013
1 paper in library cites
Cited by
4
papers in your library
Cites
23
papers in your library
Read
on November 15, 2025
Your review
Tags
Paper Aliases
No aliases