2019

Language Models as Knowledge Bases?

F. Petroni, Tim Rocktaschel, P. Lewis, A. Bakhtin, Yonghui Wu, A. H. Miller, Sebastian Riedel

citations

Cite Score

68

AI summary

This paper introduces LAMA (LAnguage Model Analysis) probe to test the factual and commonsense knowledge in language models, finding that BERT contains relational knowledge competitive with traditional NLP methods and achieves remarkable results for open-domain QA.

Main Contributions

  • Introduces LAMA (LAnguage Model Analysis) probe to test the factual and commonsense knowledge in language models.
  • Finds that BERT contains relational knowledge competitive with traditional NLP methods that have some access to oracle knowledge.
  • Shows that BERT also does remarkably well on open-domain question answering against a supervised baseline.
  • Demonstrates that certain types of factual knowledge are learned much more readily than others by standard language model pretraining approaches.
  • Shows that BERT-large achieves remarkable results for open-domain QA, reaching 57.1% precision@10 compared to 63.5% of a knowledge base constructed using a task-specific supervised relation extraction system.

Abstract

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the training data, and may be able to answer queries structured as "fill-in-the-blank" cloze statements. Language models have many advantages over structured knowledge bases: they require no schema engineering, allow practitioners to query about an open class of relations, are easy to extend to more data, and require no human supervision to train. We present an in-depth analysis of the relational knowledge already present (without fine-tuning) in a wide range of state-of-the-art pretrained language models. We find that (i) without fine-tuning, BERT contains relational knowledge competitive with traditional NLP methods that have some access to oracle knowledge, (ii) BERT also does remarkably well on open-domain question answering against a supervised baseline, and (iii) certain types of factual knowledge are learned much more readily than others by standard language model pretraining approaches. The surprisingly strong ability of these models to recall factual knowledge without any fine-tuning demonstrates their potential as unsupervised open-domain QA systems. The code to reproduce our analysis is available at https://github.com/facebookresearch/LAMA.

Citation Graph

Loading graph...

References [39]

Sort:
Filter:

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018

39 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Alec Radford, Jeffrey Wu, Rewon Child, D. Luan, Dario Amodei, Ilya Sutskever - 2019

27 papers in library cite

M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018

27 papers in library cite

Alec Radford, K. Narasimhan, T. Salimans, Ilya Sutskever - 2018

23 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018

26 papers in library cite

Z. Dai, Zhilin Yang, Yining Yang, W. Cohen, J. Carbonell, Quoc Le, Ruslan Salakhutdinov - 2019

9 papers in library cite

T. Kwiatkowski, J. Palomaki, O. Rhinehart, Michael Collins, A. P. Parikh, C. Alberti, D. Epstein, Illia Polosukhin, M. Kelcey, Jacob Devlin, K. Lee, K. N. Toutanova, Llion Jones, M. W. Chang, Andrew Dai, Jakob Uszkoreit, Quoc Le, Slav Petrov - 2019

9 papers in library cite

Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals - 2014

22 papers in library cite

Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

18 papers in library cite

S. Merity, Caiming Xiong, J. Bradbury, Richard Socher - 2017

12 papers in library cite

R. T. Mccoy, Ellie Pavlick, Tal Linzen - 2019

5 papers in library cite

Siva Reddy, Deli Chen, Christopher D. Manning - 2018

6 papers in library cite

Tomas Mikolov, Geoffrey Zweig - 2012

12 papers in library cite

Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktaschel - 2019

3 papers in library cite

Yann N. Dauphin, A. Fan, Michael Auli, D. Grangier - 2016

8 papers in library cite

Deli Chen, Adam Fisch, Jason Weston, Antoine Bordes - 2017

10 papers in library cite

G. Melis, C. Dyer, Phil Blunsom - 2018

6 papers in library cite

D. Bahdanau, F. Hill, Jan Leike, E. Hughes, P. Kohli, Edward Grefenstette - 2019

4 papers in library cite

I. Tenney, P. Xia, Berlin Chen, A. Wang, A. Poliak, R. T. Mccoy, N. Kim, B. V. Durme, S. Bowman, Dipanjan Das, Ellie Pavlick - 2019

4 papers in library cite

Y. Goldberg - 2019

4 papers in library cite

M. E. Peters, M. Neumann, Luke Zettlemoyer, W. T. Yih - 2018

4 papers in library cite

A. Talmor, J. Herzig, N. Lourie, Jonathan Berant - 2019

3 papers in library cite

M. Baroni, G. Dinu, German Kruszewski - 2014

3 papers in library cite

F. Hill, R. Reichart, Anna Korhonen - 2015

3 papers in library cite

R. Marvin, Tal Linzen - 2018

3 papers in library cite

Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi - 2018

2 papers in library cite

S. R. K. Branavan, D. Silver, R. Barzilay - 2011

2 papers in library cite

M. Nickel, K. Murphy, V. Tresp, E. Gabrilovich - 2016

1 paper in library cites

M. C. Boisvert, D. Bahdanau, S. Lahlou, L. Willems, C. Saharia, T. H. Nguyen, Yoshua Bengio - 2018

1 paper in library cites

D. Sorokin, I. Gurevych - 2017

1 paper in library cites

S. Welleck, K. Brantley, H. D. Iii, Kyunghyun Cho - 2019

1 paper in library cites

M. Surdeanu, H. Ji - 2014

1 paper in library cites

R. Speer, C. Havasi - 2012

1 paper in library cites

H. Elsahar, P. Vougiouklis, A. Remaci, C. Gravier, J. Hare, F. Laforest, E. Simperl - 2018

1 paper in library cites

Antoine Bordes, Nicolas Usunier, A. G. Duran, Jason Weston, O. Yakhnenko - 2013

1 paper in library cites

Cited by

4

papers in your library

Cites

23

papers in your library

Read

on November 15, 2025

Your review

Tags

Paper Aliases

No aliases