2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

P. Lewis, Ethan Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Kuttler, Martha Lewis, W. T. Yih, Tim Rocktaschel, Sebastian Riedel, K. Douwe

citations

Cite Score

82

AI summary

This paper introduces Retrieval-Augmented Generation (RAG) models, which combine pre-trained seq2seq models with a dense vector index of Wikipedia accessed by a neural retriever, achieving state-of-the-art results on open domain QA tasks and generating more factual language.

Main Contributions

  • Introduces RAG models, which combine pre-trained parametric and non-parametric memory for language generation.
  • Compares two RAG formulations: RAG-Sequence and RAG-Token.
  • Achieves state-of-the-art results on three open domain QA tasks: Natural Questions, WebQuestions, and CuratedTrec.
  • Demonstrates that RAG models generate more specific, diverse, and factual language than a state-of-the-art parametric-only seq2seq baseline.
  • Shows that the non-parametric memory can be replaced to update the models' knowledge as the world changes.

Abstract

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) - models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the parametric memory is a pre-trained seq2seq model and the non-parametric memory is a dense vector index of Wikipedia, accessed with a pre-trained neural retriever. We compare two RAG formulations, one which conditions on the same retrieved passages across the whole generated sequence, and another which can use different passages per token. We fine-tune and evaluate our models on a wide range of knowledge-intensive NLP tasks and set the state of the art on three open domain QA tasks, outperforming parametric seq2seq models and task-specific retrieve-and-extract architectures. For language generation tasks, we find that RAG models generate more specific, diverse and factual language than a state-of-the-art parametric-only seq2seq baseline.

Citation Graph

Loading graph...

References [65]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

Jacob Devlin, M. W. Chang, K. Lee, Kristina Toutanova - 2018

39 papers in library cite

Alec Radford, Jeffrey Wu, Rewon Child, D. Luan, Dario Amodei, Ilya Sutskever - 2019

27 papers in library cite

Alec Radford, K. Narasimhan, T. Salimans, Ilya Sutskever - 2018

23 papers in library cite

Martha Lewis, Yibo Liu, N. Goyal, M. Ghazvininejad, A. Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer - 2019

6 papers in library cite

A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018

26 papers in library cite

Thomas Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, P. Cistac, T. Rault, R. Louf, M. Funtowicz, J. Davison, Sam Shleifer, P. V. Platen, C. Ma, Yacine Jernite, J. Plu, Chenfeng Xu, T. L. Scao, S. Gugger, M. Drame, Q. Lhoest, Alexander M. Rush - 2019

7 papers in library cite

T. Kwiatkowski, J. Palomaki, O. Rhinehart, Michael Collins, A. P. Parikh, C. Alberti, D. Epstein, Illia Polosukhin, M. Kelcey, Jacob Devlin, K. Lee, K. N. Toutanova, Llion Jones, M. W. Chang, Andrew Dai, Jakob Uszkoreit, Quoc Le, Slav Petrov - 2019

9 papers in library cite

F. Petroni, Tim Rocktaschel, P. Lewis, A. Bakhtin, Yonghui Wu, A. H. Miller, Sebastian Riedel - 2019

4 papers in library cite

M. Joshi, E. Choi, D. Weld, Luke Zettlemoyer - 2017

18 papers in library cite

S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus - 2015

18 papers in library cite

A. Wang, Y. Pruksachatkun, Nikita Nangia, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2019

15 papers in library cite

Jason Weston, S. Chopra, Antoine Bordes - 2015

18 papers in library cite

A. Fan, Martha Lewis, Yann Dauphin - 2018

4 papers in library cite

Rodrigo Nogueira, Kyunghyun Cho - 2019

1 paper in library cites

P. J. Liu, M. Saleh, E. Pot, B. Goodrich, R. Sepassi, Lukasz Kaiser, Noam Shazeer - 2018

7 papers in library cite

E. Dinan, S. Roller, K. Shuster, A. Fan, Michael Auli, Jason Weston - 2019

4 papers in library cite

Noam Shazeer - 2020

2 papers in library cite

C. Clark, Matt Gardner - 2017

7 papers in library cite

Armand Joulin, Tomas Mikolov - 2015

9 papers in library cite

Colin Raffel, Noam Shazeer, A. Roberts, K. Lee, S. Narang, M. Matena, Y. Zhou, Wentao Li, P. J. Liu - 2019

17 papers in library cite

J. Johnson, M. Douze, Hervé Jégou - 2017

4 papers in library cite

M. Ott, S. Edunov, A. Baevski, A. Fan, S. Gross, N. Ng, D. Grangier, Michael Auli - 2019

4 papers in library cite

Y. A. Malkov, D. A. Yashunin - 2016

1 paper in library cites

Deli Chen, Adam Fisch, Jason Weston, Antoine Bordes - 2017

10 papers in library cite

K. Guu, K. Lee, Z. Tung, P. Panupong, M. W. Chang - 2020

5 papers in library cite

T. N. Nguyen, M. Rosenberg, X. Song, Jianfeng Gao, S. Tiwary, R. Majumder, L. Deng - 2016

8 papers in library cite

A. Fan, Yacine Jernite, Ethan Perez, D. Grangier, Jason Weston, Michael Auli - 2019

4 papers in library cite

M. Ghazvininejad, Chris Brockett, M. W. Chang, B. Dolan, Jianfeng Gao, W. T. Yih, M. Galley - 2017

3 papers in library cite

M. Dunn, L. Sagun, M. Higgins, V. U. Guney, V. Cirik, Kyunghyun Cho - 2017

5 papers in library cite

F. Petroni, P. Lewis, A. Piktus, Tim Rocktaschel, Yonghui Wu, A. H. Miller, Sebastian Riedel - 2020

1 paper in library cites

Jonathan Berant, A. Chou, R. Frostig, Percy Liang - 2013

8 papers in library cite

Jeffrey Li, M. Galley, Chris Brockett, Jianfeng Gao, B. Dolan - 2016

4 papers in library cite

Ethan Perez, S. Karamcheti, Rob Fergus, Jason Weston, Douwe Kiela, Kyunghyun Cho - 2019

4 papers in library cite

V. Karpukhin, B. Ouguz, S. Min, L. Y. Wu, S. Edunov, Deli Chen, W. T. Yih - 2020

3 papers in library cite

M. Li, Jason Weston, S. Roller - 2019

2 papers in library cite

T. Fevry, L. B. Soares, N. Fitzgerald, E. Choi, T. Kwiatkowski - 2020

2 papers in library cite

J. Thorne, A. Vlachos, C. Christodoulopoulos, A. Mittal - 2018

2 papers in library cite

U. Khandelwal, Omer Levy, Dan Jurafsky, Luke Zettlemoyer, Martha Lewis - 2020

2 papers in library cite

K. Lee, M. W. Chang, Kristina Toutanova - 2019

2 papers in library cite

P. Micikevicius, S. Narang, J. Alben, G. Diamos, E. Elsen, D. Garcia, B. Ginsburg, M. Houston, O. Kuchaiev, G. Venkatesh, H. Wu - 2018

2 papers in library cite

Shijie Wang, M. Yu, X. Guo, Zhengtao Wang, T. Klinger, Wenxuan Zhang, S. Chang, Gerald Tesauro, B. Zhou, J. J. Jiang - 2018

2 papers in library cite

G. Marcus - 2020

2 papers in library cite

N. Moghe, S. Arora, S. Banerjee, M. M. Khapra - 2018

2 papers in library cite

T. B. Hashimoto, K. Guu, Y. Oren, P. S. Liang - 2018

1 paper in library cites

S. Zhang, Mohit Bansal - 2019

1 paper in library cites

A. Fan, C. Gardent, C. Braud, Antoine Bordes - 2020

1 paper in library cites

E. Choi, D. Hewlett, Jakob Uszkoreit, Illia Polosukhin, A. Lacoste, Jonathan Berant - 2017

1 paper in library cites

A. Vijayakumar, M. Cogswell, R. Selvaraju, Q. Sun, S. Lee, D. Crandall, D. Batra - 2018

1 paper in library cites

Shijie Wang, M. Yu, J. J. Jiang, Wenxuan Zhang, X. Guo, S. Chang, Zhengtao Wang, T. Klinger, Gerald Tesauro, M. Campbell - 2018

1 paper in library cites

K. Guu, T. B. Hashimoto, Y. Oren, Percy Liang - 2018

1 paper in library cites

L. Massarelli, F. Petroni, A. Piktus, M. Ott, Tim Rocktaschel, V. Plachouras, F. Silvestri, Sebastian Riedel - 2019

1 paper in library cites

G. Lample, A. Sablayrolles, M. A. Ranzato, L. Denoyer, Hervé Jégou - 2019

1 paper in library cites

P. Baudis, J. Sedivy - 2015

1 paper in library cites

B. Bi, Chun-Liang Li, Chiyu Wu, Minghao Yan, Wenyi Wang - 2020

1 paper in library cites

W. Zhong, Jiacheng Xu, D. Tang, Zhiwei Xu, N. Duan, M. Zhou, J. Wang, J. Yin - 2019

1 paper in library cites

Jason Weston, E. Dinan, A. Miller - 2018

1 paper in library cites

Haozhe Liu, M. Ma, L. Huang, H. Xiong, Z. He - 2019

1 paper in library cites

J. Gu, Yuzhi Wang, Kyunghyun Cho, V. O. K. Li - 2018

1 paper in library cites

N. Hossain, M. Ghazvininejad, Luke Zettlemoyer - 2020

1 paper in library cites

S. Robertson, H. Zaragoza - 2009

1 paper in library cites

P. Nema, M. M. Khapra - 2018

1 paper in library cites

K. Grace, J. Salvatier, A. Dafoe, B. Zhang, Owain Evans - 2017

1 paper in library cites

Cited by

5

papers in your library

Cites

32

papers in your library

Read

on May 25, 2025

Your review

Tags

Paper Aliases

No aliases