2018

Generating Wikipedia by Summarizing Long Sequences

P. J. Liu, M. Saleh, E. Pot, B. Goodrich, R. Sepassi, Lukasz Kaiser, Noam Shazeer

citations

Cite Score

39

AI summary

This paper introduces a new method for generating Wikipedia articles by using a two-stage extractive-abstractive summarization framework on a large, parallel dataset. It uses a decoder-only Transformer architecture to attend to long sequences, achieving strong results in ROUGE scores and human evaluations.

Main Contributions

  • Introduces a two-stage extractive-abstractive framework for generating Wikipedia articles.
  • Proposes a decoder-only Transformer architecture for abstractive summarization that can handle very long sequences.
  • Demonstrates the effectiveness of the proposed model in generating fluent and coherent multi-sentence paragraphs and even whole Wikipedia articles.
  • Achieves strong results in ROUGE scores and human evaluations compared to traditional encoder-decoder architectures.
  • Releases the URLs used in the experiments to encourage further research on large-scale summarization.

Abstract

We show that generating English Wikipedia articles can be approached as a multi-document summarization of source documents. We use extractive summarization to coarsely identify salient information and a neural abstractive model to generate the article. For the abstractive model, we introduce a decoder-only architecture that can scalably attend to very long sequences, much longer than typical encoder-decoder architectures used in sequence transduction. We show that this model can generate fluent, coherent multi-sentence paragraphs and even whole Wikipedia articles. When given reference documents, we show it can extract relevant factual information as reflected in perplexity, ROUGE scores and human evaluations.

Citation Graph

Loading graph...

References [20]

Sort:
Filter:

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017

47 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Chin Yew Lin - 2004

9 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

Yonghui Wu, M. Schuster, Ziru Chen, Quoc V. Le, M. Norouzi, W. Macherey, M. Krikun, Yue Cao, Q. Gao, K. Macherey, J. Klingner, A. Shah, M. J. Johnson, Xiaodong Liu, Lukasz Kaiser, S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, Wenyi Wang, C. Young, J. Smith, J. Riesa, A. Rudnick, Oriol Vinyals, G. S. Corrado, M. Hughes, Jeffrey Dean - 2016

15 papers in library cite

Alexander M. Rush, S. Chopra, Jason Weston - 2015

13 papers in library cite

Noam Shazeer, Azalia Mirhoseini, K. Maziarz, A. Davis, Quoc Le, Geoffrey Hinton, Jeffrey Dean - 2017

9 papers in library cite

R. Nallapati, B. Zhou, C. N. D. Santos, C. G. Gulcehre, Bing Xiang - 2016

10 papers in library cite

R. Paulus, Caiming Xiong, Richard Socher - 2017

7 papers in library cite

S. Chopra, Michael Auli, A. Rush, S. Harvard - 2016

5 papers in library cite

R. Parker, D. Graff, J. Kong, K. Chen, K. Maeda - 2011

5 papers in library cite

D. Hewlett, A. Lacoste, Llion Jones, Illia Polosukhin, A. Fandrianto, J. Han, M. Kelcey, D. Berthelot - 2016

4 papers in library cite

J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. N. Mendes, S. Hellmann, M. Morsey, P. V. Kleef, S. Auer - 2014

2 papers in library cite

R. Lebret, D. Grangier, Michael Auli - 2016

2 papers in library cite

C. Sauper, R. Barzilay - 2009

1 paper in library cites

H. T. Dang - 2005

1 paper in library cites

R. Mihalcea, P. Tarau - 2004

1 paper in library cites

A. Nenkova, L. Vanderwende - 2005

1 paper in library cites

L. Page, S. Brin, R. Motwani, T. Winograd - 1999

1 paper in library cites

J. Ramos - 2003

1 paper in library cites

Cited by

7

papers in your library

Cites

10

papers in your library

Read

on August 6, 2025

Your review

Tags

Paper Aliases

No aliases