2015

A Neural Attention Model for Abstractive Sentence Summarization

Alexander M. Rush, S. Chopra, Jason Weston

citations

Cite Score

68

AI summary

This paper introduces an attention-based neural network model for abstractive sentence summarization, trained end-to-end, and achieving significant performance gains on the DUC-2004 shared task compared with several strong baselines. The model leverages a local attention mechanism to generate summaries conditioned on input sentences.

Main Contributions

  • Introduces a fully data-driven approach to abstractive sentence summarization.
  • Utilizes a local attention-based neural network model for generating summaries.
  • Demonstrates that the model can be trained end-to-end and scales to large training datasets.
  • Achieves significant performance gains on the DUC-2004 shared task compared with several strong baselines.
  • Explores the use of additional features to trade-off the abstractive/extractive tendency of the system

Abstract

Summarization based on text extraction is inherently limited, but generation-style abstractive methods have proven challenging to build. In this work, we propose a fully data-driven approach to abstractive sentence summarization. Our method utilizes a local attention-based model that generates each word of the summary conditioned on the input sentence. While the model is structurally simple, it can easily be trained end-to-end and scales to a large amount of training data. The model shows significant performance gains on the DUC-2004 shared task compared with several strong baselines.

Citation Graph

Loading graph...

References [28]

Sort:
Filter:

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014

38 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Chin Yew Lin - 2004

9 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

27 papers in library cite

T. Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Wojciech Zaremba - 2014

14 papers in library cite

P. Koehn, H. Hoang, Alexandra Birch, Chris Callison Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, E. Herbst - 2007

8 papers in library cite

Christopher D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, D. Mcclosky - 2014

6 papers in library cite

R. Parker, D. Graff, J. Kong, K. Chen, K. Maeda - 2011

5 papers in library cite

B. Dorr, D. Zajic, Richard Schwartz - 2003

3 papers in library cite

C. Napoles, M. Gormley, B. V. Durme - 2012

2 papers in library cite

D. Zajic, B. J. Dorr, Richard Schwartz - 2004

2 papers in library cite

P. Over, H. Dang, D. Harman - 2007

2 papers in library cite

M. Banko, V. O. Mittal, M. J. Witbrock - 2000

2 papers in library cite

Christopher D. Manning, P. Raghavan, H. Schtze - 2008

2 papers in library cite

F. J. Och - 2003

2 papers in library cite

K. Filippova, Y. Altun - 2013

2 papers in library cite

T. Cohn, Mirella Lapata - 2008

2 papers in library cite

K. Woodsend, Y. Feng, Mirella Lapata - 2010

2 papers in library cite

H. D. Iii, D. Marcu - 2002

1 paper in library cites

J. Clarke, Mirella Lapata - 2008

1 paper in library cites

Missing year

J. Clarke, Mirella Lapata

1 paper in library cites

S. Wubben, A. V. D. Bosch, E. Krahmer - 2012

1 paper in library cites

K. Knight, D. Marcu - 2002

1 paper in library cites

H. Jing - 2002

1 paper in library cites

Cited by

13

papers in your library

Cites

8

papers in your library

Read

on November 26, 2025

Your review

Tags

Paper Aliases

No aliases