2015

End-to-End Memory Networks

S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus

citations

Cite Score

66

AI summary

This paper introduces a novel recurrent neural network architecture with an external memory and attention mechanism, trained end-to-end. It is competitive with Memory Networks on question answering tasks, and demonstrates comparable performance to RNNs and LSTMs on language modeling using Penn TreeBank and Text8 datasets.

Main Contributions

  • Introduces an end-to-end memory network with recurrent attention model.
  • Demonstrates the model's applicability to question answering and language modeling tasks.
  • Achieves competitive performance on question answering with less supervision than Memory Networks.
  • Shows comparable language modeling performance to RNNs and LSTMs on Penn TreeBank and Text8.
  • Highlights the improved results from multiple computational hops.

Abstract

We introduce a neural network with a recurrent attention model over a possibly large external memory. The architecture is a form of Memory Network [23] but unlike the model in that work, it is trained end-to-end, and hence requires significantly less supervision during training, making it more generally applicable in realistic settings. It can also be seen as an extension of RNNsearch [2] to the case where multiple computational steps (hops) are performed per output symbol. The flexibility of the model allows us to apply it to tasks as diverse as (synthetic) question answering [22] and to language modeling. For the former our approach is competitive with Memory Networks, but with less supervision. For the latter, on the Penn TreeBank and Text8 datasets our approach demonstrates comparable performance to RNNs and LSTMs. In both cases we show that the key concept of multiple computational hops yields improved results.

Citation Graph

Loading graph...

References [25]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2014

11 papers in library cite

K. Xu, Jimmy Lei Ba, R. Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, R. Zemel, Yoshua Bengio - 2015

12 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993

22 papers in library cite

Alex Graves - 2013

27 papers in library cite

Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals - 2014

22 papers in library cite

Alex Graves, G. Wayne, Ivo Danihelka - 2014

18 papers in library cite

M. Sundermeyer, R. Schluter, Hermann Ney - 2010

7 papers in library cite

K. Gregor, Ivo Danihelka, Alex Graves, D. J. Rezende, Daan Wierstra - 2015

5 papers in library cite

Jason Weston, S. Chopra, Antoine Bordes - 2015

18 papers in library cite

Jason Weston, Antoine Bordes, S. Chopra, Tomas Mikolov - 2015

11 papers in library cite

Armand Joulin, Tomas Mikolov - 2015

9 papers in library cite

J. Koutnik, K. Greff, Faustino Gomez, Jürgen Schmidhuber - 2014

4 papers in library cite

Tomas Mikolov, Armand Joulin, S. Chopra, M. Mathieu, Marc'aurelio Ranzato - 2015

8 papers in library cite

Tomas Mikolov - 2012

17 papers in library cite

J. Goodman - 2001

15 papers in library cite

J. Pollack - 1991

4 papers in library cite

M. C. Mozer, S. Das - 1993

2 papers in library cite

B. Peng, Z. L. Lu, H. Li, K. Wong - 2015

2 papers in library cite

K. Steinbuch, U. Piske - 1963

1 paper in library cites

C. G. Atkeson, S. Schaal - 1995

1 paper in library cites

W. K. Taylor - 1959

1 paper in library cites

Cited by

18

papers in your library

Cites

16

papers in your library

Read

on August 2, 2025

Your review

Tags

Paper Aliases

No aliases