2015
Cite Score
66
AI summary
This paper introduces a novel recurrent neural network architecture with an external memory and attention mechanism, trained end-to-end. It is competitive with Memory Networks on question answering tasks, and demonstrates comparable performance to RNNs and LSTMs on language modeling using Penn TreeBank and Text8 datasets.
Main Contributions
Abstract
We introduce a neural network with a recurrent attention model over a possibly large external memory. The architecture is a form of Memory Network [23] but unlike the model in that work, it is trained end-to-end, and hence requires significantly less supervision during training, making it more generally applicable in realistic settings. It can also be seen as an extension of RNNsearch [2] to the case where multiple computational steps (hops) are performed per output symbol. The flexibility of the model allows us to apply it to tasks as diverse as (synthetic) question answering [22] and to language modeling. For the former our approach is competitive with Memory Networks, but with less supervision. For the latter, on the Penn TreeBank and Text8 datasets our approach demonstrates comparable performance to RNNs and LSTMs. In both cases we show that the key concept of multiple computational hops yields improved results.
Citation Graph
References [25]
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2014
11 papers in library cite
K. Xu, Jimmy Lei Ba, R. Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, R. Zemel, Yoshua Bengio - 2015
12 papers in library cite
Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001
62 papers in library cite
M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993
22 papers in library cite
Alex Graves - 2013
27 papers in library cite
Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals - 2014
22 papers in library cite
Alex Graves, G. Wayne, Ivo Danihelka - 2014
18 papers in library cite
M. Sundermeyer, R. Schluter, Hermann Ney - 2010
7 papers in library cite
K. Gregor, Ivo Danihelka, Alex Graves, D. J. Rezende, Daan Wierstra - 2015
5 papers in library cite
Jason Weston, S. Chopra, Antoine Bordes - 2015
18 papers in library cite
Jason Weston, Antoine Bordes, S. Chopra, Tomas Mikolov - 2015
11 papers in library cite
Armand Joulin, Tomas Mikolov - 2015
9 papers in library cite
J. Koutnik, K. Greff, Faustino Gomez, Jürgen Schmidhuber - 2014
4 papers in library cite
Tomas Mikolov, Armand Joulin, S. Chopra, M. Mathieu, Marc'aurelio Ranzato - 2015
8 papers in library cite
Tomas Mikolov - 2012
17 papers in library cite
J. Goodman - 2001
15 papers in library cite
S. Das, C. Giles, G. Sun - 1992
5 papers in library cite
J. Pollack - 1991
4 papers in library cite
M. C. Mozer, S. Das - 1993
2 papers in library cite
B. Peng, Z. L. Lu, H. Li, K. Wong - 2015
2 papers in library cite
K. Steinbuch, U. Piske - 1963
1 paper in library cites
C. G. Atkeson, S. Schaal - 1995
1 paper in library cites
W. K. Taylor - 1959
1 paper in library cites
Cited by
18
papers in your library
Cites
16
papers in your library
Read
on August 2, 2025
Your review
Tags
Paper Aliases
No aliases