2014

Neural Turing Machines

Alex Graves, G. Wayne, Ivo Danihelka

citations

Cite Score

66

AI summary

This paper introduces Neural Turing Machines (NTM), coupling neural networks with external memory, interacting via attentional processes. NTMs are differentiable and trainable with gradient descent, showing they can infer algorithms like copying, sorting, and associative recall, outperforming LSTMs on algorithmic tasks.

Main Contributions

  • Introduces the Neural Turing Machine (NTM) architecture, combining neural networks with external memory.
  • Demonstrates that NTMs can be trained end-to-end using gradient descent due to their differentiable nature.
  • Shows NTMs can infer simple algorithms such as copying, sorting, and associative recall.
  • Presents experimental results showing that NTMs learn faster and generalize better than LSTMs on the tested algorithmic tasks.
  • Introduces content-based and location-based addressing mechanisms for interacting with the external memory.

Abstract

We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-to-end, allowing it to be efficiently trained with gradient descent. Preliminary results demonstrate that Neural Turing Machines can infer simple algorithms such as copying, sorting, and associative recall from input and output examples.

Citation Graph

Loading graph...

References [42]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

J. J. Hopfield - 1982

8 papers in library cite

Geoffrey Hinton - 2013

13 papers in library cite

Alex Graves - 2013

27 papers in library cite

Sepp Hochreiter, Yoshua Bengio, Paolo Frasconi, Jürgen Schmidhuber - 2001

16 papers in library cite

Sepp Hochreiter, A. Steven Younger, Peter R. Conwell - 2001

4 papers in library cite

Ilya Sutskever, James Martens, Geoffrey E. Hinton - 2011

13 papers in library cite

Geoffrey E. Hinton - 1986

13 papers in library cite

Alex Graves, Navdeep Jaitly - 2014

2 papers in library cite

J. B. Pollack - 1990

7 papers in library cite

D. E. Rumelhart, J. L. Mcclelland, P. R. Group - 1986

15 papers in library cite

Richard Socher, B. Huval, Christopher D. Manning, Andrew Y. Ng - 2012

7 papers in library cite

J. A. Fodor, Z. W. Pylyshyn - 1988

2 papers in library cite

T. A. Plate - 2003

2 papers in library cite

N. Chomsky - 1956

2 papers in library cite

Paolo Frasconi, M. Gori, A. Sperduti - 1998

1 paper in library cites

T. E. Hazy, M. J. Frank, R. C. O'reilly - 2006

1 paper in library cites

D. S. Touretzky - 1990

1 paper in library cites

P. S. G. Rakic - 1995

1 paper in library cites

M. L. Minsky - 1967

1 paper in library cites

H. S. Seung - 1998

1 paper in library cites

J. V. Neumann - 1945

1 paper in library cites

C. Eliasmith - 2013

1 paper in library cites

Kevin P. Murphy - 2012

1 paper in library cites

[29]Memory

A. Baddeley, M. Eysenck, M. Anderson - 2009

1 paper in library cites

C. R. Gallistel, A. P. King - 2009

1 paper in library cites

H. T. Siegelmann, E. D. Sontag - 1995

1 paper in library cites

Peter Dayan - 2008

1 paper in library cites

G. F. Marcus - 2003

1 paper in library cites

George A. Miller - 2003

1 paper in library cites

W. Fitch, M. D. Hauser, N. Chomsky - 2005

1 paper in library cites

M. Rigotti, O. Barak, M. R. Warden, X. J. Wang, N. D. Daw, E. K. Miller, S. Fusi - 2013

1 paper in library cites

R. F. Hadley - 2009

1 paper in library cites

P. Barrouillet, S. Bernardin, V. Camos - 2004

1 paper in library cites

Cited by

18

papers in your library

Cites

13

papers in your library

Read

on October 13, 2025

Your review

Tags

Paper Aliases

No aliases