1997

LSTM Can Solve Hard Long Time Lag Problems

Sepp Hochreiter, Jürgen Schmidhuber

citations

Cite Score

46

AI summary

This paper introduces LSTM, a novel recurrent network algorithm, to solve a hard problem involving distributed, high-precision, continuous-valued representations and long minimal time lags, demonstrating that LSTM can solve non-trivial problems that other recurrent networks cannot.

Main Contributions

  • Showed that problems used to promote previous algorithms can be solved more quickly by random weight guessing than by the proposed algorithms.
  • Introduced LSTM to solve a hard problem that cannot be solved by random search or any other recurrent net algorithm.
  • Demonstrated LSTM's ability to work well with distributed representations and perform calculations involving high-precision, continuous values.
  • Showed that LSTM can solve tasks that are impossible to solve within reasonable time by other algorithms.
  • Evaluated on an adding problem where the task is to output the sum of the first components of those pairs that are marked by second components equal to 1.0.

Abstract

Standard recurrent nets cannot deal with long minimal time lags between relevant signals. Several recent NIPS papers propose alternative methods. We first show: problems used to promote various previous algorithms can be solved more quickly by random weight guessing than by the proposed algorithms. We then use LSTM, our own recent algorithm, to solve a hard problem that can neither be quickly solved by random search nor by any other recurrent net algorithm we are aware of.

Citation Graph

Loading graph...

References [20]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Sepp Hochreiter - 1991

18 papers in library cite

A. J. Robinson, F. Fallside - 1987

10 papers in library cite

Jürgen Schmidhuber - 1992

8 papers in library cite

S. Elhihi, Yoshua Bengio - 1996

6 papers in library cite

Ronald J. Williams, J. Peng - 1990

5 papers in library cite

B. A. Pearlmutter - 1995

5 papers in library cite

M. C. Mozer - 1992

5 papers in library cite

Yoshua Bengio, Paolo Frasconi - 1994

4 papers in library cite

A. Cleeremans, D. S. Schreiber, J. L. Mcclelland - 1989

4 papers in library cite

T. Lin, B. G. Horne, P. Tino, C. L. Giles - 1995

4 papers in library cite

A. W. Smith, David Zipser - 1989

4 papers in library cite

J. Pollack - 1991

4 papers in library cite

C. B. Miller, C. L. Giles - 1993

3 papers in library cite

R. L. Watrous, G. M. Kuhn, J. E. Moody, S. J. Hanson, R. P. Lippman - 1992

3 papers in library cite

S. E. Fahlman - 1991

3 papers in library cite

Yoshua Bengio, Paolo Frasconi, D. S. Touretzky, T. K. Leen - 1995

2 papers in library cite

M. Tomita - 1982

2 papers in library cite

P. Manolios, R. Fanelli - 1994

2 papers in library cite

Cited by

5

papers in your library

Cites

2

papers in your library

Read

on June 23, 2025

Your review

Tags

Paper Aliases

No aliases