1992
Cite Score
36
AI summary
This paper presents a comprehensive analysis of gradient-based learning algorithms for recurrent neural networks, categorizing them by exactness and operational mode, and provides detailed computational complexity analyses for various architectures and tasks, including the impact of teacher forcing.
Main Contributions
Abstract
In this chapter we describe and analyze several gradient-based algorithms for supervised learning in recurrent networks, focusing on discrete-time, feedforward networks having connections that generally include a delay of one time step. We categorize algorithms based on whether they compute the exact gradient or an approximation, and whether they are suitable for continually operating networks or only for epochwise tasks. We analyze the computational complexity of these algorithms in terms of both space and time requirements for general cases and worst-case scenarios. Specific architectures with simplified delay structures and the role of 'teacher forcing' in learning dynamics are also discussed.
Citation Graph
References [44]
D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986
46 papers in library cite
Fernando J. Pineda - 1987
5 papers in library cite
Paul J. Werbos - 1988
11 papers in library cite
R. Williams, David Zipser - 1989
8 papers in library cite
R. Watrous, Lokendra Shastri - 1987
4 papers in library cite
P. Werbos - 1974
14 papers in library cite
Ronald J. Williams - 1989
6 papers in library cite
L. B. Almeida - 1987
5 papers in library cite
Ronald J. Williams, J. Peng - 1990
5 papers in library cite
B. A. Pearlmutter - 1989
5 papers in library cite
Jürgen Schmidhuber - 1992
4 papers in library cite
B. Widrow, S. D. Stearns - 1985
4 papers in library cite
J. A. E. Bryson, Y. C. Ho - 1969
4 papers in library cite
Fernando J. Pineda - 1988
4 papers in library cite
A. Cleeremans, D. S. Schreiber, J. L. Mcclelland - 1989
4 papers in library cite
A. W. Smith, David Zipser - 1989
4 papers in library cite
G. Kuhn - 1987
3 papers in library cite
M. C. Mozer - 1989
3 papers in library cite
K. Doya, S. Yoshizawa - 1989
3 papers in library cite
Michael I. Jordan - 1986
3 papers in library cite
R. Williams - 1990
2 papers in library cite
J. Bachrach - 1988
2 papers in library cite
L. E. J. Mcbride, K. S. Narendra - 1965
2 papers in library cite
R. Rohwer - 1990
2 papers in library cite
B. Baird - 1989
1 paper in library cites
M. Gherrity - 1989
1 paper in library cites
M. Sato - 1990
1 paper in library cites
D. Arnold, D. A. Robinson - 1989
1 paper in library cites
M. Sato - 1990
1 paper in library cites
David Zipser - 1989
1 paper in library cites
Yann Lecun - 1988
1 paper in library cites
Ronald J. Williams, David Zipser - 1989
1 paper in library cites
Jeffrey L. Elman - 1988
1 paper in library cites
K. S. Narendra, K. Parthasarathy - 1990
1 paper in library cites
F. S. Tsung - 1990
1 paper in library cites
S. Lockery, Y. Fang, T. Sejnowski - 1990
1 paper in library cites
T. J. Anastasio - 1991
1 paper in library cites
A. Waibel, T. Hanazawa, Geoffrey Hinton, K. Shikano, K. Lang - 1987
1 paper in library cites
Fernando J. Pineda - 1989
1 paper in library cites
F. S. Tsung, G. W. Cottrell, A. Selverston - 1990
1 paper in library cites
K. S. Narendra, A. M. Annaswamy - 1989
1 paper in library cites
S. W. Piche - 1994
1 paper in library cites
A. J. Robinson, F. Fallside - 1987
1 paper in library cites
R. Rohwer, S. Renals - 1989
1 paper in library cites
Cited by
8
papers in your library
Cites
6
papers in your library
Read
on February 8, 2026
Your review
Tags
Paper Aliases
No aliases