2007

Training Recurrent Networks by EVOLINO

Jürgen Schmidhuber, Daan Wierstra, Matteo Gagliolo, F. J. Gomez

citations

Cite Score

16

AI summary

This paper introduces Evolino, a novel method for training recurrent neural networks by evolving weights to hidden nodes and using linear methods for optimal linear mappings, demonstrating its ability to solve tasks unlearnable by Echo State nets and outperform gradient descent RNNs.

Main Contributions

  • Introduces Evolino, a novel method for training RNNs that combines neuroevolution for hidden unit weights and analytical linear methods for output mappings.
  • Presents PI-Evolino, which uses pseudoinverse-based linear regression to minimize summed squared error for output weights.
  • Presents Evoke, which uses quadratic programming to maximize the margin, creating the first evolutionary recurrent support vector machines.
  • Demonstrates that Evolino-based LSTM can solve context-sensitive language tasks and multiple superimposed sine wave generation tasks that Echo State Networks (ESNs) cannot.
  • Achieves higher accuracy in continuous function generation tasks compared to conventional gradient descent RNNs, including gradient-based LSTM, and generalizes well from minimal training data.

Abstract

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for training RNNs, due to numerous local minima. For such cases, we present a novel method: EVOlution of systems with LINear Outputs (Evolino). Evolino evolves weights to the nonlinear, hidden nodes of RNNs while computing optimal linear mappings from hidden state to output, using methods such as pseudo-inverse-based linear regression. If we instead use quadratic programming to maximize the margin, we obtain the first evolutionary recurrent support vector machines. We show that Evolino-based LSTM can solve tasks that Echo State nets (Jaeger, 2004a) cannot and achieves higher accuracy in certain continuous function generation tasks than conventional gradient descent RNNs, including gradient-based LSTM.

Citation Graph

Loading graph...

References [54]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Felix A. Gers, Jürgen Schmidhuber, Fred Cummins - 2000

13 papers in library cite

Herbert Jaeger, Harald Haas - 2004

4 papers in library cite

Sepp Hochreiter, Yoshua Bengio, Paolo Frasconi, Jürgen Schmidhuber - 2001

16 papers in library cite

Paul J. Werbos - 1988

11 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

5 papers in library cite

Jürgen Schmidhuber, Sepp Hochreiter, Yoshua Bengio - 2001

1 paper in library cites

Alex Graves, Jürgen Schmidhuber - 2005

14 papers in library cite

D. E. Rumelhart, J. L. Mcclelland, P. R. Group - 1986

15 papers in library cite

P. Werbos - 1974

14 papers in library cite

Sepp Hochreiter - 1991

18 papers in library cite

A. J. Robinson, F. Fallside - 1987

10 papers in library cite

F. Gers, N. Schraudolph, Jürgen Schmidhuber - 2002

9 papers in library cite

V. Vapnik - 1995

9 papers in library cite

Ronald J. Williams - 1989

6 papers in library cite

B. A. Pearlmutter - 1995

5 papers in library cite

J. H. Holland - 1975

4 papers in library cite

Felix A. Gers, Jürgen Schmidhuber - 2001

3 papers in library cite

H. P. Schwefel - 1995

2 papers in library cite

J. A. P. Ortiz, F. Gers, D. Eck, Jürgen Schmidhuber - 2002

2 papers in library cite

K. Muller, A. Smola, G. Ratsch, B. Scholkopf, J. Kohlmorgen, V. Vapnik - 1997

2 papers in library cite

Herbert Jaeger - 2001

2 papers in library cite

W. Maass, T. Natschlager, H. Markram - 2002

1 paper in library cites

R. Penrose - 1955

1 paper in library cites

Xingcheng Yao - 1993

1 paper in library cites

F. J. Gomez, Jürgen Schmidhuber - 2005

1 paper in library cites

G. Miller, P. Todd, S. Hedge - 1989

1 paper in library cites

H. Shimodaira, K. I. Noma, M. Nakai, S. Sagayama - 2002

1 paper in library cites

Jürgen Schmidhuber - 1990

1 paper in library cites

D. E. Moriarty, R. Miikkulainen - 1996

1 paper in library cites

Jürgen Schmidhuber, Matteo Gagliolo, Daan Wierstra, Faustino Gomez - 2006

1 paper in library cites

Jürgen Schmidhuber, Daan Wierstra, F. J. Gomez - 2005

1 paper in library cites

O. Miglino, H. Lund, S. Nolfi - 1995

1 paper in library cites

Faustino Gomez, Jürgen Schmidhuber - 2005

1 paper in library cites

M. A. Potter, K. A. D. Jong - 1995

1 paper in library cites

K. Sims - 1994

1 paper in library cites

T. V. D. Zant, V. Becanovic, K. Ishii, H. U. Kobialka, P. G. Ploger - 2004

1 paper in library cites

J. Salomon, S. King, M. Osborne - 2002

1 paper in library cites

S. Nolfi, D. Floreano, O. Miglino, F. Mondada - 1994

1 paper in library cites

K. Ishii, T. V. D. Zant, V. Becanovic, P. G. Ploger - 2004

1 paper in library cites

Jürgen Schmidhuber, F. Gers, D. Eck - 2002

1 paper in library cites

Daan Wierstra, F. J. Gomez, Jürgen Schmidhuber - 2005

1 paper in library cites

S. Mukherjee, E. Osuna, F. Girosi - 1997

1 paper in library cites

H. P. Schwefel - 1977

1 paper in library cites

M. C. Mackey, L. Glass - 1977

1 paper in library cites

E. P. Maillard, D. Gueriot - 1997

1 paper in library cites

J. Suykens, J. Vandewalle - 2000

1 paper in library cites

F. J. Gomez - 2003

1 paper in library cites

B. M. Yamauchi, R. D. Beer - 1994

1 paper in library cites

Faustino Gomez, R. Miikkulainen - 1999

1 paper in library cites

D. E. Moriarty - 1997

1 paper in library cites

H. T. Siegelmann, E. D. Sontag - 1991

1 paper in library cites

Cited by

1

papers in your library

Cites

10

papers in your library

Read

on January 29, 2026

Your review

Tags

Paper Aliases

No aliases