2001

Evaluating Benchmark Problems by Random Guessing

Jürgen Schmidhuber, Sepp Hochreiter, Yoshua Bengio

citations

Cite Score

2

AI summary

This paper demonstrates that Random Weight Guessing (RG) often outperforms more complex methods on long-term dependency benchmark problems for recurrent neural networks, suggesting that some benchmark solutions are dense in weight space and questioning the difficulty of these benchmarks.

Main Contributions

  • Introduced Random Weight Guessing (RG) as a simple baseline for evaluating long-term dependency benchmark problems.
  • Showed that RG often outperforms previous, more complex learning algorithms on widely used benchmarks like latch and 2-sequence problems.
  • Proposed that RG can be used as a first test to evaluate the difficulty of benchmark problems for recurrent neural networks.
  • Argued that the success of RG indicates that solutions to many benchmarks are dense in weight space and correspond to 'flat minima'.
  • Suggested that future benchmarks should be designed to make simple random search algorithms fail.

Abstract

Numerous recent papers focus on standard recurrent nets' problems with tasks involving long-term dependencies. We solve such tasks by random weight guessing (RG). Although RG cannot be viewed as a reasonable learning algorithm we find that it often outperforms previous, more complex methods on widely used benchmark problems. One reason for RG's success is that the solutions to many of these benchmarks are dense in weight space. An analysis of cases in which RG works well versus those in which it does not can serve to improve the quality of benchmarks for novel recurrent net algorithms.

Citation Graph

Loading graph...

References [24]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

5 papers in library cite

Sepp Hochreiter - 1991

18 papers in library cite

Jürgen Schmidhuber - 1992

8 papers in library cite

Ronald J. Williams - 1989

6 papers in library cite

S. Elhihi, Yoshua Bengio - 1996

6 papers in library cite

B. A. Pearlmutter - 1995

5 papers in library cite

M. C. Mozer - 1992

5 papers in library cite

Yoshua Bengio, Paolo Frasconi - 1994

4 papers in library cite

T. Lin, B. G. Horne, P. Tino, C. L. Giles - 1995

4 papers in library cite

J. Pollack - 1991

4 papers in library cite

C. B. Miller, C. L. Giles - 1993

3 papers in library cite

R. L. Watrous, G. M. Kuhn, J. E. Moody, S. J. Hanson, R. P. Lippman - 1992

3 papers in library cite

Yoshua Bengio, Paolo Frasconi, D. S. Touretzky, T. K. Leen - 1995

2 papers in library cite

M. Tomita - 1982

2 papers in library cite

L. Saul, M. Jordan - 1996

2 papers in library cite

P. Manolios, R. Fanelli - 1994

2 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

2 papers in library cite

S. I. Gallant - 1990

1 paper in library cites

Jürgen Schmidhuber - 1997

1 paper in library cites

K. S. Fu, T. L. Booth - 1975

1 paper in library cites

M. Jordan, Zoubin Ghahramani, L. Saul - 1997

1 paper in library cites

Kevin J. Lang - 1996

1 paper in library cites

Cited by

1

papers in your library

Cites

3

papers in your library

Read

on January 31, 2026

Your review

Tags

Paper Aliases

No aliases