2001

Learning to Learn Using Gradient Descent

Sepp Hochreiter, A. Steven Younger, Peter R. Conwell

citations

Cite Score

61

AI summary

This paper introduces gradient descent to meta-learning using recurrent neural networks, enabling the derivation of complex, well-performing learning algorithms from scratch and demonstrating non-stationary time series prediction.

Main Contributions

  • Introduces gradient descent methods for meta-learning, making it feasible for large systems.
  • Utilizes recurrent neural networks and their learning routines as meta-learning systems.
  • Demonstrates the system's ability to derive complex, well-performing learning algorithms from scratch.
  • Shows that the approach performs non-stationary time series prediction.
  • Identifies LSTM as a good meta-learner for this approach, outperforming other recurrent architectures.

Abstract

This paper introduces the application of gradient descent methods to meta-learning. The concept of "meta-learning", i.e. of a system that improves or discovers a learning algorithm, has been of interest in machine learning for decades because of its appealing applications. Previous meta-learning approaches have been based on evolutionary methods and, therefore, have been restricted to small models with few free parameters. We make meta-learning in large systems feasible by using recurrent neural networks with their attendant learning routines as meta-learning systems. Our system derived complex well performing learning algorithms from scratch. In this paper we also show that our approach performs non-stationary time series prediction.

Citation Graph

Loading graph...

References [19]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Jeffrey L. Elman - 1990

23 papers in library cite

Paul J. Werbos - 1988

11 papers in library cite

Ronald J. Williams, David Zipser - 1992

8 papers in library cite

Rich Caruana - 1995

3 papers in library cite

R. Williams, David Zipser - 1989

8 papers in library cite

Sebastian Thrun, L. Pratt - 1998

3 papers in library cite

A. J. Robinson, F. Fallside - 1987

10 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

2 papers in library cite

D. Haussler - 1988

2 papers in library cite

T. P. Runarsson, M. T. Jonsson - 2000

1 paper in library cites

N. E. Cotter, Peter R. Conwell - 1990

1 paper in library cites

A. Steven Younger, Peter R. Conwell, N. E. Cotter - 1999

1 paper in library cites

P. Utgoff - 1986

1 paper in library cites

Jürgen Schmidhuber, J. Zhao, M. Wiering - 1997

1 paper in library cites

Jürgen Schmidhuber, J. Zhao, M. Wiering - 1996

1 paper in library cites

D. Chalmers - 1990

1 paper in library cites

H. Ellis - 1965

1 paper in library cites

Cited by

4

papers in your library

Cites

7

papers in your library

Read

on January 11, 2026

Your review

Tags

Paper Aliases

No aliases