2009

Curriculum Learning

Yoshua Bengio, J. Louradour, Ronan Collobert, Jason Weston

citations

Cite Score

79

AI summary

This paper introduces curriculum learning, a training strategy for machine learning where examples are presented in a meaningful order, illustrating gradually more concepts and increasing complexity, resulting in improved generalization, faster convergence, and better local minima, with experiments on vision and language tasks, deep neural networks, and shape recognition.

Main Contributions

  • Formalizes curriculum learning as a training strategy in machine learning.
  • Demonstrates improved generalization and faster convergence through curriculum learning.
  • Hypothesizes that curriculum learning helps find better local minima in non-convex optimization.
  • Introduces simple multi-stage curriculum strategies for vision and language tasks.
  • Shows how curriculum learning can act as a regularizer.

Abstract

Humans and animals learn much better when the examples are not randomly presented but organized in a meaningful order which illustrates gradually more concepts, and gradually more complex ones. Here, we formalize such training strategies in the context of machine learning, and call them "curriculum learning". In the context of recent research studying the difficulty of training in the presence of non-convex training criteria (for deep deterministic and stochastic neural networks), we explore curriculum learning in various set-ups. The experiments show that significant improvements in generalization can be achieved. We hypothesize that curriculum learning has both an effect on the speed of convergence of the training process to a minimum and, in the case of non-convex criteria, on the quality of the local minima obtained: curriculum learning can be seen as a particular form of continuation method (a general strategy for global optimization of non-convex functions).

Citation Graph

Loading graph...

References [30]

Sort:
Filter:

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006

43 papers in library cite

Yoshua Bengio - 2009

25 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre Antoine Manzagol - 2008

25 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

Jeffrey L. Elman - 1993

5 papers in library cite

Marc'aurelio Ranzato, C. Poultney, S. Chopra, Yann Lecun - 2006

20 papers in library cite

Hugo Larochelle, Dumitru Erhan, Aaron Courville, James Bergstra, Yoshua Bengio - 2007

13 papers in library cite

Jason Weston, F. Ratle, Ronan Collobert - 2008

10 papers in library cite

Pascal Vincent - 2009

5 papers in library cite

K. A. Krueger, Peter Dayan - 2009

1 paper in library cites

Holger Schwenk, Jean Luc Gauvain - 2002

14 papers in library cite

Marc'aurelio Ranzato, Y. Boureau, Yann Lecun - 2008

12 papers in library cite

Y. Freund, D. Haussler - 1992

8 papers in library cite

J. Hastad, M. Goldmann - 1991

7 papers in library cite

Ruslan Salakhutdinov, Geoffrey Hinton - 2007

5 papers in library cite

Ruslan Salakhutdinov, A. Mnih, Geoffrey E. Hinton - 2007

5 papers in library cite

Ruslan Salakhutdinov, Geoffrey E. Hinton - 2008

4 papers in library cite

G. B. Peterson - 2004

3 papers in library cite

D. Cohn, Zoubin Ghahramani, M. Jordan - 1995

2 papers in library cite

Sebastian Thrun - 1996

2 papers in library cite

Ziyi Wu - 1997

2 papers in library cite

E. L. Allgower, K. Georg - 1980

2 papers in library cite

T. Coleman, Ziyi Wu - 1994

2 papers in library cite

B. F. Skinner - 1958

2 papers in library cite

I. Derenyi, T. Geszti, G. Gyorgyi - 1994

1 paper in library cites

Cited by

6

papers in your library

Cites

15

papers in your library

Read

on March 26, 2025

Your review

Tags

Paper Aliases

No aliases