2009
Cite Score
79
AI summary
This paper introduces curriculum learning, a training strategy for machine learning where examples are presented in a meaningful order, illustrating gradually more concepts and increasing complexity, resulting in improved generalization, faster convergence, and better local minima, with experiments on vision and language tasks, deep neural networks, and shape recognition.
Main Contributions
Abstract
Humans and animals learn much better when the examples are not randomly presented but organized in a meaningful order which illustrates gradually more concepts, and gradually more complex ones. Here, we formalize such training strategies in the context of machine learning, and call them "curriculum learning". In the context of recent research studying the difficulty of training in the presence of non-convex training criteria (for deep deterministic and stochastic neural networks), we explore curriculum learning in various set-ups. The experiments show that significant improvements in generalization can be achieved. We hypothesize that curriculum learning has both an effect on the speed of convergence of the training process to a minimum and, in the case of non-convex criteria, on the quality of the local minima obtained: curriculum learning can be seen as a particular form of continuation method (a general strategy for global optimization of non-convex functions).
Citation Graph
References [30]
Geoffrey Hinton, Ruslan Salakhutdinov - 2006
37 papers in library cite
Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006
43 papers in library cite
Yoshua Bengio - 2009
25 papers in library cite
Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001
62 papers in library cite
Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre Antoine Manzagol - 2008
25 papers in library cite
Ronan Collobert, Jason Weston - 2008
32 papers in library cite
Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006
33 papers in library cite
Jeffrey L. Elman - 1993
5 papers in library cite
Marc'aurelio Ranzato, C. Poultney, S. Chopra, Yann Lecun - 2006
20 papers in library cite
Hugo Larochelle, Dumitru Erhan, Aaron Courville, James Bergstra, Yoshua Bengio - 2007
13 papers in library cite
Jason Weston, F. Ratle, Ronan Collobert - 2008
10 papers in library cite
Pascal Vincent - 2009
5 papers in library cite
K. A. Krueger, Peter Dayan - 2009
1 paper in library cites
Holger Schwenk, Jean Luc Gauvain - 2002
14 papers in library cite
Marc'aurelio Ranzato, Y. Boureau, Yann Lecun - 2008
12 papers in library cite
Y. Freund, D. Haussler - 1992
8 papers in library cite
J. Hastad, M. Goldmann - 1991
7 papers in library cite
Ruslan Salakhutdinov, Geoffrey Hinton - 2007
5 papers in library cite
Ruslan Salakhutdinov, A. Mnih, Geoffrey E. Hinton - 2007
5 papers in library cite
Ruslan Salakhutdinov, Geoffrey E. Hinton - 2008
4 papers in library cite
G. B. Peterson - 2004
3 papers in library cite
D. Cohn, Zoubin Ghahramani, M. Jordan - 1995
2 papers in library cite
Sebastian Thrun - 1996
2 papers in library cite
Ziyi Wu - 1997
2 papers in library cite
D. Rohde, D. Plaut - 1999
2 papers in library cite
E. L. Allgower, K. Georg - 1980
2 papers in library cite
T. Coleman, Ziyi Wu - 1994
2 papers in library cite
B. F. Skinner - 1958
2 papers in library cite
I. Derenyi, T. Geszti, G. Gyorgyi - 1994
1 paper in library cites
T. D. Sanger - 1994
1 paper in library cites
Cited by
6
papers in your library
Cites
15
papers in your library
Read
on March 26, 2025
Your review
Tags
Paper Aliases
No aliases