Cite Score
43
AI summary
This paper introduces a Hessian-free optimization method for training deep auto-encoders. It achieves results superior to existing methods without pre-training on the same tasks, scaling well to large datasets. The method addresses pathological curvature, offering a practical and effective approach to deep learning optimization.
Main Contributions
Abstract
We develop a 2nd-order optimization method based on the "Hessian-free" approach, and apply it to training deep auto-encoders. Without using pre-training, we obtain results superior to those reported by Hinton & Salakhutdinov (2006) on the same tasks they considered. Our method is practical, easy to use, scales nicely to very large datasets, and isn't limited in applicability to autoencoders, or any specific model class. We also discuss the issue of "pathological curvature" as a possible explanation for the difficulty of deep-learning and how 2nd-order optimization, and our method in particular, effectively deals with it.
Citation Graph
References [9]
Geoffrey Hinton, Ruslan Salakhutdinov - 2006
37 papers in library cite
Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998
20 papers in library cite
Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006
33 papers in library cite
Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre Antoine Manzagol, Pascal Vincent, Samy Bengio - 2010
12 papers in library cite
N. N. Schraudolph - 2002
4 papers in library cite
B. Pearlmutter - 1994
4 papers in library cite
J. Nocedal, S. Wright - 1999
2 papers in library cite
S. Amari, H. Park, K. Fukumizu - 2000
1 paper in library cites
E. Mizutani, S. Dreyfus - 2008
1 paper in library cites
Cited by
12
papers in your library
Cites
4
papers in your library
Read
on July 31, 2025
Your review
Tags
Paper Aliases
No aliases