2010

Deep Learning via Hessian-Free Optimization

James Martens

citations

Cite Score

43

AI summary

This paper introduces a Hessian-free optimization method for training deep auto-encoders. It achieves results superior to existing methods without pre-training on the same tasks, scaling well to large datasets. The method addresses pathological curvature, offering a practical and effective approach to deep learning optimization.

Main Contributions

  • Introduces a 2nd-order optimization method based on the Hessian-free approach.
  • Achieves superior results to existing methods on deep auto-encoder training tasks without pre-training.
  • Scales effectively to very large datasets.
  • Discusses pathological curvature as an explanation for deep-learning difficulties.
  • Provides a practical and easy-to-use optimization method for deep learning.

Abstract

We develop a 2nd-order optimization method based on the "Hessian-free" approach, and apply it to training deep auto-encoders. Without using pre-training, we obtain results superior to those reported by Hinton & Salakhutdinov (2006) on the same tasks they considered. Our method is practical, easy to use, scales nicely to very large datasets, and isn't limited in applicability to autoencoders, or any specific model class. We also discuss the issue of "pathological curvature" as a possible explanation for the difficulty of deep-learning and how 2nd-order optimization, and our method in particular, effectively deals with it.

Citation Graph

Loading graph...

References [9]

Sort:
Filter:

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998

20 papers in library cite

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre Antoine Manzagol, Pascal Vincent, Samy Bengio - 2010

12 papers in library cite

N. N. Schraudolph - 2002

4 papers in library cite

B. Pearlmutter - 1994

4 papers in library cite

J. Nocedal, S. Wright - 1999

2 papers in library cite

S. Amari, H. Park, K. Fukumizu - 2000

1 paper in library cites

Cited by

12

papers in your library

Cites

4

papers in your library

Read

on July 31, 2025

Your review

Tags

Paper Aliases

No aliases