2014

Deeply-Supervised Nets

Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu

citations

Cite Score

65

AI summary

This paper introduces Deeply-Supervised Nets (DSN) which minimize classification error and enhance learning transparency in deep networks by adding a "companion objective" to hidden layers, achieving state-of-the-art results on MNIST, CIFAR-10, CIFAR-100, and SVHN datasets.

Main Contributions

  • Introduced Deeply-Supervised Nets (DSN) for direct and early supervision of hidden layers and the output layer.
  • Proposed a "companion objective" for individual hidden layers as an additional constraint/regularization.
  • Formulation significantly enhances the performance of existing supervised deep learning methods.
  • Provided justification for the formulation using stochastic gradient techniques and demonstrated improved convergence rates.
  • Achieved state-of-the-art classification error on benchmark datasets including MNIST, CIFAR-10, CIFAR-100, and SVHN.

Abstract

Our proposed deeply-supervised nets (DSN) method simultaneously minimizes classification error while making the learning process of hidden layers direct and transparent. We make an attempt to boost the classification performance by studying a new formulation in deep networks. Three aspects in convolutional neural networks (CNN) style architectures are being looked at: (1) transparency of the intermediate layers to the overall classification; (2) discriminativeness and robustness of learned features, especially in the early layers; (3) effectiveness in training due to the presence of the exploding and vanishing gradients. We introduce “companion objective" to the individual hidden layers, in addition to the overall objective at the output layer (a different strategy to layer-wise pre-training). We extend techniques from stochastic gradient methods to analyze our algorithm. The advantage of our method is evident and our experimental result on benchmark datasets shows significant performance gain over existing methods (e.g. all state-of-the-art results on MNIST, CIFAR-10, CIFAR-100, and SVHN).

Citation Graph

Loading graph...

References [31]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

Matthew D. Zeiler, Rob Fergus - 2014

15 papers in library cite

Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006

43 papers in library cite

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, Ross Girshick, S. Guadarrama, Trevor Darrell - 2014

12 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

M. Lin, Qinlang Chen, Shuicheng Yan - 2013

11 papers in library cite

Razvan Pascanu, Tomas Mikolov, Yoshua Bengio - 2013

21 papers in library cite

Yoshua Bengio, P. Lamblin, D. Popovici, Hugo Larochelle - 2006

33 papers in library cite

Dan C. Ciresan, Ueli Meier, Jürgen Schmidhuber - 2012

11 papers in library cite

J. Donahue, Y. Jia, Oriol Vinyals, J. Hoffman, N. Zhang, E. Tzeng, Trevor Darrell - 2014

15 papers in library cite

G. Dahl, D. Yu, L. Deng, Alex Acero - 2012

19 papers in library cite

L. Wan, M. Zeiler, S. Zhang, Rob Fergus - 2013

8 papers in library cite

K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009

20 papers in library cite

Yoshua Bengio - 2013

17 papers in library cite

Jason Weston, F. Ratle, Ronan Collobert - 2008

10 papers in library cite

James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010

22 papers in library cite

D. Eigen, J. Rolfe, Rob Fergus, Yann Lecun - 2013

2 papers in library cite

Honglak Lee, R. Grosse, R. Ranganath, Andrew Y. Ng - 2009

12 papers in library cite

V. Vapnik - 1995

9 papers in library cite

Jeffrey L. Elman - 1991

5 papers in library cite

F. Huang, Yann Lecun - 2006

5 papers in library cite

Matthew D. Zeiler, Rob Fergus - 2013

5 papers in library cite

Leon Bottou - 1998

4 papers in library cite

Quoc V. Le, J. Ngiam, Ziru Chen, D. Chia, P. W. Koh, Andrew Y. Ng - 2010

4 papers in library cite

Y. Tang - 2013

2 papers in library cite

N. Srivastava, Ruslan Salakhutdinov - 2013

2 papers in library cite

J. Snoek, R. P. Adams, Hugo Larochelle - 2012

1 paper in library cites

P. L. Loh, M. J. Wainwright - 2013

1 paper in library cites

A. Rakhlin, O. Shamir, K. Sridharan - 2012

1 paper in library cites

Cited by

8

papers in your library

Cites

20

papers in your library

Read

on February 18, 2026

Your review

Tags

Paper Aliases

No aliases