Cite Score
87
AI summary
This paper quantifies the generality versus specificity of neurons in each layer of a deep convolutional neural network, reporting that transferability is negatively affected by specialization and optimization difficulties, and initializing a network with transferred features can improve generalization performance on ImageNet.
Main Contributions
Abstract
Many deep neural networks trained on natural images exhibit a curious phenomenon in common: on the first layer they learn features similar to Gabor filters and color blobs. Such first-layer features appear not to be specific to a particular dataset or task, but general in that they are applicable to many datasets and tasks. Features must eventually transition from general to specific by the last layer of the network, but this transition has not been studied extensively. In this paper we experimentally quantify the generality versus specificity of neurons in each layer of a deep convolutional neural network and report a few surprising results. Transferability is negatively affected by two distinct issues: (1) the specialization of higher layer neurons to their original task at the expense of performance on the target task, which was expected, and (2) optimization difficulties related to splitting networks between co-adapted neurons, which was not expected. In an example network trained on ImageNet, we demonstrate that either of these two issues may dominate, depending on whether features are transferred from the bottom, middle, or top of the network. We also document that the transferability of features decreases as the distance between the base task and target task increases, but that transferring features even from distant tasks can be better than using random features. A final surprising result is that initializing a network with transferred features from almost any number of layers can produce a boost to generalization that lingers even after fine-tuning to the target dataset.
Citation Graph
References [15]
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012
71 papers in library cite
J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009
28 papers in library cite
Ross Girshick, J. Donahue, Trevor Darrell, Jitendra Malik - 2014
18 papers in library cite
Matthew D. Zeiler, Rob Fergus - 2014
15 papers in library cite
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, Ross Girshick, S. Guadarrama, Trevor Darrell - 2014
12 papers in library cite
Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012
25 papers in library cite
Li Fei Fei, Rob Fergus, Pietro Perona - 2004
15 papers in library cite
J. Donahue, Y. Jia, Oriol Vinyals, J. Hoffman, N. Zhang, E. Tzeng, Trevor Darrell - 2014
15 papers in library cite
K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009
20 papers in library cite
Rich Caruana - 1995
3 papers in library cite
P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, Rob Fergus, Yann Lecun - 2014
16 papers in library cite
Honglak Lee, R. Grosse, R. Ranganath, Andrew Y. Ng - 2009
12 papers in library cite
Quoc Le, A. Karpenko, J. Ngiam, A. Ng - 2011
4 papers in library cite
Yoshua Bengio - 2011
2 papers in library cite
Yoshua Bengio, F. Bastien, A. Bergeron, N. B. Lewandowski, T. Breuel, Y. Chherawala, M. Cisse, M. Cote, Dumitru Erhan, J. Eustache, Xavier Glorot, X. Muller, S. P. Lebeuf, Razvan Pascanu, S. Rifai, F. Savard, G. Sicard - 2011
1 paper in library cites
Cited by
2
papers in your library
Cites
12
papers in your library
Read
on October 24, 2025
Your review
Tags
Paper Aliases
No aliases