2016
Cite Score
88
AI summary
This paper analyzes deep residual networks, demonstrating that identity mappings as skip connections and after-addition activation enable direct signal propagation. They propose a new residual unit improving training and generalization, achieving improved results on CIFAR-10/100 and ImageNet using very deep ResNets.
Main Contributions
Abstract
Deep residual networks [1] have emerged as a family of extremely deep architectures showing compelling accuracy and nice convergence behaviors. In this paper, we analyze the propagation formulations behind the residual building blocks, which suggest that the forward and backward signals can be directly propagated from one block to any other block, when using identity mappings as the skip connections and after-addition activation. A series of ablation experiments support the importance of these identity mappings. This motivates us to propose a new residual unit, which makes training easier and improves generalization. We report improved results using a 1001-layer ResNet on CIFAR-10 (4.62% error) and CIFAR-100, and a 200-layer ResNet on ImageNet. Code is available at: https://github.com/KaimingHe/resnet-1k-layers.
Citation Graph
References [23]
K. He, X. Zhang, S. Ren, Jian Sun - 2016
20 papers in library cite
K. Simonyan, Andrew Zisserman - 2014
20 papers in library cite
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015
20 papers in library cite
S. Ioffe, Christian Szegedy - 2015
18 papers in library cite
T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014
14 papers in library cite
Zbigniew Wojna - 2015
5 papers in library cite
Alex Krizhevsky - 2009
27 papers in library cite
V. Nair, Geoffrey E. Hinton - 2010
18 papers in library cite
K. He, X. Zhang, S. Ren, Jian Sun - 2015
10 papers in library cite
Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackal - 1989
24 papers in library cite
Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012
25 papers in library cite
M. Lin, Qinlang Chen, Shuicheng Yan - 2013
11 papers in library cite
D. A. Clevert, Thomas Unterthiner, Sepp Hochreiter - 2016
2 papers in library cite
R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015
6 papers in library cite
Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014
8 papers in library cite
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Zhongqiang Huang, A. Karpathy, A. Khosla, M. Bernstein - 2014
18 papers in library cite
Christian Szegedy, S. Ioffe, Vincent Vanhoucke, A. A. Alemi - 2017
3 papers in library cite
A. Romero, Nicolas Ballas, S. E. Kahou, A. Chassang, C. Gatta, Yoshua Bengio - 2015
5 papers in library cite
R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015
6 papers in library cite
D. Mishkin, J. Matas - 2016
2 papers in library cite
J. T. Springenberg, Alexey Dosovitskiy, T. Brox, M. Riedmiller - 2014
4 papers in library cite
B. Graham - 2014
2 papers in library cite
Cited by
4
papers in your library
Cites
21
papers in your library
Read
on October 31, 2025
Your review
Tags
Paper Aliases
No aliases