2016

Identity Mappings in Deep Residual Networks

K. He, X. Zhang, S. Ren, Jian Sun

citations

Cite Score

88

AI summary

This paper analyzes deep residual networks, demonstrating that identity mappings as skip connections and after-addition activation enable direct signal propagation. They propose a new residual unit improving training and generalization, achieving improved results on CIFAR-10/100 and ImageNet using very deep ResNets.

Main Contributions

  • Demonstrates the importance of identity mappings in deep residual networks for direct signal propagation.
  • Proposes a new residual unit that makes training easier and improves generalization.
  • Achieves improved results on CIFAR-10 with a 1001-layer ResNet (4.62% error).
  • Achieves improved results on CIFAR-100 with a 1001-layer ResNet.
  • Achieves improved results on ImageNet with a 200-layer ResNet.

Abstract

Deep residual networks [1] have emerged as a family of extremely deep architectures showing compelling accuracy and nice convergence behaviors. In this paper, we analyze the propagation formulations behind the residual building blocks, which suggest that the forward and backward signals can be directly propagated from one block to any other block, when using identity mappings as the skip connections and after-addition activation. A series of ablation experiments support the importance of these identity mappings. This motivates us to propose a new residual unit, which makes training easier and improves generalization. We report improved results using a 1001-layer ResNet on CIFAR-10 (4.62% error) and CIFAR-100, and a 200-layer ResNet on ImageNet. Code is available at: https://github.com/KaimingHe/resnet-1k-layers.

Citation Graph

Loading graph...

References [23]

Sort:
Filter:

K. He, X. Zhang, S. Ren, Jian Sun - 2016

20 papers in library cite

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

Zbigniew Wojna - 2015

5 papers in library cite

Alex Krizhevsky - 2009

27 papers in library cite

V. Nair, Geoffrey E. Hinton - 2010

18 papers in library cite

K. He, X. Zhang, S. Ren, Jian Sun - 2015

10 papers in library cite

Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackal - 1989

24 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

M. Lin, Qinlang Chen, Shuicheng Yan - 2013

11 papers in library cite

D. A. Clevert, Thomas Unterthiner, Sepp Hochreiter - 2016

2 papers in library cite

R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015

6 papers in library cite

Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014

8 papers in library cite

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Zhongqiang Huang, A. Karpathy, A. Khosla, M. Bernstein - 2014

18 papers in library cite

Christian Szegedy, S. Ioffe, Vincent Vanhoucke, A. A. Alemi - 2017

3 papers in library cite

A. Romero, Nicolas Ballas, S. E. Kahou, A. Chassang, C. Gatta, Yoshua Bengio - 2015

5 papers in library cite

R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015

6 papers in library cite

D. Mishkin, J. Matas - 2016

2 papers in library cite

J. T. Springenberg, Alexey Dosovitskiy, T. Brox, M. Riedmiller - 2014

4 papers in library cite

B. Graham - 2014

2 papers in library cite

Cited by

4

papers in your library

Cites

21

papers in your library

Read

on October 31, 2025

Your review

Tags

Paper Aliases

No aliases