2016

Deep Residual Learning for Image Recognition

K. He, X. Zhang, S. Ren, Jian Sun

citations

Cite Score

100

AI summary

This paper introduces a deep residual learning framework, enabling the training of significantly deeper networks by learning residual functions with shortcut connections. They achieved 3.57% error on the ImageNet test set, winning 1st place in the ILSVRC 2015 classification task, and demonstrated improved performance on COCO object detection dataset.

Main Contributions

  • Introduces a deep residual learning framework to address the degradation problem in very deep networks, enabling easier optimization and accuracy gains from increased depth.
  • Presents residual networks (ResNets) with identity shortcut connections that allow training of networks with up to 152 layers.
  • Achieves state-of-the-art results on the ImageNet dataset, winning 1st place in the ILSVRC 2015 classification task with a 3.57% error rate.
  • Demonstrates the effectiveness of residual learning on other datasets such as CIFAR-10, and shows improved performance on object detection tasks on the COCO dataset.
  • Provides analysis of layer responses, showing that ResNets have generally smaller responses than their plain counterparts, suggesting that identity mappings provide reasonable preconditioning.

Abstract

Deeper neural networks are more difficult to train. We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously. We explicitly reformulate the layers as learning residual functions with reference to the layer inputs, instead of learning unreferenced functions. We provide comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. On the ImageNet dataset we evaluate residual nets with a depth of up to 152 layers—8× deeper than VGG nets [41] but still having lower complexity. An ensemble of these residual nets achieves 3.57% error on the ImageNet test set. This result won the 1st place on the ILSVRC 2015 classification task. We also present analysis on CIFAR-10 with 100 and 1000 layers. The depth of representations is of central importance for many visual recognition tasks. Solely due to our extremely deep representations, we obtain a 28% relative improvement on the COCO object detection dataset. Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

Citation Graph

Loading graph...

References [50]

Sort:
Filter:

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

J. Long, E. Shelhamer, Trevor Darrell - 2015

7 papers in library cite

Jian Sun - 2016

2 papers in library cite

Ross Girshick, J. Donahue, Trevor Darrell, Jitendra Malik - 2014

18 papers in library cite

Ross Girshick - 2015

2 papers in library cite

Alex Krizhevsky - 2009

27 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

V. Nair, Geoffrey E. Hinton - 2010

18 papers in library cite

K. He, X. Zhang, S. Ren, Jian Sun - 2015

10 papers in library cite

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman - 2010

7 papers in library cite

Matthew D. Zeiler, Rob Fergus - 2014

15 papers in library cite

Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackal - 1989

24 papers in library cite

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, Ross Girshick, S. Guadarrama, Trevor Darrell - 2014

12 papers in library cite

K. He, X. Zhang, S. Ren, Jian Sun - 2014

6 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

M. Lin, Qinlang Chen, Shuicheng Yan - 2013

11 papers in library cite

Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998

20 papers in library cite

R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015

6 papers in library cite

Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014

8 papers in library cite

Yoshua Bengio - 2013

17 papers in library cite

Surya Ganguli - 2014

9 papers in library cite

Tapani Raiko, Harri Valpola, Yann Lecun - 2012

7 papers in library cite

P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, Rob Fergus, Yann Lecun - 2014

16 papers in library cite

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Zhongqiang Huang, A. Karpathy, A. Khosla, M. Bernstein - 2014

18 papers in library cite

C. M. Bishop - 1995

12 papers in library cite

A. Romero, Nicolas Ballas, S. E. Kahou, A. Chassang, C. Gatta, Yoshua Bengio - 2015

5 papers in library cite

K. He, Jian Sun - 2014

2 papers in library cite

G. F. Montufar, Razvan Pascanu, Kyunghyun Cho, Yoshua Bengio - 2014

3 papers in library cite

R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015

6 papers in library cite

F. Perronnin, C. Dance - 2007

3 papers in library cite

Hervé Jégou, F. Perronnin, M. Douze, J. Sanchez, P. Perez, Cordelia Schmid - 2012

2 papers in library cite

N. N. Schraudolph - 1998

2 papers in library cite

S. Ren, K. He, Ross Girshick, X. Zhang, Jian Sun - 2015

2 papers in library cite

K. Chatfield, Victor Lempitsky, A. Vedaldi, Andrew Zisserman - 2011

2 papers in library cite

W. L. Briggs, S. F. Mccormick - 2000

1 paper in library cites

N. N. Schraudolph - 1998

1 paper in library cites

R. Szeliski - 1990

1 paper in library cites

R. Szeliski - 2006

1 paper in library cites

W. Venables, B. Ripley - 1999

1 paper in library cites

S. Gidaris, N. Komodakis - 2015

1 paper in library cites

B. D. Ripley - 1996

1 paper in library cites

Hervé Jégou, M. Douze, Cordelia Schmid - 2011

1 paper in library cites

T. Vatanen, Tapani Raiko, Harri Valpola, Yann Lecun - 2013

1 paper in library cites

A. Vedaldi, B. Fulkerson - 2008

1 paper in library cites

Cited by

20

papers in your library

Cites

35

papers in your library

Read

on July 14, 2025

Your review

Tags

Paper Aliases

No aliases