2015

Delving Deep Into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

K. He, X. Zhang, S. Ren, Jian Sun

citations

Cite Score

93

AI summary

The paper introduces Parametric Rectified Linear Unit (PReLU) for image classification, improving model fitting with negligible cost and little overfitting. It also introduces a robust initialization method for training extremely deep rectified models from scratch, achieving 4.94% top-5 error on the ImageNet 2012 dataset.

Main Contributions

  • Introduces Parametric Rectified Linear Unit (PReLU) that generalizes the traditional rectified unit.
  • PReLU improves model fitting with nearly zero extra computational cost and little overfitting risk.
  • Derives a robust initialization method that particularly considers the rectifier nonlinearities.
  • Enables training extremely deep rectified models directly from scratch.
  • Achieves 4.94% top-5 test error on the ImageNet 2012 classification dataset, surpassing human-level performance.

Abstract

Rectified activation units (rectifiers) are essential for state-of-the-art neural networks. In this work, we study rectifier neural networks for image classification from two aspects. First, we propose a Parametric Rectified Linear Unit (PReLU) that generalizes the traditional rectified unit. PReLU improves model fitting with nearly zero extra computational cost and little overfitting risk. Second, we derive a robust initialization method that particularly considers the rectifier nonlinearities. This method enables us to train extremely deep rectified models directly from scratch and to investigate deeper or wider network architectures. Based on our PReLU networks (PReLU-nets), we achieve 4.94% top-5 test error on the ImageNet 2012 classification dataset. This is a 26% relative improvement over the ILSVRC 2014 winner (GoogLeNet, 6.66% [29]). To our knowledge, our result is the first to surpass human-level performance (5.1%, [22]) on this visual recognition challenge.

Citation Graph

Loading graph...

References [34]

Sort:
Filter:

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014

20 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

V. Nair, Geoffrey E. Hinton - 2010

18 papers in library cite

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman - 2010

7 papers in library cite

Matthew D. Zeiler, Rob Fergus - 2014

15 papers in library cite

Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackal - 1989

24 papers in library cite

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, Ross Girshick, S. Guadarrama, Trevor Darrell - 2014

12 papers in library cite

K. He, X. Zhang, S. Ren, Jian Sun - 2014

6 papers in library cite

Xavier Glorot, Antoine Bordes, Yoshua Bengio - 2011

17 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

A. L. Maas, A. Y. Hannun, Andrew Y. Ng - 2013

3 papers in library cite

M. Lin, Qinlang Chen, Shuicheng Yan - 2013

11 papers in library cite

Y. Taigman, Michael Yang, Marc'aurelio Ranzato, Lior Wolf - 2014

5 papers in library cite

Dan C. Ciresan, Ueli Meier, Jürgen Schmidhuber - 2012

11 papers in library cite

L. Wan, M. Zeiler, S. Zhang, Rob Fergus - 2013

8 papers in library cite

Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014

8 papers in library cite

Yoshua Bengio - 2013

17 papers in library cite

Surya Ganguli - 2014

9 papers in library cite

Alex Krizhevsky - 2014

3 papers in library cite

Matthew D. Zeiler, M. A. Ranzato, R. Monga, M. Mao, K. Yang, Quoc Le, P. Nguyen, A. Senior, Vincent Vanhoucke, Jeffrey Dean - 2013

3 papers in library cite

P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, Rob Fergus, Yann Lecun - 2014

16 papers in library cite

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Zhongqiang Huang, A. Karpathy, A. Khosla, M. Bernstein - 2014

18 papers in library cite

K. Chatfield, K. Simonyan, A. Vedaldi, Andrew Zisserman - 2014

5 papers in library cite

K. He, Jian Sun - 2014

2 papers in library cite

A. G. Howard - 2013

4 papers in library cite

R. K. Srivastava, Jonathan Masci, S. Kazerounian, Faustino Gomez, Jürgen Schmidhuber - 2013

3 papers in library cite

D. Eigen, J. Rolfe, Rob Fergus, Yann Lecun - 2013

2 papers in library cite

R. Wu, Y. Shan, G. Sun - 2015

2 papers in library cite

Y. S. Sun, Yanru Chen, Xinpeng Wang, X. Tang - 2014

1 paper in library cites

F. Agostinelli, M. Hoffman, P. Sadowski, P. Baldi - 2014

1 paper in library cites

Cited by

10

papers in your library

Cites

31

papers in your library

Read

on July 20, 2025

Your review

Tags

Paper Aliases

No aliases