2015

Rethinking the Inception Architecture for Computer Vision

Zbigniew Wojna

citations

Cite Score

96

AI summary

This paper introduces a new Inception architecture for computer vision that utilizes factorized convolutions and aggressive regularization. The proposed Inception-v3 achieves state-of-the-art results on the ILSVRC 2012 classification challenge with 21.2% top-1 and 5.6% top-5 error for single frame evaluation. An ensemble of 4 models and multi-crop evaluation achieves 3.5% top-5 error and 17.3% top-1 error.

Main Contributions

  • Introduces design principles for scaling up convolutional networks, emphasizing computational efficiency and parameter count.
  • Presents a novel Inception architecture (Inception-v3) that utilizes factorized convolutions and aggressive regularization.
  • Achieves a new state-of-the-art top-1 error rate of 21.2% and top-5 error rate of 5.6% on the ILSVRC 2012 classification benchmark for single frame evaluation.
  • Demonstrates that high-quality results can be achieved with relatively low receptive field resolution (79x79), which is beneficial for detecting small objects.
  • Shows that combining lower parameter counts with batch-normalized auxiliary classifiers and label smoothing enables the training of high-quality networks on modest-sized training sets.

Abstract

Convolutional networks are at the core of most state-of-the-art computer vision solutions for a wide variety of tasks. Since 2014 very deep convolutional networks started to become mainstream, yielding substantial gains in various benchmarks. Although increased model size and computational cost tend to translate to immediate quality gains for most tasks (as long as enough labeled data is provided for training), computational efficiency and low parameter count are still enabling factors for various use cases such as mobile vision and big-data scenarios. Here we are exploring ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization. We benchmark our methods on the ILSVRC 2012 classification challenge validation set demonstrate substantial gains over the state of the art: 21.2% top-1 and 5.6% top-5 error for single frame evaluation using a network with a computational cost of 5 billion multiply-adds per inference and with using less than 25 million parameters. With an ensemble of 4 models and multi-crop evaluation, we report 3.5% top-5 error and 17.3% top-1 error.

Citation Graph

Loading graph...

References [23]

Sort:
Filter:

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

J. Long, E. Shelhamer, Trevor Darrell - 2015

7 papers in library cite

Ross Girshick, J. Donahue, Trevor Darrell, Jitendra Malik - 2014

18 papers in library cite

K. He, X. Zhang, S. Ren, Jian Sun - 2015

10 papers in library cite

M. Abadi, Akshat Agarwal, P. Barham, E. Brevdo, Ziru Chen, C. Citro, G. Corrado, A. Davis, Jeffrey Dean, M. Devin, Sanjay Ghemawat, I. Goodfellow, A. Harp, Geoffrey Irving, M. Isard, Y. Jia, R. Jozefowicz, Lukasz Kaiser, M. Kudlur, J. Levenberg, D. Mane, R. Monga, S. Moore, D. Murray, Christopher Olah, M. Schuster, J. Shlens, B. Steiner, Ilya Sutskever, K. Talwar, P. Tucker, Vincent Vanhoucke, V. Vasudevan, F. Viegas, Oriol Vinyals, P. Warden, M. Wattenberg, M. Wicke, Y. Yu, Xiaoqiang Zheng - 2015

11 papers in library cite

A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, Li Fei Fei - 2014

2 papers in library cite

Razvan Pascanu, Tomas Mikolov, Yoshua Bengio - 2013

21 papers in library cite

Ilya Sutskever, James Martens, G. Dahl, Geoffrey Hinton - 2013

13 papers in library cite

Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014

8 papers in library cite

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Zhongqiang Huang, A. Karpathy, A. Khosla, M. Bernstein - 2014

18 papers in library cite

Weizhu Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, Yanru Chen - 2015

2 papers in library cite

A. Lavin - 2015

3 papers in library cite

T. Tieleman, Geoffrey Hinton - 2012

4 papers in library cite

Dumitru Erhan, Christian Szegedy, A. Toshev, Dragomir Anguelov - 2014

4 papers in library cite

A. Toshev, Christian Szegedy - 2014

2 papers in library cite

F. Schroff, D. Kalenichenko, J. Philbin - 2015

1 paper in library cites

N. Wang, D. Y. Yeung - 2013

1 paper in library cites

C. Dong, C. C. Loy, K. He, X. Tang - 2014

1 paper in library cites

Y. M. Attias, Q. Yu, M. C. Stumpe, V. Shet, S. Arnoud, L. Yatziv - 2015

1 paper in library cites

D. C. Psichogios, L. H. Ungar - 1993

1 paper in library cites

Cited by

5

papers in your library

Cites

15

papers in your library

Read

on August 2, 2025

Your review

Tags

Paper Aliases

No aliases