2016

Quantized Neural Networks: Training Neural Networks With Low Precision Weights and Activations

Yoshua Bengio

citations

Cite Score

60

AI summary

This paper introduces Quantized Neural Networks (QNNs), training neural networks with low precision weights/activations, evaluated on MNIST, CIFAR-10, SVHN, and ImageNet, achieving comparable accuracy to 32-bit counterparts, also demonstrating a binary matrix multiplication GPU kernel for faster QNN execution.

Main Contributions

  • Introduces a method to train Quantized-Neural-Networks (QNNs) with low precision weights and activations at run-time and during gradient computation.
  • Demonstrates the possibility of training BNNs on MNIST, CIFAR-10, and SVHN, achieving near state-of-the-art results.
  • Presents results on the ImageNet dataset using binary weights/activations and quantized versions.
  • Shows that quantized gradients can be used with only 6-bits with small accuracy degradation.
  • Shows that with 4-bit weights and activations Recurrent QNNs achieve similar accuracies as their 32-bit floating point counterparts on the Penn Treebank dataset.

Abstract

We introduce a method to train Quantized Neural Networks (QNNs) neural networks with extremely low precision (e.g., 1-bit) weights and activations, at run-time. At train- time the quantized weights and activations are used for computing the parameter gradients. During the forward pass, QNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operations. As a result, power consumption is expected to be drastically reduced. We trained QNNs over the MNIST, CIFAR-10, SVHN and ImageNet datasets. The resulting QNNs achieve prediction accuracy comparable to their 32-bit counterparts. For example, our quantized version of AlexNet with 1-bit weights and 2-bit activations achieves 51% top-1 accuracy. Moreover, we quantize the parameter gradients to 6-bits as well which enables gradients computation using only bit-wise opera- tion. Quantized recurrent neural networks were tested over the Penn Treebank dataset, and achieved comparable accuracy as their 32-bit counterparts using only 4-bits. Last but not least, we programmed a binary matrix multiplication GPU kernel with which it is possible to run our MNIST QNN 7 times faster than with an unoptimized GPU kernel, without suffering any loss in classification accuracy. The QNN code is available online.

Citation Graph

Loading graph...

References [71]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

K. Simonyan, Andrew Zisserman - 2014

20 papers in library cite

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014

20 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

V. Mnih - 2015

9 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, Ilya Sutskever, T. Lillicrap, M. Leach, Koray Kavukcuoglu, T. Graepel, Demis Hassabis - 2016

5 papers in library cite

Geoffrey Hinton - 2012

21 papers in library cite

M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993

22 papers in library cite

L. Wan, M. Zeiler, S. Zhang, Rob Fergus - 2013

8 papers in library cite

Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014

8 papers in library cite

Yoshua Bengio - 2013

17 papers in library cite

T. Sainath, Abdel Rahman Mohamed, Brian Kingsbury, Bhuvana Ramabhadran - 2013

2 papers in library cite

Clement Farabet - 2011

5 papers in library cite

F. Bastien, P. Lamblin, Razvan Pascanu, James Bergstra, I. Goodfellow, A. Bergeron, A. Bouchard, N. Nicolas, Yoshua Bengio - 2012

13 papers in library cite

Vincent Vanhoucke, A. Senior, Mark Z. Mao - 2011

4 papers in library cite

Tomas Mikolov, Geoffrey Zweig - 2012

12 papers in library cite

Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, John Makhoul - 2014

9 papers in library cite

Yoshua Bengio - 2013

5 papers in library cite

James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010

22 papers in library cite

S. Han, H. Mao, W. J. Dally - 2015

3 papers in library cite

S. Han, H. Mao, W. J. Dally - 2015

3 papers in library cite

A. Romero, Nicolas Ballas, S. E. Kahou, A. Chassang, C. Gatta, Yoshua Bengio - 2015

5 papers in library cite

S. Gupta, A. Agrawal, Karthik Gopalakrishnan, P. Narayanan - 2015

3 papers in library cite

Weizhu Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, Yanru Chen - 2015

2 papers in library cite

Yoshua Bengio, N. Leonard, Aaron Courville - 2013

3 papers in library cite

A. Mordvintsev, Christopher Olah, M. Tyka - 2015

2 papers in library cite

Alex Graves - 2011

8 papers in library cite

Y. Tang - 2013

2 papers in library cite

A. Coates, B. Huval, Tianle Wang, D. Wu, Bryan Catanzaro, N. Andrew - 2013

2 papers in library cite

Clement Farabet, Yann Lecun, Koray Kavukcuoglu, E. Culurciello, B. Martini, P. Akselrod, S. Talay - 2011

2 papers in library cite

S. Han, J. Pool, J. Tran, W. Dally - 2015

2 papers in library cite

Clement Farabet, B. Martini, B. Corda, P. Akselrod, E. Culurciello, Yann Lecun - 2011

2 papers in library cite

Geoffrey Hinton - 2012

2 papers in library cite

B. Graham - 2014

2 papers in library cite

M. Rastegari, V. Ordonez, Joseph Redmon, Ali Farhadi - 2016

2 papers in library cite

G. Govindu, L. Zhuo, S. Choi, V. Prasanna - 2004

1 paper in library cites

N. Torii, H. Kokubo, D. Yamamoto, K. Itoh, M. Takenaka, T. Matsumoto - 2016

1 paper in library cites

S. K. Esser, R. Appuswamy, P. Merolla, J. V. Arthur, D. S. Modha - 2015

1 paper in library cites

Missing year

W. Zheng, Y. Tang

1 paper in library cites

M. Courbariaux, Yoshua Bengio, J. P. David - 2015

1 paper in library cites

Missing year

M. Kim, P. Smaragdis

1 paper in library cites

Y. Gong, L. Liu, Michael Yang, L. Bourdev - 2014

1 paper in library cites

M. Horowitz - 2014

1 paper in library cites

D. Miyashita, E. H. Lee, B. Murmann - 2016

1 paper in library cites

Yanru Chen, T. Luo, Shuming Liu, S. Zhang, Luheng He, J. Wang, Lei Li, T. Chen, Zhiwei Xu, N. Sun - 2014

1 paper in library cites

P. Merolla, R. Appuswamy, J. Arthur, S. K. Esser, D. Modha - 2016

1 paper in library cites

T. Chen, Z. Du, N. Sun, J. Wang, Chiyu Wu, Yanru Chen, O. Temam - 2014

1 paper in library cites

Shuyan Zhou, Z. Ni, Xinyu Zhou, H. Wen, Yonghui Wu, Y. Zou - 2016

1 paper in library cites

X. Zhang, James Zou, X. Ming, K. He, Jian Sun - 2015

1 paper in library cites

C. Lomont - 2003

1 paper in library cites

K. Hwang, W. Sung - 2014

1 paper in library cites

Chen Yu Lee, P. W. Gallagher, Zhuowen Tu - 2015

1 paper in library cites

P. Gysel, M. Motamedi, S. Ghiasi - 2016

1 paper in library cites

S. Dieleman, J. Schlter, Colin Raffel, E. Olson, S. K. Snderby, D. Nouri, D. Maturana, M. Thoma, E. Battenberg, J. Kelly, J. D. Fauw, M. Heilman, Diogo149, B. Mcfee, H. Weideman, Takacsg84, Peterderivaz, Jon, Instagibbs, D. K. Rasul, Congliu, Britefury, J. Degrave - 2015

1 paper in library cites

P. H. Pham, D. Jelaca, Clement Farabet, B. Martini, Yann Lecun, E. Culurciello - 2012

1 paper in library cites

Zongyu Lin, M. Courbariaux, R. Memisevic, Yoshua Bengio - 2015

1 paper in library cites

Zongyu Lin, M. Courbariaux, R. Memisevic, Yoshua Bengio - 2015

1 paper in library cites

J. Ott, Zongyu Lin, Y. Z. Zhang, S. C. Liu, Yoshua Bengio - 2016

1 paper in library cites

H. Spang, P. Schultheiss - 1962

1 paper in library cites

C. Baldassi, A. Ingrosso, C. Lucibello, L. Saglietti, R. Zecchina - 2015

1 paper in library cites

Zhoujun Cheng, D. Soudry, Z. Mao, Z. Lan - 2015

1 paper in library cites

M. Courbariaux, Yoshua Bengio, J. P. David - 2014

1 paper in library cites

R. Andri, L. Cavigelli, D. Rossi, L. Benini - 2016

1 paper in library cites

Cited by

0

papers in your library

Cites

33

papers in your library

Read

on November 20, 2025

Your review

Tags

Paper Aliases

No aliases