2016
Cite Score
60
AI summary
This paper introduces Quantized Neural Networks (QNNs), training neural networks with low precision weights/activations, evaluated on MNIST, CIFAR-10, SVHN, and ImageNet, achieving comparable accuracy to 32-bit counterparts, also demonstrating a binary matrix multiplication GPU kernel for faster QNN execution.
Main Contributions
Abstract
We introduce a method to train Quantized Neural Networks (QNNs) neural networks with extremely low precision (e.g., 1-bit) weights and activations, at run-time. At train- time the quantized weights and activations are used for computing the parameter gradients. During the forward pass, QNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operations. As a result, power consumption is expected to be drastically reduced. We trained QNNs over the MNIST, CIFAR-10, SVHN and ImageNet datasets. The resulting QNNs achieve prediction accuracy comparable to their 32-bit counterparts. For example, our quantized version of AlexNet with 1-bit weights and 2-bit activations achieves 51% top-1 accuracy. Moreover, we quantize the parameter gradients to 6-bits as well which enables gradients computation using only bit-wise opera- tion. Quantized recurrent neural networks were tested over the Penn Treebank dataset, and achieved comparable accuracy as their 32-bit counterparts using only 4-bits. Last but not least, we programmed a binary matrix multiplication GPU kernel with which it is possible to run our MNIST QNN 7 times faster than with an unoptimized GPU kernel, without suffering any loss in classification accuracy. The QNN code is available online.
Citation Graph
References [71]
D. P. Kingma, Jimmy Lei Ba - 2014
49 papers in library cite
K. Simonyan, Andrew Zisserman - 2014
20 papers in library cite
Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012
71 papers in library cite
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998
62 papers in library cite
Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015
20 papers in library cite
S. Ioffe, Christian Szegedy - 2015
18 papers in library cite
N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014
20 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
V. Mnih - 2015
9 papers in library cite
Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014
58 papers in library cite
Yoshua Bengio - 2010
20 papers in library cite
D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, Ilya Sutskever, T. Lillicrap, M. Leach, Koray Kavukcuoglu, T. Graepel, Demis Hassabis - 2016
5 papers in library cite
Geoffrey Hinton - 2012
21 papers in library cite
M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993
22 papers in library cite
L. Wan, M. Zeiler, S. Zhang, Rob Fergus - 2013
8 papers in library cite
Chen Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen Tu - 2014
8 papers in library cite
Yoshua Bengio - 2013
17 papers in library cite
T. Sainath, Abdel Rahman Mohamed, Brian Kingsbury, Bhuvana Ramabhadran - 2013
2 papers in library cite
Clement Farabet - 2011
5 papers in library cite
F. Bastien, P. Lamblin, Razvan Pascanu, James Bergstra, I. Goodfellow, A. Bergeron, A. Bouchard, N. Nicolas, Yoshua Bengio - 2012
13 papers in library cite
Vincent Vanhoucke, A. Senior, Mark Z. Mao - 2011
4 papers in library cite
Tomas Mikolov, Geoffrey Zweig - 2012
12 papers in library cite
Jacob Devlin, Rabih Zbib, Zhongqiang Huang, Thomas Lamar, Richard Schwartz, John Makhoul - 2014
9 papers in library cite
Yoshua Bengio - 2013
5 papers in library cite
James Bergstra, O. Breuleux, F. Bastien, P. Lamblin, Razvan Pascanu, G. Desjardins, J. Turian, D. W. Farley, Yoshua Bengio - 2010
22 papers in library cite
S. Han, H. Mao, W. J. Dally - 2015
3 papers in library cite
S. Han, H. Mao, W. J. Dally - 2015
3 papers in library cite
A. Romero, Nicolas Ballas, S. E. Kahou, A. Chassang, C. Gatta, Yoshua Bengio - 2015
5 papers in library cite
S. Gupta, A. Agrawal, Karthik Gopalakrishnan, P. Narayanan - 2015
3 papers in library cite
Weizhu Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, Yanru Chen - 2015
2 papers in library cite
Yoshua Bengio, N. Leonard, Aaron Courville - 2013
3 papers in library cite
A. Mordvintsev, Christopher Olah, M. Tyka - 2015
2 papers in library cite
Alex Graves - 2011
8 papers in library cite
Y. Tang - 2013
2 papers in library cite
A. Coates, B. Huval, Tianle Wang, D. Wu, Bryan Catanzaro, N. Andrew - 2013
2 papers in library cite
Clement Farabet, Yann Lecun, Koray Kavukcuoglu, E. Culurciello, B. Martini, P. Akselrod, S. Talay - 2011
2 papers in library cite
S. Han, J. Pool, J. Tran, W. Dally - 2015
2 papers in library cite
Clement Farabet, B. Martini, B. Corda, P. Akselrod, E. Culurciello, Yann Lecun - 2011
2 papers in library cite
Geoffrey Hinton - 2012
2 papers in library cite
B. Graham - 2014
2 papers in library cite
M. Rastegari, V. Ordonez, Joseph Redmon, Ali Farhadi - 2016
2 papers in library cite
G. Govindu, L. Zhuo, S. Choi, V. Prasanna - 2004
1 paper in library cites
N. Torii, H. Kokubo, D. Yamamoto, K. Itoh, M. Takenaka, T. Matsumoto - 2016
1 paper in library cites
S. K. Esser, R. Appuswamy, P. Merolla, J. V. Arthur, D. S. Modha - 2015
1 paper in library cites
W. Zheng, Y. Tang
1 paper in library cites
M. Courbariaux, Yoshua Bengio, J. P. David - 2015
1 paper in library cites
Y. Gong, L. Liu, Michael Yang, L. Bourdev - 2014
1 paper in library cites
M. Horowitz - 2014
1 paper in library cites
D. Miyashita, E. H. Lee, B. Murmann - 2016
1 paper in library cites
Yanru Chen, T. Luo, Shuming Liu, S. Zhang, Luheng He, J. Wang, Lei Li, T. Chen, Zhiwei Xu, N. Sun - 2014
1 paper in library cites
P. Merolla, R. Appuswamy, J. Arthur, S. K. Esser, D. Modha - 2016
1 paper in library cites
T. Chen, Z. Du, N. Sun, J. Wang, Chiyu Wu, Yanru Chen, O. Temam - 2014
1 paper in library cites
Shuyan Zhou, Z. Ni, Xinyu Zhou, H. Wen, Yonghui Wu, Y. Zou - 2016
1 paper in library cites
X. Zhang, James Zou, X. Ming, K. He, Jian Sun - 2015
1 paper in library cites
D. Soudry, I. Hubara, R. Meir - 2014
1 paper in library cites
C. Lomont - 2003
1 paper in library cites
K. Hwang, W. Sung - 2014
1 paper in library cites
Chen Yu Lee, P. W. Gallagher, Zhuowen Tu - 2015
1 paper in library cites
P. Gysel, M. Motamedi, S. Ghiasi - 2016
1 paper in library cites
S. Dieleman, J. Schlter, Colin Raffel, E. Olson, S. K. Snderby, D. Nouri, D. Maturana, M. Thoma, E. Battenberg, J. Kelly, J. D. Fauw, M. Heilman, Diogo149, B. Mcfee, H. Weideman, Takacsg84, Peterderivaz, Jon, Instagibbs, D. K. Rasul, Congliu, Britefury, J. Degrave - 2015
1 paper in library cites
P. H. Pham, D. Jelaca, Clement Farabet, B. Martini, Yann Lecun, E. Culurciello - 2012
1 paper in library cites
Zongyu Lin, M. Courbariaux, R. Memisevic, Yoshua Bengio - 2015
1 paper in library cites
Zongyu Lin, M. Courbariaux, R. Memisevic, Yoshua Bengio - 2015
1 paper in library cites
J. Ott, Zongyu Lin, Y. Z. Zhang, S. C. Liu, Yoshua Bengio - 2016
1 paper in library cites
H. Spang, P. Schultheiss - 1962
1 paper in library cites
C. Baldassi, A. Ingrosso, C. Lucibello, L. Saglietti, R. Zecchina - 2015
1 paper in library cites
Zhoujun Cheng, D. Soudry, Z. Mao, Z. Lan - 2015
1 paper in library cites
M. Courbariaux, Yoshua Bengio, J. P. David - 2014
1 paper in library cites
R. Andri, L. Cavigelli, D. Rossi, L. Benini - 2016
1 paper in library cites
Cited by
0
papers in your library
Cites
33
papers in your library
Read
on November 20, 2025
Your review
Tags
Paper Aliases
No aliases