Papperoni

2014

Intriguing Properties of Neural Networks

Rob Fergus

citations

Cite Score

AI summary

This paper explores the counter-intuitive properties of deep neural networks, finding that high-level units lack clear distinctions and that networks exhibit discontinuous input-output mappings, which allows adversarial examples crafting using error maximization.

Main Contributions

Identified that high-level units in neural networks do not have clear distinctions from random linear combinations.
Discovered that deep neural networks have discontinuous input-output mappings.
Introduced a method to generate adversarial examples by maximizing the network's prediction error.
Demonstrated the transferability of adversarial examples across different networks and training sets.
Showed that adversarial training can improve generalization.

Abstract

Deep neural networks are highly expressive models that have recently achieved state of the art performance on speech and visual recognition tasks. While their expressiveness is the reason they succeed, it also causes them to learn uninterpretable solutions that could have counter-intuitive properties. In this paper we report two such properties. First, we find that there is no distinction between individual high level units and random linear combinations of high level units, according to various methods of unit analysis. It suggests that it is the space, rather than the individual units, that contains the semantic information in the high layers of neural networks. Second, we find that deep neural networks learn input-output mappings that are fairly discontinuous to a significant extent. We can cause the network to misclassify an image by applying a certain hardly perceptible perturbation, which is found by maximizing the network’s prediction error. In addition, the specific nature of these perturbations is not a random artifact of learning: the same perturbation can cause a different network, that was trained on a different subset of the dataset, to misclassify the same input.

Citation Graph

Loading graph...

References [13]

Sort:

Filter:

[1]ImageNet Classification With Deep Convolutional Neural Networks

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

Google Scholar

I'm giving this a 5 just because of the impact, but this is VEEERY derivative of earlier work. Kudos for them for putting it all together, but really there's nothing revolutionary here.

[2]ImageNet: A Large-Scale Hierarchical Image Database

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

Google Scholar

Very nice idea and huge impact!

[3]Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

26 papers in library cite

Google Scholar

Expanded wor2vec. Very nice overall.

[4]Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, J. Donahue, Trevor Darrell, Jitendra Malik - 2014

18 papers in library cite

Google Scholar

Good results, beat overfeat, used pretraining for improving performance. Only issue is that the paper is overly long...

[5]Visualizing and Understanding Convolutional Networks

Matthew D. Zeiler, Rob Fergus - 2014

15 papers in library cite

Google Scholar

Very good explanation and visualization of CNNs, and also nice that they use their findings to improve the performance. The ablation study is also nice.

[6]Learning Deep Architectures for AI

Yoshua Bengio - 2009

25 papers in library cite

Google Scholar

It's a nice overview. Some sections get very theoretical, but the first half is very good and I feel that it does a waaaay better job of explaining RBMs and DBNs than other papers. This feels like Bengio is taking your hand and saying "if you don't know what's going on, here you go, everything you need to know to jump into the deep nets train"

[7]Building High-Level Features Using Large Scale Unsupervised Learning

Quoc V. Le, M. A. Ranzato, R. Monga, M. Devin, K. Chen, G. S. Corrado, Jeffrey Dean, Andrew Y. Ng - 2012

10 papers in library cite

Google Scholar

Very nice and very early work - seems very simple but very insightful to use an autoencoder to detect objects. Also, very similar to the neocognitron :)

[8]Visualizing Higher-Layer Features of a Deep Network

Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pascal Vincent - 2009

4 papers in library cite

Google Scholar

Very nice the way that they tackled it as an optimization problem using gradient descent. I think this is a similar approach to adversarial examples (not sure if this is what inspired them, I don't remember)

[9]Measuring Invariances in Deep Networks

I. Goodfellow, Quoc Le, A. Saxe, A. Ng - 2009

7 papers in library cite

Google Scholar

Very nice concept and methodology, but the results in the end are underwhelming

[10]Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups

Geoffrey E. Hinton, L. Deng, D. Yu, George E. Dahl, A. Mohamed, Navdeep Jaitly, A. Senior, Vincent Vanhoucke, P. Nguyen, T. N. Sainath, Brian Kingsbury - 2012

8 papers in library cite

Google Scholar

The core of the paper itself is a bit boring and doesn't introduce anything new (just RBMs and DBNs again) but I am giving this a 4 because it's probably the best explanation of RBMs and DBNs I've read so far.

[11]The MNIST Database of Handwritten Digits

Yann Lecun - 1998

8 papers in library cite

Google Scholar

Not a paper - it's actually a dataset

[12]A Discriminatively Trained, Multiscale, Deformable Part Model

P. F. Felzenszwalb, D. Mcallester, D. Ramanan - 2008

2 papers in library cite

Google Scholar

[13]How to Explain Individual Classification Decisions

D. Baehrens, T. Schroeter, S. Harmeling, M. Kawanabe, K. Hansen, Klaus Robert Muller - 2010

1 paper in library cites

Google Scholar

Cited by

papers in your library

Cites

papers in your library

Read

on November 8, 2025

Very nice, and the first to notice how flaky NNs are. I think the end they went overboard with math, but the rest of the paper is very good.