Papperoni

2020

Do We Train on Test Data? Purging CIFAR of Near-Duplicates

Joachim Denzler

Open PDF Google Scholar

citations

Cite Score

7

AI summary

This paper introduces the ciFAIR dataset, a purged version of CIFAR-10 and CIFAR-100, where near-duplicates between training and test sets are removed, and it re-evaluates CNN performance, finding a significant drop in classification accuracy, suggesting overfitting to memorization.

Main Contributions

Identified a significant number of near-duplicate images in CIFAR-10 and CIFAR-100 test sets.
Introduced the ciFAIR dataset by replacing duplicates in the test sets with new images.
Re-evaluated state-of-the-art CNN architectures on the ciFAIR dataset, demonstrating a notable performance drop.
Showed that models can achieve near-perfect classification on duplicate images, indicating memorization.
The relative ranking of models remains consistent, suggesting research efforts haven't heavily overfitted to duplicates.

Abstract

The CIFAR-10 and CIFAR-100 datasets are two of the most heavily benchmarked datasets in computer vision and are often used to evaluate novel methods and model architectures in the field of deep learning. However, we find that 3.3% and 10% of the images from the test sets of these datasets have duplicates in the training set. These duplicates are easily recognizable by memorization and may, hence, bias the comparison of image recognition techniques regarding their generalization capability. To eliminate this bias, we provide the “fair CIFAR” (ciFAIR) dataset, where we replaced all duplicates in the test sets with new images sampled from the same domain. We then re-evaluate the classification performance of various popular state-of-the-art CNN architectures on these new test sets to investigate whether recent research has overfitted to memorizing data instead of learning abstract concepts. We find a significant drop in classification accuracy of between 9% and 14% relative to the original performance on the duplicate-free test set. The ciFAIR dataset and pre-trained models are available at https://cvjena.github.io/cifair/, where we also maintain a leaderboard.

Citation Graph

Loading graph...

References [24]

Sort:

Filter:

[1]Deep Residual Learning for Image Recognition

K. He, X. Zhang, S. Ren, Jian Sun - 2016

20 papers in library cite

This is simply amazing. Very very simple idea, totally revolutionary. No maths, just "it works!". Amazing.

[2]ImageNet Classification With Deep Convolutional Neural Networks

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

I'm giving this a 5 just because of the impact, but this is VEEERY derivative of earlier work. Kudos for them for putting it all together, but really there's nothing revolutionary here.

[3]ImageNet: A Large-Scale Hierarchical Image Database

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

Very nice idea and huge impact!

[4]Densely Connected Convolutional Networks

G. Huang, Ze Liu, K. Weinberger, Laurens Van Der Maaten - 2017

5 papers in library cite

I liked this paper so much! The way that it's written makes it very easy to follow. Results are nice, explanations are intuitive. Very nice!

[5]Learning Multiple Layers of Features From Tiny Images

Alex Krizhevsky - 2009

27 papers in library cite

It's alright. It mainly focuses on RBMs and their features and the actual part that describes the dataset is like 1 page. However, it's maybe the best intuitive description of an RBM I have seen. Other than that, it reads very much like an undergraduate thesis.

[6]80 Million Tiny Images: A Large Dataset for Non-Parametric Object and Scene Recognition

Antonio Torralba, Rob Fergus, W. Freeman - 2008

8 papers in library cite

The initial part about data collection and dataset description was nice, but the part of classifying was a bit overkill

[7]Do CIFAR-10 Classifiers Generalize to CIFAR-10?

Vaishaal Shankar - 2018

2 papers in library cite

It's a very nice analysis and it's interesting it took so long for people to see these problems.

[8]Imagenet Large Scale Visual Recognition Challenge

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Zhongqiang Huang, A. Karpathy, A. Khosla, M. Bernstein - 2014

18 papers in library cite

Imagenet dataset challenge paper

[9]Aggregated Residual Transformations for Deep Neural Networks

Saining Xie, Ross Girshick, Piotr Dollar, Zhuowen Tu, K. He - 2017

3 papers in library cite

SOTA vision

[10]Wide Residual Networks

S. Zagoruyko, N. Komodakis - 2016

5 papers in library cite

Turn networks wide vs. deep

[11]Neural Codes for Image Retrieval

A. Babenko, A. Slesarev, A. Chigorin, Victor Lempitsky - 2014

1 paper in library cites

Basically embeddings for image retrieval

[12]Aggregating Local Deep Features for Image Retrieval

A. Babenko, Victor Lempitsky - 2015

1 paper in library cites

Using CNN image features for image retrieval

[13]Deep Pyramidal Residual Networks

D. Han, Jeremy Kim, Jeremy Kim - 2017

3 papers in library cite

[14]Regularized Evolution for Image classifier Architecture Search

Y. H. Q. V. Le, E. Real, A. Aggarwal - 2018

3 papers in library cite

[15]Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

C. Sun, A. Shrivastava, Shivalika Singh, Aman Gupta - 2017

2 papers in library cite

G. Miller, C. Fellbaum - 2007

2 papers in library cite

[17]Content-Based Image Retrieval at the End of the Early Years

A. W. Smeulders, M. Worring, S. Santini, Aman Gupta, R. Jain - 2000

1 paper in library cites

[18]Deep Learning Is Not a Matter of Depth but of Good Training

B. Barz, Joachim Denzler - 2018

1 paper in library cites

[19]Improving Large-Scale Image Retrieval Through Robust Aggregation of Local Descriptors

S. S. Husain, M. Bober - 2017

1 paper in library cites

[20]Learning With Average Precision: Training Image Retrieval With a Listwise Loss

J. Revaud, J. Almazan, R. S. Rezende, C. R. D. Souza - 2019

1 paper in library cites

[21]Spatial Transformer Networks

M. Jaderberg, K. Simonyan, Andrew Zisserman, Koray Kavukcuoglu - 2015

1 paper in library cites

[22]Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning

Bo Wu, Weizhu Chen, Yu Fan, Y. Z. Zhang, J. Hou, J. Huang, Weizhou Liu, Tong Zhang - 2019

1 paper in library cites

[23]The Caltech-Ucsd Birds-200-2011 Dataset

C. Wah, S. Branson, Peter Welinder, Pietro Perona, S. Belongie - 2011

1 paper in library cites

[24]The MIR Flickr Retrieval Evaluation

M. J. Huiskes, M. S. Lew - 2008

1 paper in library cites

Cited by

1

papers in your library

Cites

12

papers in your library

Read

on November 10, 2025

It's a bit derivative from the other paper regarding CIFAR. It's good though, solid addition.

Tags

Paper Aliases

No aliases