Papperoni

2007

Caltech-256 Object Category Dataset

Greg Griffin, Alex Holub, Pietro Perona

Open PDF Google Scholar

citations

Cite Score

64

AI summary

This paper introduces the Caltech-256 image dataset, a new benchmark for object recognition with 256 categories and 30,607 images, addressing limitations of previous datasets by increasing category count, minimum images per category, and introducing a clutter category for background rejection, demonstrating its challenge through spatial pyramid matching.

Main Contributions

Introduces Caltech-256, a new object category dataset with 256 categories and 30,607 images, significantly larger than Caltech-101.
Improves data collection by increasing minimum images per category to 80, avoiding image rotation artifacts, and adding a large clutter category for background rejection.
Proposes several testing paradigms to measure classification performance, including the use of a background clutter class.
Benchmarks the dataset using simple metrics (Size Classifier, Correlation Classifier) and a state-of-the-art Spatial Pyramid Matching algorithm, showing Caltech-256 is roughly half as challenging as Caltech-101 for the latter.
Demonstrates the use of the clutter category to train an interest detector for rejecting uninformative background regions.

Abstract

We introduce a challenging set of 256 object categories containing a total of 30607 images. The original Caltech-101 [1] was collected by choosing a set of object categories, downloading examples from Google Images and then manually screening out all images that did not fit the category. Caltech-256 is collected in a similar manner with several improvement: a) the number of categories is more than doubled, b) the minimum number of images in any category is increased from 31 to 80, c) artifacts due to image rotation are avoided and d) a new and larger clutter category is introduced for testing background rejection. We suggest several testing paradigms to measure classification performance, then benchmark the dataset using two simple metrics as well as a state-of-the-art spatial pyramid matching [2] algorithm. Finally we use the clutter category to train an interest detector which rejects uninformative background regions.

Citation Graph

Loading graph...

References [17]

Sort:

Filter:

[1]Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce - 2006

14 papers in library cite

It's a fun read, but in the end is just an application of the spatial pyramid matching kernel from the other paper.

[2]Learning Generative Visual Models From Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

Li Fei Fei, Rob Fergus, Pietro Perona - 2004

15 papers in library cite

I think most people cite this thinking this is where the Caltech 101 dataset comes from (it's not). Anyway, it's just an extension of the other dataset and it's very mathy, not NNs, and uninteresting.

[3]Labelme: A Database and Web-Based Tool for Image Annotation

Bryan C. Russell, Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2008

10 papers in library cite

It's a good paper overall but not worth the read. They just describe the platform (which for the time may have been a paradigm shift from in-house datasets). Maybe the basis for Amazon MT?

[4]The Pyramid Match kernel: Discriminative Classification With Sets of Image Features

Kristen Grauman, Trevor Darrell - 2005

4 papers in library cite

Very simple and elegant solution to set matching. At first I didn't understand, but then it clicked. I think it could be used for other stuff as well!

[5]Rapid Object Detection Using a Boosted Cascade of Simple Features

P. Viola, M. J. Jones - 2001

10 papers in library cite

[6]Distinctive Image Features From Scale-Invariant Keypoints

D. Lowe - 2004

9 papers in library cite

[7]Shape Matching and Object Recognition Using Low Distortion Correspondences

A. C. Berg, T. L. Berg, Jitendra Malik - 2005

8 papers in library cite

[8]Svm-knn: Discriminative Nearest Neighbor Classification for Visual Category Recognition

Haowei Zhang, A. C. Berg, M. Maire, Jitendra Malik - 2006

6 papers in library cite

[9]Columbia Object Image Library: Coil

S. Nene, S. Nayar, H. Murase - 1996

4 papers in library cite

[10]Sharing Features: Efficient Boosting Procedures for Multiclass Object Detection

Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2004

4 papers in library cite

[11]Multiclass Object Recognition With Sparse, Localized Features

J. Mutch, D. Lowe - 2006

3 papers in library cite

[12]The 2005 PASCAL Visual Object Classes Challenge

Mark Everingham, Andrew Zisserman, Christopher K. I. Williams, Luc Van Gool - 2006

3 papers in library cite

66 pages

[13]Combining Generative Models and Fisher Kernels for Object Class Recognition

Alex Holub, M. Welling, Pietro Perona - 2005

2 papers in library cite

[14]Visual Object Category Recognition

Rob Fergus - 2005

2 papers in library cite

[15]American Surfaces

S. Shore - 2005

1 paper in library cites

[16]Object Localization With Boosting and Weak Supervision for Generic Object Recognition

A. Opelt, A. Pinz - 2005

1 paper in library cites

[17]Uncommon Places: The Complete Works

S. Shore - 2004

1 paper in library cites

Cited by

9

papers in your library

Cites

4

papers in your library

Read

on January 31, 2026

Boring read, but at least they were somewhat impactful.

Tags

Paper Aliases

No aliases