2006

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce

citations

Cite Score

86

AI summary

This paper introduces Spatial Pyramid Matching for scene categorization, an extension of bag-of-features that uses multi-resolution histograms of local features to capture approximate geometric correspondence, achieving state-of-the-art results on the Caltech-101 database and high accuracy on a large database of fifteen natural scene categories.

Main Contributions

  • Introduces Spatial Pyramid Matching (SPM) for scene categorization, an extension of the bag-of-features representation that incorporates approximate global geometric correspondence.
  • Proposes a technique that partitions images into increasingly fine sub-regions and computes histograms of local features within each, creating a "spatial pyramid" representation.
  • Demonstrates that SPM significantly improves performance on challenging scene categorization tasks compared to orderless bag-of-features methods.
  • Achieves state-of-the-art performance on the Caltech-101 database and high accuracy on a large database of fifteen natural scene categories.
  • Provides insights into the success of existing image descriptions like Torralba's "gist" and Lowe's SIFT descriptors through the spatial pyramid framework.

Abstract

This paper presents a method for recognizing scene categories based on approximate global geometric correspondence. This technique works by partitioning the image into increasingly fine sub-regions and computing histograms of local features found inside each sub-region. The resulting "spatial pyramid" is a simple and computationally efficient extension of an orderless bag-of-features image representation, and it shows significantly improved performance on challenging scene categorization tasks. Specifically, our proposed method exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories. The spatial pyramid framework also offers insights into the success of several recently proposed image descriptions, including Torralba's "gist" and Lowe's SIFT descriptors.

Citation Graph

Loading graph...

References [25]

Sort:
Filter:

Li Fei Fei, Rob Fergus, Pietro Perona - 2004

15 papers in library cite

Kristen Grauman, Trevor Darrell - 2005

4 papers in library cite

Reference title contains 'et al'

D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003

10 papers in library cite

A. Opelt, A. Fussenegger, P. Auer - 2004

2 papers in library cite

A. C. Berg, T. L. Berg, Jitendra Malik - 2005

8 papers in library cite

Aude Oliva, Antonio Torralba - 2001

7 papers in library cite

Haowei Zhang, A. C. Berg, M. Maire, Jitendra Malik - 2006

6 papers in library cite

Li Fei Fei, Pietro Perona - 2005

4 papers in library cite

Rob Fergus, Pietro Perona, Andrew Zisserman - 2003

4 papers in library cite

M. Swain, D. Ballard - 1991

3 papers in library cite

Antonio Torralba, Kevin P. Murphy, William T. Freeman, M. A. Rubin - 2003

2 papers in library cite

Josef Sivic, B. Russell, A. Efros, Andrew Zisserman, W. Freeman - 2005

2 papers in library cite

P. Quelhas, F. Monay, J. M. Odobez, D. Gatica, T. Tuytelaars, Luc Van Gool - 2005

2 papers in library cite

E. Hadjidemetriou, M. Grossberg, S. Nayar - 2004

2 papers in library cite

C. Wallraven, B. Caputo, A. Graf - 2003

2 papers in library cite

M. M. Gorkani, R. W. Picard - 1994

2 papers in library cite

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce - 2005

1 paper in library cites

J. Willamowski, D. Arregui, G. Csurka, C. R. Dance, L. Fan - 2004

1 paper in library cites

M. Szummer, R. Picard - 1998

1 paper in library cites

J. Zhang, M. Marszalek, Svetlana Lazebnik, Cordelia Schmid - 2005

1 paper in library cites

B. Schiele, J. Crowley - 2000

1 paper in library cites

J. Koenderink, A. V. Doorn - 1999

1 paper in library cites

D. Lowe - 2000

1 paper in library cites

T. Hofmann - 2001

1 paper in library cites

Cited by

14

papers in your library

Cites

3

papers in your library

Read

on January 25, 2026

Your review

Tags

Paper Aliases

No aliases