2005

Learning Object Categories From Google's Image Search

Rob Fergus, Li Fei Fei, Pietro Perona, Andrew Zisserman

citations

Cite Score

36

AI summary

This paper introduces TSI-PLSA, an extension of PLSA that incorporates spatial information, to learn object categories directly from noisy, unstructured image search engine results, achieving competitive performance on standard test sets compared to methods using hand-prepared datasets.

Main Contributions

  • Proposed an approach to learn object categories directly from raw output of Internet image search engines, reducing the need for manually prepared datasets.
  • Introduced TSI-PLSA (Translation and Scale Invariant pLSA), a new model that extends pLSA to include spatial information in a translation and scale-invariant manner.
  • Demonstrated that the proposed model can handle high intra-class variability and a large proportion of unrelated images from search engine results.
  • Evaluated the model on standard test sets, showing performance competitive with existing methods trained on hand-prepared datasets.
  • Showed the potential of learned models to improve image search quality by re-ranking images based on learned topics.

Abstract

Current approaches to object category recognition require datasets of training images to be manually prepared, with varying degrees of supervision. We present an approach that can learn an object category from just its name, by uti- lizing the raw output of image search engines available on the Internet. We develop a new model, TSI-PLSA, which extends PLSA (as applied to visual words) to include spa- tial information in a translation and scale invariant man- ner. Our approach can handle the high intra-class vari- ability and large proportion of unrelated images returned by search engines. We evaluate the models on standard test sets, showing performance competitive with existing meth- ods trained on hand prepared datasets.

Citation Graph

Loading graph...

References [24]

Sort:
Filter:

Josef Sivic, Andrew Zisserman - 2003

5 papers in library cite

Yann Lecun, Fu Jie Huang, Leon Bottou - 2004

18 papers in library cite

Reference title contains 'et al'

D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003

10 papers in library cite

A. Opelt, A. Fussenegger, P. Auer - 2004

2 papers in library cite

P. Viola, M. J. Jones - 2001

10 papers in library cite

A. C. Berg, T. L. Berg, Jitendra Malik - 2005

8 papers in library cite

D. G. Lowe - 1999

6 papers in library cite

Li Fei Fei, Rob Fergus, Pietro Perona - 2003

4 papers in library cite

Li Fei Fei, Pietro Perona - 2005

4 papers in library cite

B. Leibe, A. Leonardis, B. Schiele - 2004

4 papers in library cite

Rob Fergus, Pietro Perona, Andrew Zisserman - 2003

4 papers in library cite

Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2004

4 papers in library cite

Rob Fergus, Pietro Perona, Andrew Zisserman - 2004

3 papers in library cite

Sandhini Agarwal, A. Awan, Dan Roth - 2004

3 papers in library cite

K. Barnard, P. Duygulu, N. D. Freitas, David Forsyth, D. Blei, M. Jordan - 2003

3 papers in library cite

T. Kadir, M. Brady - 2001

3 papers in library cite

M. Weber, M. Welling, Pietro Perona - 2000

3 papers in library cite

G. Csurka, C. Bray, C. Dance, L. Fan - 2004

3 papers in library cite

K. Mikolajczyk, Cordelia Schmid - 2001

2 papers in library cite

Rob Fergus, Pietro Perona - 2003

1 paper in library cites

Josef Sivic, B. Russell, A. Efros, Andrew Zisserman, W. Freeman - 2005

1 paper in library cites

N. V. Naquet, S. Ullman - 2003

1 paper in library cites

Mark Everingham, Luc Van Gool, C. Williams, Andrew Zisserman - 2005

1 paper in library cites

T. Hofmann - 1999

1 paper in library cites

Cited by

3

papers in your library

Cites

3

papers in your library

Read

on January 11, 2026

Your review

Tags

Paper Aliases

No aliases