Papperoni

2005

Learning Object Categories From Google's Image Search

Rob Fergus, Li Fei Fei, Pietro Perona, Andrew Zisserman

Open PDF Google Scholar

citations

Cite Score

36

AI summary

This paper introduces TSI-PLSA, an extension of PLSA that incorporates spatial information, to learn object categories directly from noisy, unstructured image search engine results, achieving competitive performance on standard test sets compared to methods using hand-prepared datasets.

Main Contributions

Proposed an approach to learn object categories directly from raw output of Internet image search engines, reducing the need for manually prepared datasets.
Introduced TSI-PLSA (Translation and Scale Invariant pLSA), a new model that extends pLSA to include spatial information in a translation and scale-invariant manner.
Demonstrated that the proposed model can handle high intra-class variability and a large proportion of unrelated images from search engine results.
Evaluated the model on standard test sets, showing performance competitive with existing methods trained on hand-prepared datasets.
Showed the potential of learned models to improve image search quality by re-ranking images based on learned topics.

Abstract

Current approaches to object category recognition require datasets of training images to be manually prepared, with varying degrees of supervision. We present an approach that can learn an object category from just its name, by uti- lizing the raw output of image search engines available on the Internet. We develop a new model, TSI-PLSA, which extends PLSA (as applied to visual words) to include spa- tial information in a translation and scale invariant man- ner. Our approach can handle the high intra-class vari- ability and large proportion of unrelated images returned by search engines. We evaluate the models on standard test sets, showing performance competitive with existing meth- ods trained on hand prepared datasets.

Citation Graph

Loading graph...

References [24]

Sort:

Filter:

[1]Video google: A Text Retrieval Approach to Object Matching in Videos

Josef Sivic, Andrew Zisserman - 2003

5 papers in library cite

Fun read! It's not really related to AI, but TBH the way they do search on video is more interesting than the object recog.

[2]Learning Methods for Generic Object Recognition With Invariance to Pose and Lighting

Yann Lecun, Fu Jie Huang, Leon Bottou - 2004

18 papers in library cite

Good paper, nice methodology for creating different images. However, I think that this was not too impactful... I don't see this being used a lot.

Reference title contains 'et al'

[3]Latent Dirichlet Allocation

D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003

10 papers in library cite

30 pages; LDA

[4]Weak Hypotheses and Boosting for Generic Object Detection and Recognition

A. Opelt, A. Fussenegger, P. Auer - 2004

2 papers in library cite

[5]Rapid Object Detection Using a Boosted Cascade of Simple Features

P. Viola, M. J. Jones - 2001

10 papers in library cite

[6]Shape Matching and Object Recognition Using Low Distortion Correspondences

A. C. Berg, T. L. Berg, Jitendra Malik - 2005

8 papers in library cite

[7]Object Recognition From Local Scale-Invariant Features

D. G. Lowe - 1999

6 papers in library cite

[8]A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories

Li Fei Fei, Rob Fergus, Pietro Perona - 2003

4 papers in library cite

[9]A Bayesian Hierarchical Model for Learning Natural Scene Categories

Li Fei Fei, Pietro Perona - 2005

4 papers in library cite

[10]Combined Object Categorization and Segmentation With an Implicit Shape Model

B. Leibe, A. Leonardis, B. Schiele - 2004

4 papers in library cite

[11]Object Class Recognition by Unsupervised Scale-Invariant Learning

Rob Fergus, Pietro Perona, Andrew Zisserman - 2003

4 papers in library cite

[12]Sharing Features: Efficient Boosting Procedures for Multiclass Object Detection

Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2004

4 papers in library cite

[13]A Visual Category Filter for Google Images

Rob Fergus, Pietro Perona, Andrew Zisserman - 2004

3 papers in library cite

[14]Learning to Detect Objects in Images via a Sparse, Part-Based Representation

Sandhini Agarwal, A. Awan, Dan Roth - 2004

3 papers in library cite

[15]Matching Words and Pictures

K. Barnard, P. Duygulu, N. D. Freitas, David Forsyth, D. Blei, M. Jordan - 2003

3 papers in library cite

[16]Scale, Saliency and Image Description

T. Kadir, M. Brady - 2001

3 papers in library cite

[17]Unsupervised Learning of Models for Recognition

M. Weber, M. Welling, Pietro Perona - 2000

3 papers in library cite

[18]Visual Categorization With Bags of Keypoints

G. Csurka, C. Bray, C. Dance, L. Fan - 2004

3 papers in library cite

[19]Indexing Based on Scale Invariant Interest Points

K. Mikolajczyk, Cordelia Schmid - 2001

2 papers in library cite

[20]Caltech Object Category Datasets

Rob Fergus, Pietro Perona - 2003

1 paper in library cites

[21]Discovering Object Categories in Image Collections

Josef Sivic, B. Russell, A. Efros, Andrew Zisserman, W. Freeman - 2005

1 paper in library cites

[22]Object Recognition With Informative Features and Linear Classification

N. V. Naquet, S. Ullman - 2003

1 paper in library cites

[23]PASCAL Visual Object Challenge Datasets

Mark Everingham, Luc Van Gool, C. Williams, Andrew Zisserman - 2005

1 paper in library cites

[24]Probabilistic Latent Semantic Indexing

T. Hofmann - 1999

1 paper in library cites

Cited by

3

papers in your library

Cites

3

papers in your library

Read

on January 11, 2026

Wow, they focus totally on the LSA stuff, and not on the Google Image Search stuff. Anticlimatic.

Tags

Paper Aliases

No aliases