2010

Large Scale Image Annotation: Learning to Rank With Joint Word-Image Embeddings

Jason Weston, Samy Bengio, Nicolas Usunier

citations

Cite Score

24

AI summary

This paper introduces WSABIE, a scalable method for large-scale image annotation by learning a low-dimensional joint embedding space for images and annotations, optimizing precision at k using the novel WARP loss, and introducing a "sibling precision" metric, outperforming baselines in speed, memory, and accuracy on ImageNet and Web-data datasets.

Main Contributions

  • Introduces WSABIE (Web Scale Annotation by Image Embedding) for large-scale image annotation.
  • Proposes the WARP loss (Weighted Approximate-Rank Pairwise loss) for efficient, online optimization of precision at k, enabling training on datasets that do not fit in memory.
  • Develops a low-dimensional joint embedding space for images and annotations, allowing fast testing and low memory usage.
  • Introduces the "sibling precision" metric to quantify semantic similarity of predictions, giving credit for related annotations.
  • Achieves superior performance compared to competing approaches on ImageNet and a proprietary Web-data dataset, with significant speedups and memory reductions.

Abstract

Image annotation datasets are becoming larger and larger, with tens of millions of images and tens of thousands of possible anno-tations. We propose a strongly performing method that scales to such datasets by simultaneously learning to optimize precision at k of the ranked list of annotations for a given image and learning a low-dimensional joint embedding space for both images and annotations. Our method both outperforms several baseline methods and, in comparison to them, is faster and consumes less memory. We also demonstrate how our method learns an interpretable model, where annotations with alternate spellings or even languages are close in the embedding space. Hence, even when our model does not predict the exact annotation given by a human la-beler, it often predicts similar annotations, a fact that we try to quantify by measuring the newly introduced "sibling" precision metric, where our method also obtains excellent results.

Citation Graph

Loading graph...

References [24]

Sort:
Filter:

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

Sutton Monro - 1951

3 papers in library cite

Greg Griffin, Alex Holub, Pietro Perona - 2007

9 papers in library cite

Antonio Torralba, Rob Fergus, W. Freeman - 2008

8 papers in library cite

Rie Kubota Ando, Tong Zhang - 2005

10 papers in library cite

C. Fellbaum - 1998

12 papers in library cite

R. Tibshirani - 1996

4 papers in library cite

D. Wolpert - 1992

3 papers in library cite

D. Grangier, Samy Bengio - 2008

2 papers in library cite

K. Crammer, O. Dekel, J. Keshet, S. S. Shwartz, Yoram Singer - 2006

2 papers in library cite

Antonio Torralba, Rob Fergus, Y. Weiss - 2008

2 papers in library cite

A. Makadia, V. Pavlovic, S. Kumar - 2008

1 paper in library cites

Y. Yue, T. Finley, F. Radlinski, T. Joachims - 2007

1 paper in library cites

Fanyue Xia, T. Y. Liu, J. Wang, Wenxuan Zhang, H. Li - 2008

1 paper in library cites

F. Monay, D. G. Perez - 2004

1 paper in library cites

B. Bai, Jason Weston, D. Grangier, Ronan Collobert, K. Sadamasa, Y. Qi, C. Cortes, M. Mohri - 2009

1 paper in library cites

Nicolas Usunier, D. Buffoni, P. Gallinari - 2009

1 paper in library cites

Rob Fergus, Y. Weiss, Antonio Torralba - 2009

1 paper in library cites

Zijian Zhou, D. Zhan, Qiang Yang - 2007

1 paper in library cites

Missing year

J. Wang, Jeffrey Li, G. Wiederholdy

1 paper in library cites

M. Guillaumin, T. Mensink, J. Verbeek, Cordelia Schmid, I. Lear, L. Kuntzmann - 2009

1 paper in library cites

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman - 2007

1 paper in library cites

Kristen Grauman, Trevor Darrell - 2007

1 paper in library cites

Nicolas Loeff, Ali Farhadi, I. Endres, David Forsyth - 2009

1 paper in library cites

Cited by

3

papers in your library

Cites

6

papers in your library

Read

on February 13, 2026

Your review

Tags

Paper Aliases

No aliases