2008

Utility Data Annotation With Amazon Mechanical Turk

Alexander Sorokin, David Forsyth

citations

Cite Score

33

AI summary

This paper presents a framework for outsourcing data annotation tasks to Amazon Mechanical Turk, demonstrating that this approach yields good quality, cheap, and fast annotations for various image labeling problems, and outlines strategies for task specification and pricing.

Main Contributions

  • Demonstrates how to efficiently outsource image annotation to Amazon Mechanical Turk.
  • Shows that annotations produced via MT are high-quality, cost-effective, and fast.
  • Introduces strategies for defining, pricing, and ensuring the quality of annotation tasks.
  • Discusses four different annotation protocols, including coarse object segmentation, polygonal labeling, and 14-point human landmark labeling.
  • Presents empirical results from five annotation experiments, collecting 3861 labels for 982 images at a total cost of US$59.

Abstract

We show how to outsource data annotation to Amazon Mechanical Turk. Doing so has produced annotations in quite large numbers relatively cheaply. The quality is good, and can be checked and controlled. Annotations are produced quickly. We describe results for several different annotation problems. We describe some strategies for determining when the task is well specified and properly priced.

Citation Graph

Loading graph...

References [28]

Sort:
Filter:

M. P. Marcus, B. Santorini, Mary Ann Marcinkiewicz - 1993

22 papers in library cite

Bryan C. Russell, Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2008

10 papers in library cite

Li Fei Fei, Rob Fergus, Pietro Perona - 2006

5 papers in library cite

Greg Griffin, Alex Holub, Pietro Perona - 2007

9 papers in library cite

Luis Von Ahn, Laura Dabbish - 2004

5 papers in library cite

G. B. Huang, M. Ramesh, T. Berg, E. L. Miller - 2007

5 papers in library cite

N. Dalal, B. Triggs - 2005

12 papers in library cite

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman - 2007

7 papers in library cite

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman - 2007

7 papers in library cite

Sandhini Agarwal, A. Awan, Dan Roth - 2004

3 papers in library cite

D. Martin, C. Fowlkes, D. Tal, Jitendra Malik - 2001

2 papers in library cite

M. Blank, L. Gorelick, E. Shechtman, M. Irani, R. Basri - 2005

2 papers in library cite

Luis Von Ahn, Rosanne Liu, M. Blum - 2006

2 papers in library cite

C. Papageorgiou, T. Poggio - 2000

1 paper in library cites

Missing author listMissing year

1 paper in library cites

P. J. Phillips, A. Martin, C. Wilson, M. Przybocki - 2000

1 paper in library cites

P. N. Belhumeur, J. P. Hespanha, D. J. Kriegman - 1997

1 paper in library cites

Missing author list

2008

1 paper in library cites

K. Barnard, Q. Fan, R. Swaminathan, A. Hoogs, R. Collins, P. Rondot, J. Kaufhold - 2008

1 paper in library cites

Missing author list

2008

1 paper in library cites

Missing year

D. Martin, C. Fowlkes, Jitendra Malik

1 paper in library cites

D. Ramanan - 2007

1 paper in library cites

Missing author listMissing year

1 paper in library cites

G. Mori, Xiang Ren, A. Efros, Jitendra Malik - 2004

1 paper in library cites

T. Sim, S. Baker, M. Bsat - 2002

1 paper in library cites

Missing author listMissing year

1 paper in library cites

T. L. Berg, A. C. Berg, J. Edwards, David Forsyth - 2004

1 paper in library cites

Cited by

2

papers in your library

Cites

5

papers in your library

Read

on February 2, 2026

Your review

Tags

Paper Aliases

No aliases