2014

Microsoft COCO: Common Objects in Context

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick

citations

Cite Score

98

AI summary

This paper introduces the MS COCO dataset, a large-scale dataset for object recognition and scene understanding with 2.5 million labeled instances in 328k images. It uses per-instance segmentation and crowd worker involvement, providing baseline performance analysis using a Deformable Parts Model, outperforming models trained on prior datasets due to the diversity of non-iconic views.

Main Contributions

  • Introduces a new large-scale dataset (MS COCO) for object recognition and scene understanding.
  • The dataset contains 2.5 million labeled instances in 328k images.
  • Presents a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN.
  • Provides baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.
  • Emphasizes the importance of non-iconic views and contextual reasoning for object recognition.

Abstract

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding. This is achieved by gathering images of complex everyday scenes containing common objects in their natural context. Objects are labeled using per-instance segmentations to aid in precise object localization. Our dataset contains photos of 91 objects types that would be easily recognizable by a 4 year old. With a total of 2.5 million labeled instances in 328k images, the creation of our dataset drew upon extensive crowd worker involvement via novel user interfaces for category detection, instance spotting and instance segmentation. We present a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN. Finally, we provide baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.

Citation Graph

Loading graph...

References [51]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

Ross Girshick, J. Donahue, Trevor Darrell, Jitendra Malik - 2014

18 papers in library cite

Alex Krizhevsky - 2009

27 papers in library cite

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman - 2010

7 papers in library cite

Li Fei Fei, Rob Fergus, Pietro Perona - 2004

15 papers in library cite

Bryan C. Russell, Antonio Torralba, Kevin P. Murphy, William T. Freeman - 2008

10 papers in library cite

Jianxiong Xiao, James Hays, K. Ehinger, Aude Oliva, Antonio Torralba - 2010

2 papers in library cite

Greg Griffin, Alex Holub, Pietro Perona - 2007

9 papers in library cite

Antonio Torralba, Rob Fergus, W. Freeman - 2008

8 papers in library cite

P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, Rob Fergus, Yann Lecun - 2014

16 papers in library cite

Yann Lecun - 1998

8 papers in library cite

C. Fellbaum - 1998

12 papers in library cite

N. Dalal, B. Triggs - 2005

12 papers in library cite

P. F. Felzenszwalb, Ross Girshick, D. Mcallester, D. Ramanan - 2010

8 papers in library cite

Antonio Torralba, A. Efros - 2011

5 papers in library cite

S. Nene, S. Nayar, H. Murase - 1996

4 papers in library cite

D. Hoiem, Y. Chodpathumwan, Q. Dai - 2012

4 papers in library cite

V. Ordonez, G. Kulkarni, T. L. Berg - 2011

3 papers in library cite

D. Scharstein, R. Szeliski - 2002

2 papers in library cite

Peter Welinder, S. Branson, T. Mita, C. Wah, F. Schroff, S. Belongie, Pietro Perona - 2010

2 papers in library cite

C. Rashtchian, P. Young, M. Hodosh, J. Hockenmaier - 2010

2 papers in library cite

Ali Farhadi, I. Endres, D. Hoiem, David Forsyth - 2009

2 papers in library cite

X. Zhu, C. Vondrick, D. Ramanan, C. Fowlkes - 2012

2 papers in library cite

M. Douze, Hervé Jégou, H. Sandhawalia, L. Amsaleg, Cordelia Schmid - 2009

2 papers in library cite

E. Hjelmas, B. Low - 2001

2 papers in library cite

N. Silberman, D. Hoiem, P. Kohli, Rob Fergus - 2012

2 papers in library cite

J. Deng, O. Russakovsky, J. Krause, M. Bernstein, A. C. Berg, Li Fei Fei - 2014

2 papers in library cite

S. M. Seitz, B. Curless, J. Diebel, D. Scharstein, R. Szeliski - 2006

1 paper in library cites

S. Baker, D. Scharstein, J. Lewis, S. Roth, M. Black, R. Szeliski - 2011

1 paper in library cites

S. Palmer, E. Rosch, P. Chase - 1981

1 paper in library cites

P. Arbelaez, M. Maire, C. Fowlkes, Jitendra Malik - 2011

1 paper in library cites

O. Russakovsky, J. Deng, Zhongqiang Huang, A. Berg, Li Fei Fei - 2013

1 paper in library cites

Ross Girshick, P. F. Felzenszwalb, D. Mcallester - 2012

1 paper in library cites

T. Berg, A. Berg - 2009

1 paper in library cites

V. Ordonez, J. Deng, Yejin Choi, A. Berg, T. Berg - 2013

1 paper in library cites

G. B. Huang, M. Ramesh, T. Berg, E. L. Miller - 2007

1 paper in library cites

Yining Yang, S. Hallman, D. Ramanan, C. Fowlkes - 2012

1 paper in library cites

G. Heitz, D. Koller - 2008

1 paper in library cites

C. Lampert, H. Nickisch, S. Harmeling - 2009

1 paper in library cites

Q. Dai, D. Hoiem - 2012

1 paper in library cites

T. Brox, L. Bourdev, S. Maji, Jitendra Malik - 2011

1 paper in library cites

S. Bell, P. Upchurch, N. Snavely, K. Bala - 2013

1 paper in library cites

Piotr Dollar, C. Wojek, B. Schiele, Pietro Perona - 2012

1 paper in library cites

L. Bourdev, Jitendra Malik - 2009

1 paper in library cites

G. Brostow, J. Fauqueur, Roberto Cipolla - 2009

1 paper in library cites

R. Sitton - 1996

1 paper in library cites

G. Patterson, James Hays - 2012

1 paper in library cites

J. Shotton, John Winn, C. Rother, A. Criminisi - 2009

1 paper in library cites

D. Ramanan - 2007

1 paper in library cites

Cited by

14

papers in your library

Cites

14

papers in your library

Read

on July 31, 2025

Your review

Tags

Paper Aliases

No aliases