2014

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

J. Donahue, Y. Jia, Oriol Vinyals, J. Hoffman, N. Zhang, E. Tzeng, Trevor Darrell

citations

Cite Score

77

AI summary

This paper introduces DeCAF, a deep convolutional activation feature, achieving state-of-the-art results on several important vision challenges including scene recognition, domain adaptation, and fine-grained recognition. The method uses a convolutional network trained on ImageNet and releases an open-source implementation.

Main Contributions

  • Introduces DeCAF: a generic visual feature based on a convolutional network trained on ImageNet.
  • Demonstrates that convolutional features cluster semantic topics more readily than conventional features.
  • Achieves state-of-the-art results on Caltech-101, the Office domain adaptation dataset, the Caltech-UCSD Birds fine-grained recognition dataset, and the SUN-397 scene recognition database.
  • Releases an open-source implementation of DeCAF, along with all associated network parameters.

Abstract

We evaluate whether features extracted from the activation of a deep convolutional network trained in a fully supervised fashion on a large, fixed set of object recognition tasks can be re purposed to novel generic tasks. Our generic tasks may differ significantly from the originally trained tasks and there may be insufficient la beled or unlabeled data to conventionally train or adapt a deep architecture to the new tasks. We investigate and visualize the semantic clustering of deep convolutional features with respect to a va riety of such tasks, including scene recognition, domain adaptation, and fine-grained recognition challenges. We compare the efficacy of relying on various network levels to define a fixed fea ture, and report novel results that significantly outperform the state-of-the-art on several impor tant vision challenges. We are releasing DeCAF, an open-source implementation of these deep convolutional activation features, along with all associated network parameters to enable vision researchers to be able to conduct experimenta tion with deep representations across a range of visual concept learning paradigms.

Citation Graph

Loading graph...

References [46]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

J. Deng, W. Dong, Richard Socher, L. J. Li, K. Li, Li Fei Fei - 2009

28 papers in library cite

Yann Lecun, Leon Bottou, Yoshua Bengio, Patrick Haffner - 1998

62 papers in library cite

Geoffrey Hinton - 2008

7 papers in library cite

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackal - 1989

24 papers in library cite

Geoffrey E. Hinton, N. Srivastava, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2012

25 papers in library cite

Rich Caruana - 1997

13 papers in library cite

Li Fei Fei, Rob Fergus, Pietro Perona - 2004

15 papers in library cite

Jianxiong Xiao, James Hays, K. Ehinger, Aude Oliva, Antonio Torralba - 2010

2 papers in library cite

K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009

20 papers in library cite

Quoc V. Le, M. A. Ranzato, R. Monga, M. Devin, K. Chen, G. S. Corrado, Jeffrey Dean, Andrew Y. Ng - 2012

10 papers in library cite

Rajat Raina, Alexis Battle, Honglak Lee, Benjamin Packer, A. Ng - 2007

7 papers in library cite

Rie Kubota Ando, Tong Zhang - 2005

10 papers in library cite

Sebastian Thrun - 1996

3 papers in library cite

A. Berg, J. Deng, Li Fei Fei - 2012

1 paper in library cites

G. Mesnil, Yann Dauphin, Xavier Glorot, S. Rifai, Yoshua Bengio, I. Goodfellow, E. Lavoie, X. Muller, G. Desjardins, D. W. Farley, Pascal Vincent, Aaron Courville, J. Berkgstra - 2012

2 papers in library cite

N. Dalal, B. Triggs - 2005

12 papers in library cite

P. F. Felzenszwalb, Ross Girshick, D. Mcallester, D. Ramanan - 2010

8 papers in library cite

Aude Oliva, Antonio Torralba - 2001

7 papers in library cite

Antonio Torralba, A. Efros - 2011

5 papers in library cite

Quoc Le, W. Zou, S. Y. Yeung, A. Ng - 2011

4 papers in library cite

A. Argyriou, T. Evgeniou, M. Pontil - 2006

3 papers in library cite

K. Saenko, B. Kulis, M. Fritz, Trevor Darrell - 2010

2 papers in library cite

Peter Welinder, S. Branson, T. Mita, C. Wah, F. Schroff, S. Belongie, Pietro Perona - 2010

2 papers in library cite

H. D. Iii - 2007

2 papers in library cite

Xiang Ren, D. Ramanan - 2013

2 papers in library cite

J. Wang, Jihan Yang, K. Yu, F. Lv, T. Huang, Y. Gong - 2010

2 papers in library cite

Sanja Fidler, A. Leonardis - 2007

2 papers in library cite

N. Zhang, R. Farrell, F. Iandola, Trevor Darrell - 2013

1 paper in library cites

L. Bourdev, S. Maji, Jitendra Malik - 2011

1 paper in library cites

S. Chopra, S. Balakrishnan, R. Gopalan - 2013

1 paper in library cites

J. Hoffman, E. Rodner, J. Donahue, K. Saenko, Trevor Darrell - 2013

1 paper in library cites

L. Torresani, M. Szummer, A. Fitzgibbon - 2010

1 paper in library cites

B. Gong, Yangyang Shi, F. Sha, Kristen Grauman - 2012

1 paper in library cites

Jihan Yang, L. Y., Yuandong Tian, L. Duan, W. Gao - 2009

1 paper in library cites

L. Bo, Xiang Ren, D. Fox - 2010

1 paper in library cites

L. Kennedy, A. Hauptmann - 2006

1 paper in library cites

D. Hsu, S. Kakade, John Langford, Tong Zhang - 2009

1 paper in library cites

Lei Li, H. Su, Li Fei Fei, E. Xing - 2010

1 paper in library cites

H. Bay, T. Tuytelaars, Luc Van Gool - 2006

1 paper in library cites

A. Quattoni, Michael Collins, Trevor Darrell - 2008

1 paper in library cites

Shivalika Singh, Aman Gupta, A. Efros - 2012

1 paper in library cites

L. Zhu, Yanru Chen, A. Yuille - 2007

1 paper in library cites

B. Kulis, K. Saenko, Trevor Darrell - 2011

1 paper in library cites

Cited by

15

papers in your library

Cites

17

papers in your library

Read

on August 2, 2025

Your review

Tags

Paper Aliases

No aliases