2012
Cite Score
4
AI summary
This paper introduces UCF101, a new large-scale dataset for human action recognition from web videos, featuring 101 action classes, over 13k clips, and 27 hours of data, and provides baseline recognition results using a bag-of-words approach.
Main Contributions
Abstract
We introduce UCF101 which is currently the largest dataset of human actions. It consists of 101 action classes, over 13k clips and 27 hours of video data. The database consists of realistic user-uploaded videos containing camera motion and cluttered background. Additionally, we provide baseline action recognition results on this new dataset using standard bag of words approach with overall performance of 44.5%. To the best of our knowledge, UCF101 is currently the most challenging dataset of actions due to its large number of classes, large number of clips and also unconstrained nature of such clips.
Citation Graph
References [12]
M. Blank, L. Gorelick, E. Shechtman, M. Irani, R. Basri - 2005
2 papers in library cite
J. C. Niebles, C. W. Chen, Li Fei Fei - 2010
2 papers in library cite
Joseph Liu, J. Luo, Mubarak Shah - 2009
2 papers in library cite
M. Rodriguez, J. Ahmed, Mubarak Shah - 2008
1 paper in library cites
D. Weinland, E. Boyer, R. Ronfard - 2007
1 paper in library cites
M. Marszaek, I. Laptev, Cordelia Schmid - 2009
1 paper in library cites
H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, T. Serre - 2011
1 paper in library cites
G. Johansson, S. Bergstrom, W. Epstein - 1994
1 paper in library cites
K. Reddy, Mubarak Shah - 2012
1 paper in library cites
C. Schuldt, I. Laptev, B. Caputo - 2004
1 paper in library cites
Cited by
1
papers in your library
Cites
0
papers in your library
Read
on February 17, 2026
Your review
Tags
Paper Aliases
No aliases