2011

Parsing Natural Scenes and Natural Language With Recursive Neural Networks

Richard Socher, C. C. Lin, C. Manning, Andrew Y. Ng

citations

Cite Score

53

AI summary

This paper introduces a recursive neural network architecture that parses images and sentences, achieving state-of-the-art results on the Stanford background dataset for segmentation and annotation. The image parse tree features outperform Gist descriptors for scene classification. The algorithm also parses natural language sentences with competitive performance.

Main Contributions

  • Introduces a deep learning method to achieve state-of-the-art results on segmentation and annotation of complex scenes.
  • Presents a recursive neural network architecture that predicts hierarchical tree structures for scene images.
  • Demonstrates that learned features outperform state-of-the-art methods, such as Gist descriptors, for scene classification.
  • Shows that the algorithm is general and can parse natural language sentences obtaining competitive performance.
  • Achieves a new state-of-the-art performance on the Stanford background dataset (78.1%).

Abstract

Recursive structure is commonly found in the inputs of different modalities such as natural scene images or natural language sentences. Discovering this recursive structure helps us to not only identify the units that an image or sentence contains but also how they interact to form a whole. We introduce a max-margin structure prediction architecture based on recursive neural networks that can successfully recover such structure both in complex scene images as well as sentences. The same algorithm can be used both to provide a competitive syntactic parser for natural language sentences from the Penn Treebank and to outperform alternative approaches for semantic scene segmentation, annotation and classification. For segmentation and annotation our algorithm obtains a new level of state-of-the-art performance on the Stanford background dataset (78.1%). The features from the image parse tree outperform Gist descriptors for scene classification by 4%.

Citation Graph

Loading graph...

References [25]

Sort:
Filter:

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce - 2006

14 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Honglak Lee, R. Grosse, R. Ranganath, Andrew Y. Ng - 2009

12 papers in library cite

Aude Oliva, Antonio Torralba - 2001

7 papers in library cite

Manning, Schutze - 1999

4 papers in library cite

Slav Petrov, L. Barrett, R. Thibaux, Dan Klein - 2006

4 papers in library cite

Richard Socher, Christopher D. Manning, Andrew Y. Ng - 2010

4 papers in library cite

C. Goller, A. Kuchler - 1996

4 papers in library cite

D. Hoiem, A. A. Efros, M. Hebert - 2006

4 papers in library cite

S. C. Zhu, D. Mumford - 2006

3 papers in library cite

J. Shotton, John Winn, C. Rother, A. Criminisi - 2006

3 papers in library cite

Aman Gupta, L. S. Davis - 2008

2 papers in library cite

J. Henderson - 2003

2 papers in library cite

J. Tighe, Svetlana Lazebnik - 2010

2 papers in library cite

N. Ratliff, J. A. Bagnell, M. Zinkevich - 2007

1 paper in library cites

Richard Socher, Li Fei Fei - 2010

1 paper in library cites

S. Gould, R. Fulton, D. Koller - 2009

1 paper in library cites

B. Taskar, Dan Klein, Michael Collins, D. Koller, C. Manning - 2004

1 paper in library cites

D. Comaniciu, P. Meer - 2002

1 paper in library cites

Andrew Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora, S. Belongie - 2007

1 paper in library cites

L. Zhu, C. Yuanhao, Antonio Torralba, William T. Freeman, A. L. Yuille - 2010

1 paper in library cites

J. M. Siskind, J. Sherman, Jr, I. Pollak, M. P. Harper, C. A. Bouman - 2007

1 paper in library cites

L. J. Li, Richard Socher, Li Fei Fei - 2009

1 paper in library cites

Cited by

10

papers in your library

Cites

5

papers in your library

Read

on April 27, 2025

Your review

Tags

Paper Aliases

No aliases