2015

Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books

Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler

citations

Cite Score

69

AI summary

This paper introduces a model that aligns books to movies to provide rich descriptive explanations for visual content, using a neural sentence embedding trained on the BookCorpus dataset and a video-text neural embedding trained on DVS descriptions of movie clips. It achieves good performance for movie/book alignment.

Main Contributions

  • Introduces a model that aligns books to movies to provide rich descriptive explanations for visual content.
  • Exploits a neural sentence embedding trained on the BookCorpus dataset.
  • Uses a video-text neural embedding trained on DVS descriptions of movie clips.
  • Proposes a context-aware CNN to combine information from multiple sources.
  • Introduces the MovieBook dataset with 11 movie/book pairs annotated with 2,070 shot-to-sentence correspondences.

Abstract

Books are a rich source of both fine-grained information, how a character, an object or a scene looks like, as well as high-level semantics, what someone is thinking, feeling and how these states evolve through a story. This paper aims to align books to their movie releases in order to provide rich descriptive explanations for visual content that go semantically far beyond the captions available in current datasets. To align movies and books we exploit a neural sentence embedding that is trained in an unsupervised way from a large corpus of books, as well as a video-text neural embedding for computing similarities between movie clips and sentences in the book. We propose a context-aware CNN to combine information from multiple sources. We demonstrate good quantitative performance for movie/book alignment and show several qualitative examples that showcase the diversity of tasks our model can be used for.

Citation Graph

Loading graph...

References [38]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Christian Szegedy, Weizhou Liu, Y. Jia, P. Sermanet, S. Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich - 2015

20 papers in library cite

T. Y. Lin, M. Maire, S. Belongie, James Hays, Pietro Perona, D. Ramanan, Piotr Dollar, C. L. Zitnick - 2014

14 papers in library cite

Tomas Mikolov, K. Chen, G. S. Corrado, Jeffrey Dean - 2013

26 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014

38 papers in library cite

K. Papineni, S. Roukos, T. Ward, Wei Jing Zhu - 2002

19 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2014

11 papers in library cite

K. Xu, Jimmy Lei Ba, R. Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, R. Zemel, Yoshua Bengio - 2015

12 papers in library cite

Dumitru Erhan - 2015

11 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

N. Kalchbrenner, Phil Blunsom - 2013

27 papers in library cite

Richard S. Zemel - 2014

5 papers in library cite

A. Karpathy, Li Fei Fei - 2014

6 papers in library cite

S. Venugopalan, Hu Xu, J. Donahue, M. Rohrbach, R. J. Mooney, K. Saenko - 2014

1 paper in library cites

A. Rohrbach, M. Rohrbach, N. Tandon, B. Schiele - 2015

1 paper in library cites

J. Mao, Weixin Xu, Yining Yang, J. Wang, A. L. Yuille - 2014

3 papers in library cite

H. Pirsiavash, C. Vondrick, Antonio Torralba - 2014

1 paper in library cites

G. Kulkarni, V. Premraj, S. Dhar, Shanda Li, Yejin Choi, A. C. Berg, T. L. Berg - 2011

4 papers in library cite

Ali Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, David Forsyth - 2010

3 papers in library cite

Aman Gupta, L. S. Davis - 2008

2 papers in library cite

Mark Everingham, Josef Sivic, Andrew Zisserman - 2006

1 paper in library cites

Josef Sivic, Mark Everingham, Andrew Zisserman - 2009

1 paper in library cites

M. Malinowski, M. Fritz - 2014

1 paper in library cites

Sanja Fidler, Archit Sharma, R. Urtasun - 2013

1 paper in library cites

M. Tapaswi, M. Bauml, R. Stiefelhagen - 2015

1 paper in library cites

M. Tapaswi, M. Bauml, R. Stiefelhagen - 2015

1 paper in library cites

Xiangning Lin, D. Parikh - 2015

1 paper in library cites

A. Schwing, T. Hazan, M. Pollefeys, R. Urtasun - 2012

1 paper in library cites

B. Zhou, A. Lapedriza, Jianxiong Xiao, Antonio Torralba, Aude Oliva - 2014

1 paper in library cites

V. Ramanathan, Armand Joulin, Percy Liang, Li Fei Fei - 2014

1 paper in library cites

T. Cour, C. Jordan, E. Miltsakaki, B. Taskar - 2008

1 paper in library cites

P. Sankar, C. V. Jawahar, Andrew Zisserman - 2009

1 paper in library cites

V. Ramanathan, Percy Liang, Li Fei Fei - 2013

1 paper in library cites

D. Lin, Sanja Fidler, C. Kong, R. Urtasun - 2014

1 paper in library cites

C. Kong, D. Lin, Mohit Bansal, R. Urtasun, Sanja Fidler - 2014

1 paper in library cites

Cited by

18

papers in your library

Cites

20

papers in your library

Read

on August 5, 2025

Your review

Tags

Paper Aliases

No aliases