2012

Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription

Pascal Vincent

citations

Cite Score

35

AI summary

This paper introduces a recurrent neural network (RNN-RBM) model for polyphonic music generation and transcription, outperforming traditional models on realistic datasets. The model combines recurrent neural networks with restricted Boltzmann machines to discover temporal dependencies in high-dimensional symbolic music sequences, serving as a symbolic prior to improve transcription accuracy.

Main Contributions

  • Introduces a novel RNN-RBM architecture for modeling temporal dependencies in high-dimensional sequences.
  • Demonstrates the effectiveness of the model on polyphonic music generation and transcription tasks.
  • Achieves improved accuracy in polyphonic transcription by using the model as a symbolic prior.
  • Shows that pretraining the RNN layer of an RNN-RBM via Hessian-free optimization significantly improves performance.
  • Demonstrates that RNN-NADE is a robust distribution estimator.

Abstract

We investigate the problem of modeling symbolic sequences of polyphonic music in a completely general piano-roll representation. We introduce a probabilistic model based on distribution estimators conditioned on a recurrent neural network that is able to discover temporal dependencies in high-dimensional sequences. Our approach outperforms many traditional models of polyphonic music on a variety of realistic datasets. We show how our musical language model can serve as a symbolic prior to improve the accuracy of polyphonic transcription.

Citation Graph

Loading graph...

References [26]

Sort:
Filter:

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

46 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Yoshua Bengio - 2009

25 papers in library cite

Geoffrey Hinton - 2002

23 papers in library cite

James Martens, Ilya Sutskever - 2011

13 papers in library cite

P. Smolensky - 1986

11 papers in library cite

M. Welling, M. R. Zvi, Geoffrey Hinton - 2005

8 papers in library cite

Hugo Larochelle, I. Murray - 2011

5 papers in library cite

Ilya Sutskever, Geoffrey Hinton, G. Taylor - 2008

5 papers in library cite

Ruslan Salakhutdinov, I. Murray - 2008

4 papers in library cite

Ilya Sutskever, Geoffrey E. Hinton - 2007

3 papers in library cite

Yoshua Bengio, Samy Bengio - 2000

3 papers in library cite

Graham W. Taylor, Geoffrey E. Hinton, S. T. Roweis - 2007

3 papers in library cite

J. Nam, J. Ngiam, Honglak Lee, M. Slaney - 2011

2 papers in library cite

D. Eck, Jürgen Schmidhuber - 2002

2 papers in library cite

M. Allan, Christopher K. I. Williams - 2005

2 papers in library cite

M. C. Mozer - 1994

2 papers in library cite

G. E. Poliner, D. P. W. Ellis - 2007

1 paper in library cites

B. Schrauwen, L. Buesing - 2009

1 paper in library cites

A. T. Cemgil - 2004

1 paper in library cites

V. Mnih, Hugo Larochelle, Geoffrey Hinton - 2011

1 paper in library cites

M. Bay, A. F. Ehmann, J. S. Downie - 2009

1 paper in library cites

Yiwei Li, D. L. Wang - 2007

1 paper in library cites

V. Lavrenko, J. Pickens - 2003

1 paper in library cites

J. F. Paiement, Samy Bengio, D. Eck - 2009

1 paper in library cites

E. Hsu, K. Pulli, J. Popovic - 2005

1 paper in library cites

Cited by

8

papers in your library

Cites

6

papers in your library

Read

on July 12, 2025

Your review

Tags

Paper Aliases

No aliases