2006

Connectionist Temporal Classification: Labelling Unsegmented Sequence Data With Recurrent Neural Networks

Alex Graves, Santiago Fernandez, Faustino Gomez, Jürgen Schmidhuber

citations

Cite Score

80

AI summary

This paper introduces Connectionist Temporal Classification (CTC), a novel method using RNNs for labeling unsegmented sequences. It removes the need for pre-segmented data and post-processing. Applied to the TIMIT speech corpus, CTC outperforms baseline HMM and hybrid HMM-RNN models.

Main Contributions

  • Introduces Connectionist Temporal Classification (CTC) for labeling unsegmented sequence data with RNNs.
  • Presents a method to train RNNs to directly label unsegmented sequences, eliminating the need for pre-segmentation and post-processing.
  • The network outputs are interpreted as a probability distribution over all possible label sequences, conditioned on a given input sequence.
  • An objective function is derived that directly maximizes the probabilities of the correct labellings.
  • CTC outperforms both a baseline HMM and a hybrid HMM-RNN on the TIMIT speech corpus.

Abstract

Many real-world sequence learning tasks require the prediction of sequences of labels from noisy, unsegmented input data. In speech recognition, for example, an acoustic signal is transcribed into words or sub-word units. Recurrent neural networks (RNNs) are powerful sequence learners that would seem well suited to such tasks. However, because they require pre-segmented training data, and post-processing to transform their outputs into label sequences, their applicability has so far been limited. This paper presents a novel method for training RNNs to label unsegmented sequences directly, thereby solving both problems. An experiment on the TIMIT speech corpus demonstrates its advantages over both a baseline HMM and a hybrid HMM-RNN.

Citation Graph

Loading graph...

References [17]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

M. Schuster, Kuldip K. Paliwal - 1997

10 papers in library cite

Yann Lecun, Leon Bottou, G. B. Orr, Klaus Robert Muller - 1998

20 papers in library cite

P. Werbos - 1990

9 papers in library cite

A. Robinson - 1994

9 papers in library cite

Alex Graves, Jürgen Schmidhuber - 2005

14 papers in library cite

C. M. Bishop - 1995

12 papers in library cite

F. Gers, N. Schraudolph, Jürgen Schmidhuber - 2002

9 papers in library cite

H. Bourlard, N. Morgan - 1993

8 papers in library cite

J. Lafferty, Andrew Mccallum, F. C. Pereira - 2001

6 papers in library cite

L. R. Rabiner - 1989

4 papers in library cite

N. N. Schraudolph - 2002

4 papers in library cite

Tony Robinson - 1991

3 papers in library cite

Yoshua Bengio - 1999

2 papers in library cite

Alex Graves, Santiago Fernandez, Jürgen Schmidhuber - 2005

1 paper in library cites

Cited by

7

papers in your library

Cites

8

papers in your library

Read

on June 30, 2025

Your review

Tags

Paper Aliases

No aliases