2005

Framewise Phoneme Classification With Bidirectional LSTM Networks

Jürgen Schmidhuber

citations

Cite Score

34

AI summary

This paper introduces bidirectional LSTM networks for framewise phoneme classification on the TIMIT dataset, achieving high recognition scores. It demonstrates the architectural advantage of bidirectional training and LSTM over conventional RNNs, capturing time dependencies in speech.

Main Contributions

  • Introduces bidirectional LSTM networks for the first time.
  • Calculates the full error gradient for LSTM weights.
  • Achieves high framewise recognition score on the TIMIT database.
  • Demonstrates the architectural advantage of bidirectional training over unidrectional LSTM.
  • Demonstrates the architectural advantage of LSTM over conventional RNNs.

Abstract

In this paper, we apply bidirectional training to a Long Short Term Memory (LSTM) network for the first time. We also present a modified, full gradient version of the LSTM learning algorithm. On the TIMIT speech database, we measure the framewise phoneme classification ability of bidirectional and unidirectional variants of both LSTM and conventional Recurrent Neural Networks (RNNs). We find that the LSTM architecture outperforms conventional RNNs and that bidirectional networks outperform unidirectional ones.

Citation Graph

Loading graph...

References [23]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

M. Schuster, Kuldip K. Paliwal - 1997

10 papers in library cite

Sepp Hochreiter, Yoshua Bengio, Paolo Frasconi, Jürgen Schmidhuber - 2001

16 papers in library cite

A. Robinson - 1994

9 papers in library cite

C. M. Bishop - 1995

12 papers in library cite

A. J. Robinson, F. Fallside - 1987

10 papers in library cite

F. Gers, N. Schraudolph, Jürgen Schmidhuber - 2002

9 papers in library cite

H. Bourlard, N. Morgan - 1993

8 papers in library cite

D. Isto - 1990

5 papers in library cite

N. N. Schraudolph - 2002

4 papers in library cite

Ronald J. Williams, David Zipser - 1990

2 papers in library cite

J. A. P. Ortiz, F. Gers, D. Eck, Jürgen Schmidhuber - 2002

2 papers in library cite

Alex Graves, N. Beringer, Jürgen Schmidhuber - 2004

1 paper in library cites

D. Eck, Alex Graves, Jürgen Schmidhuber - 2003

1 paper in library cites

P. Baldi, S. Brunak, Paolo Frasconi, G. Pollastri, G. Soda - 2001

1 paper in library cites

Alex Graves, D. Eck, N. Beringer, Jürgen Schmidhuber - 2004

1 paper in library cites

Jixuan Chen, N. S. Chaudhari - 2004

1 paper in library cites

R. Chen, L. Jamieson - 1996

1 paper in library cites

P. Baldi, S. Brunak, Paolo Frasconi, G. Soda, G. Pollastri - 1999

1 paper in library cites

N. Beringer - 2004

1 paper in library cites

M. Schuster - 1999

1 paper in library cites

T. Fukada, M. Schuster, Y. Sagisaka - 1999

1 paper in library cites

M. Riedmiller, H. Braun - 1992

1 paper in library cites

Cited by

14

papers in your library

Cites

5

papers in your library

Read

on June 30, 2025

Your review

Tags

Paper Aliases

Framewise Phoneme Classification With Bidirectional LSTM and Other Neural Network Architectures