2013

Hybrid Speech Recognition With Deep Bidirectional LSTM

Alex Graves, Navdeep Jaitly, Abdel Rahman Mohamed

citations

Cite Score

58

AI summary

This paper introduces a DBLSTM-HMM hybrid system for speech recognition. It achieves state-of-the-art results on TIMIT and outperforms GMM and deep network benchmarks on a subset of the Wall Street Journal corpus. However, the improvement in word error rate over the deep network is modest.

Main Contributions

  • A DBLSTM-HMM hybrid system for speech recognition is proposed.
  • The system achieves state-of-the-art results on TIMIT.
  • The system outperforms GMM and deep network benchmarks on a subset of the Wall Street Journal corpus.
  • The hybrid approach with DBLSTM appears to be well suited for tasks where acoustic modelling predominates.

Abstract

Deep Bidirectional LSTM (DBLSTM) recurrent neural networks have recently been shown to give state-of-the-art performance on the TIMIT speech database. However, the results in that work relied on recurrent-neural-network-specific objective functions, which are difficult to integrate with existing large vocabulary speech recognition systems. This paper investigates the use of DBLSTM as an acoustic model in a standard neural network-HMM hybrid system. We find that a DBLSTM-HMM hybrid gives equally good results on TIMIT as the previous work. It also outperforms both GMM and deep network benchmarks on a subset of the Wall Street Journal corpus. However the improvement in word error rate over the deep network is modest, despite a great increase in frame-level accuracy. We conclude that the hybrid approach with DBLSTM appears to be well suited for tasks where acoustic modelling predominates. Further investigation needs to be conducted to understand how to better leverage the improvements in frame-level accuracy towards better word error rates.

Citation Graph

Loading graph...

References [20]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Geoffrey Hinton - 2012

21 papers in library cite

Geoffrey Hinton - 2013

13 papers in library cite

M. Schuster, Kuldip K. Paliwal - 1997

10 papers in library cite

Alex Graves, Santiago Fernandez, Faustino Gomez, Jürgen Schmidhuber - 2006

7 papers in library cite

Alex Graves - 2012

7 papers in library cite

A. Robinson - 1994

9 papers in library cite

Alex Graves, Jürgen Schmidhuber - 2005

14 papers in library cite

A. Mohamed, G. Dahl, Geoffrey Hinton - 2012

12 papers in library cite

F. Gers, N. Schraudolph, Jürgen Schmidhuber - 2002

9 papers in library cite

H. Bourlard, N. Morgan - 1993

8 papers in library cite

Alex Graves - 2011

8 papers in library cite

K. F. Lee, H. W. Hon - 1989

5 papers in library cite

D. Isto - 1990

5 papers in library cite

K. C. Jim, C. L. Giles, B. G. Horne - 1996

4 papers in library cite

D. Povey, A. Ghoshal - 2011

4 papers in library cite

Geoffrey E. Hinton, D. V. Camp - 1993

3 papers in library cite

A. Maas, Quoc Le, T. O'neil, Oriol Vinyals, P. Nguyen, A. Ng - 2012

2 papers in library cite

Oriol Vinyals, S. Ravuri, D. Povey - 2012

2 papers in library cite

Qihao Zhu, Berlin Chen, N. Morgan, Andreas Stolcke - 2005

2 papers in library cite

Cited by

2

papers in your library

Cites

9

papers in your library

Read

on April 30, 2025

Your review

Tags

Paper Aliases

No aliases