2012

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

G. Dahl, D. Yu, L. Deng, Alex Acero

citations

Cite Score

70

AI summary

This paper introduces a novel context-dependent deep neural network hidden Markov model (CD-DNN-HMM) for large-vocabulary speech recognition (LVSR), leveraging deep belief networks for phone recognition. The model achieves significant improvements on a business search dataset, outperforming conventional GMM-HMMs.

Main Contributions

  • Proposes a novel CD-DNN-HMM hybrid architecture for LVSR.
  • Demonstrates that CD-DNN-HMMs can significantly outperform conventional CD-GMM-HMMs on a challenging business search dataset.
  • Achieves an absolute sentence accuracy improvement of 5.8% and 9.2% over CD-GMM-HMMs.
  • Illustrates the key components of the CD-DNN-HMM model and describes the procedure for applying it to LVSR.
  • Analyzes the effects of various modeling choices on performance.

Abstract

We propose a novel context-dependent (CD) model for large-vocabulary speech recognition (LVSR) that leverages recent advances in using deep belief networks for phone recognition. We describe a pre-trained deep neural network hidden Markov model (DNN-HMM) hybrid architecture that trains the DNN to produce a distribution over senones (tied triphone states) as its output. The deep belief network pre-training algorithm is a robust and often helpful way to initialize deep neural networks generatively that can aid in optimization and reduce generalization error. We illustrate the key components of our model, describe the procedure for applying CD-DNN-HMMs to LVSR, and analyze the effects of various modeling choices on performance. Experiments on a challenging business search dataset demonstrate that CD-DNN-HMMs can significantly outperform the conventional context-dependent Gaussian mixture model (GMM)-HMMs, with an absolute sentence accuracy improvement of 5.8% and 9.2% (or relative error reduction of 16.0% and 23.2%) over the CD-GMM-HMMs trained using the minimum phone error rate (MPE) and maximum-likelihood (ML) criteria, respectively.

Citation Graph

Loading graph...

References [84]

Sort:
Filter:

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

34 papers in library cite

Yoshua Bengio - 2010

20 papers in library cite

Geoffrey Hinton, Ruslan Salakhutdinov - 2006

37 papers in library cite

Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006

43 papers in library cite

Yoshua Bengio - 2009

25 papers in library cite

Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre Antoine Manzagol - 2008

25 papers in library cite

Ronan Collobert, Jason Weston - 2008

32 papers in library cite

Geoffrey Hinton - 2002

23 papers in library cite

Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre Antoine Manzagol, Pascal Vincent, Samy Bengio - 2010

12 papers in library cite

K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009

20 papers in library cite

Yoshua Bengio, Yann Lecun - 2007

15 papers in library cite

James Martens - 2010

12 papers in library cite

Pascal Vincent - 2009

5 papers in library cite

P. Smolensky - 1986

11 papers in library cite

A. Mohamed, G. Dahl, Geoffrey Hinton - 2009

3 papers in library cite

A. Mohamed, G. Dahl, Geoffrey Hinton - 2012

12 papers in library cite

H. Bourlard, N. Morgan - 1993

8 papers in library cite

M. Welling, M. R. Zvi, Geoffrey Hinton - 2005

8 papers in library cite

Yoshua Bengio, O. Delalleau, N. L. Roux - 2006

7 papers in library cite

George E. Dahl, Marc'aurelio Ranzato, A. Mohamed, Geoffrey E. Hinton - 2010

6 papers in library cite

V. Mnih - 2009

5 papers in library cite

Ruslan Salakhutdinov, Geoffrey Hinton - 2007

5 papers in library cite

Ilya Sutskever, Geoffrey Hinton, G. Taylor - 2008

5 papers in library cite

D. Povey, D. Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, G. Saon, K. Visweswariah - 2008

4 papers in library cite

N. Morgan, H. Bourlard - 1990

3 papers in library cite

Yoshua Bengio, O. Delalleau, C. Simard - 2007

3 papers in library cite

A. Mohamed, D. Yu, L. Deng - 2010

3 papers in library cite

D. Yu, L. Deng, G. Dahl - 2010

3 papers in library cite

V. Nair, Geoffrey Hinton - 2009

2 papers in library cite

H. Bourlard, N. Morgan, C. Wooters, S. Renals - 1992

2 papers in library cite

L. Deng - 1999

2 papers in library cite

S. Renals, N. Morgan, H. Boulard, Michael Cohen, H. Franco - 1994

2 papers in library cite

D. Povey, P. Woodland - 2002

2 papers in library cite

S. Kapadia, V. Valtchev, S. Young - 1993

2 papers in library cite

Marc'aurelio Ranzato, Geoffrey Hinton - 2010

2 papers in library cite

J. Baker, L. Deng, J. Glass, Sanjeev Khudanpur, C. Lee, N. Morgan, D. O'shaugnessy - 2009

2 papers in library cite

J. Baker, L. Deng, J. Glass, Sanjeev Khudanpur, C. Lee, N. Morgan, D. O'shaugnessy - 2009

2 papers in library cite

Y. Hifny, S. Renals - 2009

2 papers in library cite

L. Deng, D. Yu, Alex Acero - 2006

2 papers in library cite

H. Hermansky, D. P. W. Ellis, S. S. Sharma - 2000

2 papers in library cite

F. Valente, M. Doss, C. Plahl, S. Ravuri, Wenyi Wang - 2010

1 paper in library cites

Georg Heigold - 2010

1 paper in library cites

D. Yu, L. Deng, Y. Gong, Alex Acero - 2009

1 paper in library cites

Geoffrey Zweig, P. Nguyen - 2010

1 paper in library cites

Geoffrey Zweig, P. Nguyen - 2009

1 paper in library cites

E. Trentin, M. Gori - 2001

1 paper in library cites

Jeffrey Li, L. Deng, D. Yu, Y. Gong, Alex Acero - 2009

1 paper in library cites

J. Bridle, L. Deng, J. Picone, H. Richards, J. Ma, T. Kamm, M. Schuster, S. Pike, R. Regan - 1998

1 paper in library cites

D. Yu, Y. Ju, Yuzhi Wang, Geoffrey Zweig, Alex Acero - 2007

1 paper in library cites

J. Morris, E. F. Lussier - 2006

1 paper in library cites

A. Robinson, G. Cook, D. Ellis, E. F. Lussier, S. Renals, D. Williams - 2002

1 paper in library cites

H. Franco, Michael Cohen, N. Morgan, D. Rumelhart, V. Abrash - 1994

1 paper in library cites

H. Boulard, N. Morgan - 1993

1 paper in library cites

D. Yu, L. Deng - 2010

1 paper in library cites

D. Vergyri, A. Mandal, Wenyi Wang, Andreas Stolcke, James Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schluter, K. Kirchhoff, A. Faria, N. Morgan - 2008

1 paper in library cites

D. Povey - 2003

1 paper in library cites

E. Mcdermott, T. Hazen, J. Roux, A. Nakamura, S. Katagiri - 2007

1 paper in library cites

J. Hennebert, C. Ris, H. Bourlard, S. Renals, N. Morgan - 1997

1 paper in library cites

A. Gunawardana, M. Mahajan, Alex Acero, J. Platt - 2005

1 paper in library cites

H. Jiang, Xiang Lisa Li - 2007

1 paper in library cites

F. Sha, L. Saul - 2006

1 paper in library cites

Xiang Lisa Li, H. Jiang, C. L. Liu - 2005

1 paper in library cites

G. Dahl, D. Yu, L. Deng, Alex Acero - 2011

1 paper in library cites

D. Yu, L. Deng - 2007

1 paper in library cites

D. Yu, L. Deng, X. He, Alex Acero - 2008

1 paper in library cites

D. Yu, L. Deng, X. He, Alex Acero - 2007

1 paper in library cites

V. Mnih, Geoffrey Hinton - 2010

1 paper in library cites

Alex Acero, N. Bernstein, R. Chambers, Y. Ju, Xiang Lisa Li, J. Odell, P. Nguyen, O. Scholtz, Geoffrey Zweig - 2008

1 paper in library cites

M. Gales, P. Woodland - 1996

1 paper in library cites

B. Juang, W. Chou, C. Lee - 1997

1 paper in library cites

F. Grezl, P. Fousek - 2008

1 paper in library cites

F. Grezl, M. Karafiat, S. Kontar, Jan Cernocky - 2007

1 paper in library cites

N. Morgan, Qihao Zhu, Andreas Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, Berlin Chen, O. Cetin, H. Bourlard, M. Athineos - 2005

1 paper in library cites

M. Hwang, X. Huang - 1993

1 paper in library cites

J. Neto, L. Almeida, M. Hochberg, C. Martins, L. Nunes, S. Renals, Tony Robinson - 1995

1 paper in library cites

J. Neto, C. Martins, L. Almeida - 1996

1 paper in library cites

Y. Yan, M. Fanty, R. Cole - 1997

1 paper in library cites

P. Fousek, L. Lamel, J. Gauvain - 2008

1 paper in library cites

D. Yu, L. Deng, X. He, Alex Acero - 2006

1 paper in library cites

Qihao Zhu, Andreas Stolcke, Berlin Chen, N. Morgan - 2005

1 paper in library cites

Cited by

19

papers in your library

Cites

16

papers in your library

Read

on July 11, 2025

Your review

Tags

Paper Aliases

No aliases