2012
Cite Score
70
AI summary
This paper introduces a novel context-dependent deep neural network hidden Markov model (CD-DNN-HMM) for large-vocabulary speech recognition (LVSR), leveraging deep belief networks for phone recognition. The model achieves significant improvements on a business search dataset, outperforming conventional GMM-HMMs.
Main Contributions
Abstract
We propose a novel context-dependent (CD) model for large-vocabulary speech recognition (LVSR) that leverages recent advances in using deep belief networks for phone recognition. We describe a pre-trained deep neural network hidden Markov model (DNN-HMM) hybrid architecture that trains the DNN to produce a distribution over senones (tied triphone states) as its output. The deep belief network pre-training algorithm is a robust and often helpful way to initialize deep neural networks generatively that can aid in optimization and reduce generalization error. We illustrate the key components of our model, describe the procedure for applying CD-DNN-HMMs to LVSR, and analyze the effects of various modeling choices on performance. Experiments on a challenging business search dataset demonstrate that CD-DNN-HMMs can significantly outperform the conventional context-dependent Gaussian mixture model (GMM)-HMMs, with an absolute sentence accuracy improvement of 5.8% and 9.2% (or relative error reduction of 16.0% and 23.2%) over the CD-GMM-HMMs trained using the minimum phone error rate (MPE) and maximum-likelihood (ML) criteria, respectively.
Citation Graph
References [84]
D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986
34 papers in library cite
Yoshua Bengio - 2010
20 papers in library cite
Geoffrey Hinton, Ruslan Salakhutdinov - 2006
37 papers in library cite
Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006
43 papers in library cite
Yoshua Bengio - 2009
25 papers in library cite
Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre Antoine Manzagol - 2008
25 papers in library cite
Ronan Collobert, Jason Weston - 2008
32 papers in library cite
Geoffrey Hinton - 2002
23 papers in library cite
Dumitru Erhan, Yoshua Bengio, Aaron Courville, Pierre Antoine Manzagol, Pascal Vincent, Samy Bengio - 2010
12 papers in library cite
K. Jarrett, Koray Kavukcuoglu, Marc'aurelio Ranzato, Yann Lecun - 2009
20 papers in library cite
Yoshua Bengio, Yann Lecun - 2007
15 papers in library cite
James Martens - 2010
12 papers in library cite
Pascal Vincent - 2009
5 papers in library cite
P. Smolensky - 1986
11 papers in library cite
A. Mohamed, G. Dahl, Geoffrey Hinton - 2009
3 papers in library cite
A. Mohamed, G. Dahl, Geoffrey Hinton - 2012
12 papers in library cite
H. Bourlard, N. Morgan - 1993
8 papers in library cite
M. Welling, M. R. Zvi, Geoffrey Hinton - 2005
8 papers in library cite
Yoshua Bengio, O. Delalleau, N. L. Roux - 2006
7 papers in library cite
George E. Dahl, Marc'aurelio Ranzato, A. Mohamed, Geoffrey E. Hinton - 2010
6 papers in library cite
V. Mnih - 2009
5 papers in library cite
Ruslan Salakhutdinov, Geoffrey Hinton - 2007
5 papers in library cite
Ilya Sutskever, Geoffrey Hinton, G. Taylor - 2008
5 papers in library cite
D. Povey, D. Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, G. Saon, K. Visweswariah - 2008
4 papers in library cite
N. Morgan, H. Bourlard - 1990
3 papers in library cite
Yoshua Bengio, O. Delalleau, C. Simard - 2007
3 papers in library cite
A. Mohamed, D. Yu, L. Deng - 2010
3 papers in library cite
D. Yu, L. Deng, G. Dahl - 2010
3 papers in library cite
V. Nair, Geoffrey Hinton - 2009
2 papers in library cite
H. Bourlard, N. Morgan, C. Wooters, S. Renals - 1992
2 papers in library cite
L. Deng - 1999
2 papers in library cite
S. Renals, N. Morgan, H. Boulard, Michael Cohen, H. Franco - 1994
2 papers in library cite
X. He, L. Deng, W. Chou - 2008
2 papers in library cite
D. Povey, P. Woodland - 2002
2 papers in library cite
S. Kapadia, V. Valtchev, S. Young - 1993
2 papers in library cite
Marc'aurelio Ranzato, Geoffrey Hinton - 2010
2 papers in library cite
J. Baker, L. Deng, J. Glass, Sanjeev Khudanpur, C. Lee, N. Morgan, D. O'shaugnessy - 2009
2 papers in library cite
J. Baker, L. Deng, J. Glass, Sanjeev Khudanpur, C. Lee, N. Morgan, D. O'shaugnessy - 2009
2 papers in library cite
Y. Hifny, S. Renals - 2009
2 papers in library cite
L. Deng, D. Yu, Alex Acero - 2006
2 papers in library cite
H. Hermansky, D. P. W. Ellis, S. S. Sharma - 2000
2 papers in library cite
L. Deng, D. Yu, Alex Acero - 2006
1 paper in library cites
F. Valente, M. Doss, C. Plahl, S. Ravuri, Wenyi Wang - 2010
1 paper in library cites
L. Deng - 1998
1 paper in library cites
Georg Heigold - 2010
1 paper in library cites
D. Yu, L. Deng, Y. Gong, Alex Acero - 2009
1 paper in library cites
Geoffrey Zweig, P. Nguyen - 2010
1 paper in library cites
Geoffrey Zweig, P. Nguyen - 2009
1 paper in library cites
E. Trentin, M. Gori - 2001
1 paper in library cites
Jeffrey Li, L. Deng, D. Yu, Y. Gong, Alex Acero - 2009
1 paper in library cites
J. Bridle, L. Deng, J. Picone, H. Richards, J. Ma, T. Kamm, M. Schuster, S. Pike, R. Regan - 1998
1 paper in library cites
D. Yu, Y. Ju, Yuzhi Wang, Geoffrey Zweig, Alex Acero - 2007
1 paper in library cites
J. Morris, E. F. Lussier - 2006
1 paper in library cites
A. Robinson, G. Cook, D. Ellis, E. F. Lussier, S. Renals, D. Williams - 2002
1 paper in library cites
H. Franco, Michael Cohen, N. Morgan, D. Rumelhart, V. Abrash - 1994
1 paper in library cites
H. Boulard, N. Morgan - 1993
1 paper in library cites
D. Yu, L. Deng - 2010
1 paper in library cites
D. Vergyri, A. Mandal, Wenyi Wang, Andreas Stolcke, James Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schluter, K. Kirchhoff, A. Faria, N. Morgan - 2008
1 paper in library cites
D. Povey - 2003
1 paper in library cites
E. Mcdermott, T. Hazen, J. Roux, A. Nakamura, S. Katagiri - 2007
1 paper in library cites
J. Hennebert, C. Ris, H. Bourlard, S. Renals, N. Morgan - 1997
1 paper in library cites
A. Gunawardana, M. Mahajan, Alex Acero, J. Platt - 2005
1 paper in library cites
H. Jiang, Xiang Lisa Li - 2007
1 paper in library cites
F. Sha, L. Saul - 2006
1 paper in library cites
Xiang Lisa Li, H. Jiang, C. L. Liu - 2005
1 paper in library cites
G. Dahl, D. Yu, L. Deng, Alex Acero - 2011
1 paper in library cites
D. Yu, L. Deng - 2007
1 paper in library cites
D. Yu, L. Deng, X. He, Alex Acero - 2008
1 paper in library cites
D. Yu, L. Deng, X. He, Alex Acero - 2007
1 paper in library cites
V. Mnih, Geoffrey Hinton - 2010
1 paper in library cites
Alex Acero, N. Bernstein, R. Chambers, Y. Ju, Xiang Lisa Li, J. Odell, P. Nguyen, O. Scholtz, Geoffrey Zweig - 2008
1 paper in library cites
M. Gales, P. Woodland - 1996
1 paper in library cites
B. Juang, W. Chou, C. Lee - 1997
1 paper in library cites
C. Lee, Q. Huo - 2000
1 paper in library cites
F. Grezl, P. Fousek - 2008
1 paper in library cites
F. Grezl, M. Karafiat, S. Kontar, Jan Cernocky - 2007
1 paper in library cites
N. Morgan, Qihao Zhu, Andreas Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, Berlin Chen, O. Cetin, H. Bourlard, M. Athineos - 2005
1 paper in library cites
M. Hwang, X. Huang - 1993
1 paper in library cites
J. Neto, L. Almeida, M. Hochberg, C. Martins, L. Nunes, S. Renals, Tony Robinson - 1995
1 paper in library cites
J. Neto, C. Martins, L. Almeida - 1996
1 paper in library cites
Y. Yan, M. Fanty, R. Cole - 1997
1 paper in library cites
P. Fousek, L. Lamel, J. Gauvain - 2008
1 paper in library cites
D. Yu, L. Deng, X. He, Alex Acero - 2006
1 paper in library cites
Qihao Zhu, Andreas Stolcke, Berlin Chen, N. Morgan - 2005
1 paper in library cites
Cited by
19
papers in your library
Cites
16
papers in your library
Read
on July 11, 2025
Your review
Tags
Paper Aliases
No aliases