2007

Continuous Space Language Models

Holger Schwenk

citations

Cite Score

28

AI summary

This paper introduces a neural network language model that estimates probability in continuous space, achieving word error rate reductions on international benchmark tasks using the NIST evaluations on broadcast news and conversational speech recognition. It uses training corpora of several hundred million words.

Main Contributions

  • Introduces a neural network language model for large vocabulary continuous speech recognition.
  • Describes highly efficient learning algorithms for training corpora of several hundred million words.
  • Demonstrates incorporation into a large vocabulary continuous speech recognizer using lattice rescoring.
  • Achieves consistent word error rate reductions on benchmark tasks.
  • Compares favorably to four-gram back-off language models with modified Kneser-Ney smoothing.

Abstract

This paper describes the use of a neural network language model for large vocabulary continuous speech recognition. The underlying idea of this approach is to attack the data sparseness problem by performing the language model probability estimation in a continuous space. Highly efficient learning algorithms are described that enable the use of training corpora of several hundred million words. It is also shown that this approach can be incorporated into a large vocabulary continuous speech recognizer using a lattice rescoring framework at a very low additional processing time. The neural network language model was thoroughly evaluated in a state-of-the-art large vocabulary continuous speech recognizer for several international benchmark tasks, in particular the Nist evaluations on broadcast news and conversational speech recognition. The new approach is compared to four-gram back-off language models trained with modified Kneser-Ney smoothing which has often been reported to be the best known smoothing method. Usually the neural network language model is interpolated with the back-off language model. In that way, consistent word error rate reductions for all considered tasks and languages were achieved, ranging from 0.4% to almost 1% absolute.

Citation Graph

Loading graph...

References [60]

Sort:
Filter:

Jeffrey L. Elman - 1990

23 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Andreas Stolcke - 2002

13 papers in library cite

A. L. Berger, S. A. D. Pietra, Vincent J. Della Pietra - 1996

10 papers in library cite

Geoffrey E. Hinton - 1986

13 papers in library cite

F. Morin, Yoshua Bengio - 2005

19 papers in library cite

Yoshua Bengio, Jean Sebastien Senecal - 2003

11 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2002

14 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2005

7 papers in library cite

Weixin Xu, Alex Rudnicky - 2000

5 papers in library cite

Holger Schwenk, D. Dchelotte, Jean Luc Gauvain - 2006

5 papers in library cite

Jürgen Schmidhuber - 1996

3 papers in library cite

Holger Schwenk - 2004

6 papers in library cite

Frederick Jelinek - 2003

6 papers in library cite

L. Breiman - 1994

4 papers in library cite

C. M. Bishop - 1995

12 papers in library cite

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, R. Harshman - 1990

12 papers in library cite

V. N. Vapnik - 1998

10 papers in library cite

J. Goodman - 2001

15 papers in library cite

S. F. Chen, J. Goodman - 1998

13 papers in library cite

P. F. Brown, P. V. Desouza, R. L. Mercer, Vincent J. Della Pietra, J. C. Lai - 1992

12 papers in library cite

R. Kuhn, R. D. Mori - 1990

6 papers in library cite

R. Rosenfeld - 1996

6 papers in library cite

C. Chelba, Frederick Jelinek - 2000

6 papers in library cite

R. Miikkulainen, M. G. Dyer - 1991

4 papers in library cite

A. Emami, Frederick Jelinek - 2005

4 papers in library cite

P. Brown, J. Cocke, S. D. Pietra, Vincent J. Della Pietra, Frederick Jelinek, J. Lafferty, R. Mercer, P. Roossin - 1990

3 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2005

3 papers in library cite

R. Kneser, V. Steinbiss - 1993

3 papers in library cite

J. Bilmes, K. Asanovic, C. Chin, J. Demmel - 1997

3 papers in library cite

Jean Luc Gauvain, L. Lamel, Holger Schwenk, G. Adda, L. C. Chen, F. Lefe`vre - 2003

2 papers in library cite

A. Paccanaro, Geoffrey Hinton - 2000

2 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2004

2 papers in library cite

R. Iyer, M. Ostendorf - 1999

2 papers in library cite

Jean Luc Gauvain, L. Lamel, G. Adda - 2002

2 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2003

2 papers in library cite

Jean Luc Gauvain, G. Adda, M. A. Decker, A. Allauzen, V. Gendner, L. Lamel, Holger Schwenk - 2005

2 papers in library cite

A. Ito, M. Khoda, M. Ostendorf - 1999

1 paper in library cites

P. Brown, J. Cocke, S. D. Pietra, Vincent J. Della Pietra, Frederick Jelinek, J. Lafferty, R. Mercer, P. Roossin - 1990

1 paper in library cites

M. Nakamura, K. Shikano - 1989

1 paper in library cites

M. Federico - 1996

1 paper in library cites

M. J. Castro, V. Polvoreda - 2001

1 paper in library cites

H. K. J. Kuoi, E. F. Lussier, H. Jiang, C. H. Lee - 2002

1 paper in library cites

Ziru Chen, K. F. Lee, M. J. Li - 2000

1 paper in library cites

I. Bulyko, M. Ostendorf, Andreas Stolcke - 2003

1 paper in library cites

Holger Schwenk - 2001

1 paper in library cites

M. J. Castro, F. Prat - 2003

1 paper in library cites

O. Kimball, C. L. Kao, T. Arvizo, John Makhoul, R. Iyer - 2004

1 paper in library cites

R. Prasad, S. Matsoukas, C. L. Kao, J. Ma, D. X. Xu, T. Colthurst, G. Thattai, O. Kimball, Richard Schwartz, Jean Luc Gauvain, L. Lamel, Holger Schwenk, G. Adda, F. Lefevre - 2004

1 paper in library cites

L. Nguyen, S. Abdou, M. Afify, John Makhoul, S. Matsoukas, Richard Schwartz, Bing Xiang, L. Lamel, Jean Luc Gauvain, G. Adda, Holger Schwenk, F. Lefevre - 2004

1 paper in library cites

S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J. F. Bonastre, G. Gravier - 2005

1 paper in library cites

L. Lamel, Jean Luc Gauvain, G. Adda, C. Barras, E. Bilinski, O. Galibert, A. Pujol, Holger Schwenk, X. Zhu - 2006

1 paper in library cites

Wenyi Wang, Andreas Stolcke, M. Harper - 2004

1 paper in library cites

L. Lamel, G. Adda, E. Bilinski, Jean Luc Gauvain - 2005

1 paper in library cites

L. C. Chen, Jean Luc Gauvain, L. Lamel, G. Adda, M. Adda - 2001

1 paper in library cites

Holger Schwenk, Jean Luc Gauvain - 2004

1 paper in library cites

P. Xu, L. Mangu - 2005

1 paper in library cites

Cited by

12

papers in your library

Cites

19

papers in your library

Read

on April 27, 2025

Your review

Tags

Paper Aliases

No aliases