2011

Empirical Evaluation and Combination of Advanced Language Modeling Techniques

Tomas Mikolov, A. Deoras, S. Kombrink, Lukas Burget, Jan Cernocky

citations

Cite Score

19

AI summary

The paper introduces a novel approach to language modeling by combining various advanced techniques such as class-based models, cache models, maximum entropy models, structured language models, random forest language models, and neural network-based language models. By using linear interpolation the model achieves state-of-the-art perplexity reduction on the Penn Treebank corpus.

Main Contributions

  • Introduces a novel combination of language modeling techniques.
  • Achieves state-of-the-art perplexity reduction on the Penn Treebank corpus.
  • Demonstrates significant improvements over individual models and traditional n-gram baselines.
  • Explores adaptive linear model combination for dynamic weight estimation.
  • Evaluates performance with increasing training data, showing continued improvements with larger datasets.

Abstract

We present results obtained with several advanced language modeling techniques, including class based model, cache model, maximum entropy model, structured language model, random forest language model and several types of neural network based language models. We show results obtained after combining all these models by using linear interpolation. We conclude that for both small and moderately sized tasks, we obtain new state of the art results with combination of models, that is significantly better than performance of any individual model. Obtained perplexity reductions against Good-Turing trigram baseline are over 50% and against modified Kneser-Ney smoothed 5-gram over 40%.

Citation Graph

Loading graph...

References [20]

Sort:
Filter:

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

34 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010

36 papers in library cite

Andreas Stolcke - 2002

13 papers in library cite

Tomas Mikolov, S. Kombrink, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2011

16 papers in library cite

A. Mnih, Geoffrey Hinton - 2007

12 papers in library cite

Holger Schwenk - 2007

12 papers in library cite

H. S. Le, I. Oparin, A. Allauzen, Jean Luc Gauvain, F. Yvon - 2011

7 papers in library cite

J. Goodman - 2001

15 papers in library cite

S. F. Chen, J. Goodman - 1998

13 papers in library cite

D. Filimonov, M. Harper - 2009

4 papers in library cite

A. Emami, Frederick Jelinek - 2004

4 papers in library cite

Tomas Mikolov, J. Kopecky, Lukas Burget, O. Glembek, Jan Cernocky - 2009

4 papers in library cite

A. Emami, Frederick Jelinek - 2005

4 papers in library cite

P. Xu - 2005

4 papers in library cite

P. Xu, D. Karakos, Sanjeev Khudanpur - 2009

3 papers in library cite

T. Alumae, M. Kurimo - 2010

2 papers in library cite

D. Klakow - 1998

1 paper in library cites

S. Momtazi, F. Faubel, D. Klakow - 2010

1 paper in library cites

Cited by

13

papers in your library

Cites

7

papers in your library

Read

on March 28, 2025

Your review

Tags

Paper Aliases

No aliases