Cite Score
58
AI summary
This paper introduces optimized backing-off distributions for M-gram language modeling, derived from theoretical approaches, achieving a 10% improvement in perplexity and a 5% reduction in word error rate on the Verbmobil and Wall-Street-Journal corpora.
Main Contributions
Abstract
In stochastic language modeling, backing-off is a widely used method to cope with the sparse data problem. In case of unseen events this method backs off to a less specific distribution. In this paper we propose to use distributions which are especially optimized for the task of backing-off. Two different theoretical derivations lead to distributions which are quite different from the probability distributions that are usually used for backing off. Experiments show an improvement of about 10% in terms of perplexity and 5% in terms of word error rate.
Citation Graph
References [7]
S. Katz - 1987
11 papers in library cite
R. O. Duda, P. E. Hart - 1973
9 papers in library cite
I. J. Good - 1953
2 papers in library cite
X. Aubert, C. Dugast, Hermann Ney, V. Steinbiss - 1994
1 paper in library cites
Hermann Ney, U. Essen, R. Kneser - 1994
1 paper in library cites
Frederick Jelinek - 1991
1 paper in library cites
D. B. Paul, J. M. Baker - 1992
1 paper in library cites
Cited by
11
papers in your library
Cites
0
papers in your library
Read
on June 13, 2025
Your review
Tags
Paper Aliases
No aliases