2003

Training Connectionist Models for the Structured Language Model

P. Xu, A. Emami, Frederick Jelinek

citations

Cite Score

2

AI summary

This paper introduces connectionist models for the Structured Language Model (SLM), achieving improved perplexity (PPL) on the UPENN Treebank corpora by leveraging distributed representations and EM training, resulting in an 8% relative improvement over Kneser-Ney smoothing.

Main Contributions

  • Proposes connectionist models to improve Structured Language Models (SLM).
  • Uses distributed representations of items in the history to better utilize contexts.
  • Applies an EM procedure to further train the connectionist models.
  • Achieves significant improvement in PPL over interpolated and back-off models on the UPENN Treebank.
  • Demonstrates that the neural network enhanced SLM results in a language model that is much less correlated with the baseline Kneser-Ney smoothed trigram.

Abstract

We investigate the performance of the Structured Language Model (SLM) in terms of perplexity (PPL) when its components are modeled by connectionist models. The connectionist models use a distributed representation of the items in the history and make much better use of contexts than currently used interpolated or back-off models, not only because of the inherent capability of the connectionist model in fighting the data sparseness problem, but also because of the sub-linear growth in the model size when the context length is increased. The connectionist models can be further trained by an EM procedure, similar to the previously used procedure for training the SLM. Our experiments show that the connectionist models can significantly improve the PPL over the interpolated and back-off models on the UPENN Treebank corpora, after interpolating with a baseline trigram language model. The EM training procedure can improve the connectionist models further, by using hidden events obtained by the SLM parser.

Citation Graph

Loading graph...

References [14]

Sort:
Filter:

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Andreas Stolcke - 2002

13 papers in library cite

A. L. Berger, S. A. D. Pietra, Vincent J. Della Pietra - 1996

10 papers in library cite

Frederick Jelinek - 2003

6 papers in library cite

J. Goodman - 2001

15 papers in library cite

S. F. Chen, J. Goodman - 1998

13 papers in library cite

C. Chelba, Frederick Jelinek - 2000

6 papers in library cite

S. Haykin - 1999

4 papers in library cite

P. Xu, C. Chelba, Frederick Jelinek - 2002

2 papers in library cite

J. Henderson - 2003

2 papers in library cite

E. Charniak - 2001

1 paper in library cites

D. H. V. Uystel, D. V. Compernolle, P. Wambacq - 2001

1 paper in library cites

W. Kim, Sanjeev Khudanpur, Jeffrey Wu - 2001

1 paper in library cites

Cited by

3

papers in your library

Cites

4

papers in your library

Read

on April 25, 2025

Your review

Tags

Paper Aliases

No aliases