2007

Three New Graphical Models for Statistical Language Modelling

A. Mnih, Geoffrey Hinton

citations

Cite Score

35

AI summary

This paper introduces three new probabilistic language models using distributed word representations to predict the next word in a sequence. It uses real-valued distributed representations and stochastic binary hidden features, achieving significant improvements over n-gram models on a statistical language modeling task.

Main Contributions

  • Proposed three new probabilistic language models for statistical language modelling using distributed word representations.
  • Introduced a Factored Restricted Boltzmann Machine (FRBM) language model.
  • Developed a Temporal Factored RBM to capture long-range dependencies.
  • Investigated a log-bilinear language model for direct parameterization of word distribution.
  • Achieved state-of-the-art performance on the APNews dataset with one of the proposed models.

Abstract

The supremacy of n-gram models in statistical language modelling has recently been challenged by parametric models that use distributed representations to counteract the difficulties caused by data sparsity. We propose three new probabilistic language models that define the distribution of the next word in a sequence given several preceding words by using distributed representations of those words. We show how real-valued distributed representations for words can be learned at the same time as learning a large set of stochastic binary hidden features that are used to predict the distributed representation of the next word from previous distributed representations. Adding connections from the previous states of the binary hidden features improves performance as does adding direct connections between the real-valued distributed representations. One of our models significantly outperforms the very best n-gram models.

Citation Graph

Loading graph...

References [13]

Sort:
Filter:

Geoffrey E. Hinton, S. Osindero, Y. Teh - 2006

43 papers in library cite

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Geoffrey Hinton - 2002

23 papers in library cite

Andreas Stolcke - 2002

13 papers in library cite

Geoffrey E. Hinton - 1986

13 papers in library cite

F. Morin, Yoshua Bengio - 2005

19 papers in library cite

Yoshua Bengio, Jean Sebastien Senecal - 2003

11 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2005

7 papers in library cite

Frederick Jelinek - 2003

6 papers in library cite

S. F. Chen, J. Goodman - 1998

13 papers in library cite

Ilya Sutskever, Geoffrey E. Hinton - 2007

3 papers in library cite

John Blitzer, A. Globerson, Fernando Pereira - 2005

1 paper in library cites

John Blitzer, K. Weinberger, L. Saul, Fernando Pereira - 2005

1 paper in library cites

Cited by

12

papers in your library

Cites

9

papers in your library

Read

on March 22, 2025

Your review

Tags

Paper Aliases

No aliases