2009

A Scalable Hierarchical Distributed Language Model

A. Mnih, Geoffrey E. Hinton

citations

Cite Score

42

AI summary

This paper introduces a fast hierarchical language model (HLBL) with a feature-based algorithm for automatic word tree construction. HLBL outperforms non-hierarchical neural models and n-gram models on the APNews dataset by building trees from distributed word representations, achieving state-of-the-art performance.

Main Contributions

  • Introduces a fast hierarchical language model (HLBL) based on the log-bilinear model.
  • Presents a feature-based algorithm for automatic construction of word trees from data, eliminating the need for expert knowledge.
  • Demonstrates that HLBL can handle multiple occurrences of each word in the tree.
  • Shows that HLBL outperforms non-hierarchical neural models and n-gram models.
  • Achieves state-of-the-art performance by using a carefully constructed hierarchy over words.

Abstract

Neural probabilistic language models (NPLMs) have been shown to be competitive with and occasionally superior to the widely-used n-gram language models. The main drawback of NPLMs is their extremely long training and testing times. Morin and Bengio have proposed a hierarchical language model built around a binary tree of words, which was two orders of magnitude faster than the non-hierarchical model it was based on. However, it performed considerably worse than its non-hierarchical counterpart in spite of using a word tree created using expert knowledge. We introduce a fast hierarchical language model along with a simple feature-based algorithm for automatic construction of word trees from the data. We then show that the resulting models can outperform non-hierarchical neural models as well as the best n-gram models.

Citation Graph

Loading graph...

References [12]

Sort:
Filter:

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

F. Morin, Yoshua Bengio - 2005

19 papers in library cite

A. Mnih, Geoffrey Hinton - 2007

12 papers in library cite

Yoshua Bengio, Jean Sebastien Senecal - 2003

11 papers in library cite

Holger Schwenk, Jean Luc Gauvain - 2002

14 papers in library cite

Frederick Jelinek - 2003

6 papers in library cite

C. Fellbaum - 1998

12 papers in library cite

J. Goodman - 2001

15 papers in library cite

S. F. Chen, J. Goodman - 1998

13 papers in library cite

P. F. Brown, P. V. Desouza, R. L. Mercer, Vincent J. Della Pietra, J. C. Lai - 1992

12 papers in library cite

Fernando Pereira, N. Tishby, L. Lee - 1993

4 papers in library cite

J. G. Mcmahon, F. J. Smith - 1996

1 paper in library cites

Cited by

16

papers in your library

Cites

7

papers in your library

Read

on March 18, 2025

Your review

Tags

Paper Aliases

No aliases