Papperoni

2002

Connectionist Language Modeling for Large Vocabulary Continuous Speech Recognition

Holger Schwenk, Jean Luc Gauvain

citations

Cite Score

AI summary

This paper introduces a connectionist language model using a neural network to project words into a continuous space for estimating n-gram probabilities. Evaluated on the DARPA HUB5 task, it demonstrates improvements in perplexity and word error rate compared to standard 3-gram models, showcasing its potential for large vocabulary speech recognition.

Main Contributions

Introduces a novel connectionist language model for large vocabulary continuous speech recognition.
Utilizes a neural network to project word indices into a continuous space, enabling smooth probability interpolation.
Evaluates the connectionist LM on the DARPA HUB5 conversational telephone speech recognition task.
Achieves consistent improvements in perplexity and word error rate compared to standard 3-gram models.
Discusses efficient decoding strategies using static and dynamic shortlists to reduce computational complexity.

Abstract

This paper describes ongoing work on a new approach for language modeling for large vocabulary continuous speech recognition. Almost all state-of-the-art systems use statistical n-gram language models estimated on text corpora. One principle problem with such language models is the fact that many of the n-grams are never observed even in very large training corpora, and therefore it is common to back-off to a lower-order model. In this paper we propose to address this problem by carrying out the estimation task in a continuous space, enabling a smooth interpolation of the probabilities. A neural network is used to learn the projection of the words onto a continuous space and to estimate the n-gram probabilities. The connectionist language model is being evaluated on the DARPA HUB5 conversational telephone speech recognition task and preliminary results show consistent improvements in both perplexity and word error rate.

Citation Graph

Loading graph...

References [6]

Sort:

Filter:

[1]A Neural Probabilistic Language Model

Yoshua Bengio, R. Ducharme, Pascal Vincent - 2001

62 papers in library cite

Google Scholar

What started it all. Very simple and elegant.

[2]Neural Networks for Pattern Recognition

C. M. Bishop - 1995

12 papers in library cite

Google Scholar

Book, 38k citations

[3]The LIMSI Broadcast News Transcription System

Jean Luc Gauvain, L. Lamel, G. Adda - 2002

2 papers in library cite

Google Scholar

[4]Fast Decoding for Indexation of Broadcast Data

Jean Luc Gauvain, L. Lamel - 2000

1 paper in library cites

Google Scholar

[5]Language Model Adaptation

R. Demori, M. Frederico - 1999

1 paper in library cites

Google Scholar

[6]The 2001 NIST Evaluation for Recognition of Conversational Speech Over the Telephone

A. Martin, M. Przybocki - 2001

1 paper in library cites

Google Scholar

Cited by

papers in your library

Cites

papers in your library

Read

on March 18, 2025

Only real relevance is being early. Otherwise not much to see.