2008

Hybrid Word-Subword Decoding for Spoken Term Detection

Lukas Burget

citations

Cite Score

3

AI summary

This paper introduces a hybrid word-subword recognition system for spoken term detection, utilizing a hybrid recognition network and hybrid word-subword lattices. Evaluated on spoken term detection accuracy and index size, the multigram model trained on the word recognizer vocabulary shows improvement in word recognition accuracy.

Main Contributions

  • Introduces a hybrid word-subword recognition system for spoken term detection.
  • Employs a hybrid recognition network to produce hybrid word-subword lattices.
  • Evaluates performance using one phone and two multigram models for sub-word units.
  • Finds the multigram model trained on the word recognizer vocabulary to be the best subword model.
  • Achieves improvement in word recognition accuracy and spoken term detection accuracy for in-vocabulary and out-of-vocabulary terms searched separately.

Abstract

This paper deals with a hybrid word-subword recognition system for spoken term detection. The decoding is driven by a hybrid recognition network and the decoder directly produces hybrid word-subword lattices. One phone and two multigram models were tested to represent sub-word units. The systems were evaluated in terms of spoken term detection accuracy and the size of index. We concluded that the best subword model for hybrid word-subword recognition is the multigram model trained on the word recognizer vocabulary. We achieved an improvement in word recognition accuracy, and in spoken term detection accuracy when in-vocabulary and out-of-vocabulary terms are searched separately. Spoken term detection accuracy with the full (in-vocabulary and out-of-vocabulary) term set was slightly worse but the required index size was significantly reduced.

Citation Graph

Loading graph...

References [12]

Sort:
Filter:

I. Bazzi - 2002

3 papers in library cite

H. Jiang - 2005

1 paper in library cites

S. Deligne, F. Bimbot - 1995

1 paper in library cites

Jan Cernocky - 2007

1 paper in library cites

I. Szoke, Lukas Burget, Jan Cernocky, M. Fapso - 2008

1 paper in library cites

T. Hain - 2006

1 paper in library cites

J. Fiscus, J. Ajot, G. Doddington - 2006

1 paper in library cites

M. Karafiat, Lukas Burget, Jan Cernocky - 2005

1 paper in library cites

M. Mohri, Fernando Pereira, M. Riley - 2000

1 paper in library cites

Cited by

1

papers in your library

Cites

0

papers in your library

Read

on June 21, 2025

Your review

Tags

Paper Aliases

No aliases