Discriminative keyword spotting

Joseph Keshet, David Grangier, Samy Bengio

Research output: Contribution to journalArticlepeer-review

86 Scopus citations

Abstract

This paper proposes a new approach for keyword spotting, which is based on large margin and kernel methods rather than on HMMs. Unlike previous approaches, the proposed method employs a discriminative learning procedure, in which the learning phase aims at achieving a high area under the ROC curve, as this quantity is the most common measure to evaluate keyword spotters. The keyword spotter we devise is based on mapping the input acoustic representation of the speech utterance along with the target keyword into a vector-space. Building on techniques used for large margin and kernel methods for predicting whole sequences, our keyword spotter distills to a classifier in this vector-space, which separates speech utterances in which the keyword is uttered from speech utterances in which the keyword is not uttered. We describe a simple iterative algorithm for training the keyword spotter and discuss its formal properties, showing theoretically that it attains high area under the ROC curve. Experiments on read speech with the TIMIT corpus show that the resulted discriminative system outperforms the conventional context-independent HMM-based system. Further experiments using the TIMIT trained model, but tested on both read (HTIMIT, WSJ) and spontaneous speech (OGI Stories), show that without further training or adaptation to the new corpus our discriminative system outperforms the conventional context-independent HMM-based system.

Original languageEnglish
Pages (from-to)317-329
Number of pages13
JournalSpeech Communication
Volume51
Issue number4
DOIs
StatePublished - Apr 2009
Externally publishedYes

Bibliographical note

Funding Information:
Part of this work was supported by EU project DIRAC (FP6-0027787). Most of this work has been performed while David Grangier was with the IDIAP Research Institute. The authors wish to thank the anonymous reviewers for their helpful comments, which have enhanced this paper.

Keywords

  • Discriminative models
  • Keyword spotting
  • Large margin and kernel methods
  • Speech recognition
  • Spoken term detection
  • Support vector machines

Fingerprint

Dive into the research topics of 'Discriminative keyword spotting'. Together they form a unique fingerprint.

Cite this