A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification

Hagai Aronowitz, David Burshtein, Amihood Amir

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Test utterance parameterization (TUP) using Gaussian mixture models (GMMs) has recently been shown to be beneficial for speaker indexing due to its computational efficiency and identical accuracy compared to classic GMM-based recognizers. We show that TUP can also lead to more accurate speaker recognition. On the NIST-2004 evaluation corpus, recognition error rate was reduced by 8% compared to the classic GMM-based algorithm. Furthermore, we introduce a novel generative statistical model for generation of test utterances by speakers. This model is incorporated naturally into the TUP framework and improves speaker recognition accuracy. On the NIST-2004 evaluation corpus, recognition error rate was reduced by 15% compared to the classic GMM-based algorithm.
Original languageAmerican English
Title of host publicationAcoustics, Speech, and Signal Processing, 2005. Proceedings.(ICASSP'05). IEEE International Conference on
PublisherIEEE
StatePublished - 2005

Bibliographical note

Place of conference:Philadelphia, Pennsylvania, USA

Fingerprint

Dive into the research topics of 'A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification'. Together they form a unique fingerprint.

Cite this