Automatic identification of gender from speech

Research output: Contribution to journalConference articlepeer-review

37 Scopus citations

Abstract

Identifying the gender of a speaker from speech has a variety of applications ranging from speech analytics to personalizing human-machine interactions. While gender identification in previous work has explored the use of the statistical properties of the speaker’s pitch features, in this paper, we explore the impact of using spectral features in conjunction with pitch features on identifying gender. We present a novel approach that leverages pitch feature trajectories in the interest of identifying the speaker’s gender with as little speech as possible. We also investigate the cross-lingual robustness of a model trained on English speakers to identify the gender of German speakers. Finally, we present a model for gender detection in German that outperforms the state-of-the-art results on a benchmark data set.

Original languageEnglish
Pages (from-to)84-88
Number of pages5
JournalProceedings of the International Conference on Speech Prosody
Volume2016-January
DOIs
StatePublished - 2016
Externally publishedYes
Event8th Speech Prosody 2016 - Boston, United States
Duration: 31 May 20163 Jun 2016

Bibliographical note

Publisher Copyright:
© 2016, International Speech Communications Association. All rights reserved.

Keywords

  • Computational paralinguistics
  • Feature trajectories
  • Gender identification
  • Human-computer interaction

Fingerprint

Dive into the research topics of 'Automatic identification of gender from speech'. Together they form a unique fingerprint.

Cite this