Abstract
Identifying the gender of a speaker from speech has a variety of applications ranging from speech analytics to personalizing human-machine interactions. While gender identification in previous work has explored the use of the statistical properties of the speaker’s pitch features, in this paper, we explore the impact of using spectral features in conjunction with pitch features on identifying gender. We present a novel approach that leverages pitch feature trajectories in the interest of identifying the speaker’s gender with as little speech as possible. We also investigate the cross-lingual robustness of a model trained on English speakers to identify the gender of German speakers. Finally, we present a model for gender detection in German that outperforms the state-of-the-art results on a benchmark data set.
| Original language | English |
|---|---|
| Pages (from-to) | 84-88 |
| Number of pages | 5 |
| Journal | Proceedings of the International Conference on Speech Prosody |
| Volume | 2016-January |
| DOIs | |
| State | Published - 2016 |
| Externally published | Yes |
| Event | 8th Speech Prosody 2016 - Boston, United States Duration: 31 May 2016 → 3 Jun 2016 |
Bibliographical note
Publisher Copyright:© 2016, International Speech Communications Association. All rights reserved.
Keywords
- Computational paralinguistics
- Feature trajectories
- Gender identification
- Human-computer interaction