Abstract
In this study we address the problem of training a neural network for language identification using speech samples in the form of i-vectors. Our approach involves training a classifier and analyzing the obtained confusion matrix. We cluster the languages by simultaneously clustering the columns and the rows of the confusion matrix. The language clusters are then used to define a modified cost function for training a neural-network that focuses on distinguishing between the true language and languages within the same cluster. The results show enhanced language identification on the NIST 2015 language identification dataset.
Original language | English |
---|---|
Title of host publication | 2016 IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings |
Editors | Kostas Diamantaras, Aurelio Uncini, Francesco A. N. Palmieri, Jan Larsen |
Publisher | IEEE Computer Society |
ISBN (Electronic) | 9781509007462 |
DOIs | |
State | Published - 8 Nov 2016 |
Event | 26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings - Vietri sul Mare, Salerno, Italy Duration: 13 Sep 2016 → 16 Sep 2016 |
Publication series
Name | IEEE International Workshop on Machine Learning for Signal Processing, MLSP |
---|---|
Volume | 2016-November |
ISSN (Print) | 2161-0363 |
ISSN (Electronic) | 2161-0371 |
Conference
Conference | 26th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2016 - Proceedings |
---|---|
Country/Territory | Italy |
City | Vietri sul Mare, Salerno |
Period | 13/09/16 → 16/09/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- Confusion matrix
- clustering
- language identification