Abstract
Sound source localization is a cumbersome task in challenging reverberation conditions. Recently, there is a growing interest in developing learning-based localization methods. In this approach, acoustic features are extracted from the measured signals and then given as input to a model that maps them to the corresponding source positions. Typically, a massive dataset of labeled samples from known positions is required to train such models.Here, we present a novel weakly-supervised deep-learning localization method that exploits only a few labeled (anchor) samples with known positions, together with a larger set of unlabeled samples, for which we only know their relative physical ordering. We design an architecture that uses a stochastic combination of triplet-ranking loss for the unlabeled samples and physical loss for the anchor samples, to learn a nonlinear deep embedding that maps acoustic features to an azimuth angle of the source. The combined loss can be optimized effectively using standard gradient-based approach.Evaluating the proposed approach on simulated data, we demonstrate its significant improvement over two previous learning-based approaches for various reverberation levels, while maintaining consistent performance with varying sizes of labeled data.
Original language | English |
---|---|
Title of host publication | 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 283-287 |
Number of pages | 5 |
ISBN (Electronic) | 9781728111230 |
DOIs | |
State | Published - Oct 2019 |
Event | 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 - New Paltz, United States Duration: 20 Oct 2019 → 23 Oct 2019 |
Publication series
Name | IEEE Workshop on Applications of Signal Processing to Audio and Acoustics |
---|---|
Volume | 2019-October |
ISSN (Print) | 1931-1168 |
ISSN (Electronic) | 1947-1629 |
Conference
Conference | 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019 |
---|---|
Country/Territory | United States |
City | New Paltz |
Period | 20/10/19 → 23/10/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- acoustic source localization
- deep embedding learning
- relative transfer function
- triplet-loss