Abstract
The problem of blind audio source separation (BASS) in noisy and reverberant conditions is addressed by a novel approach, termed Global and LOcal Simplex Separation (GLOSS), which integrates full- and narrow-band simplex representations. We show that the eigenvectors of the correlation matrix between time frames in a certain frequency band form a simplex that organizes the frames according to the speaker activities in the corresponding band. We propose to build two simplex representations: One global based on a broad frequency band and one local based on a narrow band. In turn, the two representations are combined to determine the dominant speaker in each time-frequency (TF) bin. Using the identified dominating speakers, a spectral mask is computed and is utilized for extracting each of the speakers using spatial beamforming followed by spectral postfiltering. The performance of the proposed algorithm is demonstrated using real-life recordings in various noisy and reverberant conditions.
Original language | English |
---|---|
Article number | 9004553 |
Pages (from-to) | 914-928 |
Number of pages | 15 |
Journal | IEEE/ACM Transactions on Audio Speech and Language Processing |
Volume | 28 |
DOIs | |
State | Published - 2020 |
Bibliographical note
Publisher Copyright:© 2014 IEEE.
Funding
Manuscript received July 1, 2019; revised December 31, 2019 and February 9, 2020; accepted February 9, 2020. Date of publication February 20, 2020; date of current version February 12, 2020. The work of Bracha Laufer-Goldshtein was supported by the Adams Fellowship Program of the Israel Academy of Sciences and Humanities. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Stefan Bilbao. (Corresponding author: Sharon Gannot.) Bracha Laufer-Goldshtein and Sharon Gannot are with the Faculty of Engineering, Bar-Ilan University, Ramat-Gan 5290002, Israel (e-mail: [email protected]; [email protected]).
Funders | Funder number |
---|---|
Israel Academy of Sciences and Humanities |
Keywords
- Blind audio source separation (BASS)
- beamformer
- relative transfer function (RTF)
- simplex
- spectral mask