Global and local simplex representations for multichannel source separation

Bracha Laufer-Goldshtein, Ronen Talmon, Sharon Gannot

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

The problem of blind audio source separation (BASS) in noisy and reverberant conditions is addressed by a novel approach, termed Global and LOcal Simplex Separation (GLOSS), which integrates full- and narrow-band simplex representations. We show that the eigenvectors of the correlation matrix between time frames in a certain frequency band form a simplex that organizes the frames according to the speaker activities in the corresponding band. We propose to build two simplex representations: One global based on a broad frequency band and one local based on a narrow band. In turn, the two representations are combined to determine the dominant speaker in each time-frequency (TF) bin. Using the identified dominating speakers, a spectral mask is computed and is utilized for extracting each of the speakers using spatial beamforming followed by spectral postfiltering. The performance of the proposed algorithm is demonstrated using real-life recordings in various noisy and reverberant conditions.

Original languageEnglish
Article number9004553
Pages (from-to)914-928
Number of pages15
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume28
DOIs
StatePublished - 2020

Bibliographical note

Publisher Copyright:
© 2014 IEEE.

Funding

Manuscript received July 1, 2019; revised December 31, 2019 and February 9, 2020; accepted February 9, 2020. Date of publication February 20, 2020; date of current version February 12, 2020. The work of Bracha Laufer-Goldshtein was supported by the Adams Fellowship Program of the Israel Academy of Sciences and Humanities. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Stefan Bilbao. (Corresponding author: Sharon Gannot.) Bracha Laufer-Goldshtein and Sharon Gannot are with the Faculty of Engineering, Bar-Ilan University, Ramat-Gan 5290002, Israel (e-mail: [email protected]; [email protected]).

FundersFunder number
Israel Academy of Sciences and Humanities

    Keywords

    • Blind audio source separation (BASS)
    • beamformer
    • relative transfer function (RTF)
    • simplex
    • spectral mask

    Fingerprint

    Dive into the research topics of 'Global and local simplex representations for multichannel source separation'. Together they form a unique fingerprint.

    Cite this