Abstract
Speech communication systems are prone to performance degradation in reverberant and noisy acoustic environments. Dereverberation and noise reduction algorithms typically require several model parameters, e.g. The speech, reverberation and noise power spectral densities (PSDs). A commonly used assumption is that the noise PSD matrix is known. However, in practical acoustic scenarios, the noise PSD matrix is unknown and should be estimated along with the speech and reverberation PSDs. In this article, we consider the case of rank-deficient noise PSD matrix, which arises when the noise signal consists of multiple directional noise sources, whose number is less than the number of microphones. We derive two closed-form maximum likelihood estimators (MLEs). The first is a non-blocking-based estimator which jointly estimates the speech, reverberation and noise PSDs, and the second is a blocking-based estimator, which first blocks the speech signal and then jointly estimates the reverberation and noise PSDs. Both estimators are analytically compared and analyzed, and mean square errors (MSEs) expressions are derived. Furthermore, Cramér-Rao Bounds (CRBs) on the estimated PSDs are derived. The proposed estimators are examined using both simulation and real reverberant and noisy signals, demonstrating the advantage of the proposed method compared to competing estimators.
Original language | English |
---|---|
Article number | 8943979 |
Pages (from-to) | 619-634 |
Number of pages | 16 |
Journal | IEEE/ACM Transactions on Audio Speech and Language Processing |
Volume | 28 |
DOIs | |
State | Published - 2020 |
Bibliographical note
Publisher Copyright:© 2014 IEEE.
Funding
Manuscript received April 9, 2019; revised November 10, 2019; accepted December 11, 2019. Date of publication December 27, 2019; date of current version January 21, 2020. The work of Y. Laufer and B. Laufer-Goldshtein was supported by the Adams Fellowship Program of the Israel Academy of Sciences and Humanities. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Hirokazu Kameoka. (Corresponding author: Sharon Gannot.) The authors are with the Faculty of Engineering, Bar-Ilan University, Ramat-Gan 5290002, Israel (e-mail: [email protected]; [email protected]; [email protected]). Digital Object Identifier 10.1109/TASLP.2019.2962689
Funders | Funder number |
---|---|
Israel Academy of Sciences and Humanities |
Keywords
- Cramér-Rao Bound (CRB)
- Dereverberation
- Maximum likelihood estimation (MLE)
- Noise reduction