Overcoming Covariance Matrix Phase Sensitivity in Single-Channel Speech Enhancement with Correlated Spectral Components

Johannes Stahl, Sean Ulrich Niethe Wood, Pejman Mowlaee Beikzadehmahaleh

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

The single-channel speech enhancement problem in the short-time Fourier transform domain is addressed. Traditional approaches assume statistical independence between signal components from different frequency regions, resulting in estimators that are functions of diagonal covariance matrices. More recent approaches drop this assumption and explicitly model dependencies between discrete Fourier transform bins. Full covariance matrices of speech and noise are required in this case to obtain optimal estimates of the clean speech spectrum, where off-diagonal entries are complex-valued in general. We show that the performance of estimators resulting from such models is highly sensitive to the phase estimation accuracy of these off-diagonal entries. Since it is non-trivial to estimate the covariance phases from noisy speech data, we propose a linear multidimensional short-time spectral amplitude estimator that circumvents the need to estimate them. We evaluate the speech enhancement performance of this approach and compare it to relevant benchmarks that also take into account inter-channel dependencies.
Original languageEnglish
Title of host publicationITG-Fb. 282: Speech Communication
PublisherVDE
Pages286-290
Number of pages5
Publication statusPublished - 2018
Event13th ITG Conference on Speech Communication - Oldenburg, Germany
Duration: 10 Oct 201812 Oct 2018

Conference

Conference13th ITG Conference on Speech Communication
Country/TerritoryGermany
CityOldenburg
Period10/10/1812/10/18

Fingerprint

Dive into the research topics of 'Overcoming Covariance Matrix Phase Sensitivity in Single-Channel Speech Enhancement with Correlated Spectral Components'. Together they form a unique fingerprint.

Cite this