Maximum a posteriori speech enhancement based on double spectrum

Pejman Mowlaee, Daniel Scheran, Johannes Stahl, Sean U.N. Wood, W. Bastiaan Kleijn

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

Abstract

While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

Original languageEnglish
Title of host publicationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
Pages2738-2742
Number of pages5
DOIs
Publication statusPublished - 1 Jan 2019
Event20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019 - Graz, Austria
Duration: 15 Sep 201919 Sep 2019

Conference

Conference20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019
CountryAustria
CityGraz
Period15/09/1919/09/19

    Fingerprint

Keywords

  • Double Spectrum
  • MAP Estimator
  • Modulation Domain Processing
  • Speech Enhancement

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modelling and Simulation

Cite this

Mowlaee, P., Scheran, D., Stahl, J., Wood, S. U. N., & Bastiaan Kleijn, W. (2019). Maximum a posteriori speech enhancement based on double spectrum. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019 (pp. 2738-2742) https://doi.org/10.21437/Interspeech.2019-1197