Maximum a posteriori speech enhancement based on double spectrum

Pejman Mowlaee, Daniel Scheran, Johannes Stahl, Sean U.N. Wood, W. Bastiaan Kleijn

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

Originalspracheenglisch
TitelProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
Seiten2738-2742
Seitenumfang5
DOIs
PublikationsstatusVeröffentlicht - 1 Jan. 2019
Veranstaltung20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language: INTERSPEECH 2019 - Messe Congress Graz, Graz, Österreich
Dauer: 15 Sept. 201919 Sept. 2019

Konferenz

Konferenz20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language
Land/GebietÖsterreich
OrtGraz
Zeitraum15/09/1919/09/19

ASJC Scopus subject areas

  • Sprache und Linguistik
  • Human-computer interaction
  • Signalverarbeitung
  • Software
  • Modellierung und Simulation

Fingerprint

Untersuchen Sie die Forschungsthemen von „Maximum a posteriori speech enhancement based on double spectrum“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren