Binaural Codebook-Based Speech Enhancement with Atomic Speech Presence Probability

Sean U.N. Wood*, Johannes K.W. Stahl, Pejman Mowlaee

*Korrespondierende/r Autor/-in für diese Arbeit

Publikation: Beitrag in einer FachzeitschriftArtikelBegutachtung

Abstract

In this work, we present a universal codebook-based speech enhancement framework that relies on a single codebook to encode both speech and noise components. The atomic speech presence probability (ASPP) is defined as the probability that a given codebook atom encodes speech at a given point in time. We develop ASPP estimators based on binaural cues including the interaural phase and level difference (IPD and ILD), the interaural coherence magnitude (ICM), as well as a combined version leveraging the full interaural transfer function (ITF). We evaluate the performance of the resulting ASPP-based speech enhancement algorithms on binaural mixtures of reverberant speech and real-world noise. The proposed approach improves both objective speech quality and intelligibility over a wide range of input SNR, as measured with PESQ and binaural STOI metrics, outperforming two binaural speech enhancement benchmark methods. We show that the proposed ITF-based ASPP approach achieves a good balance of the trade-off between binaural noise reduction and binaural cue preservation.

Originalspracheenglisch
Aufsatznummer8811601
Seiten (von - bis)2150-2161
Seitenumfang12
FachzeitschriftIEEE/ACM Transactions on Audio Speech and Language Processing
Jahrgang27
Ausgabenummer12
DOIs
PublikationsstatusVeröffentlicht - 1 Dez. 2019

ASJC Scopus subject areas

  • Informatik (sonstige)
  • Akustik und Ultraschall
  • Computational Mathematics
  • Elektrotechnik und Elektronik

Fingerprint

Untersuchen Sie die Forschungsthemen von „Binaural Codebook-Based Speech Enhancement with Atomic Speech Presence Probability“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren