Source-Filter based Single Channel Speech Separation using Pitch Information

Michael Stark; Michael Wohlmayr; Franz Pernkopf

doi:10.1109/TASL.2010.2047419

Source-Filter based Single Channel Speech Separation using Pitch Information

Michael Stark, Michael Wohlmayr, Franz Pernkopf

Institute of Signal Processing and Speech Communication (4420)

Research output: Contribution to journal › Article › peer-review

Abstract

In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

Original language	English
Pages (from-to)	242-255
Journal	IEEE Transactions on Audio Speech and Language Processing
Volume	19
Issue number	2
DOIs	https://doi.org/10.1109/TASL.2010.2047419
Publication status	Published - 2011

Access to Document

10.1109/TASL.2010.2047419

Cite this

@article{d6f56ba1c6ec4eb98b68750ddfda825a,

title = "Source-Filter based Single Channel Speech Separation using Pitch Information",

abstract = "In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.",

author = "Michael Stark and Michael Wohlmayr and Franz Pernkopf",

year = "2011",

doi = "10.1109/TASL.2010.2047419",

language = "English",

volume = "19",

pages = "242--255",

journal = "IEEE Transactions on Audio Speech and Language Processing ",

issn = "1558-7924",

publisher = "Institute of Electrical and Electronics Engineers",

number = "2",

}

TY - JOUR

T1 - Source-Filter based Single Channel Speech Separation using Pitch Information

AU - Stark, Michael

AU - Wohlmayr, Michael

AU - Pernkopf, Franz

PY - 2011

Y1 - 2011

N2 - In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

AB - In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

U2 - 10.1109/TASL.2010.2047419

DO - 10.1109/TASL.2010.2047419

M3 - Article

SN - 1558-7924

VL - 19

SP - 242

EP - 255

JO - IEEE Transactions on Audio Speech and Language Processing

JF - IEEE Transactions on Audio Speech and Language Processing

IS - 2

ER -

Source-Filter based Single Channel Speech Separation using Pitch Information

Abstract

Access to Document

Fingerprint

Cite this