The New Bionic Electro-Larynx Speech System

Anna Katharina Fuchs, Martin Hagmüller, Gernot Kubin

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Persons who have lost their larynx and thus speech functionality need to use a substitution voice to regain speech. The electro-larynx (EL) is a widely used device but is known for its unnatural and monotonic speech quality. Previous research has addressed these problems, but until now no significant improvements could be reported. Moreover, the importance of human-centered computing and co-Adaptation was only considered marginally. The EL speech system is a complex system including hardware (artificial excitation source or sound transducer), software (control and generation of the artificial excitation signal) and most important of all, the human user. Taking one isolated problem into consideration is not an adequate approach here, since all aspects of the EL speech system need to be integrated into a holistic approach. We have responded by pushing forward the boundaries of the conventional EL device toward a new bionic electro-larynx speech system. In this paper, we review existing approaches and summarize the most important findings. Subsequently, we piece together an overall speech system, which integrates several parts to significantly improve EL speech: we study electromyographic signals to analyze their suitability for hands-free on/off control of the EL speech system and investigate learning effects in the training of naive users. We propose a new transducer for the EL speech system based on electromagnetic mechanisms. The technical properties of the new transducer show significant advantages over the conventional electro-dynamic transducer. Furthermore, we propose an algorithm to estimate a changing fundamental frequency from the speech spectral envelope and an optimized waveform shape for the excitation signal. Listening tests show that a changing fundamental frequency improves the perceived naturalness of EL speech. Listening tests serve as a proof of concept for the resulting EL speech system, which confirm that the proposed system is very promising.

Original languageEnglish
Article number7421971
Pages (from-to)952-961
Number of pages10
JournalIEEE journal of selected topics in signal processing
Volume10
Issue number5
DOIs
Publication statusPublished - 1 Aug 2016

Fingerprint

Bionics
Transducers
Regain

Keywords

  • Artificial excitation signal
  • bionic
  • electro-larynx
  • electro-magnetic transducer
  • electromyography
  • frequency estimation
  • speech enhancement

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing

Cite this

The New Bionic Electro-Larynx Speech System. / Fuchs, Anna Katharina; Hagmüller, Martin; Kubin, Gernot.

In: IEEE journal of selected topics in signal processing, Vol. 10, No. 5, 7421971, 01.08.2016, p. 952-961.

Research output: Contribution to journalArticleResearchpeer-review

@article{b60e6606567e4522902765759e4cf3d9,
title = "The New Bionic Electro-Larynx Speech System",
abstract = "Persons who have lost their larynx and thus speech functionality need to use a substitution voice to regain speech. The electro-larynx (EL) is a widely used device but is known for its unnatural and monotonic speech quality. Previous research has addressed these problems, but until now no significant improvements could be reported. Moreover, the importance of human-centered computing and co-Adaptation was only considered marginally. The EL speech system is a complex system including hardware (artificial excitation source or sound transducer), software (control and generation of the artificial excitation signal) and most important of all, the human user. Taking one isolated problem into consideration is not an adequate approach here, since all aspects of the EL speech system need to be integrated into a holistic approach. We have responded by pushing forward the boundaries of the conventional EL device toward a new bionic electro-larynx speech system. In this paper, we review existing approaches and summarize the most important findings. Subsequently, we piece together an overall speech system, which integrates several parts to significantly improve EL speech: we study electromyographic signals to analyze their suitability for hands-free on/off control of the EL speech system and investigate learning effects in the training of naive users. We propose a new transducer for the EL speech system based on electromagnetic mechanisms. The technical properties of the new transducer show significant advantages over the conventional electro-dynamic transducer. Furthermore, we propose an algorithm to estimate a changing fundamental frequency from the speech spectral envelope and an optimized waveform shape for the excitation signal. Listening tests show that a changing fundamental frequency improves the perceived naturalness of EL speech. Listening tests serve as a proof of concept for the resulting EL speech system, which confirm that the proposed system is very promising.",
keywords = "Artificial excitation signal, bionic, electro-larynx, electro-magnetic transducer, electromyography, frequency estimation, speech enhancement",
author = "Fuchs, {Anna Katharina} and Martin Hagm{\"u}ller and Gernot Kubin",
year = "2016",
month = "8",
day = "1",
doi = "10.1109/JSTSP.2016.2535970",
language = "English",
volume = "10",
pages = "952--961",
journal = "IEEE journal of selected topics in signal processing",
issn = "1932-4553",
publisher = "Institute of Electrical and Electronics Engineers",
number = "5",

}

TY - JOUR

T1 - The New Bionic Electro-Larynx Speech System

AU - Fuchs, Anna Katharina

AU - Hagmüller, Martin

AU - Kubin, Gernot

PY - 2016/8/1

Y1 - 2016/8/1

N2 - Persons who have lost their larynx and thus speech functionality need to use a substitution voice to regain speech. The electro-larynx (EL) is a widely used device but is known for its unnatural and monotonic speech quality. Previous research has addressed these problems, but until now no significant improvements could be reported. Moreover, the importance of human-centered computing and co-Adaptation was only considered marginally. The EL speech system is a complex system including hardware (artificial excitation source or sound transducer), software (control and generation of the artificial excitation signal) and most important of all, the human user. Taking one isolated problem into consideration is not an adequate approach here, since all aspects of the EL speech system need to be integrated into a holistic approach. We have responded by pushing forward the boundaries of the conventional EL device toward a new bionic electro-larynx speech system. In this paper, we review existing approaches and summarize the most important findings. Subsequently, we piece together an overall speech system, which integrates several parts to significantly improve EL speech: we study electromyographic signals to analyze their suitability for hands-free on/off control of the EL speech system and investigate learning effects in the training of naive users. We propose a new transducer for the EL speech system based on electromagnetic mechanisms. The technical properties of the new transducer show significant advantages over the conventional electro-dynamic transducer. Furthermore, we propose an algorithm to estimate a changing fundamental frequency from the speech spectral envelope and an optimized waveform shape for the excitation signal. Listening tests show that a changing fundamental frequency improves the perceived naturalness of EL speech. Listening tests serve as a proof of concept for the resulting EL speech system, which confirm that the proposed system is very promising.

AB - Persons who have lost their larynx and thus speech functionality need to use a substitution voice to regain speech. The electro-larynx (EL) is a widely used device but is known for its unnatural and monotonic speech quality. Previous research has addressed these problems, but until now no significant improvements could be reported. Moreover, the importance of human-centered computing and co-Adaptation was only considered marginally. The EL speech system is a complex system including hardware (artificial excitation source or sound transducer), software (control and generation of the artificial excitation signal) and most important of all, the human user. Taking one isolated problem into consideration is not an adequate approach here, since all aspects of the EL speech system need to be integrated into a holistic approach. We have responded by pushing forward the boundaries of the conventional EL device toward a new bionic electro-larynx speech system. In this paper, we review existing approaches and summarize the most important findings. Subsequently, we piece together an overall speech system, which integrates several parts to significantly improve EL speech: we study electromyographic signals to analyze their suitability for hands-free on/off control of the EL speech system and investigate learning effects in the training of naive users. We propose a new transducer for the EL speech system based on electromagnetic mechanisms. The technical properties of the new transducer show significant advantages over the conventional electro-dynamic transducer. Furthermore, we propose an algorithm to estimate a changing fundamental frequency from the speech spectral envelope and an optimized waveform shape for the excitation signal. Listening tests show that a changing fundamental frequency improves the perceived naturalness of EL speech. Listening tests serve as a proof of concept for the resulting EL speech system, which confirm that the proposed system is very promising.

KW - Artificial excitation signal

KW - bionic

KW - electro-larynx

KW - electro-magnetic transducer

KW - electromyography

KW - frequency estimation

KW - speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=84982747119&partnerID=8YFLogxK

U2 - 10.1109/JSTSP.2016.2535970

DO - 10.1109/JSTSP.2016.2535970

M3 - Article

VL - 10

SP - 952

EP - 961

JO - IEEE journal of selected topics in signal processing

JF - IEEE journal of selected topics in signal processing

SN - 1932-4553

IS - 5

M1 - 7421971

ER -