Measurement of fundamental frequencies in diplophonic voices

P. Aichinger, M. Hagmüller, Imme Roesner, Wolfgang Bigenzahn, Berit Schneider-Stickler, Jean Schoentgen, F. Pernkopf

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

Abstract

Fundamental frequency (F0) extraction in disordered voice is a prerequisite for many types of clinical analyses. In this paper, we make an approach to F0 extraction based on audio waveform modeling and evaluate it with regard to reference F0s obtained from laryngeal high-speed videos. We analyze 65 euphonic and 112 dysphonic (28 diplophonic, 84 non-diplophonic) intervals of sustained phonations during rigid telescopic high-speed video laryngoscopy. Waveform modeling has a total error rate ETotal of 46.28 % on diplophonic voices, which is a significant improvement compared to a benchmark method (95.74 %). The results illustrate that F0 extraction in diplophonic voice is challenging. Multiple F0s need to be considered in the analysis of disordered voice, which may lead towards more valid clinical voice assessment in the future.

Original languageEnglish
Title of host publicationProceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015
EditorsClaudia Manfredi
PublisherFirenze University Press
Pages21-24
Number of pages4
ISBN (Electronic)9788866557920
Publication statusPublished - 1 Jan 2015
Event9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015 - Firenze, Italy
Duration: 2 Sep 20154 Sep 2015

Publication series

NameProceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015

Conference

Conference9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015
CountryItaly
CityFirenze
Period2/09/154/09/15

Keywords

  • Diplophonia
  • Fundamental frequency measurement
  • Laryngeal high-speed videos
  • Voice disorders

ASJC Scopus subject areas

  • Computer Science Applications
  • Biomedical Engineering

Cite this

Aichinger, P., Hagmüller, M., Roesner, I., Bigenzahn, W., Schneider-Stickler, B., Schoentgen, J., & Pernkopf, F. (2015). Measurement of fundamental frequencies in diplophonic voices. In C. Manfredi (Ed.), Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015 (pp. 21-24). (Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015). Firenze University Press.

Measurement of fundamental frequencies in diplophonic voices. / Aichinger, P.; Hagmüller, M.; Roesner, Imme; Bigenzahn, Wolfgang; Schneider-Stickler, Berit; Schoentgen, Jean; Pernkopf, F.

Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015. ed. / Claudia Manfredi. Firenze University Press, 2015. p. 21-24 (Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

Aichinger, P, Hagmüller, M, Roesner, I, Bigenzahn, W, Schneider-Stickler, B, Schoentgen, J & Pernkopf, F 2015, Measurement of fundamental frequencies in diplophonic voices. in C Manfredi (ed.), Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015. Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015, Firenze University Press, pp. 21-24, 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015, Firenze, Italy, 2/09/15.
Aichinger P, Hagmüller M, Roesner I, Bigenzahn W, Schneider-Stickler B, Schoentgen J et al. Measurement of fundamental frequencies in diplophonic voices. In Manfredi C, editor, Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015. Firenze University Press. 2015. p. 21-24. (Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015).
Aichinger, P. ; Hagmüller, M. ; Roesner, Imme ; Bigenzahn, Wolfgang ; Schneider-Stickler, Berit ; Schoentgen, Jean ; Pernkopf, F. / Measurement of fundamental frequencies in diplophonic voices. Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015. editor / Claudia Manfredi. Firenze University Press, 2015. pp. 21-24 (Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015).
@inproceedings{e5d031b1925f4d17a2a1de519b3b8efa,
title = "Measurement of fundamental frequencies in diplophonic voices",
abstract = "Fundamental frequency (F0) extraction in disordered voice is a prerequisite for many types of clinical analyses. In this paper, we make an approach to F0 extraction based on audio waveform modeling and evaluate it with regard to reference F0s obtained from laryngeal high-speed videos. We analyze 65 euphonic and 112 dysphonic (28 diplophonic, 84 non-diplophonic) intervals of sustained phonations during rigid telescopic high-speed video laryngoscopy. Waveform modeling has a total error rate ETotal of 46.28 {\%} on diplophonic voices, which is a significant improvement compared to a benchmark method (95.74 {\%}). The results illustrate that F0 extraction in diplophonic voice is challenging. Multiple F0s need to be considered in the analysis of disordered voice, which may lead towards more valid clinical voice assessment in the future.",
keywords = "Diplophonia, Fundamental frequency measurement, Laryngeal high-speed videos, Voice disorders",
author = "P. Aichinger and M. Hagm{\"u}ller and Imme Roesner and Wolfgang Bigenzahn and Berit Schneider-Stickler and Jean Schoentgen and F. Pernkopf",
year = "2015",
month = "1",
day = "1",
language = "English",
series = "Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015",
publisher = "Firenze University Press",
pages = "21--24",
editor = "Claudia Manfredi",
booktitle = "Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015",

}

TY - GEN

T1 - Measurement of fundamental frequencies in diplophonic voices

AU - Aichinger, P.

AU - Hagmüller, M.

AU - Roesner, Imme

AU - Bigenzahn, Wolfgang

AU - Schneider-Stickler, Berit

AU - Schoentgen, Jean

AU - Pernkopf, F.

PY - 2015/1/1

Y1 - 2015/1/1

N2 - Fundamental frequency (F0) extraction in disordered voice is a prerequisite for many types of clinical analyses. In this paper, we make an approach to F0 extraction based on audio waveform modeling and evaluate it with regard to reference F0s obtained from laryngeal high-speed videos. We analyze 65 euphonic and 112 dysphonic (28 diplophonic, 84 non-diplophonic) intervals of sustained phonations during rigid telescopic high-speed video laryngoscopy. Waveform modeling has a total error rate ETotal of 46.28 % on diplophonic voices, which is a significant improvement compared to a benchmark method (95.74 %). The results illustrate that F0 extraction in diplophonic voice is challenging. Multiple F0s need to be considered in the analysis of disordered voice, which may lead towards more valid clinical voice assessment in the future.

AB - Fundamental frequency (F0) extraction in disordered voice is a prerequisite for many types of clinical analyses. In this paper, we make an approach to F0 extraction based on audio waveform modeling and evaluate it with regard to reference F0s obtained from laryngeal high-speed videos. We analyze 65 euphonic and 112 dysphonic (28 diplophonic, 84 non-diplophonic) intervals of sustained phonations during rigid telescopic high-speed video laryngoscopy. Waveform modeling has a total error rate ETotal of 46.28 % on diplophonic voices, which is a significant improvement compared to a benchmark method (95.74 %). The results illustrate that F0 extraction in diplophonic voice is challenging. Multiple F0s need to be considered in the analysis of disordered voice, which may lead towards more valid clinical voice assessment in the future.

KW - Diplophonia

KW - Fundamental frequency measurement

KW - Laryngeal high-speed videos

KW - Voice disorders

UR - http://www.scopus.com/inward/record.url?scp=85025098699&partnerID=8YFLogxK

M3 - Conference contribution

T3 - Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015

SP - 21

EP - 24

BT - Proceedings and Report - 9th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2015

A2 - Manfredi, Claudia

PB - Firenze University Press

ER -