EU - DIRHA - FP7 - Distant speech Interaction for Robust Home Applications

  • Morales Cordovilla, Juan Andrés, (Teilnehmer (Co-Investigator))
  • Hagmüller, Martin (Teilnehmer (Co-Investigator))
  • Kubin, Gernot (Projektleiter (Principal Investigator))

Projekt: Foschungsprojekt



The DIRHA project addresses the development of voice-enabled automated home environments based on distant-speech interaction in different languages. A distributed microphone network is installed in the rooms of a house in order to monitor selectively acoustic and speech activities observable inside any space, and to eventually run a spoken dialogue session with a given user in order to implement a service or to have access to appliances and other devices. The multi-microphone front-end is based on the use of arrays consisting of analog microphones or Micro Electro-Mechanical Systems (MEMS) digital microphones. The targeted system analyses the given multi-space acoustic scene in a coherent way, by processing in a parallelized fashion simultaneous activities which occur in different rooms, and in case by supporting at the same time the interaction with users who may speak in different areas of the house. These very challenging objectives require advances in different scientific and technical fields. In fact, based on the given network of microphone arrays, multi-microphone front-end processing includes, among the others, tasks as speaker localization, acoustic echo cancellation, speech enhancement, acoustic event segmentation and classification. It is then necessary to have robust technologies for distant-speech recognition and speaker identification (and verification). Effective solutions for language modeling in the selected languages, speech understanding, concurrent management of spoken dialogue interaction, together with user interface and integration between the resulting technological components, will also represent fundamental features for the implementation of the proposed smart home interface. The final prototype will be integrated in an automated home and evaluated by real users.
Tatsächlicher Beginn/ -es Ende1/01/1231/12/14


  • 3 Beitrag in einem Konferenzband
  • 1 Poster
  • 1 Artikel

A corpus of read and conversational Austrian German

Schuppler, B., Hagmüller, M. & Zahrer, A., 1 Nov 2017, in : Speech Communication. 94, S. 62-74 13 S.

Publikation: Beitrag in einer FachzeitschriftArtikelForschungBegutachtung

  • AMISCO: The Austrian German Multi-Sensor Corpus

    Pessentheiner, H., Pichler, T. C. & Hagmüller, M., 2016, Proceedings of the Tenth International Conference on Language Resources and Evaluation. European Language Resources Association

    Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandForschungBegutachtung

    CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shaped Microphone Array

    Morales Cordovilla, J. A., Pessentheiner, H., Hagmüller, M., Gonzales, J. A. & Kubin, G., 2014, Advances in Speech and Language Technologies for Iberian Languages. Springer, Band 8854. S. 148-157 (Lecture notes in computer science; Band 8854).

    Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandForschungBegutachtung