A corpus of read and conversational Austrian German

Publikation: Beitrag in einer FachzeitschriftArtikelForschungBegutachtung

Abstract

This paper presents GRASS (Graz corpus of Read and Spontaneous Speech), the first large scale speech database for Austrian German with both read and conversational speech. In total, the corpus contains approximately 1900 min of speech in which 38 speakers produced more than 220,000 word tokens from 14,593 different word types. The corpus consists of three components. First, the Conversational Speech Component contains free conversations of one hour length between friends, colleagues, couples, or family members. Second, the Commands Component contains commands and keywords which were either read or elicited by pictures. Third, the Read Speech Component contains phonetically balanced sentences and digits. The speech of all components has been recorded at fullband quality in a soundproof recording-studio with head-mounted microphones, large-diaphragm microphones, a laryngograph, and with a video camera. The corpus was fully annotated at the orthographic level, and partly also at the segmental, sub-segmental and prosodic level. Our analysis of conversational speech characteristics such as overlapping speech, laughter, repetitions, hesitations and the use of colloquial and dialectal words allows us to conclude that the conversational speech material is highly casual in nature. The collected corpus provides conversational material for phoneticians and linguists interested in topics specific for Austrian German (e.g., pronunciation variability, prosody, syntax of spoken Austrian German), and for those studying talk in interaction in general (turn-taking, grounding, entrainment, extra-linguistic factors etc.). Furthermore, it is a valuable resource for speech technologists interested in the development of ASR and dialogue systems for different speaking styles of Austrian German.

Originalspracheenglisch
Seiten (von - bis)62-74
Seitenumfang13
FachzeitschriftSpeech Communication
Jahrgang94
DOIs
PublikationsstatusVeröffentlicht - 1 Nov 2017

Fingerprint

Microphones
Corpus
Speech
Austrian German
Prosody
Dialogue Systems
colloquial
Entrainment
Studios
Electric grounding
Video cameras
Diaphragms
humor
Digit
Linguistics
syntax
family member
Overlapping
recording
speaking

Schlagwörter

    ASJC Scopus subject areas

    • Software
    • !!Modelling and Simulation
    • Kommunikation
    • Sprache und Linguistik
    • Linguistik und Sprache
    • !!Computer Vision and Pattern Recognition
    • !!Computer Science Applications

    Dies zitieren

    A corpus of read and conversational Austrian German. / Schuppler, Barbara; Hagmüller, Martin; Zahrer, Alexander.

    in: Speech Communication, Jahrgang 94, 01.11.2017, S. 62-74.

    Publikation: Beitrag in einer FachzeitschriftArtikelForschungBegutachtung

    @article{9ef0f62901c842b79d22701691b928e9,
    title = "A corpus of read and conversational Austrian German",
    abstract = "This paper presents GRASS (Graz corpus of Read and Spontaneous Speech), the first large scale speech database for Austrian German with both read and conversational speech. In total, the corpus contains approximately 1900 min of speech in which 38 speakers produced more than 220,000 word tokens from 14,593 different word types. The corpus consists of three components. First, the Conversational Speech Component contains free conversations of one hour length between friends, colleagues, couples, or family members. Second, the Commands Component contains commands and keywords which were either read or elicited by pictures. Third, the Read Speech Component contains phonetically balanced sentences and digits. The speech of all components has been recorded at fullband quality in a soundproof recording-studio with head-mounted microphones, large-diaphragm microphones, a laryngograph, and with a video camera. The corpus was fully annotated at the orthographic level, and partly also at the segmental, sub-segmental and prosodic level. Our analysis of conversational speech characteristics such as overlapping speech, laughter, repetitions, hesitations and the use of colloquial and dialectal words allows us to conclude that the conversational speech material is highly casual in nature. The collected corpus provides conversational material for phoneticians and linguists interested in topics specific for Austrian German (e.g., pronunciation variability, prosody, syntax of spoken Austrian German), and for those studying talk in interaction in general (turn-taking, grounding, entrainment, extra-linguistic factors etc.). Furthermore, it is a valuable resource for speech technologists interested in the development of ASR and dialogue systems for different speaking styles of Austrian German.",
    keywords = "Austrian German, Automatic transcription, Conversational speech, Prosodic transcription, Read speech",
    author = "Barbara Schuppler and Martin Hagm{\"u}ller and Alexander Zahrer",
    year = "2017",
    month = "11",
    day = "1",
    doi = "10.1016/j.specom.2017.09.003",
    language = "English",
    volume = "94",
    pages = "62--74",
    journal = "Speech Communication",
    issn = "0167-6393",
    publisher = "Elsevier B.V.",

    }

    TY - JOUR

    T1 - A corpus of read and conversational Austrian German

    AU - Schuppler, Barbara

    AU - Hagmüller, Martin

    AU - Zahrer, Alexander

    PY - 2017/11/1

    Y1 - 2017/11/1

    N2 - This paper presents GRASS (Graz corpus of Read and Spontaneous Speech), the first large scale speech database for Austrian German with both read and conversational speech. In total, the corpus contains approximately 1900 min of speech in which 38 speakers produced more than 220,000 word tokens from 14,593 different word types. The corpus consists of three components. First, the Conversational Speech Component contains free conversations of one hour length between friends, colleagues, couples, or family members. Second, the Commands Component contains commands and keywords which were either read or elicited by pictures. Third, the Read Speech Component contains phonetically balanced sentences and digits. The speech of all components has been recorded at fullband quality in a soundproof recording-studio with head-mounted microphones, large-diaphragm microphones, a laryngograph, and with a video camera. The corpus was fully annotated at the orthographic level, and partly also at the segmental, sub-segmental and prosodic level. Our analysis of conversational speech characteristics such as overlapping speech, laughter, repetitions, hesitations and the use of colloquial and dialectal words allows us to conclude that the conversational speech material is highly casual in nature. The collected corpus provides conversational material for phoneticians and linguists interested in topics specific for Austrian German (e.g., pronunciation variability, prosody, syntax of spoken Austrian German), and for those studying talk in interaction in general (turn-taking, grounding, entrainment, extra-linguistic factors etc.). Furthermore, it is a valuable resource for speech technologists interested in the development of ASR and dialogue systems for different speaking styles of Austrian German.

    AB - This paper presents GRASS (Graz corpus of Read and Spontaneous Speech), the first large scale speech database for Austrian German with both read and conversational speech. In total, the corpus contains approximately 1900 min of speech in which 38 speakers produced more than 220,000 word tokens from 14,593 different word types. The corpus consists of three components. First, the Conversational Speech Component contains free conversations of one hour length between friends, colleagues, couples, or family members. Second, the Commands Component contains commands and keywords which were either read or elicited by pictures. Third, the Read Speech Component contains phonetically balanced sentences and digits. The speech of all components has been recorded at fullband quality in a soundproof recording-studio with head-mounted microphones, large-diaphragm microphones, a laryngograph, and with a video camera. The corpus was fully annotated at the orthographic level, and partly also at the segmental, sub-segmental and prosodic level. Our analysis of conversational speech characteristics such as overlapping speech, laughter, repetitions, hesitations and the use of colloquial and dialectal words allows us to conclude that the conversational speech material is highly casual in nature. The collected corpus provides conversational material for phoneticians and linguists interested in topics specific for Austrian German (e.g., pronunciation variability, prosody, syntax of spoken Austrian German), and for those studying talk in interaction in general (turn-taking, grounding, entrainment, extra-linguistic factors etc.). Furthermore, it is a valuable resource for speech technologists interested in the development of ASR and dialogue systems for different speaking styles of Austrian German.

    KW - Austrian German

    KW - Automatic transcription

    KW - Conversational speech

    KW - Prosodic transcription

    KW - Read speech

    UR - http://www.scopus.com/inward/record.url?scp=85030676590&partnerID=8YFLogxK

    U2 - 10.1016/j.specom.2017.09.003

    DO - 10.1016/j.specom.2017.09.003

    M3 - Article

    VL - 94

    SP - 62

    EP - 74

    JO - Speech Communication

    JF - Speech Communication

    SN - 0167-6393

    ER -