Quantle: Fair and Honest Presentation Coach in Your Pocket

Olga Saukh, Balz Maag

Research output: Contribution to conferencePaperpeer-review

Abstract

Great public speakers are made, not born. Practicing a presentation in front of colleagues is common practice and results in a set of subjective judgements what could be improved. In this paper we describe the design and implementation of a mobile app which estimates the quality of speaker’s delivery in real time in a fair, repeatable and privacy-preserving way. Quantle estimates the speaker’s pace in terms of the number of syllables, words and clauses, computes pitch and duration of pauses. The basic parameters are then used to estimate the talk complexity based on readability scores from the literature to help the speaker adjust his delivery to the tar- get audience. In contrast to speech-to-text-based methods used to implement a digital presentation coach, Quantle does processing locally in real time and works in the flight mode. This design has three implications: (1) Quantle does not interfere with the surrounding hardware, (2) it is power-aware, since 95.2 % of the energy used by the app on iPhone 6 is spent to operate the built-in microphone and the screen, and (3) audio data and processing results are not shared with a third party therewith preserving speaker’s privacy.

We evaluate Quantle on artificial, online and live data. We artificially modify an audio sample by changing the volume, speed, tempo, pitch and noise level to test robustness of Quantle and its performance limits. We then test Quantle on 1017 TED talks held in English and compare computed features to those extracted from the available transcript processed by online text evaluation services. Quantle estimates of syllable and word counts are 85.4 % and 82.8 % accurate, and pitch is over 90 % accurate. We use the outcome of this study to extract typical ranges for each vocal characteristic. We then use Quantle on live data at a social event, and as a tool for speakers to track their delivery when rehearsing a talk. Our results confirm that Quantle is robust to different noise levels, varying distances from the sound source, phone orientation, and achieves comparable performance to speech-to-text methods.
Original languageEnglish
Number of pages12
Publication statusPublished - 16 Apr 2019
Event18th ACM/IEEE International Conference on Information Processing in Sensor Networks: IPSN 2019 - Montreal, Canada
Duration: 16 Apr 201918 Apr 2019

Conference

Conference18th ACM/IEEE International Conference on Information Processing in Sensor Networks
Abbreviated titleIPSN
Country/TerritoryCanada
CityMontreal
Period16/04/1918/04/19

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Dive into the research topics of 'Quantle: Fair and Honest Presentation Coach in Your Pocket'. Together they form a unique fingerprint.

Cite this