ESSV Archive

Studientexte zur Sprachkommunikation Band 103: Elektronische Sprachsignalverarbeitung 2022

Conference proceedings of the 33st conference in Sonderborg with 31 contributions. Editor(s): Oliver Niebuhr, Malin Svensson Lundmark, Heather Weston ISBN: 978-3-95908-548-9

Cover of the ESSV 2022 proceedings book.

Models

Producing syllables: motor planning, motor programming and execution

Bernd J. Kröger, Trevor Bekolay

Improved features driving an T-oscillator for cortical segmentation of speech into syllables

Harald Höge

Speech intelligibility prediction with hybrid auditory model- and ML-based methods: The best of two worlds?

Birger Kollmeier, David Hülsmeier, Anna Warzybok

Towards a soft fluidic elastomer tongue for a mechanical vocal tract

Peter Birkholz, Christian Kosmas Mayer, Patrick Häsner

Articulatory Synthesis

Using semantic embeddings for initiating and planning articulatory speech synthesis

Paul Schmidt-Barbo, Sebastian Otte, Martin V. Butz, R. Harald Baayen, Konstantin Sering

Articubench - An articulatory speech synthesis benchmark

Konstantin Sering, Paul Schmidt-Barbo

Efficient exploration of articulatory dimensions

Paul Konstantin Krug, Peter Birkholz, Branislav Gerazov, Daniel Rudolph Van Niekerk, Anqi Xu, Yi Xu

Is there a hesitation bias for ambiguous color terms?

Simon Betz, Ricardo Davids, Caroline Müller, Éva Székely, Petra Wagner, Maischa Amelie Weber, Cassandra Youssef-Baronfeind, Sina Zarrieß

Interaction & Turn-taking

Analysis of phonetic/prosodic features in interaction stages

Daniel Duran, Ronald Böck

Lexical frequency and listener's response to packet loss in telephone conversations

Thilo Michael, Omnia Ibrahim

The power of conversation flow in video conference tools: evaluation of speaker change cues

Mincheng Chang, Thilo Michael, Sebastian Möller, David Schlangen

Times and turns in stimulating meetings

Ronald Böck

Voice Assistants & Speech Dialogue Systems

Upcoming new ITU-T recommendation on the evaluation of text-based chatbots

Sebastian Möller, Stefan Hillmann, Thilo Michael, Jan Nehring, Tim Polzehl

Kommunikative Komponenten sozialer Intelligenz von künstlichen kooperativen Spielenden

Casey C. Bennett, Benjamin Weiss, Jaeyoung Suh, Eunseo Yoon, Jihong Jeong, Sungmin Yang, Yejin Chae

Erroneous reactions of voice assistants

Lea Kisser, Ingo Siegert

The voice of creativity: Effects of pitch range in the voice of a robot facilitator

Kerstin Fischer, Oliver Niebuhr, Ali Asadi

Poster

Perceptual cues for smiled voice - An articulatory synthesis study

Simon Stone, Pia Abdul-Hak, Peter Birkholz

Perceptual categorization of breath noises in speech pauses

Raphael Werner, Jürgen Trouvain, Beeke Muhlack, Bernd Möbius

Einfluss von Entrauschungsverfahren auf die automatische Segmentierung mit WebMAUS

Lorenz Gutscher, Nicola Klingler, Michael Pucher

Vergleichende Evaluation von zwei Ansätzen für ein Question-Answering System

Katja Schreiber, Stefan Hillman

The Charles - A new sensor device for measuring body language and stress in speech communication

Vidar Freyr Gudmundsson, ïo Valls-Ratés, Oliver Niebuhr

F1 and F2 formant variations and inter-speaker articulatory variability: A preliminary analysis

Antoine Serrurier, Christiane Neuschaefer-Rube

The effects of the online visualization of acoustic-prosodic features of speech on speakers' productions

Kerstin Fischer, Oliver Niebuhr

Signal Processing & Comprehension

Detection of salient events in an acoustical scene

Kristian Kroschel

A Window-based method for target estimation

Paul Konstantin Krug

Comparing detection methods for pause-internal particles

Mikey Elmers

Comprehension of closely related languages: A visual world eye tracking study

Jacek Kudera, Philip Georgis, Hasan Md Tusfiqur Alam, Bernd Möbius, Tania Avgustinova, Dietrich Klakow

Prosody

Prosodic characteristics of Bulgarian-Accented German

Bistra Andreeva, Snezhina Dimitrova

Improving the quality of synthesized speech of a Viennese dialect speaker through speaker adaptation

Lorenz Gutscher, Michael Pucher

Emotion

Emotion preservation for one-shot speaker anonymization using McAdams

Yamini Sinha, Andreas Wendemuth, Ingo Siegert

"High on emotion?" How audio codecs interfere with the perceived charisma and emotional states of men and women

Oliver Niebuhr, Ingo Siegert

ESSV Konferenz Elektronische Sprachsignalverarbeitung