ESSV Konferenz Elektronische Sprachsignalverarbeitung

Studientexte zur Sprachkommunikation Band 103: Elektronische Sprachsignalverarbeitung 2022


Conference proceedings of the 33st conference in Sonderborg with 31 contributions. Editor(s): Oliver Niebuhr, Malin Svensson Lundmark, Heather Weston ISBN: 978-3-95908-548-9

Models




Producing syllables: motor planning, motor programming and execution

Bernd J. Kröger, Trevor Bekolay




Improved features driving an T-oscillator for cortical segmentation of speech into syllables

Harald Höge




Speech intelligibility prediction with hybrid auditory model- and ML-based methods: The best of two worlds?

Birger Kollmeier, David Hülsmeier, Anna Warzybok




Towards a soft fluidic elastomer tongue for a mechanical vocal tract

Peter Birkholz, Christian Kosmas Mayer, Patrick Häsner

Articulatory Synthesis




Using semantic embeddings for initiating and planning articulatory speech synthesis

Paul Schmidt-Barbo, Sebastian Otte, Martin V. Butz, R. Harald Baayen, Konstantin Sering




Articubench - An articulatory speech synthesis benchmark

Konstantin Sering, Paul Schmidt-Barbo




Efficient exploration of articulatory dimensions

Paul Konstantin Krug, Peter Birkholz, Branislav Gerazov, Daniel Rudolph Van Niekerk, Anqi Xu, Yi Xu




Is there a hesitation bias for ambiguous color terms?

Simon Betz, Ricardo Davids, Caroline Müller, Éva Székely, Petra Wagner, Maischa Amelie Weber, Cassandra Youssef-Baronfeind, Sina Zarrieß

Interaction & Turn-taking




Analysis of phonetic/prosodic features in interaction stages

Daniel Duran, Ronald Böck




Lexical frequency and listener's response to packet loss in telephone conversations

Thilo Michael, Omnia Ibrahim




The power of conversation flow in video conference tools: evaluation of speaker change cues

Mincheng Chang, Thilo Michael, Sebastian Möller, David Schlangen




Times and turns in stimulating meetings

Ronald Böck

Voice Assistants & Speech Dialogue Systems




Upcoming new ITU-T recommendation on the evaluation of text-based chatbots

Sebastian Möller, Stefan Hillmann, Thilo Michael, Jan Nehring, Tim Polzehl




Kommunikative Komponenten sozialer Intelligenz von künstlichen kooperativen Spielenden

Casey C. Bennett, Benjamin Weiss, Jaeyoung Suh, Eunseo Yoon, Jihong Jeong, Sungmin Yang, Yejin Chae




Erroneous reactions of voice assistants

Lea Kisser, Ingo Siegert




The voice of creativity: Effects of pitch range in the voice of a robot facilitator

Kerstin Fischer, Oliver Niebuhr, Ali Asadi

Poster




Perceptual cues for smiled voice - An articulatory synthesis study

Simon Stone, Pia Abdul-Hak, Peter Birkholz




Perceptual categorization of breath noises in speech pauses

Raphael Werner, Jürgen Trouvain, Beeke Muhlack, Bernd Möbius




Einfluss von Entrauschungsverfahren auf die automatische Segmentierung mit WebMAUS

Lorenz Gutscher, Nicola Klingler, Michael Pucher




Vergleichende Evaluation von zwei Ansätzen für ein Question-Answering System

Katja Schreiber, Stefan Hillman




The Charles - A new sensor device for measuring body language and stress in speech communication

Vidar Freyr Gudmundsson, ïo Valls-Ratés, Oliver Niebuhr




F1 and F2 formant variations and inter-speaker articulatory variability: A preliminary analysis

Antoine Serrurier, Christiane Neuschaefer-Rube




The effects of the online visualization of acoustic-prosodic features of speech on speakers' productions

Kerstin Fischer, Oliver Niebuhr

Signal Processing & Comprehension




Detection of salient events in an acoustical scene

Kristian Kroschel




A Window-based method for target estimation

Paul Konstantin Krug




Comparing detection methods for pause-internal particles

Mikey Elmers




Comprehension of closely related languages: A visual world eye tracking study

Jacek Kudera, Philip Georgis, Hasan Md Tusfiqur Alam, Bernd Möbius, Tania Avgustinova, Dietrich Klakow

Prosody




Prosodic characteristics of Bulgarian-Accented German

Bistra Andreeva, Snezhina Dimitrova




Improving the quality of synthesized speech of a Viennese dialect speaker through speaker adaptation

Lorenz Gutscher, Michael Pucher

Emotion




Emotion preservation for one-shot speaker anonymization using McAdams

Yamini Sinha, Andreas Wendemuth, Ingo Siegert




"High on emotion?" How audio codecs interfere with the perceived charisma and emotional states of men and women

Oliver Niebuhr, Ingo Siegert