ESSV Konferenz Elektronische Sprachsignalverarbeitung

Studientexte zur Sprachkommunikation Band 107: Elektronische Sprachsignalverarbeitung 2024


Conference proceedings of the 35st conference in Regensburg with 31 contributions. Editor(s): Timo Baumann ISBN: 978-3-95908-325-6 Some of the articles in this volume are not available as PDF files. If you are interested in these individual contributions, the volume can be bought or borrowed below.

Hauptvortrag




More Than Words: Advancements and Challenges in Speech Recognition for Singing

Anna Kruspe




Linguistic Politeness in Artificial Conversational Agents

Hendrik Buschmeier

Chatbots und Dialogsysteme




Chatbot in the Museum - A Field Study of User Experience and Modality Usage

Stefan Schaffer, Eva Schwaetzer, Aaron Ruß, Oliver Gustke




Usability and User Experience of a Chatbot for Student Support

Stefan Hillmann, Philine Kowol, Adnan Ahmad, Ruochen Tang, Sebastian Möller




Interaktionsverhalten eines Avatars im digitalen sprachtherapeutischen Setting

Mathias Walther, Elisabeth Zeuner, Eugenia Rykova




Review of Usage and Potentials of Conversational Interfaces at Universities and in Students Daily Lifes

Lea Kisser, Matthias Busch, Ingo Siegert

Phonetische Untersuchungen




Perception of Formant Distortion in German Words and Non-words

Uliana Eliseeva, Ivan Yuen, Bernd Möbius




Synchrony of Glottal Area Waveform Parameters During the Production of Obstruents in Vowel Context

Joao Vitor Possamai De Menezes, Christian Kleiner, Marie-Anne Kainz, Matthias Echternach, Peter Birkholz




Computergestützte Bestimmung des Sprechflusses bei Vorschulkindern

Valentin Kany, Jürgen Trouvain




The Use of Temporal Features in Cortical Segmentation of Syllables

Harald Höge

Spracherkennung und -verstehen




Epsilon-Verarbeitung bei Minimalistischen Grammatiken für Zahlen .

Johannes Kuhn, Matthias Wolff, Borislav Borislavov




NoiSLU: A Noisy Speech Corpus for Spoken Language Understanding in the Public Transport Domain

Mariano Frohnmaier, Steffen Freisinger, Madeline Faye Holt, Munir Georges




Ein quantenlogisch motivierter Ansatz zur Verarbeitung von Äußerungs- Bedeutungspaaren

Markus Huber-Liebl, Günther Wirsching




Octra Backend - Eine skalierbare Infrastruktur für Transkriptionsprojekte

Christoph Draxler, Julian Pömp

Paralinguistische Analysen




An Investigation of Acoustic Features of the Lower Vocal Tract for Speaker Recognition .

Peter Birkholz, Xinyu Zhang




Towards Speech Privacy Assessment for Voice Assistants: Exploring Subjective and Objective Measures for Babble Noise

Anjana Rajasekhar, Anna Leschanowsky, Nils Peters




Konzept und Evaluation eines Softwaresystems zur Unterstützung der CRM-basierten Sprechwirkungsuntersuchung

Thorben Frank Jahnke, Corinna Sonnen, Mathias Walther




In Tune With In-Poco? A New Device for Analyzing and Training the Interplay of Body Posture and Charismatic Speech Prosody

Tobias Blaabjerg Karlsen, Karl Jhon Decuzar De Castro, Emils Pipars, Iyad Ahed Abdelrahman Abdel Qader, Jose Dumitru Ilinca Sainz, Simas Srugys, Oliver Niebuhr

Large Language Models




Can Language Models Behave Like Wine Sommeliers? Using Multiple Agents To Evaluate The Quality of Wine Descriptors Generated By Llama 2

Siddarth Venkateswaran, Ronald Böck




Supervised vs. Zero-Shot Learning Automatic Classification of Comments on Educational Videos Using Pre-Trained Language Models

Benedict Kettler, Stefan Hillmann




Is there Text in Wine? - S+U Learning-Based Named Entity Recognition and Triplet Extraction from Wine Aroma Descriptors

Siddarth Venkateswaran, Abdullah Al Foysal, Nazeer Basha Shaik, Ronald Böck




Can We See Your Response Before You Speak? Exploring Linguistic Information Found in Inter-Turn Pauses

Christian Schuler, Shravan Nayak, Debjoy Saha, Timo Baumann

Sprachsynthese und Hörpräferenzen




Speech/Non-Speech Classification Slightly Improves Synthesis Quality in PAULE

Konstantin Sering




Evaluation of Audio Deepfakes - Systematic Review

Yamini Sinha, Jan Hintz, Ingo Siegert




Evaluating the Impact of Prosody Feature Normalization on the Controllability of Pitch in Speech Synthesis

Judith Bauer, Frank Zalkow, Meinard Müller, Christian Dittmar




Listener-Oriented Consequences of Predictability-Based Acoustic Adjustment

Omnia Ibrahim, Ivan Yuen, Wei Xue, Bistra Andreeva, Bernd Möbius

Poster




Speech Recognition Errors in ASR Engines and Their Impact on Linguistic Analysis in Psychotherapies

Martha Schubert, Yamini Sinha, Julia Krüger, Ingo Siegert




Empirical Evaluation of ASR and NLU in a Multimodal Dialogue System for Survey Answering

Philipp L. Harnisch, Stefan Hillmann




Extending HAnS: Large Language Models for Question Answering, Summarization, and Topic Segmentation in an ML-based Learning Experience Platform

Thomas Ranzenberger, Tobias Bocklet, Steffen Freisinger, Munir Georges, Kevin Glocker, Aaricia Herygers, Korbinian Riedhammer, Fabian Schneider, Christopher Simic, Khabbab Zakaria




The Influence of Signal Segmentation Methods on Rhythm-Based Speaker Recognition

Neda Mousavi, Sven Grawunder




Unsupervised Emotional Pattern Recognition Using Rhythmic and Vocal Features

Neda Mousavi, Seyyed Saeed Sarfjoo, Sven Grawunder