ESSV Archive

Studientexte zur Sprachkommunikation Band 107: Elektronische Sprachsignalverarbeitung 2024

Conference proceedings of the 35st conference in Regensburg with 31 contributions. Editor(s): Timo Baumann ISBN: 978-3-95908-325-6 Some of the articles in this volume are not available as PDF files. If you are interested in these individual contributions, the volume can be bought or borrowed below.

Cover of the ESSV 2024 proceedings book.

Hauptvortrag

More Than Words: Advancements and Challenges in Speech Recognition for Singing

Anna Kruspe

Linguistic Politeness in Artificial Conversational Agents

Hendrik Buschmeier

Chatbots und Dialogsysteme

Chatbot in the Museum - A Field Study of User Experience and Modality Usage

Stefan Schaffer, Eva Schwaetzer, Aaron Ruß, Oliver Gustke

Usability and User Experience of a Chatbot for Student Support

Stefan Hillmann, Philine Kowol, Adnan Ahmad, Ruochen Tang, Sebastian Möller

Interaktionsverhalten eines Avatars im digitalen sprachtherapeutischen Setting

Mathias Walther, Elisabeth Zeuner, Eugenia Rykova

Review of Usage and Potentials of Conversational Interfaces at Universities and in Students Daily Lifes

Lea Kisser, Matthias Busch, Ingo Siegert

Phonetische Untersuchungen

Perception of Formant Distortion in German Words and Non-words

Uliana Eliseeva, Ivan Yuen, Bernd Möbius

Synchrony of Glottal Area Waveform Parameters During the Production of Obstruents in Vowel Context

Joao Vitor Possamai De Menezes, Christian Kleiner, Marie-Anne Kainz, Matthias Echternach, Peter Birkholz

Computergestützte Bestimmung des Sprechflusses bei Vorschulkindern

Valentin Kany, Jürgen Trouvain

The Use of Temporal Features in Cortical Segmentation of Syllables

Harald Höge

Spracherkennung und -verstehen

Epsilon-Verarbeitung bei Minimalistischen Grammatiken für Zahlen .

Johannes Kuhn, Matthias Wolff, Borislav Borislavov

NoiSLU: A Noisy Speech Corpus for Spoken Language Understanding in the Public Transport Domain

Mariano Frohnmaier, Steffen Freisinger, Madeline Faye Holt, Munir Georges

Ein quantenlogisch motivierter Ansatz zur Verarbeitung von Äußerungs- Bedeutungspaaren

Markus Huber-Liebl, Günther Wirsching

Octra Backend - Eine skalierbare Infrastruktur für Transkriptionsprojekte

Christoph Draxler, Julian Pömp

Paralinguistische Analysen

An Investigation of Acoustic Features of the Lower Vocal Tract for Speaker Recognition .

Peter Birkholz, Xinyu Zhang

Towards Speech Privacy Assessment for Voice Assistants: Exploring Subjective and Objective Measures for Babble Noise

Anjana Rajasekhar, Anna Leschanowsky, Nils Peters

Konzept und Evaluation eines Softwaresystems zur Unterstützung der CRM-basierten Sprechwirkungsuntersuchung

Thorben Frank Jahnke, Corinna Sonnen, Mathias Walther

In Tune With In-Poco? A New Device for Analyzing and Training the Interplay of Body Posture and Charismatic Speech Prosody

Tobias Blaabjerg Karlsen, Karl Jhon Decuzar De Castro, Emils Pipars, Iyad Ahed Abdelrahman Abdel Qader, Jose Dumitru Ilinca Sainz, Simas Srugys, Oliver Niebuhr

Large Language Models

Can Language Models Behave Like Wine Sommeliers? Using Multiple Agents To Evaluate The Quality of Wine Descriptors Generated By Llama 2

Siddarth Venkateswaran, Ronald Böck

Supervised vs. Zero-Shot Learning Automatic Classification of Comments on Educational Videos Using Pre-Trained Language Models

Benedict Kettler, Stefan Hillmann

Is there Text in Wine? - S+U Learning-Based Named Entity Recognition and Triplet Extraction from Wine Aroma Descriptors

Siddarth Venkateswaran, Abdullah Al Foysal, Nazeer Basha Shaik, Ronald Böck

Can We See Your Response Before You Speak? Exploring Linguistic Information Found in Inter-Turn Pauses

Christian Schuler, Shravan Nayak, Debjoy Saha, Timo Baumann

Sprachsynthese und Hörpräferenzen

Speech/Non-Speech Classification Slightly Improves Synthesis Quality in PAULE

Konstantin Sering

Evaluation of Audio Deepfakes - Systematic Review

Yamini Sinha, Jan Hintz, Ingo Siegert

Evaluating the Impact of Prosody Feature Normalization on the Controllability of Pitch in Speech Synthesis

Judith Bauer, Frank Zalkow, Meinard Müller, Christian Dittmar

Listener-Oriented Consequences of Predictability-Based Acoustic Adjustment

Omnia Ibrahim, Ivan Yuen, Wei Xue, Bistra Andreeva, Bernd Möbius

Poster

Speech Recognition Errors in ASR Engines and Their Impact on Linguistic Analysis in Psychotherapies

Martha Schubert, Yamini Sinha, Julia Krüger, Ingo Siegert

Empirical Evaluation of ASR and NLU in a Multimodal Dialogue System for Survey Answering

Philipp L. Harnisch, Stefan Hillmann

Extending HAnS: Large Language Models for Question Answering, Summarization, and Topic Segmentation in an ML-based Learning Experience Platform

Thomas Ranzenberger, Tobias Bocklet, Steffen Freisinger, Munir Georges, Kevin Glocker, Aaricia Herygers, Korbinian Riedhammer, Fabian Schneider, Christopher Simic, Khabbab Zakaria

The Influence of Signal Segmentation Methods on Rhythm-Based Speaker Recognition

Neda Mousavi, Sven Grawunder

Unsupervised Emotional Pattern Recognition Using Rhythmic and Vocal Features

Neda Mousavi, Seyyed Saeed Sarfjoo, Sven Grawunder

ESSV Konferenz Elektronische Sprachsignalverarbeitung