ESSV Archive

Studientexte zur Sprachkommunikation Band 113: Elektronische Sprachsignalverarbeitung 2026

Conference proceedings of the 37st conference in Eichstätt with 33 contributions. Editor(s): Günther Wirsching ISBN: 978-3-95908-834-3

Cover of the ESSV 2026 proceedings book.

Hauptvortrag

Über den Versuch, ein totgesagtes Pferd zu reiten. Sprachtechnologie in der DDR im zeitlichen und räumlichen Kontext

Rüdiger Hoffmann

Is speech recognition “artificial intelligence”? A historical examination of academic branding

Thomas Haigh

Constructions, Computation and Creativity: How different is LLM and human linguistic creativity? How different is LLM and human linguistic creativity?

Thomas Hoffmann

Formale Semantik im offenen semantischen Raum

Hans Rudolf Straub

Speech Signal Recognition and Enhancement

Enhancing ASR for German Medical Domain without Fine-Tuning

Abdullah Al Foysal, Ronald Böck

Iterative Ambient-Signal-Aware Speech Enhancement via Cascaded DNN Processing without Retraining

Nilesh Madhu

An Approach to Improving Robustness in Dynamic Acoustic Environments: Context Noise Representation Learning for Urban Speech Emotion Recognition

Lisa Winkler, Andreas Wendemuth

Im Raum der Täuschung - Raumhall als Schwachstelle automatischer Deepfake-Erkennung

Sophie Hoppe, Anabell Hacker, Markus Brückl

Speech Analysis I

Zur Transkription mündlicher Phänomene in der politischen Sprache

Marcella Palladino

Dialektale Vielfalt in visuellen und auditiven Illustrationen: „Nordwind und Sonne“ in saarländischen Dialekten

Jürgen Trouvain

The Emotional Portrayal of an Ordinary Talk

Neda Mousavi, Felix Burkhardt

Evaluating full automation of formant extraction in the German Plapper Corpus

Robert Fromont, Jennifer Hay, Daniel Duran, Allie Osborne, Melanie Weirich, Miriam Oschkinat, Stefanie Jannedy

Speech Synthesis

From Writing to Speaking: on the Limits of Text-Trained Authorship Models for Speech Transcripts

Yamini Sinha, Ingo Siegert

A Servo-Motor-Actuated Artificial Lung for Robotic Speech Production

Ian S. Howard

Joint Estimation of Source and Filter Parameters for Speaker Adaptation in Articulatory Speech Synthesis

Tianyi Zhang, Peter Birkholz

TensorTract3: Pushing the Limits of Articulatory Speech Synthesis

Paul Kontantin Krug, Christoph Wagner, Peter Birkholz, Timo Stich

Speech Analysis II

How well can LLMs handle novel phonetic forms?

Daniel Duran, Laurens Winkler, Sina Zarrieß

ASR-based Automatic Assessment of Oral Production Tasks in Multilingual Children

Eugenia Rykova, Tanja Rinker, Angela Grimm

Measuring User Acceptance of Proactively Played Touristic Texts in an In-Car Voice Assistant

Niklas Berensmeyer, Stefan Hillmann, Wolfgang Maier

Voice, Language and Cognition

Creating Documents with Voice: Maybe it is not about Transcription but Reflection?

Matthias Busch, Jonas Schewior, Andreas Wendemuth, Ingo Siegert

Think Like a Team: Graph-based Representation of Shared Mental Models in Human-Agent Collaboration

Moinam Chatterjee, Behnam Ensan, Andreas Wendemuth, Ayoub Al-Hamadi

Ein konzeptioneller Beitrag zur Entwicklung und Nachbildung von Problemlöse- und Sprachfähigkeiten kognitiver Agenten

Ronald Römer, Johannes F. Kuhn, Markus Huber-Liebl, Peter B. Graben, Matthias Wolff

A Modular Multimodal Dialog Architecture for Digital PROM Collection

Stefan Hillmann, Philipp Harnisch, Daniel Schuhmann, Navid Ashrafi, Jan-Niklas Voigt-Antons

Posters

Towards a Brain-Computer Interface Modelling the Phonological Short-Term Memory

Harald Höge

Feature-Enhanced Consensus Graph Model for EEG-based Imagined Word Recognition

Syed Hur Abbas, Peter Birkholz, Muhammad Arif

Außerparlamentarische politische Kommunikation: Datenerhebung und Analyseperspektiven

Marcella Palladino, Vincenzo Gannuscio

Evaluation of WebRTC as a Framework for Voice Recordings in Online Surveys

Anabell Hacker, Iris Sidonie Bakker, Ingo Siegert

Automatic Detection of Disfluencies in L1 and L2 Child Speech

Martha Schubert, Valentin Kany

Assessing Speaking Modes in Radio News Using Topic Classification and Acoustic Parameters

Sven Grawunder, Ute Gradmann

Self-Supervised Multi-Task Learning for Enhanced Prosody Prediction in German Articulatory Speech Synthesis

Zihao Huang, Tianyi Zhang, Peter Birkholz

Parameter Optimization for Administration-Specific Speech Transcription with the Faster Whisper System

Robin Bitterlich, Paul Böhm, Oliver Jokisch

Show & Tell

HAnS: Multimodal RAG-based Persona Generation for Media and Documents in E-Learning

Thomas Ranzenberger, Steffen Freisinger, Tobias Bocklet, Korbinian Riedhammer

Alphaspeech Transcribe – eine autonome, containerisierte Speech-to-Text-Plattform für professionelle Transkriptions- und Dokumentationsworkflows

Felix Gräßer, Robert Wardenga, Dominik Jülg, Christian Gaida, Rico Petrick

ESSV Konferenz Elektronische Sprachsignalverarbeitung