ESSV Konferenz Elektronische Sprachsignalverarbeitung

Studientexte zur Sprachkommunikation Band 113: Elektronische Sprachsignalverarbeitung 2026


Conference proceedings of the 37st conference in Eichstätt with 33 contributions. Editor(s): Günther Wirsching ISBN: 978-3-95908-834-3
Cover of the ESSV 2026 proceedings book.

Hauptvortrag




Über den Versuch, ein totgesagtes Pferd zu reiten. Sprachtechnologie in der DDR im zeitlichen und räumlichen Kontext

Rüdiger Hoffmann




Is speech recognition “artificial intelligence”? A historical examination of academic branding

Thomas Haigh




Constructions, Computation and Creativity: How different is LLM and human linguistic creativity? How different is LLM and human linguistic creativity?

Thomas Hoffmann




Formale Semantik im offenen semantischen Raum

Hans Rudolf Straub

Speech Signal Recognition and Enhancement




Enhancing ASR for German Medical Domain without Fine-Tuning

Abdullah Al Foysal, Ronald Böck




Iterative Ambient-Signal-Aware Speech Enhancement via Cascaded DNN Processing without Retraining

Nilesh Madhu




An Approach to Improving Robustness in Dynamic Acoustic Environments: Context Noise Representation Learning for Urban Speech Emotion Recognition

Lisa Winkler, Andreas Wendemuth




Im Raum der Täuschung - Raumhall als Schwachstelle automatischer Deepfake-Erkennung

Sophie Hoppe, Anabell Hacker, Markus Brückl

Speech Analysis I




Zur Transkription mündlicher Phänomene in der politischen Sprache

Marcella Palladino




Dialektale Vielfalt in visuellen und auditiven Illustrationen: „Nordwind und Sonne“ in saarländischen Dialekten

Jürgen Trouvain




The Emotional Portrayal of an Ordinary Talk

Neda Mousavi, Felix Burkhardt




Evaluating full automation of formant extraction in the German Plapper Corpus

Robert Fromont, Jennifer Hay, Daniel Duran, Allie Osborne, Melanie Weirich, Miriam Oschkinat, Stefanie Jannedy

Speech Synthesis




From Writing to Speaking: on the Limits of Text-Trained Authorship Models for Speech Transcripts

Yamini Sinha, Ingo Siegert




A Servo-Motor-Actuated Artificial Lung for Robotic Speech Production

Ian S. Howard




Joint Estimation of Source and Filter Parameters for Speaker Adaptation in Articulatory Speech Synthesis

Tianyi Zhang, Peter Birkholz




TensorTract3: Pushing the Limits of Articulatory Speech Synthesis

Paul Kontantin Krug, Christoph Wagner, Peter Birkholz, Timo Stich

Speech Analysis II




How well can LLMs handle novel phonetic forms?

Daniel Duran, Laurens Winkler, Sina Zarrieß




ASR-based Automatic Assessment of Oral Production Tasks in Multilingual Children

Eugenia Rykova, Tanja Rinker, Angela Grimm




Measuring User Acceptance of Proactively Played Touristic Texts in an In-Car Voice Assistant

Niklas Berensmeyer, Stefan Hillmann, Wolfgang Maier

Voice, Language and Cognition




Creating Documents with Voice: Maybe it is not about Transcription but Reflection?

Matthias Busch, Jonas Schewior, Andreas Wendemuth, Ingo Siegert




Think Like a Team: Graph-based Representation of Shared Mental Models in Human-Agent Collaboration

Moinam Chatterjee, Behnam Ensan, Andreas Wendemuth, Ayoub Al-Hamadi




Ein konzeptioneller Beitrag zur Entwicklung und Nachbildung von Problemlöse- und Sprachfähigkeiten kognitiver Agenten

Ronald Römer, Johannes F. Kuhn, Markus Huber-Liebl, Peter B. Graben, Matthias Wolff




A Modular Multimodal Dialog Architecture for Digital PROM Collection

Stefan Hillmann, Philipp Harnisch, Daniel Schuhmann, Navid Ashrafi, Jan-Niklas Voigt-Antons

Posters




Towards a Brain-Computer Interface Modelling the Phonological Short-Term Memory

Harald Höge




Feature-Enhanced Consensus Graph Model for EEG-based Imagined Word Recognition

Syed Hur Abbas, Peter Birkholz, Muhammad Arif




Außerparlamentarische politische Kommunikation: Datenerhebung und Analyseperspektiven

Marcella Palladino, Vincenzo Gannuscio




Evaluation of WebRTC as a Framework for Voice Recordings in Online Surveys

Anabell Hacker, Iris Sidonie Bakker, Ingo Siegert




Automatic Detection of Disfluencies in L1 and L2 Child Speech

Martha Schubert, Valentin Kany




Assessing Speaking Modes in Radio News Using Topic Classification and Acoustic Parameters

Sven Grawunder, Ute Gradmann




Self-Supervised Multi-Task Learning for Enhanced Prosody Prediction in German Articulatory Speech Synthesis

Zihao Huang, Tianyi Zhang, Peter Birkholz




Parameter Optimization for Administration-Specific Speech Transcription with the Faster Whisper System

Robin Bitterlich, Paul Böhm, Oliver Jokisch

Show & Tell




HAnS: Multimodal RAG-based Persona Generation for Media and Documents in E-Learning

Thomas Ranzenberger, Steffen Freisinger, Tobias Bocklet, Korbinian Riedhammer




Alphaspeech Transcribe – eine autonome, containerisierte Speech-to-Text-Plattform für professionelle Transkriptions- und Dokumentationsworkflows

Felix Gräßer, Robert Wardenga, Dominik Jülg, Christian Gaida, Rico Petrick