ESSV Konferenz Elektronische Sprachsignalverarbeitung

Studientexte zur Sprachkommunikation Band 105: Elektronische Sprachsignalverarbeitung 2023


Conference proceedings of the 34st conference in München with 32 contributions. Editor(s): Christoph Draxler ISBN: 978-3-95908-303-4

Visualisation




Comparison of Object Tracking Algorithms for Larynx Phantom Movements in Ultrasound Videos

Christian Kleiner, Peter Birkholz




Anwendung des MFCC-Plotters zur Erfassung cepstraler Unterschiede in emotionaler Sprache

Frederick Kukla, Vanessa Reichel




Analysis of Transcriptions Using Octra – A Pilot Study

Christoph Draxler

Interaction & Dialogue




How May I Interrupt? Linguistic Design Guidelines for Proactive In-Car Voice Assistants

Anna-Maria Meck




Automatic User Experience Evaluation of Goal-Oriented Dialogs Using Pre- Trained Language Models

Mika Rebensburg, Stefan Hillmann, Nils Feldhus




A Comparison of Module Selection Strategies for Modular Dialog Systems

Philine Görzig, Jan Nehring, Stefan Hillmann, Sebastian Möller




Automatic Generation of Website-Based Multi-Turn Question-Answering Dialog Systems

Stefan Hillmann, Philine Görzig, Sebastian Möller

Emotion




Going Retro: Astonishingly Simple Yet Effective Rule-Based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions

Felix Burkhardt, Uwe Reichel, Florian Eyben, Björn Schuller




Cross-Reliability Benchmark Test for Preserving Emotional Content in Speech–Synthesis Related Datasets

Jan Hintz, Andreas Wendemuth, Ingo Siegert

Child Speech




Collecting and Annotating Natural Child Speech Data – Challenges and Interdisciplinary Perspectives

Hanna Ehlert, Edith Beaulac, Maren Wallbaum, Christopher Gebauer, Lars Rumberg, Jörn Ostermann, Ulrike Lüdtke




Pronunciation Modelling for Children’s Speech

Christopher Gebauer, Lars Rumberg, Jörn Ostermann

Phonetics




A First Report on a Perceptual Trainig Study Using Percy

Birgitte Poulsen, Ocke-Schwen Bohn, Christoph Draxler




Muster der Sprechatmung in verschiedenen Sprechstilen – Eine Pilotstudie

Jürgen Trouvain, Raphael Werner




An Automatic Method for Speech Breathing Annotation

Alexis Deighton Macintyre, Raphael Werner




Hesitation Lengthening Elicitation and Detection via Target Words in a Card Game Study

Simon Betz

Speech Pathology




Somatosensory Feedback in PAULE

Konstantin Sering, Paul Schmidt-Barbo




Concept for Semantic Error Analysis in a Mobile Application for Speech and Language Therapy Support

Eugenia Rykova, Mathias Walther




RehaLingo – Towards a Speech Training System for Aphasia

Hans-Günter Hirsch, Christian Neumann, Yannic Tiggelkamp, Riccardo Fiorista, Stefan Knecht, Alfons Schnitzler, Katja Biermann-Ruben, Dietmar Bothe, Günter Bleimann, Hendrike Frieg

Automatic Speech Recognition




Training a CNN to Estimate Voice Pathology from Connected Speech Using EGG to Automatically Label the Dataset for Voicing

Ian S. Howard, Julian Mcglashan, Adrian J. Fourcin




Implementing Easy-to-Use Recipes for the Switchboard Benchmark

Dominik Wagner, Sebastian P. Bayerl, Tobias Bocklet




Bias in Flemish Automatic Speech Recognition

Aaricia Herygers, Vass Verkhodanova, Matt Coler, Odette Scharenborg, Munir Georges

Show and Tell




Nkululeko: A Template Based System for Fast Machine Learning Experiments on Speaker Characteristics

Felix Burkhardt




The Hochschul-Assistenz-System HanS: an ML-Based Learning Experience Platform

Thomas Ranzenberger, Tobias Bocklet, Steffen Freisinger, Lia Frischholz, Munir Georges, Kevin Glocker, Aaricia Herygers, René Peinl, Korbinian Riedhammer, Fabian Schneider, Christopher Simic, Khabbab Zakaria




Transcription Portal – A Zero-configuration Workbench for Transcribing Spoken Language Recordings

Christoph Draxler, Julian Pömp

Speech Synthesis and Production




Articulatory Speech Synthesis in the Context of Speech Research and Speech Technology: Review and Prospect

Bernd J. Kröger




Can Deep Learning Help to Understand Speech Production Mechanisms?

Antoine Serrurier




Synchrony of Θ - Oscillations in Speech Perception and Speech Production

Harald Höge

Poster




Persian Speaker Classification Using Rhythmic Features

Neda Mousavi, Sven Grawunder




Approach to Speaker-Generalized Spectral Envelope Estimation by Deep Recurrent Neural Network for Speech Reconstruction in a Speech Enhancement System

Stefan Ciba, Mohammed Krini, Amir Rajabi




iDOKS: Ein integriertes Dokumentationssystem zur Zusammenfassung von Gesprächen und Meetings

Robert Wardenga, Daniel Vogel, Felix Gräßer, Mira Schielke, Leonard Starke, Rico Petrick, Torsten Rex, Jens Lehmann




Adapters for Resource-Efficient Deployment of NLU Models

Jan Nehring, Nils Feldhus, Akhyar Ahmed




Radlogistik als Anwendungsgebiet für Digitale Sprachassistenten – Ein Diskussionsbeitrag

Matthias Busch, Malte Kania, Tom Assmann, Ingo Siegert