ESSV Archive

Studientexte zur Sprachkommunikation Band 105: Elektronische Sprachsignalverarbeitung 2023

Conference proceedings of the 34st conference in München with 32 contributions. Editor(s): Christoph Draxler ISBN: 978-3-95908-303-4

Cover of the ESSV 2023 proceedings book.

Visualisation

Comparison of Object Tracking Algorithms for Larynx Phantom Movements in Ultrasound Videos

Christian Kleiner, Peter Birkholz

Anwendung des MFCC-Plotters zur Erfassung cepstraler Unterschiede in emotionaler Sprache

Frederick Kukla, Vanessa Reichel

Analysis of Transcriptions Using Octra – A Pilot Study

Christoph Draxler

Interaction & Dialogue

How May I Interrupt? Linguistic Design Guidelines for Proactive In-Car Voice Assistants

Anna-Maria Meck

Automatic User Experience Evaluation of Goal-Oriented Dialogs Using Pre- Trained Language Models

Mika Rebensburg, Stefan Hillmann, Nils Feldhus

A Comparison of Module Selection Strategies for Modular Dialog Systems

Philine Görzig, Jan Nehring, Stefan Hillmann, Sebastian Möller

Automatic Generation of Website-Based Multi-Turn Question-Answering Dialog Systems

Stefan Hillmann, Philine Görzig, Sebastian Möller

Emotion

Going Retro: Astonishingly Simple Yet Effective Rule-Based Prosody Modelling for Speech Synthesis Simulating Emotion Dimensions

Felix Burkhardt, Uwe Reichel, Florian Eyben, Björn Schuller

Cross-Reliability Benchmark Test for Preserving Emotional Content in Speech–Synthesis Related Datasets

Jan Hintz, Andreas Wendemuth, Ingo Siegert

Child Speech

Collecting and Annotating Natural Child Speech Data – Challenges and Interdisciplinary Perspectives

Hanna Ehlert, Edith Beaulac, Maren Wallbaum, Christopher Gebauer, Lars Rumberg, Jörn Ostermann, Ulrike Lüdtke

Pronunciation Modelling for Children’s Speech

Christopher Gebauer, Lars Rumberg, Jörn Ostermann

Phonetics

A First Report on a Perceptual Trainig Study Using Percy

Birgitte Poulsen, Ocke-Schwen Bohn, Christoph Draxler

Muster der Sprechatmung in verschiedenen Sprechstilen – Eine Pilotstudie

Jürgen Trouvain, Raphael Werner

An Automatic Method for Speech Breathing Annotation

Alexis Deighton Macintyre, Raphael Werner

Hesitation Lengthening Elicitation and Detection via Target Words in a Card Game Study

Simon Betz

Speech Pathology

Somatosensory Feedback in PAULE

Konstantin Sering, Paul Schmidt-Barbo

Concept for Semantic Error Analysis in a Mobile Application for Speech and Language Therapy Support

Eugenia Rykova, Mathias Walther

RehaLingo – Towards a Speech Training System for Aphasia

Hans-Günter Hirsch, Christian Neumann, Yannic Tiggelkamp, Riccardo Fiorista, Stefan Knecht, Alfons Schnitzler, Katja Biermann-Ruben, Dietmar Bothe, Günter Bleimann, Hendrike Frieg

Automatic Speech Recognition

Training a CNN to Estimate Voice Pathology from Connected Speech Using EGG to Automatically Label the Dataset for Voicing

Ian S. Howard, Julian Mcglashan, Adrian J. Fourcin

Implementing Easy-to-Use Recipes for the Switchboard Benchmark

Dominik Wagner, Sebastian P. Bayerl, Tobias Bocklet

Bias in Flemish Automatic Speech Recognition

Aaricia Herygers, Vass Verkhodanova, Matt Coler, Odette Scharenborg, Munir Georges

Show and Tell

Nkululeko: A Template Based System for Fast Machine Learning Experiments on Speaker Characteristics

Felix Burkhardt

The Hochschul-Assistenz-System HanS: an ML-Based Learning Experience Platform

Thomas Ranzenberger, Tobias Bocklet, Steffen Freisinger, Lia Frischholz, Munir Georges, Kevin Glocker, Aaricia Herygers, René Peinl, Korbinian Riedhammer, Fabian Schneider, Christopher Simic, Khabbab Zakaria

Transcription Portal – A Zero-configuration Workbench for Transcribing Spoken Language Recordings

Christoph Draxler, Julian Pömp

Speech Synthesis and Production

Articulatory Speech Synthesis in the Context of Speech Research and Speech Technology: Review and Prospect

Bernd J. Kröger

Can Deep Learning Help to Understand Speech Production Mechanisms?

Antoine Serrurier

Synchrony of Θ - Oscillations in Speech Perception and Speech Production

Harald Höge

Poster

Persian Speaker Classification Using Rhythmic Features

Neda Mousavi, Sven Grawunder

Approach to Speaker-Generalized Spectral Envelope Estimation by Deep Recurrent Neural Network for Speech Reconstruction in a Speech Enhancement System

Stefan Ciba, Mohammed Krini, Amir Rajabi

iDOKS: Ein integriertes Dokumentationssystem zur Zusammenfassung von Gesprächen und Meetings

Robert Wardenga, Daniel Vogel, Felix Gräßer, Mira Schielke, Leonard Starke, Rico Petrick, Torsten Rex, Jens Lehmann

Adapters for Resource-Efficient Deployment of NLU Models

Jan Nehring, Nils Feldhus, Akhyar Ahmed

Radlogistik als Anwendungsgebiet für Digitale Sprachassistenten – Ein Diskussionsbeitrag

Matthias Busch, Malte Kania, Tom Assmann, Ingo Siegert

ESSV Konferenz Elektronische Sprachsignalverarbeitung