ESSV Konferenz Elektronische Sprachsignalverarbeitung

Studientexte zur Sprachkommunikation Band 99: Elektronische Sprachsignalverarbeitung 2021


Conference proceedings of the 32st conference in Berlin with 41 contributions. Editor(s): Stefan Hillmann, Benjamin Weiss, Thilo Michael, Sebastian Möller ISBN: 978-3-959082-27-3 Some of the articles in this volume are not available as PDF files. If you are interested in these individual contributions, the volume can be bought or borrowed below.

Paralinguistik




Speech Signal Compression Deteriorates Acoustic Cues to Perceived Speaker Charisma

Ingo Siegert, Oliver Niebuhr




`Alexa, who are you?´ – Analysing Alexa’s, Cortana’s and Siri’s Vocal Personality

Anabell Hacker




Pseudo-Labelling and Transfer Learning Based Speech Emotion Recognition

Siddarth Venkateswaran, Ronald Böck, Thomas Keßler, Ossmane Krini




Emotion Bias in Automatic Speech Recognition

Lara-Sophie Christmann

Postersession 1




Age Classification: Comparison of Human vs Machine in Prompted and Spontaneous Speech

Felix Burkhardt, Markus Brückl, Björn W. Schuller




Cross-Lingual Acoustic Modeling in Upper Sorbian – Preliminary Study

Ivan Kraljevski, Marek Rjelka, Frank Duckhorn, Constanze Tschöpe, Matthias Wolff




Real-Time Implementation, Comparison, and Combination of Pitch Tracking Algorithms

Janina Reuter, Merikan Koyun, Christoph Daniel Schulze, Reinhard Von Hanxleden




Human pause detection in spontaneous speech in an online experiment

Jürgen Trouvain, Raphael Werner




Formalisierung und Implementierung einer adaptiven kognitiven Architektur unter Verwendung von Strukturdiagrammen

Werner Meyer, Borislav Borislavov, Friedrich Eckert, Christian Richter, Ronald Römer, Peter beim Graben, Markus Huber, Matthias Wolff




Audio and Video Processing of UAV-Based Signals in the Harmonic Project

Oliver Jokisch, Tilo Strutz, Alexander Leipnitz, Ingo Siegert,, Andrey Ronzhin




Automatic-Subtitling: Comparison on the Performance of Forced Alignment and Automatic Speech Recognition

Mino Lee Sasse, Stefan Schaffer, Aaron Ruß




Artificial Bandwidth Extension using a Glottal Excitation Model

Sebastian Barth, Simon Stone, Peter Birkholz

Automatische Spracherkennung




Towards reliability-guided information integration in audio-visual speech recognition

Wentao Yu, Steffen Zeiler, Dorothea Kolossa




On the Optimal Set of Features and the Robustness of Classifiers in Radar-based Silent Phoneme Recognition

Pouriya Amini Digehsara, Christoph Wagner, Petr Schaffer, Michael Bärhold, Simon Stone, Dirk Plettemeier, Peter Birkholz




Investigating the scarce data and resources problem for speech recognition using transfer learning and data augmentation

Fahrettin Gökgöz, Mahmoud Hashem




Open source automatic lecture subtitling

Benjamin Milde, Robert Geislinger, Irina Lindt, Timo Baumann

Keynote 2




Towards Socially Interactive Agents with Explanatory Skill

Elisabeth André

Phonetik und Artikulation




Cortical Segmentation of Syllables

Harald Höge




TargetOptimizer 2.0: Enhanced Estimation of Articulatory Targets

Paul Konstantin Krug, Simon Stone, Alexander Wilbrandt, Peter Birkholz




Phonetic convergence evaluation based on fundamental frequency variability

Bistra Andreeva, Grazyna Demenko, Jolanta Bachan, Iona Gessinger, Karolina Jankowska, Bernd Möbius




Glottal Closure Instant Detection using Echo State Networks

Peter Steiner, Ian S. Howard, Peter Birkholz




Machine Learning analysis of speech and EGG for the diagnosis of voice pathology

Ian S. Howard, Julian Mcglashan, Adrian J. Fourcin




Predictive articulatory speech synthesis with semantic discrimination

Paul Schmidt-Barbo, Elnaz Shafaei-Bajestan, Konstantin Sering

Postersession 2




The effect of Lombard speech modifications in different information density contexts

Omnia Ibrahim, Ivan Yuen, Marjolein Van Os, Bistra Andreeva, Bernd Möbius




Intents in Sprachdialogen: Eine Praxisperspektive

Benjamin Weiss, Stefan Hillmann, Sebastian Möller




VADiMoS: A Web Tool for Designing Voice Assistant Independent and Ontology Based Dialogs

Thomas Ranzenberger, Christian Hacker




Anticipatory coarticulation in predictive articulatory speech modeling

Konstantin Sering, Fabian Tomaschek, Motoki Saito




Developing the German Pronunciation Database (DAD) - an online dictionary for spoken German

Alexandra Ebel, Johannes Förster, Mathias Walther




Untersuchung von Qualitätsunterschieden zwischen gesprochener und geschriebener Sprache bei Interaktion mit einem Chatbot

Marco Braune




Der Faktor Mensch in der Mensch-Maschine-Interaktion

Daniel Duran, Sarah Warchhold

Sprachdialog




Comparison of Training Behaviour and Performance of Reinforcement Learning based Policies for Dialogue Management Sprachdialog

Stefan Hillmann, Tilo Himmelsbach, Benjamin Weiss




Comparing BERT with an intent based question answering setup for open-ended questions in the museum domain

Md. Mahmud-Uz-Zaman, Stefan Schaffer, Tatjana Scheffler




Eine Maschinensemiotische Pertinetz-Architektur für ein menschenzentriertes User-Interface

Peter Klimczak, Markus Huber, Peter beim Graben, Günther Wirsching




Normalisierungsmethoden für Intent Erkennung Modularer Dialogsysteme

Jan Nehring, Akhyar Ahmed

Keynote 3




All Interaction is Situated, All Language is Grounded: Implications for the Design of Spoken Dialogue Systems

David Schlangen

Sprachsynthese




Natural and synthetic speech comprehension in simulated tonal and pulsatile tinnitus: A pilot study

Jacek Kudera, Marjolein Van Os, Bernd Möbius




Knock-Knock! Who’s There? The Laughter-Enhanced Virtual Real-Estate Agent

Bogdan Ludusan, Petra Wagner




Evaluating the effect of pauses on number recollection in synthesized speech

Mikey Elmers, Raphael Werner, Beeke Muhlack, Bernd Möbius, Jürgen Trouvain

Sprachsignalverarbeitung und Evaluation




Prediction of Background Noise Degradations in Fullband Speech Communication Scenarios

Sebastian Möller, Andreas Bütow




Studie zur Lösbarkeit des Problems starker Pegelschwankungen im Home-Entertainment

Georg Schmidt, Ingo Siegert




Intelligibility in Telephone Conversations with Packet Loss

Thilo Michael