ESSV Archive

Studientexte zur Sprachkommunikation Band 99: Elektronische Sprachsignalverarbeitung 2021

Conference proceedings of the 32st conference in Berlin with 41 contributions. Editor(s): Stefan Hillmann, Benjamin Weiss, Thilo Michael, Sebastian Möller ISBN: 978-3-959082-27-3

Paralinguistik

Speech Signal Compression Deteriorates Acoustic Cues to Perceived Speaker Charisma

Ingo Siegert, Oliver Niebuhr

`Alexa, who are you?´ – Analysing Alexa’s, Cortana’s and Siri’s Vocal Personality

Anabell Hacker

Pseudo-Labelling and Transfer Learning Based Speech Emotion Recognition

Siddarth Venkateswaran, Ronald Böck, Thomas Keßler, Ossmane Krini

Emotion Bias in Automatic Speech Recognition

Lara-Sophie Christmann

Postersession 1

Age Classification: Comparison of Human vs Machine in Prompted and Spontaneous Speech

Felix Burkhardt, Markus Brückl, Björn W. Schuller

Cross-Lingual Acoustic Modeling in Upper Sorbian – Preliminary Study

Ivan Kraljevski, Marek Rjelka, Frank Duckhorn, Constanze Tschöpe, Matthias Wolff

Real-Time Implementation, Comparison, and Combination of Pitch Tracking Algorithms

Janina Reuter, Merikan Koyun, Christoph Daniel Schulze, Reinhard Von Hanxleden

Human pause detection in spontaneous speech in an online experiment

Jürgen Trouvain, Raphael Werner

Formalisierung und Implementierung einer adaptiven kognitiven Architektur unter Verwendung von Strukturdiagrammen

Werner Meyer, Borislav Borislavov, Friedrich Eckert, Christian Richter, Ronald Römer, Peter beim Graben, Markus Huber, Matthias Wolff

Audio and Video Processing of UAV-Based Signals in the Harmonic Project

Oliver Jokisch, Tilo Strutz, Alexander Leipnitz, Ingo Siegert,, Andrey Ronzhin

Automatic-Subtitling: Comparison on the Performance of Forced Alignment and Automatic Speech Recognition

Mino Lee Sasse, Stefan Schaffer, Aaron Ruß

Artificial Bandwidth Extension using a Glottal Excitation Model

Sebastian Barth, Simon Stone, Peter Birkholz

Automatische Spracherkennung

Towards reliability-guided information integration in audio-visual speech recognition

Wentao Yu, Steffen Zeiler, Dorothea Kolossa

On the Optimal Set of Features and the Robustness of Classifiers in Radar-based Silent Phoneme Recognition

Pouriya Amini Digehsara, Christoph Wagner, Petr Schaffer, Michael Bärhold, Simon Stone, Dirk Plettemeier, Peter Birkholz

Investigating the scarce data and resources problem for speech recognition using transfer learning and data augmentation

Fahrettin Gökgöz, Mahmoud Hashem

Open source automatic lecture subtitling

Benjamin Milde, Robert Geislinger, Irina Lindt, Timo Baumann

Keynote 2

Towards Socially Interactive Agents with Explanatory Skill

Elisabeth André

Phonetik und Artikulation

Cortical Segmentation of Syllables

Harald Höge

TargetOptimizer 2.0: Enhanced Estimation of Articulatory Targets

Paul Konstantin Krug, Simon Stone, Alexander Wilbrandt, Peter Birkholz

Phonetic convergence evaluation based on fundamental frequency variability

Bistra Andreeva, Grazyna Demenko, Jolanta Bachan, Iona Gessinger, Karolina Jankowska, Bernd Möbius

Glottal Closure Instant Detection using Echo State Networks

Peter Steiner, Ian S. Howard, Peter Birkholz

Machine Learning analysis of speech and EGG for the diagnosis of voice pathology

Ian S. Howard, Julian Mcglashan, Adrian J. Fourcin

Predictive articulatory speech synthesis with semantic discrimination

Paul Schmidt-Barbo, Elnaz Shafaei-Bajestan, Konstantin Sering

Postersession 2

The effect of Lombard speech modifications in different information density contexts

Omnia Ibrahim, Ivan Yuen, Marjolein Van Os, Bistra Andreeva, Bernd Möbius

Intents in Sprachdialogen: Eine Praxisperspektive

Benjamin Weiss, Stefan Hillmann, Sebastian Möller

VADiMoS: A Web Tool for Designing Voice Assistant Independent and Ontology Based Dialogs

Thomas Ranzenberger, Christian Hacker

Anticipatory coarticulation in predictive articulatory speech modeling

Konstantin Sering, Fabian Tomaschek, Motoki Saito

Developing the German Pronunciation Database (DAD) - an online dictionary for spoken German

Alexandra Ebel, Johannes Förster, Mathias Walther

Untersuchung von Qualitätsunterschieden zwischen gesprochener und geschriebener Sprache bei Interaktion mit einem Chatbot

Marco Braune

Der Faktor Mensch in der Mensch-Maschine-Interaktion

Daniel Duran, Sarah Warchhold

Sprachdialog

Comparison of Training Behaviour and Performance of Reinforcement Learning based Policies for Dialogue Management

Stefan Hillmann, Tilo Himmelsbach, Benjamin Weiss

Comparing BERT with an intent based question answering setup for open-ended questions in the museum domain

Md. Mahmud-Uz-Zaman, Stefan Schaffer, Tatjana Scheffler

Eine Maschinensemiotische Pertinetz-Architektur für ein menschenzentriertes User-Interface

Peter Klimczak, Markus Huber, Peter beim Graben, Günther Wirsching

Normalisierungsmethoden für Intent Erkennung Modularer Dialogsysteme

Jan Nehring, Akhyar Ahmed

Keynote 3

All Interaction is Situated, All Language is Grounded: Implications for the Design of Spoken Dialogue Systems

David Schlangen

Sprachsynthese

Natural and synthetic speech comprehension in simulated tonal and pulsatile tinnitus: A pilot study

Jacek Kudera, Marjolein Van Os, Bernd Möbius

Knock-Knock! Who’s There? The Laughter-Enhanced Virtual Real-Estate Agent

Bogdan Ludusan, Petra Wagner

Evaluating the effect of pauses on number recollection in synthesized speech

Mikey Elmers, Raphael Werner, Beeke Muhlack, Bernd Möbius, Jürgen Trouvain

Sprachsignalverarbeitung und Evaluation

Prediction of Background Noise Degradations in Fullband Speech Communication Scenarios

Sebastian Möller, Andreas Bütow

Studie zur Lösbarkeit des Problems starker Pegelschwankungen im Home-Entertainment

Georg Schmidt, Ingo Siegert

Intelligibility in Telephone Conversations with Packet Loss

Thilo Michael

ESSV Konferenz Elektronische Sprachsignalverarbeitung