ESSV Archive

Studientexte zur Sprachkommunikation Band 93: Elektronische Sprachsignalverarbeitung 2019

Conference proceedings of the 30st conference in Dresden with 45 contributions. Editor(s): Peter Birkholz, Simon Stone ISBN: 978-3-959081-57-3

Festvortrag

50 years Institute of Acoustics and Speech Communication – 30 years Conference Electronic Speech Signal Processing – 20 years Historic Acoustic-Phonetic Collection

Rüdiger Hoffmann

Spracherkennung und -wahrnehmung

Investigation of densely connected convolutional networks with domain adversarial learning for noise robust speech recognition

Chia Yu Li, Ngoc Thang Vu

Convolutional neural networks can learn duration for detecting pitch accents and lexical stress

Sabrina Stehwien, Antje Schweitzer, Ngoc Thang Vu

Perception of German tense and lax vowel contrast by Chinese learners

Yingmin Gao, Hongwei Ding, Peter Birkholz, Rainer Jäckel, Yi Lin

Hauptvortrag

Silent speech interfaces for speech restoration: current status and future challenges

José Andrés González López

Dialogsysteme

Semi-automatische Generierung und Reinforcement Learning basiertes Training eines Dialogmanagers

Stefan Hillmann, Klaus-Peter Engelbrecht, Benjamin Weiss

Comparing phonetic changes in computer-directed and human-directed speech

Eran Raveh, Ingmar Steiner, Ingo Siegert, Iona Gessinger, Bernd Möbius

Analysis and categorization of corrections in multilingual spoken dialogue system

Ivan Kraljevski, Diane Hirschfeld

Hauptvortrag

Speech and voice identity recognition in the human brain

Katharina von Kriegstein

Gehirn und kognitive Modelle

Lernen durch Differenz. Zur logisch-mathematischen Struktur maschinellen Lernens

Peter Klimczak, Günther Wirsching, Matthias Wolff

Extraction of the Ɵ- and ɤ-cycles active in human speech processing from an articulatory speech database

Harald Höge

Bidirektionale Utterance-Meaning-Transducer für Zahlworte durch kompositionale minimalistische Grammatiken

Peter beim Graben, Werner Meyer, Ronald Römer, Matthias Wolff

Hauptvortrag

In Articulation for Diversity

Korin Richmond

Sprachsynthese

Comparison of different methods for the voiced excitation of physical vocal tract models

Peter Birkholz, Simon Stone, Steffen Kürbis

Resynthesizing the GECO speech corpus with VocalTractLab

Konstantin Sering, Niels Stehwien, Yingming Gao, Martin V. Butz, Harald Baayen

How should Pepper sound - Preliminary investigations on robot vocalizations

Felix Burkhardt, Milenko Saponja, Julian Sessner, Benjamin Weiss

Hauptvortrag

Sprache von Produktgeräuschen – Mensch-Produkt Interaktion

Ercan Altinsoy

Medizinische Anwendungen

Influence of speech activity on vibrometer signals to extract vital parameters of humans

Kristian Kroschel, Jürgen Metzler

Schnelle Regelung eines monolithischen vollimplantierbaren Hörgeräts

Till Moritz Eßinger, Martin Koch, Matthias Bornitz, Hannes Seidler, Marcus Neudert, Thomas Zahnert

Poster und Demonstrationen

The restaurant booking corpus – content-identical comparative human-human and humancomputer simulated telephone conversations

Ingo Siegert, Jannik Nietzold, Ralph Heinemann, Andreas Wendemuth

ReTiCo: An open-source framework for modeling real-time conversations in spoken dialogue systems

Thilo Michael, Sebastian Möller

Segmenting multi-intent queries for spoken language understanding

Rohan Shet, Elena Davcheva, Christian Uhle

Exploration and assessment of proactive use cases for an in-car voice assistant

Maria Schmidt, Daniela Stier, Steffen Werner, Wolfgang Minker

Analysis of the influence of different room acoustics on acoustic emotion features

Juliane Höbel-Müller, Ingo Siegert, Ralph Heinemann, Alicia Flores Requardt, Michael Tornow, Andreas Wendemuth

Vergleich verschiedener Machine-Learning Ansätze zur kontinuierlichen Schätzung von perzeptivem Sprechtempo

Benjamin Weiss, Thilo Michael, Uwe Reichel, Oliver Pauly

IMS-speech: A speech to text tool

Pavel Denisov, Ngoc Thang Vu

Schätzung der spektralen Einhüllenden – Ein Vergleich von tiefen neuronalen Netzen und Codebüchern

Christopher Seitz, Mohammed Krini

Entscheidungstheoretische Modellierung der konsummatorischen Endhandlung – Vergleich von klassischen und quantenmechanischen Ansätzen

Ronald Römer, Peter beim Graben, Matthias Wolff

Multimodal speech segmentation using gaze data and spectrogram image features

Arif Khan, Ingmar Steiner

Design and deployment of multilingual industrial voice control applications

Ivan Kraljevski, M. Pohl, A. Gjoreski, U. Koloska, J. Wöhl, M. Wenzel, D. Hirschfeld

Drone sounds and environmental signals – a first review

Oliver Jokisch, Dominik Fischer

Surface stickiness and waviness of two-layer silicone structures for synthetic vocal folds

Falk Gabriel, Patrick Häsner, Eike Dohmen, Dmitry Borin, Peter Birkholz

A toolkit for nested multi-turn speech dialog in automotive environments

Timo Sowa, Soyuj Kumar Sahoo

Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab

Susanne Drechsel, Yingming Gao, Jens Frahm, Peter Birkholz

How to identify elliptical poems within a digital corpus of auditory poetry

Hussein Hussein, Burkhard Meyer-Sickendiek, Timo Baumann

Dynamic vocabulary with a Kaldi speech recognizer in a speech dialog system for automotive infotainment applications

Thomas Ranzenberger, Christian Hacker, Karl Weilhammer

Automatic vocal tract segmentation based on conditional generative adversarial neural network

Mohammad Eslami, Christiane Neuschaefer-Rube, Antoine Serrurier

Hauptvortrag

The myoelastic-aerodynamic theory of sound production in humans, mammals, and birds

Christian Herbst

Prosodie

Filled pause detection by prosodic discontinuity features

Uwe D. Reichel, Benjamin Weiss, Thilo Michael

Zur Annotation nicht-verbaler Vokalisierungen in Korpora gesprochener Sprache

Jürgen Trouvain, Malte Belz

Towards ordinal classification of voice quality features with acoustic parameters

Felix Schaeffler, Matthias Eichner, Janet Beck

Sprachproduktion

Analysis of coarticulation using EMA data with a statistical shape space model of the tongue

Alexander Hewer, Ingmar Steiner, Korin Richmond

Modelling vowel acquisition using the Birkholz synthesizer

Ian S. Howard, Peter Birkholz

Influence of the vocal tract morphology on the F1-F2 acoustic plane

Antoine Serrurier, Pierre Badin, Christiane Neuschaefer-Rube

Numerische Studie zum Einfluss laryngealer Areale auf individuelle und allgemeine akustische Eigenschaften des menschlichen Vokaltrakts bei gehaltenen Vokalen

Mario Fleischer, Alexander Mainka, Dirk Mürbe

ESSV Konferenz Elektronische Sprachsignalverarbeitung