ESSV Konferenz Elektronische Sprachsignalverarbeitung

Festvortrag



50 years Institute of Acoustics and Speech Communication – 30 years Conference Electronic Speech Signal Processing – 20 years Historic Acoustic-Phonetic Collection

Rüdiger Hoffmann

Spracherkennung und -wahrnehmung



Investigation of densely connected convolutional networks with domain adversarial learning for noise robust speech recognition

Chia Yu Li, Ngoc Thang Vu



Convolutional neural networks can learn duration for detecting pitch accents and lexical stress

Sabrina Stehwien, Antje Schweitzer, Ngoc Thang Vu



Perception of German tense and lax vowel contrast by Chinese learners

Yingmin Gao, Hongwei Ding, Peter Birkholz, Rainer Jäckel, Yi Lin

Hauptvortrag



Silent speech interfaces for speech restoration: current status and future challenges

José Andrés González López

Dialogsysteme



Semi-automatische Generierung und Reinforcement Learning basiertes Training eines Dialogmanagers

Stefan Hillmann, Klaus-Peter Engelbrecht, Benjamin Weiss



Comparing phonetic changes in computer-directed and human-directed speech

Eran Raveh, Ingmar Steiner, Ingo Siegert, Iona Gessinger, Bernd Möbius



Analysis and categorization of corrections in multilingual spoken dialogue system

Ivan Kraljevski, Diane Hirschfeld

Hauptvortrag



Speech and voice identity recognition in the human brain

Katharina von Kriegstein

Gehirn und kognitive Modelle



Lernen durch Differenz. Zur logisch-mathematischen Struktur maschinellen Lernens

Peter Klimczak, Günther Wirsching, Matthias Wolff



Extraction of the Ɵ- and ɤ-cycles active in human speech processing from an articulatory speech database

Harald Höge



Bidirektionale Utterance-Meaning-Transducer für Zahlworte durch kompositionale minimalistische Grammatiken

Peter beim Graben, Werner Meyer, Ronald Römer, Matthias Wolff

Hauptvortrag



In Articulation for Diversity

Korin Richmond

Sprachsynthese



Comparison of different methods for the voiced excitation of physical vocal tract models

Peter Birkholz, Simon Stone, Steffen Kürbis



Resynthesizing the GECO speech corpus with VocalTractLab

Konstantin Sering, Niels Stehwien, Yingming Gao, Martin V. Butz, Harald Baayen



How should Pepper sound - Preliminary investigations on robot vocalizations

Felix Burkhardt, Milenko Saponja, Julian Sessner, Benjamin Weiss

Hauptvortrag



Sprache von Produktgeräuschen – Mensch-Produkt Interaktion

Ercan Altinsoy

Medizinische Anwendungen



Influence of speech activity on vibrometer signals to extract vital parameters of humans

Kristian Kroschel, Jürgen Metzler



Schnelle Regelung eines monolithischen vollimplantierbaren Hörgeräts

Till Moritz Eßinger, Martin Koch, Matthias Bornitz, Hannes Seidler, Marcus Neudert, Thomas Zahnert

Poster und Demonstrationen



The restaurant booking corpus – content-identical comparative human-human and humancomputer simulated telephone conversations

Ingo Siegert, Jannik Nietzold, Ralph Heinemann, Andreas Wendemuth



ReTiCo: An open-source framework for modeling real-time conversations in spoken dialogue systems

Thilo Michael, Sebastian Möller



Segmenting multi-intent queries for spoken language understanding

Rohan Shet, Elena Davcheva, Christian Uhle



Exploration and assessment of proactive use cases for an in-car voice assistant

Maria Schmidt, Daniela Stier, Steffen Werner, Wolfgang Minker



Analysis of the influence of different room acoustics on acoustic emotion features

Juliane Höbel-Müller, Ingo Siegert, Ralph Heinemann, Alicia Flores Requardt, Michael Tornow, Andreas Wendemuth



Vergleich verschiedener Machine-Learning Ansätze zur kontinuierlichen Schätzung von perzeptivem Sprechtempo

Benjamin Weiss, Thilo Michael, Uwe Reichel, Oliver Pauly



IMS-speech: A speech to text tool

Pavel Denisov, Ngoc Thang Vu



Schätzung der spektralen Einhüllenden – Ein Vergleich von tiefen neuronalen Netzen und Codebüchern

Christopher Seitz, Mohammed Krini



Entscheidungstheoretische Modellierung der konsummatorischen Endhandlung – Vergleich von klassischen und quantenmechanischen Ansätzen

Ronald Römer, Peter beim Graben, Matthias Wolff



Multimodal speech segmentation using gaze data and spectrogram image features

Arif Khan, Ingmar Steiner



Design and deployment of multilingual industrial voice control applications

Ivan Kraljevski, M. Pohl, A. Gjoreski, U. Koloska, J. Wöhl, M. Wenzel, D. Hirschfeld



Drone sounds and environmental signals – a first review

Oliver Jokisch, Dominik Fischer



Surface stickiness and waviness of two-layer silicone structures for synthetic vocal folds

Falk Gabriel, Patrick Häsner, Eike Dohmen, Dmitry Borin, Peter Birkholz



A toolkit for nested multi-turn speech dialog in automotive environments

Timo Sowa, Soyuj Kumar Sahoo



Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab

Susanne Drechsel, Yingming Gao, Jens Frahm, Peter Birkholz



How to identify elliptical poems within a digital corpus of auditory poetry

Hussein Hussein, Burkhard Meyer-Sickendiek, Timo Baumann



Dynamic vocabulary with a Kaldi speech recognizer in a speech dialog system for automotive infotainment applications

Thomas Ranzenberger, Christian Hacker, Karl Weilhammer



Automatic vocal tract segmentation based on conditional generative adversarial neural network

Mohammad Eslami, Christiane Neuschaefer-Rube, Antoine Serrurier

Hauptvortrag



The myoelastic-aerodynamic theory of sound production in humans, mammals, and birds

Christian Herbst

Prosodie



Filled pause detection by prosodic discontinuity features

Uwe D. Reichel, Benjamin Weiss, Thilo Michael



Zur Annotation nicht-verbaler Vokalisierungen in Korpora gesprochener Sprache

Jürgen Trouvain, Malte Belz



Towards ordinal classification of voice quality features with acoustic parameters

Felix Schaeffler, Matthias Eichner, Janet Beck

Sprachproduktion



Analysis of coarticulation using EMA data with a statistical shape space model of the tongue

Alexander Hewer, Ingmar Steiner, Korin Richmond



Modelling vowel acquisition using the Birkholz synthesizer

Ian S. Howard, Peter Birkholz



Influence of the vocal tract morphology on the F1-F2 acoustic plane

Antoine Serrurier, Pierre Badin, Christiane Neuschaefer-Rube



Numerische Studie zum Einfluss laryngealer Areale auf individuelle und allgemeine akustische Eigenschaften des menschlichen Vokaltrakts bei gehaltenen Vokalen

Mario Fleischer, Alexander Mainka, Dirk Mürbe