Stephan Werner

Dr.-Ing. Stephan Werner

Head of Group (acting)

Electronic Media Technology Group

Helmholtzbau, Room H 3520
+49 3677 69-1653
stephan.werner@tu-ilmenau.de

Relevant activities related to research, teaching and administration to date are listed below (see also tabs 'Research' and 'Teaching').

Research activity

I have been actively involved in the scientific community with research since 2008. The focus of this research is on investigations of the relationships between technical recording, signal processing, and playback systems on the one hand, and the perception of the scene mediated by technical systems on the other. The connections between the perceiving and the technical system are to be revealed. The focus here so far, although not exclusively, has been on auditory perception.

The research carried out so far includes:

Investigations on technical components of spatial audio reproduction systems based on methods of binaural synthesis.
investigations on the influence of training and adaptation processes on the localization and externalization of auditory events
Investigations on spatial hearing when using hearing aids
Investigations on the influence of context-dependent quality parameters on the creation of a plausible auditory illusion
Development of methods to adjust acoustic parameters of a binaural synthesis system to create perceptual congruence between synthesized scene and listening conditions
Creation of data sets of measured and fitted binaural spatial transfer functions for use in research projects

Project activities

isoperare (project management)
RAVes (project management)
Co-Humanics (co-subproject management)
TheraTin (sub-project management)

Electronic Media Technology (EMT) Group and Institute of Media Technology (IMT)

I am committed to self-governance of the institute and the group.

Since April 2020 assumption of the provisional head of the EMT Group
2014 - 2020 deputy department head of the EMT Group with signature authority
2014 - 2021 elected representative of the scientific staff in the Institute Council of the IMT

Conference organization

The following is a list of activities related to conferences held or planned. The activities are always understood as cooperation with other colleagues and professors.

Publication Chair of the 9th International Conference on Quality of Multimedia Experience (QoMEX) 2017
Member of the organizing team for the 9th International Conference on Quality of Multimedia Experience (QoMEX) 2017
Organization of the International Conference on Spatial Audio (ICSA) of the Association of German Sound Engineers planned for 2019 in Ilmenau (TU Ilmenau in cooperation with Fraunhofer IDMT)

Reviewer activities

Below you will find my activities as a reviewer.

Journal of the Audio Engineering Society
Conferences of the Audio Engineering Society
International Conference on Quality of Multimedia Experience (QoMEX)

Memberships

Audio Engineering Society (AES)
German Society for Acoustics (DEGA)
Association of German Sound Engineers (VDT)

Motivation

"The most certain thing we know is undoubtedly that which we experience in our own bodies. ... Therefore the experienced world of sense forms the only unassailable basis for the work of exact science." Max Planck

Research Field

It has always been a goal of acoustic and visual recording and playback systems to create a perfect audiovisual illusion in the user. Great progress has been made in the development of audio systems, such as ambisonics, wave field synthesis and binaural headphone playback, and video systems, such as display technologies, stereoscopic recording and playback, and coding techniques. The quality of synthesis is continuously improving. Nevertheless, numerous effects and limiting factors for the realization of immersion and plausibility are not understood.

Research Approach

The construction of an immersive and plausible perception for the user can be formulated as a goal for multimedia systems and services for the generation of a virtual reality and/or a reality enriched by virtual objects. The technical system should create a multimedia illusion suitable for certain requirements and expectations. If this succeeds, the mediated scene and the technical system are characterized by a high degree of viability.

A promising approach to a fundamental understanding of viability in a multimedia context is the combined investigation of the properties of the technical system as a whole, including its individual components ("physical domain" in Figure 1), the individual perceptual characteristics of the user that describe quality ("perceptional domain" in Figure 1), and the user's interactions, especially in connection with the playback environment ("environment" in Figure 1).

In this consideration, it is essential to assume that the describable perceptual quality variables and correlations are always individual and time-variable. This additional variability seems to be primarily attributable to the changing intrinsic expectations of the individual user. The user's individual internal reference is always adjusted by cognitive processes of assimilation and accommodation.

The intended research goal results from the systematic investigation of the formation of an audiovisual illusion in consideration of individual time-variable quality characteristics and in consideration of the individual in the context of use of the technical system. The intended research is to make a decisive contribution to the understanding of the formation of an overall quality. Relevant system components and parameters of the technical system will be identified. A summary representation as a predictive model is sought. It is the firm conviction that the consideration of the interrelationships of individual system components, individual perceptive quality characteristics and individual evaluations is absolutely necessary in order to be able to make qualified statements about the emergence of an overall quality.

The main research areas are divided into:

Generation of plausible and immersive spatial auditory events
Investigation of perceptual effects in audio systems for spatial reproduction
analysis of context-dependent quality parameters within binaural audio systems
spatial hearing when using hearing implants
contributions to the development of a prediction model of spatial auditory perception

The listed teaching activities provide a summary of the teaching carried out so far at the Institute of Media Technology (█ Master; ▒ Bachelor). As a rule, the activities are always understood as cooperation with other colleagues and professors.

▒ Fundamentals of Media Technology – Lecture for Bachelor Media Technology 2nd semester - Since 2015 until 2017 supervision of the lecture; preparation of lecture notes as well as giving lectures with focus on fundamentals of digitization, auditory perception and audio recording; preparation of exams

▒ Multimedia Tools – Lecture and projects for Bachelor Applied Media and Communication Science 2nd semester - Since 2010 until 2017 Supervision of the lecture and parts of the project work; preparation of lecture notes as well as giving lectures; preparation of exams

▒ Practical Workshop Hearing Tests – Co-organization and co-supervision of the Praxiswerkstatt for Bachelor Media Technology - Since 2011 in cooperation with Fraunhofer IDMT organization and implementation of the Praxiswerkstatt Hörtests on current research questions of own and colleagues' research

▒ Main seminar – Organization and supervision for Bachelor Media Technology 5th semester - Since 2012 organization of the main seminar of the Groups Electronic Media Technology and Audiovisual Technology (since 2015); creation and supervision of main seminar topics

█ Advanced Psychoacoustics – Lecture and project for Master Media Technology - Since 2010 until 2017 Organization and supervision of the lecture; preparation of lecture notes as well as holding lectures and project work; preparation of exams; preparation, planning, implementation and evaluation of the related research project in cooperation with the students

█ Audio Systems Technology – Lecture and Seminars for Master Media Technology - Since 2009 Organization and supervision of the lecture; preparation of lecture notes as well as holding lectures and seminars; preparation of exams

█ Applied and Virtual Acoustics – Lecture and Seminars for Master Media Technology - Since 2009 Organization and supervision of the lecture; preparation of lecture notes as well as giving lectures and seminars; preparation of exams

█ ▒ Master's and Bachelor's theses – Topic setting and supervision - Since 2010 scientific supervision of final theses (see tab 'Final theses')

█ ▒ Colloquium – Organization and realization - Since 2009 holding of the colloquium for students of media technology

List of supervised master's, diploma, and bachelor's theses (█ Master/Diplom; ▒ Bachelor). The list also includes co-supervised theses. Study and media projects are not listed.

2021

█ Stefan Dietrich, „Entwurf und Implementierung eines semiautomatischen Prüfsystems zur verlässlichen Artefakt-Detektion bei drahtloser Audioübertragung in digitalen Hörsystemen“, Masterarbeit, TU Ilmenau, Co-Betreuung mit Tamás Harczos (Audifon Hörgeräte GmbH) 2021.

2019

█ Dominik Zapf, „Entwicklung einer Methode zur Messung, Darstellung und Auswertung von Verhalten in einer positions-dynamischen Binauralsyntheseanwendung“, Masterarbeit, TU Ilmenau, 2019.

█ Georg Götz, „Simplified image-source modelling for dynamic rendering of virtual acoustics“, Masterarbeit, Co-Betreuung mit Ville Pulkki (Aalto University, Finnland), TU Ilmenau, 2019.

▒ Jonathan Häußler, „Automatische Erkennung der Raumgröße und -geometrie auf Basis binauraler Signale unter Verwendung Künstlicher Neuronaler Netze“, Bachelorarbeit, TU Ilmenau, 2019.

▒ Clemens Müller, „Entwicklung und Evaluierung von Methoden zur Synthese von binauralen Raumimpulsantworten zur Abbildung neuer Quellpositionen“, Bachelorarbeit, TU Ilmenau, 2019.

2018

█ Reem Haider Mahdi, „Investigation on individual differences in sound localization tasks description : study inter-individual difference in audio sound localization“, Masterarbeit, TU Ilmenau, 2018.

█ Brijesh Bangalore Parappa, „Development and evaluation of an adaptive binaural synthesis system on the screen size “, Masterarbeit, TU Ilmenau, 2018.

█ Abhijatha K. Banashankarappa, „Noise-Robust Speaker Identification in Cars“, Masterarbeit in Zusammenarbeit mit IAV automotive engineering, TU Ilmenau, 2018.

2017

▒ Nicolas Pachatz, „Untersuchungen zur Relevanz raumakustischer Parameter bei Anpassung eines Binauralsynthesesystems an die Raumakustik des Abhörraumes“, Bachelorarbeit in Bearbeitung, TU Ilmenau, 2017.

▒ Stefan Dietrich, „Untersuchungen zur räumlichen auditiven Wahrnehmung bei Verwendung eines Knochenleitungshörers“, Bachelorarbeit in Bearbeitung, TU Ilmenau, 2017.

█ Martin Rekitt, „Virtuelle akustische Umgebung für Hörgeräte“, Masterarbeit, TU Ilmenau, 2017

█ Leo Thieme, „Entwurf einer Untersuchungsvorrichtung zur Wiedergabe von binauralen Signalen über einen Knochenleitungshörer“, Masterarbeit, TU Ilmenau, 2017.

2016

▒ Georg Götz, „Untersuchung zum Einfluss von Head-Tracking auf die Externalisierung von Hörereignissen bei Divergenz zwischen synthetisierter Szene und Abhörraum unter Verwendung eines binauralen Kopfhörersystems“, Bachelorarbeit, TU Ilmenau, 2016.

█ Christina Mittag, „Entwicklung und Evaluierung eines Verfahrens zur Synthese von binauralen Raumimpulsantworten basierend auf räumlich dünnbesetzten Messungen in realen Räumen“, Masterarbeit, TU Ilmenau, 2016 (Studienpreise der DEGA 2017 sowie der SEW Eurodrive Stiftung 2016).

▒ Andreas Löhner, „Implementierung eines Verfahrens zur Überblendung zwischen Raumimpulsantworten zweier verschiedener Räume zur Verwendung in einem Binauralsynthesesystem“, Bachelorarbeit, TU Ilmenau, 2016.

█ Samar Shahabi Ghahfarokhi, „On the influence of visual feedback on the externalization of the percieved sound sources“, Masterarbeit, TU Ilmenau, 2016.

▒ Ruth Schultheis, „Analyse und Definition einer geeigneten Testumgebung zur Qualitätsbeurteilung mit einem autostereoskopischen Display“, Bachelorarbeit, TU Ilmenau, 2016.

▒ Hong Ma, „Entwicklung einer Evaluierungsmethodik zur Bewertung der Position von bewegten Schallquellen“, Bachelorarbeit, TU Ilmenau, 2016.

2015

▒ Kai-Peter Jurgeit, „Qualitätsanalyse von Kugelarrayauralisationen basierend auf Open Profiling of Quality“, Bachelorarbeit, TU Ilmenau, 2015.

█ Thomas Mayenfels, „Untersuchung zum Einfluss von Training auf die Wahrnehmung von Externalität“, Masterarbeit, TU Ilmenau, 2015.

█ Anna Rüppel, „Qualitätsbewertung räumlicher Schallfelder unter Berücksichtigung realer Messbedingungen“, Masterarbeit, TU Ilmenau, 2015.

2014

▒ Martin Rekitt, „Bestimmung der Häufigkeitsverteilung von Quadrantenfehlern bei der Lokalisation von Hörereignissen unter Verwendung einer binauralen Kopfhörerwiedergabe“, Bachelorarbeit, TU Ilmenau, 2014.

▒ Tobias Brass, „Raumakustische Parameter von Räumen“, Bachelorarbeit, TU Ilmenau, 2014.

▒ Markus Anton, „Raumakustische Messungen und akustische Bewertung von Seminarräumen an der TU Ilmenau“, Bachelorarbeit, TU Ilmenau, 2014.

▒ Bernhard Fiedler, „Untersuchung zur Umsetzung einer interaktiven Raumsimulation zur Distanzdarstellung virtueller Schallquellen“, Bachelorarbeit, TU Ilmenau, 2014.

2013

▒ Daniel Richter, „Erweiterung eines Verfahrens zur Bestimmung der Phasenkohärenz von komplexen Signalen“, Bachelorarbeit, TU Ilmenau, 2013.

▒ Kai Pabst, „Evaluierung einer binauralen Simulation einer zweikanaligen Stereo-Lautsprecherwiedergabe“, Bachelorarbeit, TU Ilmenau, 2013.

█ Wie Li, „Individuelle Auswahl nicht individueller Außenohrübertragungsfunktionen auf Basis von optimierten Datensätzen“, Masterarbeit, TU Ilmenau, 2013.

▒ Henri Meißner, „Vergleich unterschiedlicher Fokussierungsverfahren mit Lautsprecheranordnungen im Mittel-/Hochtonbereich“, Bachelorarbeit, TU Ilmenau, 2013.

2012

▒ Anett Zabel, „Vergleich von Hörtestmethodiken zur Beurteilung der räumlichen Wahrnehmung bei binauraler Kopfhörerwiedergabe“, Bachelorarbeit, TU Ilmenau, 2012.

█ Georg Heise, „Untersuchungen zum Zusammenhang zwischen der Ohrkanalresonanzfrequenz und den richtungsabhängigen Merkmalen der Außenohrübertragungsfunktion“, Masterarbeit, TU Ilmenau, 2012.

█ Rebecca Sass, „Synthese binauraler Raumimpulsantworten“, Masterarbeit, TU Ilmenau, 2012.

█ Markus Hesse, „Mikrofonierung mit dem "Motion Tracked Binaural" Verfahren“, Diplomarbeit, TU Ilmenau, 2012.

▒ Sven Sammer, „Untersuchungen zu Probanden-Auswahlverfahren für auditive Qualitätsuntersuchungen“, Bachelorarbeit, TU Ilmenau, 2012.

█ Simone Füg, „Untersuchungen zur Distanzwahrnehmung von Hörereignissen bei Kopfhörerwiedergabe“, Masterarbeit, TU Ilmenau, 2012.

█ Hui Fei, „Individuelle Auswahl nicht individueller Außenohrübertragungsfunktionen“, Masterarbeit, TU Ilmenau, 2012.

▒ Matthias Hellmich, „Erstellung einer Datenbank von kopfbezogenen Impulsantworten“, Bachelorarbeit, TU Ilmenau, 2012.

█ Frank Jürgens, „Kurven gleicher Lautheit bei binauraler Kopfhörerwiedergabe“, Masterarbeit, TU Ilmenau, 2012 (3. Platz Studienpreis der Thüringer Landesmedienanstalt 2013).

2011

▒ Ann-Christine Haddam, „Konzeptionelle Integration des PATTI-Demonstrators in die Prozesse eines Informationssystems für adaptives Assessment“ Bachelorarbeit, TU Ilmenau, 2011.

▒ Paul Fiedler, „Schallfeldanalyse und Anpassung eines Übersprechkompensationsfilters für binaurale Wiedergabe“, Bachelorarbeit, TU Ilmenau, 2011.

█ Florian Klein, „Individualisierte Entzerrung von Außenohrübertragungsfunktionen“, Diplomarbeit, TU Ilmenau, 2011.

2010

█ Johannes Post, „Evaluation of acoustic room simulation methods and implementation of a real time acoustic room simulation algorithm for reproduction via headphones“, Diplomarbeit, TU Ilmenau, 2010.

▒ Rebecca Sass, „Vergleich des Einflusses kopfbezogener Übertragungsfunktionen auf die Ausprägung binauraler Merkmale bei Verwendung unterschiedlicher Aufnahmeverfahren“, Bachelorarbeit, TU Ilmenau, 2010.

▒ Johannes Maggi, „Binaurale Wiedergabe über Lautsprecher in Stereoanordnung“, Bachelorarbeit, TU Ilmenau, 2010.

▒ Christian Neukam, „Kompensation nicht optimaler Stereoanordnungen für die Wiedergabe von stereophonen/binauralen Signalen“, Bachelorarbeit, TU Ilmenau, 2010.

█ Christoph Hahlweg, „Entwicklung eines Systems zur akustischen Positionsbestimmung der Position eines Schallobjekts“, Diplomarbeit, TU Ilmenau, 2010.

█ Christoph Mank, „Evaluation und Implementierung eines Verfahrens zur Schallquellenlokalisierung auf Basis von Pegeldifferenz-Stereophonie“, Diplomarbeit, TU Ilmenau, 2010.

Bibliography

Döring, Nicola; Mikhailova, Veronika; Brandenburg, Karlheinz; Broll, Wolfgang; Groß, Horst-Michael; Werner, Stephan; Raake, Alexander
Digital media in intergenerational communication: status quo and future scenarios for the grandparent-grandchild relationship. - In: Universal access in the information society, ISSN 1615-5297, Bd. 23 (2024), 1, S. 379-394

Communication technologies play an important role in maintaining the grandparent-grandchild (GP-GC) relationship. Based on Media Richness Theory, this study investigates the frequency of use (RQ1) and perceived quality (RQ2) of established media as well as the potential use of selected innovative media (RQ3) in GP-GC relationships with a particular focus on digital media. A cross-sectional online survey and vignette experiment were conducted in February 2021 among N = 286 university students in Germany (mean age 23 years, 57% female) who reported on the direct and mediated communication with their grandparents. In addition to face-to-face interactions, non-digital and digital established media (such as telephone, texting, video conferencing) and innovative digital media, namely augmented reality (AR)-based and social robot-based communication technologies, were covered. Face-to-face and phone communication occurred most frequently in GP-GC relationships: 85% of participants reported them taking place at least a few times per year (RQ1). Non-digital established media were associated with higher perceived communication quality than digital established media (RQ2). Innovative digital media received less favorable quality evaluations than established media. Participants expressed doubts regarding the technology competence of their grandparents, but still met innovative media with high expectations regarding improved communication quality (RQ3). Richer media, such as video conferencing or AR, do not automatically lead to better perceived communication quality, while leaner media, such as letters or text messages, can provide rich communication experiences. More research is needed to fully understand and systematically improve the utility, usability, and joy of use of different digital communication technologies employed in GP-GC relationships.

https://doi.org/10.1007/s10209-022-00957-w

Klein, Florian; Treybig, Lukas; Schneiderwind, Christian; Werner, Stephan; Sporer, Thomas
Just noticeable reverberation difference at varying loudness levels. - In: AES Europe 2023, (2023), S. 361-368

In order to successfully fuse virtual sound sources with the real acoustic environment, the acoustic properties of the real environment must be estimated and utilized for the synthesis of virtual sound sources. Often, just noticeable differences (JNDs) of room acoustic parameters are utilized to predict a good match between virtual and real acoustics. However, several studies in this domain have shown that existing JND values of room acoustic parameters are often not able to predict the perception of the listeners. This can have various reasons: Differences in first reflection patterns are barely measurable with classical acoustic parameters; Even if acoustic differences are above the JND, a plausible reproduction might still be possible; JNDs depend on various factors (such as sound signal, etc.) and existing studies do not cover all of them. The last factor is addressed in this research paper. A three-alternative forced (3AFC) choice test was conducted at four different loudness levels (75 dB(A), 65 dB(A), 55 dB(A), and 45 dB(A)) in a reverberation time range from 0.5 s to 0.8 s. A dependency of the loudness on the detectability of reverberation differences was found for the randomly interleaved presentation of loudness levels but not for sequential presentation. Individual hearing thresholds as well as expertise level significantly influence the JND of reverberation time.

Treybig, Lukas; Werner, Stephan; Klein, Florian; Amengual Garí, Sebastià V.
Robust reverberation time estimation for audio augmented reality applications. - In: AES Europe 2023, (2023), S. 47-55

The paper presents an alternative approach for estimating reverberation time from measurements in real rooms when the requirements of the standard DIN EN ISO 3382-1/2 for the characteristics of the sound source, receiver, and measurement positions cannot be met. The main goal is to minimize the variance of the calculated reverberation times when using a directional source and receiver, or source-receiver relative positions with very small distances. For this purpose, the energy decay curve for individual octave bands is sampled in time. The estimation starts 2 ms after the direct sound. This is followed by several estimates of the RT over a 20 dB drop, starting 1 dB later with each iteration. The best fit mean of these values gives the estimated reverberation time. A comparison with the standard reverberation time estimation shows a variance reduction of 10% to 30% for binaural room impulse responses (BRIRs). The proposed method finds its application in situations where measurements can only be made at a few positions in the room and/or only in a few areas of the room. Furthermore, the method should be better suitable for measurements with receivers located near or at the head of a person.

Fischedick, Söhnke B.; Richter, Kay; Wengefeld, Tim; Seichter, Daniel; Scheidig, Andrea; Döring, Nicola; Broll, Wolfgang; Werner, Stephan; Raake, Alexander; Groß, Horst-Michael
Bridging distance with a collaborative telepresence robot for older adults - report on progress in the CO-HUMANICS project. - In: ISR Europe 2023: 56th International Symposium on Robotics, (2023), S. 346-353

In an aging society, the social needs of older adults, such as regular interactions and independent living, are crucial for their quality of life. However, due to spatial separation from their family and friends, it is difficult to maintain social relationships. Our multidisciplinary project, CO-HUMANICS, aims to meet these needs, even over long distances, through the utilization of innovative technologies, including a robot-based system. This paper presents the first prototype of our system, designed to connect family members or friends virtually present through a mobile robot with an older adult. The system incorporates bi-directional video telephony, remote control capabilities, and enhanced visualization methods. A comparison is made with other state-of-the-art robotic approaches, focusing on remote control capabilities. We provide details about the hard- and software components, e.g., a projector-based pointing unit for collaborative telepresence to assist in everyday tasks. Our comprehensive scene representation is discussed, which utilizes 3D NDT maps, enabling advanced remote navigation features, such as autonomously driving to a specific object. Finally, insights about past and concepts for future evaluation are provided to assess the developed system.

https://ieeexplore.ieee.org/document/10363093

Stolz, Georg; Klein, Florian; Werner, Stephan; Treybig, Lukas; Bley, Andreas; Martin, Christian
Discussion of acoustic and perceptual optimization methods for measuring spatial room impulse responses with a mobile robotic platform. - In: 2023 Immersive and 3D Audio: from Architecture to Automotive (I3DA), (2023), insges. 7 S.

In the field of Auditory Augmented Reality (AAR), one aim is to provide a listening experience that is as close as possible to a real scenario. Measured Spatial Room Impulse Responses (SRIRs) describe the acoustics of a room and can serve as a reference for acoustic simulations or parametrization of room acoustics. In previous works, a measurement system for SRIRs using a mobile robotic platform was introduced. The system consists of a commercially available self-driving platform on which a microphone array is mounted, while the sound sources are distributed at fixed positions in the room. The system is able to conduct high spatial resolution measurements of SRIRs in a uniform grid. In applications where time is limited and/or the area to discover is large, however, a high-resolution measurement is not always feasible.Therefore, the goal of this contribution is to compare different approaches for optimizing the measurement grid. One approach is to use mathematical optimization on acoustic parameters derived from a small set of initial measurements to determine new measurement positions in a iterative manner. Another approach is to optimize the measurement grid in respect to human auditory perception, incorporating e.g. just-noticeable differences of distance and localization perception.The results show that both approaches can achieve significant reductions in the number of measurements required for a adequate acoustic spatial reproduction, with different trade-offs depending on the application scenario and the available prior information.

https://doi.org/10.1109/I3DA57090.2023.10289338

Treybig, Lukas; Höbel-Müller, Juliane; Werner, Stephan; Nürnberger, Andreas
Acoustic inter- and intra-room similarity based on room acoustic parameters. - In: Engineering for a changing world, (2023), 5.2.136, S. 1-15

This paper shows various approaches for determining acoustic (dis-)similarity based on room acoustic parameter values derived from real measurements. The similarity is calculated across different room configurations and/or between different microphone-loudspeaker positions within the same room configuration. We compare supervised (LDA, Random Forrest) and unsupervised techniques (PCA, SPPA) and pre-selected visualizations in terms of their ability to exhibit inter- and intra-room (dis-)similarities. The data set generated comprises spatially high-resolution room impulse responses obtained from multiple source-receiver positions within a room configuration. The room acoustics are varied by introducing active walls and geometries accounting for specific room configurations. The results show that the separation of room configurations primarily relies on specific acoustic parameters, with the reverberation time playing an important role. Within a given room configuration, the acoustic parameters excluding the reverberation time mainly capture the orientation and distance between the source and receiver.

https://doi.org/10.22032/dbt.58929

Klein, Florian; Surdu, Tatiana; Treybig, Lukas; Werner, Stephan
The ability to memorize acoustic features in a discrimination task. - In: Journal of the Audio Engineering Society, ISSN 0004-7554, Bd. 71 (2023), 5, S. 254-266

How humans perceive, recognize, and remember room acoustics is of particular interest in the domain of spatial audio. For the creation of virtual or augmented acoustic environments, a room acoustic impression matches the expectations of certain room classes or a specific room. These expectations are based on the auditory memory of the acoustic room impression. In this paper, the authors present an exploratory study to evaluate the ability of listeners to recognize room acoustic features. The task of the listeners was to detect the reference room in a modified ABX double-blind stimulus test that featured a pre-defined playback order and a fixed time schedule. Furthermore, the authors explored distraction effects by employing additional nonacoustic interferences. The results show a significant decrease of the auditory memory capacity within 10 s, which is more pronounced when the listeners were distracted. However, the results suggest that auditory memory depends on what auditory cues are available.

https://doi.org/10.17743/jaes.2022.0073

Klein, Florian; Surdu, Tatiana; Aretz, Arthur; Birth, Kilian; Edelmann, Niklas; Seitelman, Florian; Ziener, Christian; Werner, Stephan; Sporer, Thomas
A dataset of measured spatial room impulse responses in different rooms including visualization. - In: AES Europe Spring 2022, (2022), S. 621-625

In this contribution, an open-source dataset of captured spatial room impulse responses (SRIRs) is presented. The data was collected in different enclosed spaces at the Technische Universität Ilmenau using an open self-build microphone array design following the spatial decomposition method (SDM) guidelines. The included rooms were selected based on their distinctive acoustical properties resulting from their general build and furnishing as required by their utility. Three different classes of spaces can be distinguished, including seminar rooms, offices, and classrooms. For each considered space different source-receiver positions were recorded, including 360? images for each condition. The dataset can be utilized for various augmented or virtual reality applications, using either a loudspeaker or headphone-based reproduction alongside the appropriate head-related transfer function sets. The complete database, including the measured impulse responses as well as the corresponding images, is publicly available.

Treybig, Lukas; Saini, Shivam; Werner, Stephan; Sloma, Ulrike; Peissig, Jürgen
Room acoustic analysis and BRIR matching based on room acoustic measurements. - In: AES International Conference on Audio for Virtual and Augmented Reality (AVAR 2022), (2022), S. 48-57

To achieve the goal of a perceptual fusion between the auralization of virtual audio objects in the room acoustics of a real listening room, an adequate adaptation of the virtual acoustics to the real room acoustics is necessary. The challenges are to describe the acoustics of different rooms by suitable parameters, to classify different rooms, and to evoke a similar auditory perception between acoustically similar rooms. An approach is presented to classify rooms based on measured BRIRs using statistical methods and to select best match BRIRs from the dataset to auralize audio objects in a new room. The results show that it is possible to separate rooms based on their room acoustic properties, that the separation also corresponds to a large extent to the perceptual distance between rooms, and that a selection of best match BRIRs is possible.

Klein, Florian; Surdu, Tatiana; Treybig, Lukas; Werner, Stephan; Aretz, Arthur; Birth, Kilian; Edelmann, Niklas; Seitelmann, Florian; Ziener, Christian; Sporer, Thomas
Auditory room identification in a memory task. - In: AES International Conference on Audio for Virtual and Augmented Reality (AVAR 2022), (2022), S. 132-141
Richtiger Name des Verfassers: Florian Seitelmann

How we perceive and remember room acoustics is of particular interest in the domain of spatial audio. For the creation of virtual or augmented acoustic environments, a room acoustic impression needs to be created which matches the expectations of certain room classes or a specific room. These expectations are based on the auditory memory of the acoustic room impression. In this paper, we present an exploratory study to evaluate the ability of listeners to remember specific rooms. The task of the listeners was to detect the reference room in a modified ABX double-blind stimulus test which featured a pre-defined playback order and a fixed time schedule. Furthermore, we explored distraction effects by employing additional non-acoustic interferences. The results show a significant decrease of the auditory memory capacity within ten seconds, which is more pronounced when the listeners were distracted. However, the results suggest that auditory memory depends on what auditory cues are available.

Seite 1 von 7 (⋎ Alle anzeigen )

Range of courses

Experience university

Start of studies

Student Life

Before the study

Range of courses

Study organization

Further offers

Information and Advice

In study

Career Start

Start-up Service

Further training

Alumni

After graduation

Contact

Dr.-Ing. Stephan Werner

Research activity

Project activities

Electronic Media Technology (EMT) Group and Institute of Media Technology (IMT)

Conference organization

Reviewer activities

Memberships

Motivation

Research Field

Research Approach

Bibliography

Range of courses

Experience university

Start of studies

Student Life

Before the study

Range of courses

Study organization

Further offers

Information and Advice

In study

Career Start

Start-up Service

Further training

Alumni

After graduation

Dr.-Ing. Stephan Werner

Activities

Research

Teaching

Theses

Bibliography