Dr.-Ing. Stephan Werner

Fachgebietsleiter (kommissarisch)
Elektronische Medientechnik

Helmholtzbau, Raum H 3520
+49 3677 69-1653
stephan.werner@tu-ilmenau.de
 

LinkedIn

ResearchGate

Google Scholar

Literaturliste

Anzahl der Treffer: 62
Erstellt: Wed, 24 Apr 2024 23:16:07 +0200 in 0.1918 sec


Döring, Nicola; Mikhailova, Veronika; Brandenburg, Karlheinz; Broll, Wolfgang; Groß, Horst-Michael; Werner, Stephan; Raake, Alexander
Digital media in intergenerational communication: status quo and future scenarios for the grandparent-grandchild relationship. - In: Universal access in the information society, ISSN 1615-5297, Bd. 23 (2024), 1, S. 379-394

Communication technologies play an important role in maintaining the grandparent-grandchild (GP-GC) relationship. Based on Media Richness Theory, this study investigates the frequency of use (RQ1) and perceived quality (RQ2) of established media as well as the potential use of selected innovative media (RQ3) in GP-GC relationships with a particular focus on digital media. A cross-sectional online survey and vignette experiment were conducted in February 2021 among N = 286 university students in Germany (mean age 23 years, 57% female) who reported on the direct and mediated communication with their grandparents. In addition to face-to-face interactions, non-digital and digital established media (such as telephone, texting, video conferencing) and innovative digital media, namely augmented reality (AR)-based and social robot-based communication technologies, were covered. Face-to-face and phone communication occurred most frequently in GP-GC relationships: 85% of participants reported them taking place at least a few times per year (RQ1). Non-digital established media were associated with higher perceived communication quality than digital established media (RQ2). Innovative digital media received less favorable quality evaluations than established media. Participants expressed doubts regarding the technology competence of their grandparents, but still met innovative media with high expectations regarding improved communication quality (RQ3). Richer media, such as video conferencing or AR, do not automatically lead to better perceived communication quality, while leaner media, such as letters or text messages, can provide rich communication experiences. More research is needed to fully understand and systematically improve the utility, usability, and joy of use of different digital communication technologies employed in GP-GC relationships.



https://doi.org/10.1007/s10209-022-00957-w
Klein, Florian; Treybig, Lukas; Schneiderwind, Christian; Werner, Stephan; Sporer, Thomas
Just noticeable reverberation difference at varying loudness levels. - In: AES Europe 2023, (2023), S. 361-368

In order to successfully fuse virtual sound sources with the real acoustic environment, the acoustic properties of the real environment must be estimated and utilized for the synthesis of virtual sound sources. Often, just noticeable differences (JNDs) of room acoustic parameters are utilized to predict a good match between virtual and real acoustics. However, several studies in this domain have shown that existing JND values of room acoustic parameters are often not able to predict the perception of the listeners. This can have various reasons: Differences in first reflection patterns are barely measurable with classical acoustic parameters; Even if acoustic differences are above the JND, a plausible reproduction might still be possible; JNDs depend on various factors (such as sound signal, etc.) and existing studies do not cover all of them. The last factor is addressed in this research paper. A three-alternative forced (3AFC) choice test was conducted at four different loudness levels (75 dB(A), 65 dB(A), 55 dB(A), and 45 dB(A)) in a reverberation time range from 0.5 s to 0.8 s. A dependency of the loudness on the detectability of reverberation differences was found for the randomly interleaved presentation of loudness levels but not for sequential presentation. Individual hearing thresholds as well as expertise level significantly influence the JND of reverberation time.



Treybig, Lukas; Werner, Stephan; Klein, Florian; Amengual Garí, Sebastià V.
Robust reverberation time estimation for audio augmented reality applications. - In: AES Europe 2023, (2023), S. 47-55

The paper presents an alternative approach for estimating reverberation time from measurements in real rooms when the requirements of the standard DIN EN ISO 3382-1/2 for the characteristics of the sound source, receiver, and measurement positions cannot be met. The main goal is to minimize the variance of the calculated reverberation times when using a directional source and receiver, or source-receiver relative positions with very small distances. For this purpose, the energy decay curve for individual octave bands is sampled in time. The estimation starts 2 ms after the direct sound. This is followed by several estimates of the RT over a 20 dB drop, starting 1 dB later with each iteration. The best fit mean of these values gives the estimated reverberation time. A comparison with the standard reverberation time estimation shows a variance reduction of 10% to 30% for binaural room impulse responses (BRIRs). The proposed method finds its application in situations where measurements can only be made at a few positions in the room and/or only in a few areas of the room. Furthermore, the method should be better suitable for measurements with receivers located near or at the head of a person.



Fischedick, Söhnke B.; Richter, Kay; Wengefeld, Tim; Seichter, Daniel; Scheidig, Andrea; Döring, Nicola; Broll, Wolfgang; Werner, Stephan; Raake, Alexander; Groß, Horst-Michael
Bridging distance with a collaborative telepresence robot for older adults - report on progress in the CO-HUMANICS project. - In: ISR Europe 2023: 56th International Symposium on Robotics, (2023), S. 346-353

In an aging society, the social needs of older adults, such as regular interactions and independent living, are crucial for their quality of life. However, due to spatial separation from their family and friends, it is difficult to maintain social relationships. Our multidisciplinary project, CO-HUMANICS, aims to meet these needs, even over long distances, through the utilization of innovative technologies, including a robot-based system. This paper presents the first prototype of our system, designed to connect family members or friends virtually present through a mobile robot with an older adult. The system incorporates bi-directional video telephony, remote control capabilities, and enhanced visualization methods. A comparison is made with other state-of-the-art robotic approaches, focusing on remote control capabilities. We provide details about the hard- and software components, e.g., a projector-based pointing unit for collaborative telepresence to assist in everyday tasks. Our comprehensive scene representation is discussed, which utilizes 3D NDT maps, enabling advanced remote navigation features, such as autonomously driving to a specific object. Finally, insights about past and concepts for future evaluation are provided to assess the developed system.



https://ieeexplore.ieee.org/document/10363093
Stolz, Georg; Klein, Florian; Werner, Stephan; Treybig, Lukas; Bley, Andreas; Martin, Christian
Discussion of acoustic and perceptual optimization methods for measuring spatial room impulse responses with a mobile robotic platform. - In: 2023 Immersive and 3D Audio: from Architecture to Automotive (I3DA), (2023), insges. 7 S.

In the field of Auditory Augmented Reality (AAR), one aim is to provide a listening experience that is as close as possible to a real scenario. Measured Spatial Room Impulse Responses (SRIRs) describe the acoustics of a room and can serve as a reference for acoustic simulations or parametrization of room acoustics. In previous works, a measurement system for SRIRs using a mobile robotic platform was introduced. The system consists of a commercially available self-driving platform on which a microphone array is mounted, while the sound sources are distributed at fixed positions in the room. The system is able to conduct high spatial resolution measurements of SRIRs in a uniform grid. In applications where time is limited and/or the area to discover is large, however, a high-resolution measurement is not always feasible.Therefore, the goal of this contribution is to compare different approaches for optimizing the measurement grid. One approach is to use mathematical optimization on acoustic parameters derived from a small set of initial measurements to determine new measurement positions in a iterative manner. Another approach is to optimize the measurement grid in respect to human auditory perception, incorporating e.g. just-noticeable differences of distance and localization perception.The results show that both approaches can achieve significant reductions in the number of measurements required for a adequate acoustic spatial reproduction, with different trade-offs depending on the application scenario and the available prior information.



https://doi.org/10.1109/I3DA57090.2023.10289338
Treybig, Lukas; Höbel-Müller, Juliane; Werner, Stephan; Nürnberger, Andreas
Acoustic inter- and intra-room similarity based on room acoustic parameters. - In: Engineering for a changing world, (2023), 5.2.136, S. 1-15

This paper shows various approaches for determining acoustic (dis-)similarity based on room acoustic parameter values derived from real measurements. The similarity is calculated across different room configurations and/or between different microphone-loudspeaker positions within the same room configuration. We compare supervised (LDA, Random Forrest) and unsupervised techniques (PCA, SPPA) and pre-selected visualizations in terms of their ability to exhibit inter- and intra-room (dis-)similarities. The data set generated comprises spatially high-resolution room impulse responses obtained from multiple source-receiver positions within a room configuration. The room acoustics are varied by introducing active walls and geometries accounting for specific room configurations. The results show that the separation of room configurations primarily relies on specific acoustic parameters, with the reverberation time playing an important role. Within a given room configuration, the acoustic parameters excluding the reverberation time mainly capture the orientation and distance between the source and receiver.



https://doi.org/10.22032/dbt.58929
Klein, Florian; Surdu, Tatiana; Treybig, Lukas; Werner, Stephan
The ability to memorize acoustic features in a discrimination task. - In: Journal of the Audio Engineering Society, ISSN 0004-7554, Bd. 71 (2023), 5, S. 254-266

How humans perceive, recognize, and remember room acoustics is of particular interest in the domain of spatial audio. For the creation of virtual or augmented acoustic environments, a room acoustic impression matches the expectations of certain room classes or a specific room. These expectations are based on the auditory memory of the acoustic room impression. In this paper, the authors present an exploratory study to evaluate the ability of listeners to recognize room acoustic features. The task of the listeners was to detect the reference room in a modified ABX double-blind stimulus test that featured a pre-defined playback order and a fixed time schedule. Furthermore, the authors explored distraction effects by employing additional nonacoustic interferences. The results show a significant decrease of the auditory memory capacity within 10 s, which is more pronounced when the listeners were distracted. However, the results suggest that auditory memory depends on what auditory cues are available.



https://doi.org/10.17743/jaes.2022.0073
Klein, Florian; Surdu, Tatiana; Aretz, Arthur; Birth, Kilian; Edelmann, Niklas; Seitelman, Florian; Ziener, Christian; Werner, Stephan; Sporer, Thomas
A dataset of measured spatial room impulse responses in different rooms including visualization. - In: AES Europe Spring 2022, (2022), S. 621-625

In this contribution, an open-source dataset of captured spatial room impulse responses (SRIRs) is presented. The data was collected in different enclosed spaces at the Technische Universität Ilmenau using an open self-build microphone array design following the spatial decomposition method (SDM) guidelines. The included rooms were selected based on their distinctive acoustical properties resulting from their general build and furnishing as required by their utility. Three different classes of spaces can be distinguished, including seminar rooms, offices, and classrooms. For each considered space different source-receiver positions were recorded, including 360? images for each condition. The dataset can be utilized for various augmented or virtual reality applications, using either a loudspeaker or headphone-based reproduction alongside the appropriate head-related transfer function sets. The complete database, including the measured impulse responses as well as the corresponding images, is publicly available.



Treybig, Lukas; Saini, Shivam; Werner, Stephan; Sloma, Ulrike; Peissig, Jürgen
Room acoustic analysis and BRIR matching based on room acoustic measurements. - In: AES International Conference on Audio for Virtual and Augmented Reality (AVAR 2022), (2022), S. 48-57

To achieve the goal of a perceptual fusion between the auralization of virtual audio objects in the room acoustics of a real listening room, an adequate adaptation of the virtual acoustics to the real room acoustics is necessary. The challenges are to describe the acoustics of different rooms by suitable parameters, to classify different rooms, and to evoke a similar auditory perception between acoustically similar rooms. An approach is presented to classify rooms based on measured BRIRs using statistical methods and to select best match BRIRs from the dataset to auralize audio objects in a new room. The results show that it is possible to separate rooms based on their room acoustic properties, that the separation also corresponds to a large extent to the perceptual distance between rooms, and that a selection of best match BRIRs is possible.



Klein, Florian; Surdu, Tatiana; Treybig, Lukas; Werner, Stephan; Aretz, Arthur; Birth, Kilian; Edelmann, Niklas; Seitelman, Florian; Ziener, Christian; Sporer, Thomas
Auditory room identification in a memory task. - In: AES International Conference on Audio for Virtual and Augmented Reality (AVAR 2022), (2022), S. 132-141

How we perceive and remember room acoustics is of particular interest in the domain of spatial audio. For the creation of virtual or augmented acoustic environments, a room acoustic impression needs to be created which matches the expectations of certain room classes or a specific room. These expectations are based on the auditory memory of the acoustic room impression. In this paper, we present an exploratory study to evaluate the ability of listeners to remember specific rooms. The task of the listeners was to detect the reference room in a modified ABX double-blind stimulus test which featured a pre-defined playback order and a fixed time schedule. Furthermore, we explored distraction effects by employing additional non-acoustic interferences. The results show a significant decrease of the auditory memory capacity within ten seconds, which is more pronounced when the listeners were distracted. However, the results suggest that auditory memory depends on what auditory cues are available.



Döring, Nicola; Conde, Melisa; Brandenburg, Karlheinz; Broll, Wolfgang; Groß, Horst-Michael; Werner, Stephan; Raake, Alexander
Can communication technologies reduce loneliness and social isolation in older people? : a scoping review of reviews. - In: International journal of environmental research and public health, ISSN 1660-4601, Bd. 19 (2022), 18, 11310, S. 1-20

Background: Loneliness and social isolation in older age are considered major public health concerns and research on technology-based solutions is growing rapidly. This scoping review of reviews aims to summarize the communication technologies (CTs) (review question RQ1), theoretical frameworks (RQ2), study designs (RQ3), and positive effects of technology use (RQ4) present in the research field. Methods: A comprehensive multi-disciplinary, multi-database literature search was conducted. Identified reviews were analyzed according to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) framework. A total of N = 28 research reviews that cover 248 primary studies spanning 50 years were included. Results: The majority of the included reviews addressed general internet and computer use (82% each) (RQ1). Of the 28 reviews, only one (4%) worked with a theoretical framework (RQ2) and 26 (93%) covered primary studies with quantitative-experimental designs (RQ3). The positive effects of technology use were shown in 55% of the outcome measures for loneliness and 44% of the outcome measures for social isolation (RQ4). Conclusion: While research reviews show that CTs can reduce loneliness and social isolation in older people, causal evidence is limited and insights on innovative technologies such as augmented reality systems are scarce.



https://doi.org/10.3390/ijerph191811310
Döring, Nicola; Mikhailova, Veronika; Brandenburg, Karlheinz; Broll, Wolfgang; Groß, Horst-Michael; Werner, Stephan; Raake, Alexander
Saying "Hi" to grandma in nine different ways : established and innovative communication media in the grandparent-grandchild relationship. - In: Technology, Mind, and Behavior, ISSN 2689-0208, (2021), insges. 1 S.

https://doi.org/10.1037/tms0000107
Werner, Stephan; Klein, Florian; Neidhardt, Annika; Sloma, Ulrike; Schneiderwind, Christian; Brandenburg, Karlheinz
Creation of auditory augmented reality using a position-dynamic binaural synthesis system - technical components, psychoacoustic needs, and perceptual evaluation. - In: Applied Sciences, ISSN 2076-3417, Bd. 11 (2021), 3, 1150, S. 1-20

For a spatial audio reproduction in the context of augmented reality, a position-dynamic binaural synthesis system can be used to synthesize the ear signals for a moving listener. The goal is the fusion of the auditory perception of the virtual audio objects with the real listening environment. Such a system has several components, each of which help to enable a plausible auditory simulation. For each possible position of the listener in the room, a set of binaural room impulse responses (BRIRs) congruent with the expected auditory environment is required to avoid room divergence effects. Adequate and efficient approaches are methods to synthesize new BRIRs using very few measurements of the listening room. The required spatial resolution of the BRIR positions can be estimated by spatial auditory perception thresholds. Retrieving and processing the tracking data of the listener’s head-pose and position as well as convolving BRIRs with an audio signal needs to be done in real-time. This contribution presents work done by the authors including several technical components of such a system in detail. It shows how the single components are affected by psychoacoustics. Furthermore, the paper also discusses the perceptive effect by means of listening tests demonstrating the appropriateness of the approaches.



https://doi.org/10.3390/app11031150
Brandenburg, Karlheinz; Klein, Florian; Neidhardt, Annika; Sloma, Ulrike; Werner, Stephan
Creating auditory illusions with binaural technology. - In: The technology of binaural understanding, (2020), S. 623-663

It is pointed out that beyond reproducing the physically correct sound pressure at the eardrums, more effects play a significant role in the quality of the auditory illusion. In some cases, these can dominate perception and even overcome physical deviations. Perceptual effects like the room-divergence effect, additional visual influences, personalization, pose and position tracking as well as adaptation processes are discussed. These effects are described individually, and the interconnections between them are highlighted. With the results from experiments performed by the authors, the perceptual effects can be quantified. Furthermore, concepts are proposed to optimize reproduction systems with regard to those effects. One example could be a system that adapts to varying listening situations as well as individual listening habits, experience and preference.



Werner, Stephan; Klein, Florian; Müller, Clemens
Evaluation of spatial audio quality of the synthesis of binaural room impulse responses for new object positions. - In: 147th Audio Engineering Society Convention 2019, (2020), S. 972-981

The aim of auditory augmented reality is to create an auditory illusion combining virtual audio objects and scenarios with the perceived real acoustic surrounding. A suitable system like position-dynamic binaural synthesis is needed to minimize perceptual conflicts with the perceived real world. The needed binaural room impulse responses (BRIRs) have to fit the acoustics of the listening room. One approach to minimize the large number of BRIRs for all source-receiver relations is the synthesis of BRIRs using only one measurement in the listening room. The focus of the paper is the evaluation of the spatial audio quality. In most conditions differences in direct-to-reverberant-energy ratio between a reference and the synthesis is below the just noticeable difference. Furthermore, small differences are found for perceived overall difference, distance, and direction perception. Perceived externalization is comparable to the usage of measured BRIRs. Challenges are detected to synthesize more further away sources from a source position that is more close to the listening positions.



Sloma, Ulrike; Klein, Florian; Werner, Stephan; Pappachan Kannookadan, Tyson
Synthesis of binaural room impulse responses for different listening positions considering the source directivity. - In: 147th Audio Engineering Society Convention 2019, (2020), S. 377-385

Brandenburg, Karlheinz; Fiedler, Bernhard; Fischer, Georg; Klein, Florian; Neidhardt, Annika; Schneiderwind, Christian; Sloma, Ulrike; Stirnat, Claudia; Werner, Stephan
Perceptual aspects in spatial audio processing. - In: Proceedings of the 23rd International Congress on Acoustics, (2019), S. 3354-3360

Spatial audio processing includes recording, modification and rendering of multichannel audio. In all these fields there is the choice of either a physical representation or of perceptual approaches trying to achieve a target perceived audio quality. Classical microphone techniques on one hand and wave field synthesis, higher order ambisonics or certain methods of binaural rendering for headphone reproduction on the other hand target a good physical representation of sound. As it is known today, especially in the case of sound reproduction a faithful physical recreation of the sound wave forms ("correct signal at the ear drums") is neither necessary nor does it allow a fully authentic or even plausible reproduction of sound. 20 years ago, MPEG-4 standardized different modes for perception based versus physics based reproduction (called "Perceptual approach to modify natural source" and "Acoustic properties for physical based audio rendering"). In spatial rendering today, more and more the perceptual approach is used in state of the art systems. We give some examples of such rendering. The same distinction of physics based versus psychoacoustics (including cognitive effects) based rendering is used today for room simulation or artificial reverb systems. Perceptual aspects are at the heart of audio signal processing today.



https://edocs.tib.eu/files/e01mr19/1677542403.pdf
Werner, Stephan; Klein, Florian; Götz, Georg
Investigation on spatial auditory perception using non-uniform spatial distribution of binaural room impulse responses. - In: Audio for virtual, augmented and mixed realities, (2019), S. 137-144

For spatial audio reproduction in the context of virtual and augmented reality, a position-dynamic binaural synthesis can be used to reproduce the ear signals for a moving listener. A set of binaural room impulse responses (BRIRs) is required for each possible position of the listener in the room. The required spatial resolution of the BRIR positions can be estimated by spatial auditory perception thresholds. If the resolution is too low, jumps in perception of direction and distance and coloration effects occur. This contribution presents an evaluation of spatial audio quality using different spatial resolutions of the position of the used BRIRs. The evaluation is performed with a moving listener. The test persons evaluate any abnormalities in the spatial audio quality. The result is a comparison of the quality and the spatial resolution of the various conditions used.



https://doi.org/10.22032/dbt.39967
Chilian, Anja; Gadyuchko, Maria; Kátai, András; Klein, Florian; Sattel, Thomas; Skuk, Verena G.; Werner, Stephan
Innovative methods and technologies for spatial listening and speech intelligibility using hearing implants. - In: Adaptive processes in hearing, (2018), S. 343-350

Werner, Stephan;
Über den Einfluss kontextabhängiger Qualitätsparameter auf die Wahrnehmung von Externalität und Hörereignisort. - Ilmenau : Universitätsbibliothek, 2018. - 1 Online-Ressource (x, 211 Seiten)
Technische Universität Ilmenau, Dissertation 2018

Diese Arbeit umfasst meine Forschungstätigkeiten auf dem Gebiet des räumlichen Hörens. Es wird der Frage nachgegangen, welche Effekte verschiedene Kontexteinflüsse auf die Herausbildung einer auditorischen Illusion haben. Diese Effekte werden quantitativ erfasst. Es werden die Wahrnehmung von Externalität, Hörereignisrichtung u. a. bei Verwendung eines Binauralsynthesesystems untersucht. Als Ziel steht die Beschreibung und Messbarmachung der Kontexteinflüsse Raumdivergenz, Personalisierung und visuelle Merkmale. Die eigenen Untersuchungen zeigen deutlich, dass für das Zustandekommen einer sogenannten perfekten auditorischen Illusion eine ausreichende technische Realisierung der korrekten Ohrsignale nicht ausreichend ist. Vielmehr ist eine Erfüllung der intrinsischen Erwartungshaltung des Hörers notwendig. Der Prozess der Herausbildung von Qualität lässt sich durch einen Vergleich und Beurteilungsprozess beschreiben. Hierbei werden die aus den durch die Sinnesorgane aufgenommenen Reizungen abgeleiteten Qualitätsmerkmale mit den abgeleiteten Merkmalen einer inneren, individuellen Referenz verglichen. Dieser Prozess ist ausschlaggebend für die Herausbildung eines Hörereignisses. Mithilfe von qualitätsbewertenden Befragungen (Hörtests) lässt sich dieses Hörereignis als Abbildung messbar und beschreibbar machen. Die Ergebnisse meiner Arbeit zeigen, dass eine Divergenz akustischer Raumparameter zwischen einer binaural synthetisierten Szene und dem Abhörraum zu einer signifikanten Verringerung der Externalisierung von Hörereignissen führt. Eine Konvergenz raumakustischer Parameter erhöht die wahrgenommene Externalisierung. Die Sichtbarkeit von bspw. Lautsprecherattrappen im Abhörraum lässt die Externalität insgesamt ansteigen. Der sogenannte Raumdivergenzeffekt kann dadurch aber nicht aufgelöst werden. Die Personalisierung eines Binauralsynthesesystems zeigt eine Verringerung von Fehllokalisationen und einen Anstieg der Externalisierung. Es wird die Vermutung aufgestellt und untersucht, inwiefern Lokalisierungsfehler und Externalisierung korrelieren. Es wird gezeigt werden, dass eine personalisierte Binauralsynthese den Raumdivergenzeffekt nicht auflösen kann. Die Untersuchungen zum Raumdivergenzeffekt dienen als Grundlage zur Entwicklung von Methoden, welche verschiedene zeit- und energiebasierte akustische Merkmale der verwendeten Raumimpulsantworten verändern. Das Ziel ist die Anpassung einzelner raumakustischer Parameter der Synthese an die Raumakustik des Abhörraumes zur Erzeugung einer plausiblen auditorischen Illusion.



https://www.db-thueringen.de/receive/dbt_mods_00039220
Brandenburg, Karlheinz; Cano, Estefanía; Klein, Florian; Köllmer, Thomas; Lukashevich, Hanna; Neidhardt, Annika; Sloma, Ulrike; Werner, Stephan
Plausible augmentation of auditory scenes using dynamic binaural synthesis for personalized auditory realities. - In: Science, technology, design, and implementation, (2018), S. 258-267

Klein, Florian; Werner, Stephan
The relevance of auditory adaptation effects for the listening experience in virtual acoustic environments. - In: 144th Audio Engineering Society International Convention 2018, (2018), S. 575-582

Sporer, Thomas; Werner, Stephan; Klein, Florian
Adjustment of the Direct-to-Reverberant-Energy-Ratio to reach externalization within a binaural synthesis system. - In: Audio for virtual and augmented reality, ISBN 978-1-5108-4346-2, (2017), S. 120-126

Brandenburg, Karlheinz; Klein, Florian; Neidhardt, Annika; Werner, Stephan
Auditory illusion over headphones revisited. - In: The journal of the Acoustical Society of America, ISSN 1520-8524, Bd. 141 (2017), 5, S. 3997

https://doi.org/10.1121/1.4989160
Werner, Stephan; Götz, Georg; Klein, Florian
Influence of head tracking on the externalization of auditory events at divergence between synthesized and listening room using a binaural headphone system. - In: 142nd Audio Engineering Society International Convention 2017, (2017), S. 17-24

Sloma, Ulrike; Klein, Florian; Helbig, Thomas; Skowronek, Janto; Gadyuchko, Maria; Werner, Stephan; Breitbarth, Andreas; Neidhardt, Annika; Chillian, Antje; Brandenburg, Karlheinz; Raake, Alexander; Notni, Gunther; Sattel, Thomas; Witte, Hartmut; Husar, Peter
GO-LEM - Charakterisierung der auditorischen und auditorisch-visuellen Wahrnehmung des Menschen in Alltagsszenen. - In: Prävention von arbeitsbedingten Gesundheitsgefahren und Erkrankungen, (2017), S. 349-356

Klein, Florian; Werner, Stephan; Mayenfels, Thomas
Influences of training on externalization of binaural synthesis in situations of room divergence. - In: Journal of the Audio Engineering Society, ISSN 0004-7554, Bd. 65 (2017), 3, S. 178-187

https://doi.org/10.17743/jaes.2016.0072
Brandenburg, Karlheinz; Werner, Stephan; Klein, Florian; Sladeczek, Christoph
Auditory illusion through headphones: history, challenges and new solutions. - In: 22nd International Congress on Acoustics (ICA 2016), (2016), S. 3063-3072

Werner, Stephan; Klein, Florian; Mayenfels, Thomas; Brandenburg, Karlheinz
A summary on acoustic room divergence and its effect on externalization of auditory events. - In: QoMEX 2016, ISBN 978-1-5090-0354-9, (2016), insges. 6 S.

This contribution presents a summary of results from perceptual auditory experiments on context dependent quality parameters for virtual acoustic environments. The investigated quality features are influenced by divergence between synthesized scene and listening room and adaptation on congruence or divergence between the rooms. Two experiments are presented. The results from the first experiment show the room divergence effect on spatial auditory perception. A divergence between the listening room and binaurally synthesized room leads to a decrease of perceived externalization while congruence yields an increase. A more comprehensive statistical analysis regarding significance, effect size and visual influences is added to complement the original publication of this data. The second experiment shows this effect as the result of expectations of the listeners and can be shifted by adaptation and training. In the experiments we show, that training to congruent or divergent room combinations can increase or decrease the room divergence effect.



http://dx.doi.org/10.1109/QoMEX.2016.7498973
Werner, Stephan; Böhme, Martina; Mayenfels, Thomas; Klein, Florian
Variance in measured binaural room transfer functions of individuals. - In: Fortschritte der Akustik, ISBN 978-3-939296-10-2, (2016), S. 83-85

Klein, Florian; Werner, Stephan
Auditory adaptation to non-individual HRTF cues in binaural audio reproduction. - In: Journal of the Audio Engineering Society, ISSN 0004-7554, Bd. 64 (2016), 1/2, S. 45-54

http://dx.doi.org/10.17743/jaes.2015.0092
Sporer, Thomas; Liebetrau, Judith; Werner, Stephan; Kepplinger, Sara; Gabb, Timo; Siedler, Theresa
Localization of audio objects in multichannel reproduction systems. - In: The future of audio entertainment technology - cinema, television and the internet, ISBN 978-1-5108-0050-2, (2015), Paper 8-1, Seite 142-150

Klein, Florian; Werner, Stephan
Auditory adaptation in spatial listening tasks. - In: 138th Audio Engineering Society convention 2015, (2015), S. 521-530

Werner, Stephan; Klein, Florian; Brandenburg, Karlheinz
Influence of spatial complexity and room acoustic disparity on perception of quality features using a binaural synthesis system. - In: 2015 Seventh International Workshop on Quality of Multimedia Experience (QoMEX), ISBN 978-1-4799-8959-1, (2014), insges. 6 S.

This contribution presents investigations on the influence of scene complexity and room acoustic disparity on the perception of different quality features using a binaural headphone system. The quality features "spatial presence" and "listener envelopment" are investigated next to "perceived externalization" and "localization" of an auditory event. The test uses three different rooms with distinct room acoustic characteristics and several scenes with different spatial complexity. The work addresses the question if the quality features can profit by the different audio scenes or not. The results show that spatial presence is influenced by spatial complexity while room acoustic disparity influences listener envelopment. Furthermore, externalization and localization are not affected by spatial complexity regardless of the personalization method used for binaural synthesis.



http://dx.doi.org/10.1109/QoMEX.2015.7148131
Klein, Florian; Werner, Stephan
Kontextabhängige Parameter bei der Qualitätsbeurteilung binauraler Kopfhörersysteme. - In: Fortschritte der Akustik, (2015), S. 1599-1601

Werner, Stephan; Rekitt, Martin; Klein, Florian
Distribution of quadrant errors in auditory localization using a binaural headphone system. - In: Fortschritte der Akustik, (2015), S. 1094-1096

The auditory system of humans enables the perception of spatial audio in real and virtual acoustics using monaural and binaural cues. The perception of direction and distance are two prominent quality features to evaluate the quality of experience of spatial audio systems. Inaccuracies in perception can occur if physical quality elements of the synthesis system are not adequate. In this study a binaural synthesis via headphones is used to re-synthesize single sound sources on several discrete positions on a full circle around the listener. An artificial head (KEMAR) and a two channel spherical microphone setup are used to measure binaural room impulse responses in a real environment. A listening test is performed to measure the number of quadrant errors and perceived externalization of the auditory events in a localization task. The distributions of its frequencies depending on the direction of the re-synthesized sound source are investigated. The results show a continuous relationship between the investigated quality features and the direction of the sound source. Furthermore, an analysis of the relationship between quadrant errors and externalization depending on direction and used binaural room impulse responses are presented.



Werner, Stephan; Klein, Florian
Influence of context dependent quality parameters on the perception of externalization and direction of an auditory event. - In: Spatial audio, ISBN 978-1-63439-759-9, (2015), S. 305-312

Next to an adequate technical realization of an audio reproduction system, the context of usage plays a major role if a perfect auditory illusion with immersion and plausibility is aspired. This contribution presents results from perceptual experiments on context dependent quality parameters. A binaural synthesis of an acoustic scene via a personalized headphone system is used. The investigated quality parameters are influenced by divergence between synthesized scene and listening room, visibility of the scene, and personalization of the system. The plausibility of the perceived auditory scene is described by the test persons with the help of the quality features perceived externalization and direction of the auditory event. The analysis shows that there are significant differences in perceived externalization depending on the occurrence of localization errors but also on divergence or congruence between the listening and synthesized room.



Werner, Stephan; Klein, Florian
Context and plausibility in binaural synthesis. - In: Proceedings of the International Conference on Spatial Audio, ICSA 2014 in Erlangen, Germany, ISBN 978-3-9812830-4-4, (2014), S. 189-192

The development of recording and playback systems is motivated by the goal to create a perfect auditory illusion. In the last decades audio systems like ambisonics, wave field synthesis and binaural headphone reproduction gained in importance. Efforts are made to increase the quality of such systems. Despite the years of research and development, there are still many perceptual effects and psychoacoustical factors which are still unknown or not fully understood. This contribution presents a research project which deals with the systematically analysis of perceptual effects regarding to binaural playback. Based on the results of listening tests a measurement model on the perception of binaural synthesized scenes is developed. For this, a set of quality features needs to be determined and evaluated. These quality features and their weightings can be summarized as plausibility of perception. The plausibility of a binaural auralization does not only depend on physical quality elements of the synthesis system, but also depends significantly on contextual factors. During the project several different auditive scenes are created for evaluation. The contextual factors which influence the quality features are varied. Different rooms for playback and recording are considered as well as variations of the complexity of the acoustical scene, the degree of personalization and the type of visual presentation. Dependencies between these factors are expected and are going to be evaluated. This contribution presents first results of contextual factors influencing quality features.



Werner, Stephan; Liebetrau, Judith
Adjustment of direct-to-reverberant-energy-ratio and the just-noticable-difference. - In: Sixth International Workshop on Quality of Multimedia Experience (QoMEX), 2014, ISBN 978-1-4799-6536-6, (2014), insges. 3 S.

The adjustment of the direct-to-reverberant-energy-ratio (DRR) within a binaural synthesis system is assessed in a listening test. The DRR is changed until the assessors perceive congruence between the synthesized and listening room. The results of the experiment are relevant to apply adaption methods on binaural synthesis to increase plausibility involving context dependent quality parameters like room divergences between the synthesized auditory scene and the listening environment. Furthermore, a discussion is aspired about the relationship between justnoticeable- difference in DRR perception and inter-quartiledistances of the adjusted DRRs.



http://dx.doi.org/10.1109/QoMEX.2014.7138310
Klein, Florian; Werner, Stephan
Perspektiven zur Anwendung der Binauralsynthese in der Medienproduktion. - In: Medienproduktion, ISSN 2193-7699, No. V (2014), Seite 12-14

http://zs.thulb.uni-jena.de/receive/jportal_jpvolume_00202970
Brandenburg, Karlheinz; Werner, Stephan
Wohlfühlen mit dem richtigen Ton: aktuelle Forschung und Technik. - In: Akustik in Büro und Objekt, (2013), S. 7-24

Klein, Florian; Werner, Stephan
HRTF adaption under decreased immersive conditions. - In: AIA-DAGA 2013, ISBN 978-3-939296-05-8, (2013), S. 580-582

Werner, Stephan; Sass, Rebecca
Synthesis of binaural room impulse responses. - In: AIA-DAGA 2013, ISBN 978-3-939296-05-8, (2013), S. 572-575

An algorithm is developed which synthesizes binaural room impulse responses (BRIRs) by interpolation between measured data sets. Emphasis is put on the interpolation between distances for different rooms. A linear interpolation in time domain and a time adjustment of the two BRIRs by dynamic time warping prior to interpolation to generate new data sets are used. BRIRs are measured in two rooms for different distances as raw data for the synthesis. For each room an interpolation between data sets of different distances is carried out. The results of the synthesis are evaluated in a listening test. On the basis of perceived distance and externalization the general applicability of the synthesized impulse responses for binaural reproduction is investigated. We can show that there is no significant difference in distance perception between synthesized and measured BRIRs. Furthermore, the test shows that perceived externalization benefits from the amount of reverberation and from congruence between the synthesized and listening room. The positive influence of the congruence between the rooms is minor if the synthesized signals are reverberant but it becomes more dominant if there is less reverberation.



Werner, Stephan; Zabel, Anett
Effects of the Direct-to-Reverberant-Ratio and divergence between synthesized and listening room on perception of virtual acoustic environment. - In: AIA-DAGA 2013, ISBN 978-3-939296-05-8, (2013), S. 566-568

Beyond the correct technical realization of binaural signals and synthesis, perceptional effects depending on congruence and divergence between the synthesized scene and listening situation have a not neglectable influence on perception and plausibility of synthesis. This contribution deals with the perceived externalization and plausibility of virtual acoustic environment at binaural headphone synthesis. Investigations of divergence and congruence between listening room and auralized room are conducted. The Direct-to-Reverberant-energy-Ratio (DRR) as an adjustable parameter of binaural room impulse responses is adapted to evaluate externalization and perceived congruence between the synthesized scene and the listening environment. A clear influence of the listening room on perception of externality is visible in the results of the listening tests.



Werner, Stephan; Liebetrau, Judith
Effects of shaping of binaural room impulse responses on localization. - In: 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX), ISBN 978-1-4799-0738-0, (2013), S. 88-93

http://dx.doi.org/10.1109/QoMEX.2013.6603216
Werner, Stephan; Liebetrau, Judith; Sporer, Thomas
Vertical sound source localization influenced by visual stimuli. - In: Signal Processing Research, ISSN 2327-171X, Bd. 2 (2013), 2, S. 29-38

It is well-known that the perception of the position of audio and video stimuli is not independent. In general, video dominates the position if the position offset between audio and video is small. Most previous work focused on natural listening conditions and position offsets between audio and video in the horizontal plane. There is little research concerning offsets in vertical direction and artificial, auralized sound environments. Among different approaches to auralization of spatial audio, the binaural reproduction is especially very interestingas it offers proper perception of direction, distance, and elevation of sound sources at moderate cost. This article addresses the question whether the thresholds of perceptual fusion of audio and video stimuli are the same in binaural reproduction systems and in natural listening conditions. To estimate the influence of audio-visual discrepancy on vertical sound source localization, two experiments have been designed. The test methods were optimized to improve usability and minimize rating errors. Both experiments resulted in psychometric functions of intersensory bias for competing audio and visual stimuli. For binaural reproduction, the obtained results showed an effect of similar magnitude for both the vertical and horizontal plane which is in good agreement with the results obtained from other studies in natural environments.



http://www.db-thueringen.de/servlets/DocumentServlet?id=22564
Jürgens, Frank; Werner, Stephan
Kurven gleicher Lautheit bei binauraler Kopfhörerwiedergabe. - In: Expertise in audio media, ISBN 978-3-9812830-3-7, (2013), S. 630-634

Werner, Stephan; Füg, Simone
Controlled auditory distance perception using binaural headphone reproduction - evaluation via listening tests. - In: Expertise in audio media, ISBN 978-3-9812830-3-7, (2013), S. 622-629

Füg, Simone; Werner, Stephan; Brandenburg, Karlheinz
Gesteuerte Distanzwahrnehmung von Hörereignissen bei binauraler Kopfhörerwiedergabe - Modifizierung gemessener BRIRs. - In: Expertise in audio media, ISBN 978-3-9812830-3-7, (2013), S. 621

Füg, Simone; Werner, Stephan; Brandenburg, Karlheinz
Controlled auditory distance perception using binaural headphone reproduction - algorithms and evaluation. - In: Expertise in audio media, ISBN 978-3-9812830-3-7, (2013), S. 614-620

Klein, Florian; Werner, Stephan; Siegel, André
Influence of base width of a two channel crosstalk canceller on the localization of simulated sound sources. - In: Expertise in audio media, ISBN 978-3-9812830-3-7, (2013), S. 607-613

Werner, Stephan; Sporer, Thomas; Brandenburg, Karlheinz
Untersuchung zur Wahrnehmung der Schalleinfallsrichtung virtueller Schallquellen bei Wiedergabe über Wellenfeldsynthese. - In: Fortschritte der Akustik, (2012), S. 635-636

Werner, Stephan; Liebetrau, Judith; Sporer, Thomas
Audio-visual discrepancy and the influence on vertical sound source localization. - In: Fourth International Workshop on Quality of Multimedia Experience (QoMEX), 2012, ISBN 978-1-4673-0724-6, (2012), S. 133-139

http://dx.doi.org/10.1109/QoMEX.2012.6263855
Sass, Rebecca; Werner, Stephan; Siegel, André
Comparison of recording methods for measurements of individualized head-related transfer functions. - In: Expertise in audiomedia, (2011), S. 721-726

Werner, Stephan; Klein, Florian; Siegel, André
On the influence of visual feedback on vertical sound source localization. - In: Proceedings of ICSA 2011, ISBN 978-3-9812830-2-0, (2011), S. 523-526

Klein, Florian; Werner, Stephan
Individualization of head-related transfer functions. - In: Fortschritte der Akustik, (2011), S. 375-376

Werner, Stephan; Klein, Florian; Klein, Florian *1986-*; Siegel, André
(Abhängigkeit der binauraler Auralisation über Kopfhörer von den Abhörbedingungen) :
Dependencies of binaural auralization via headphones on listening conditions. - In: Fortschritte der Akustik, (2011), S. 665-666

Maggi, Johannes; Siegel, André; Werner, Stephan;
Crosstalk-canceler for stereophonic setup in real rooms :
Übersprechkompensation stereophoner Anordnungen in realen Räumen. - In: Expertise in audiomedia, (2011), S. 727-732

Die meisten derzeitigen binauralen Wiedergabesysteme werden mit Kopfhörern realisiert. Dabei können aber Phänomene wie eine Im-Kopf-Lokalisation und eine Vorne-Hinten-Konfusion auftreten und der Entfernungseindruck kann verloren gehen. Aus diesen Gründen wurde die Wiedergabe mit Lautsprechern in einer Stereoanordnung untersucht. Das Wiedergabesystem wurde als statisch angenommen, d.h. zusätzlich zur Stereoanordnung sind die Abhörposition und die horizontale und vertikale Ausrichtung des Kopfes bekannt. Bei einer normalen Stereowiedergabe kommt es zu einem Übersprechen zwischen dem linken Lautsprecher und rechten Ohr und umgekehrt. Deshalb wurde ein Übersprechkompensator realisiert und im Wiedergabesystem implementiert. Zusätzlich zur Realisierung der Übersprechkompensation werden die Einflüsse des Mess- und Wiedergaberaumes kompensiert. Um sich von früheren Untersuchungen zu unterscheiden, wird die Wiedergabe nicht unter Freifeld- sondern unter Diffusfeldbedingungen durchgeführt.



Neukam, Christian; Siegel, André; Werner, Stephan;
Kompensation nicht optimaler Stereoanordnungen für die Wiedergabe von stereophonen/binauralen Signalen :
Compensation of non-optimal stereo setups for the sound reproduction of stereophonic/binaural signals. - In: Expertise in audiomedia, (2011), S. 715-720

If sound reproductions are carried out by only two loudspeakers, the stereophonic experience is limited to a small area. The illusion of an artificial stereo width collapses outside this area so that the correct localisation of a phantom source isn't possible anymore. Hence it is necessary to locate the errors caused by such a non-optimal stereo setup and thereon to develop a system which compensates these errors. In this paper appropriate technologies to adapt the so called sweet spot to the current listener's position are introduced. The designed system is formed by a 2-D vector field including FIR filter technology for equalizing the loudspeaker's transfer function. Finally, the obtained conclusion is evaluated by an analysis based on the results of an accomplished measurement.



Klein, Florian; Kruspe, Anna; Pfitzer, Martin; Werner, Stephan; Brandenburg, Karlheinz
Interaktiver audiovisueller Demonstrator psychoakustischer Phänomene. - In: Fortschritte der Akustik, (2010), S. 183-184

Werner, Stephan;
Trennung von Nutzsignalen und Rauschsignalen auf Basis von Vesikelfilterung in einem neuronalen auditorischen Modell. - 83 S. : Ilmenau, Techn. Univ., Diplomarbeit, 2008

Nachfolgende Arbeit beschäftigt sich mit der Lokalisation und der Trennung von Schallquellen im Raum. Als Ziel stehen die Lokalisation und die Trennung des Nutzsignals einer definierten Zielschallquelle von im Raum befindlichen Rauschsignalen anderer Schallquellen. Es wird eine Verbesserung der Sprachverständlichkeit für diese Nutzschallquelle angestrebt. Die Lokalisation einzelner Schallquellen wird durch die Gewinnung und Auswertung von interauralen Laufzeitunterschieden in einem binauralem Gehörmodell realisiert. Als Ausgangsbasis dient ein physiologisch inspiriertes monaurales Gehörmodell, welches die Amplituden-Zeit-Darstellung der Schallwellen in eine Frequenz-Zeit-Darstellung der Cochlea, das auditorische Bild, überführt. Dieses Modell wird durch eine binaurale Signalverarbeitungskette ergänzt. Als Eingangssignale der Lokalisationskette werden das Basilarmembranverhalten, die Vesikelkonzentration im synaptischen Spalt der inneren Haarzellen und die durch Houghtransformation erzeugten Darstellungen der internen Verzögerungstrajektorien, verwendet. Das erweiterte Verfahren wird durch Simulationen und Tests unter realen Bedingungen auf seine Tauglichkeit hin untersucht und abgestimmt. Die Verbesserung der Sprachverständlichkeit wird mit Hilfe eines automatischen Spracherkennungssystems basierend auf Hidden Markow Modellen überprüft.