Publikationen im Fachgebiet

Nachstehend finden Sie eine automatisierte Zusammenstellung der Veröffentlichungen des Fachgebietes. Die Veröffentlichungen der einzelnen Mitarbeiter:innen finden Sie auf deren persönlichen Seiten.

Publikationsliste

Anzahl der Treffer: 286
Erstellt: Thu, 16 May 2024 23:03:40 +0200 in 0.0983 sec


Johnson, David S.; Grollmisch, Sascha
Techniques improving the robustness of deep learning models for industrial sound analysis. - In: 28th European Signal Processing Conference (EUSIPCO 2020), (2020), S. 81-85

The field of Industrial Sound Analysis (ISA) aims to automatically identify faults in production machinery or manufactured goods by analyzing audio signals. Publications in this field have shown that the surface condition of metal balls and different types of bulk materials (screws, nuts, etc.) sliding down a tube can be classified with a high accuracy using audio signals and deep neural networks. However, these systems suffer from domain shift, or dataset bias, due to minor changes in the recording setup which may easily happen in real-world production lines. This paper aims at finding methods to increase robustness of existing detection systems to domain shift, ideally without the need to record new data or retrain the models. Through five experiments, we implement a convolutional neural network (CNN) for two publicly available ISA datasets and evaluate transfer learning, data normalization and data augmentation as approaches to deal with domain shift. Our results show that while supervised methods with additional labeled data are the best approach, an unsupervised method that implements data augmentation with adaptive normalization is able to improve the performance by a large margin without the need of retraining neural networks.



https://doi.org/10.23919/Eusipco47968.2020.9287327
Brandenburg, Karlheinz; Klein, Florian; Neidhardt, Annika; Sloma, Ulrike; Werner, Stephan
Creating auditory illusions with binaural technology. - In: The technology of binaural understanding, (2020), S. 623-663

It is pointed out that beyond reproducing the physically correct sound pressure at the eardrums, more effects play a significant role in the quality of the auditory illusion. In some cases, these can dominate perception and even overcome physical deviations. Perceptual effects like the room-divergence effect, additional visual influences, personalization, pose and position tracking as well as adaptation processes are discussed. These effects are described individually, and the interconnections between them are highlighted. With the results from experiments performed by the authors, the perceptual effects can be quantified. Furthermore, concepts are proposed to optimize reproduction systems with regard to those effects. One example could be a system that adapts to varying listening situations as well as individual listening habits, experience and preference.



Grollmisch, Sascha; Johnson, David; Liebetrau, Judith
Visualizing neural network decisions for industrial sound analysis. - In: SMSI 2020, (2020), S. 267-268

Grollmisch, Sascha; Johnson, David; Krüger, Tobias; Liebetrau, Judith
Plastic material classification using neural network based audio signal analysis. - In: SMSI 2020, (2020), S. 337-338

Werner, Stephan; Klein, Florian; Müller, Clemens
Evaluation of spatial audio quality of the synthesis of binaural room impulse responses for new object positions. - In: 147th Audio Engineering Society Convention 2019, (2020), S. 972-981

The aim of auditory augmented reality is to create an auditory illusion combining virtual audio objects and scenarios with the perceived real acoustic surrounding. A suitable system like position-dynamic binaural synthesis is needed to minimize perceptual conflicts with the perceived real world. The needed binaural room impulse responses (BRIRs) have to fit the acoustics of the listening room. One approach to minimize the large number of BRIRs for all source-receiver relations is the synthesis of BRIRs using only one measurement in the listening room. The focus of the paper is the evaluation of the spatial audio quality. In most conditions differences in direct-to-reverberant-energy ratio between a reference and the synthesis is below the just noticeable difference. Furthermore, small differences are found for perceived overall difference, distance, and direction perception. Perceived externalization is comparable to the usage of measured BRIRs. Challenges are detected to synthesize more further away sources from a source position that is more close to the listening positions.



Sloma, Ulrike; Klein, Florian; Werner, Stephan; Pappachan Kannookadan, Tyson
Synthesis of binaural room impulse responses for different listening positions considering the source directivity. - In: 147th Audio Engineering Society Convention 2019, (2020), S. 377-385

Lenzen, Lucien; Hedtke, Rolf; Christmann, Mike
HDR in consideration of the abilities of the human visual system. - In: SMPTE motion imaging journal, ISSN 2160-2492, Bd. 128 (2019), 5, S. 40-45

In recent years, high dynamic range (HDR) has been improved enormously. The capability of cameras and displays to reproduce small differences in luminance levels is constantly growing. However, we are still dealing with a limitation of the human visual system (HVS) known as the simultaneous contrast range (SCR). Compared to earlier studies, this paper focuses on real-world scenarios for evaluating the SCR. In natural images, bright highlights, especially in HDR, can limit the eyes' sensitivity to small differences in surrounding dark areas. This paper describes a test-image set developed as part of current research activities by the authors to measure the relation between the perceived SCR and the following four significant parameters: the distance, or rather, the viewing angle; the size of the bright highlight; the luminance of the highlight; and the ambient light. As a result, a mathematical formula is given that can help to evaluate and improve HDR viewing experiences as well as standard dynamic range downconversions.



https://doi.org/10.5594/JMI.2019.2907350
Nowak, Johannes; Fischer, Georg
Modeling the perception of system errors in spherical microphone array auralizations. - In: Journal of the Audio Engineering Society, ISSN 0004-7554, Bd. 67 (2019), 12, S. 994-1002

https://doi.org/10.17743/jaes.2019.0051
Neidhardt, Annika; Schneiderwind, Christian
Physical and perceptual differences of selected approaches to realize an echolocation scenario in room acoustical auralizations. - In: Proceedings of the International Symposium on Room Acoustics, (2019), S. 237

http://doi.org/10.18154/RWTH-CONV-240146
Schneiderwind, Christian; Neidhardt, Annika
Perceptual differences of position dependent room acoustics in a small conference room. - In: Proceedings of the International Symposium on Room Acoustics, (2019), S. 499-506

http://doi.org/10.18154/RWTH-CONV-240138