Google Suche
Prof. Dr.-Ing. Gerald Schuller
Fachgebietsleiter
+49 3677 69-2756 | Fax: +49 3677 69-2888
E-Mail senden
Kirchhoffbau, Raum K 3014
Sprechstunde nach Vereinbarung
Sekretariat
Marina Bondarev
Kirchhoffbau, Raum 3007
49 3677 69-2890 | Fax: +49 3677 69-2888
E-Mail senden
Besucheradresse
Gustav-Kirchhoff-Straße 1, Raum K 3007 (3. Etage)
98693 Ilmenau
Postanschrift
Technische Universität Ilmenau
Fakultät für Elektrotechnik und Informationstechnik
Fachgebiet Angewandte Mediensysteme
PF 100565
98684 Ilmenau
The goal of this Masters project is to design and evaluate a predictor using machine learning (DNN, CNN). This predictor should as precisely as possible predict for an given discrete audio signal the subsequent audio-sample or block of audio-samples. The predictor should be evaluated in a lossless audio-coder scheme where it is expected that the new predictor outperforms state-of-the art designs. Furthermore, using a feedback loop, it should be investigated which artificial audio signals the predictor can produce. This is also known as a "Deep Generative Model".
It is part of the project to chose a proper training set and evaluate the predictor on a test-set. The choice of the neural network architecture and its dimensions and the comparison of different designs is the crucial point. Programming language is Python with its neural network library Pytorch. Starting point are articles about the so-called "WaveNet" by Google, and "FFTNet".
Prof. Gerald Schuller (TU Ilmenau), Dr.-Ing. Sascha Disch, Dipl.-Math. Andreas Niedermeier (Fraunhofer IIS)
Backback