- Towards Constructing HMM Structure for Speech Recognition With Deep Neural Fenonic Baseform Growing. IEEE Access 9, 2021, 39098--39110 mehr… Volltext ( DOI )
- Lightweight End-to-End Speech Enhancement Generative Adversarial Network Using Sinc Convolutions. Applied Sciences 11 (16), 2021, 7564 mehr… Volltext ( DOI )
- Light-Weight Self-Attention Augmented Generative Adversarial Networks for Speech Enhancement. Electronics 10 (13), 2021, 1586 mehr… Volltext ( DOI )
- Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition. arXiv preprint arXiv:2104.01471, 2021 mehr… Volltext ( DOI )
- Deep neural fenonic baseform growing: A novel approach to construct HMM topologies for speech recognition. 2020 International Conference on High Performance Computing Simulation (HPCS), 2021 mehr…
- A Global Discriminant Joint Training Framework for Robust Speech Recognition. 2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI), IEEE, 2021 mehr… Volltext ( DOI )
- Induced Local Attention for Transformer Models in Speech Recognition. International Conference on Speech and Computer, 2021 mehr… Volltext (mediaTUM)
- Regularized forward-backward decoder for attention models. International Conference on Speech and Computer, 2021 mehr… Volltext (mediaTUM)
- Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions. 2020 mehr…
- Regularized Forward-Backward Decoder for Attention Models. 2020 mehr…
- Review of error correction for PUFs and evaluation on state-of-the-art FPGAs. Journal of Cryptographic Engineering, 2020 mehr… Volltext ( DOI )
- Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions. Proc. Interspeech 2020, 2020, 1659--1663 mehr… Volltext ( DOI )
- Frenet Coordinate Based Driving Maneuver Prediction at Roundabouts Using LSTM Networks. In: Computer Science in Cars Symposium. Association for Computing Machinery, 2020 mehr…
- MP3 Compression to Diminish Adversarial Noise in End-to-End Speech Recognition. Speech and Computer, Springer International Publishing, 2020 mehr…
- CTC-Segmentation of Large Corpora for German End-to-End Speech Recognition. Speech and Computer, Springer International Publishing, 2020 mehr…
- Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition. Speech and Computer, Springer International Publishing, 2020 mehr…
- Synchronized Forward-Backward Transformer for End-to-End Speech Recognition. Speech and Computer, Springer International Publishing, 2020 mehr…
- Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions. 2019 mehr…
- Exploring Hybrid CTC/Attention End-to-End Speech Recognition with Gaussian Processes. Proc. 21st International Conference on Speech and Computer SPECOM 2019, Springer, 2019Lecture Notes in Computer Science, pp. 258-269 mehr… Volltext ( DOI )
- Deep Neural Network Quantizers Outperforming Continuous Speech Recognition Systems. Proc. 21st International Conference on Speech and Computer SPECOM 2019, Springer, 2019Lecture Notes in Computer Science, pp. 530-539 mehr… Volltext ( DOI )
- Modular PUF Coding Chain with High-Speed Reed-Muller Decoder. International Symposium on Circuits and Systems, ISCAS 2019, 2019, pp. 1-5 mehr… Volltext ( DOI )
Privacy information: The user has configured his profile in TUMonline as hidden for anonymous visitors!
Forschungsgebiete
• Sequence Classification
• Speech Recognition
Publikationen
Lehre
• Mensch-Maschine-Kommunikation I (WS 2019)
Studentische Arbeiten
Bei Anfragen zu studentischen Arbeiten reichen Sie bitte folgende Unterlagen mit ein:
• Aktueller Lebenslauf
• Notenauszug
• Bisherige Erfahrungen aus dem Themengebiet
• Starttermin
Offen
Alle ausgeschriebenen Arbeiten finden Sie hier.
Abgeschlossen
2020
• Sinc. Convolutions for End-to-End Speech Recognition (Interdisplinary Project)
• Shallow Fusion for Attention-based Speech-Recognition (Interdisplinary Project)
• Investigation of Learnable Filters for Discrete Wavelet Decomposition (Research Internship)
• A Lightweight Deep Learning Model for Speech Command Recognition on Raw Audio Data (Master's Thesis)
• Variational Attention for End-to-End Speech Recognition (Master's Thesis)
2019
• Performance and Robustness of Distilled Neural Networks for Hybrid Speech Recognition (Master's Thesis)
• Self-Attention-basierte Spracherkennung (Scientific Seminar)
• Generative Adversarial Networks for Hybrid Speech Recognition in Pytorch-kaldi (Research Internship)
• Adversarial Training for Robust Speech Recognition in Pytorch-kaldi (Research Internship)
• Adversial Training for Improving Robustness in Hybrid Speech Recognition (Master´s Thesis)
• Speech Recognition using GANs (Bachelor´s Thesis)
• Performance and Robustness of Distilled Neural Networks for hybrid Speech Recognition (Master´s Thesis)
• Varaiational Attention for End-to-End Speech Recognition (Master´s Thesis)
• A Lightweight Deep Learning Model For Keyword Spotting On Raw Audio Data (Master´s Thesis)
• End-to-end Speech Recognition with Attention-based Models for German (Interdisplinary Project)
• Speech Recognition with Vector Quantized Attention-based Encoders (Interdisplinary Project)
• Generative Adverasarial Networks for hybrid Speech Recognition (Research Internship)
• Adversarial Training for Robust Hybrid Speech Recognition (Research Internship)
• Self-Attention and the Transformer (Scientific Seminar)
2018
• Keyword Detection for Personal Speech-to-Text Assistants (Master´s Thesis)
• Evaluation of Recurrent Neural Networks with Connectionist Temporal Classification for End-to-End Approaches to Speech Recognition (Master´s Thesis)
• Exploration of Generative Neural Networks for Hybrid Speech Recognition (Master´s Thesis)
• Bag-of-Words Classification of Spoken Languages using Vector Quantizers (Master´s Thesis)
• Defensive Distillation (Scientific Seminar)
• Speech Recognition using Machine Learning on a GPU Server (Research Internship)
• Adversarial Deep Learning on Speech-To-Text (Interdisplinary Project)
• A Kaldi Speech Recognition Input Method (Interdisplinary Project)
• Visualization of Speech Data in the Kaldi Speech Recognition Toolkit (Research Internship)
• Visualization of Attention Activations in the ESPnet Speech Recognition Toolkit (Research Internship)
• Gaussian Process Hyperparameter Optimization (Research Internship)
• Variational Autoencoders and Vector-Quantizing Autoencoders for Speech Data (Research Internship)
2017
• Post-Quantum-secure Asymmetric Encryption with QC-MDPC Codes for mbedTLS (Interdisciplinary Project)
• Implementation and Evaluation of the Post-Quantum Secure GPT Encryption Scheme for Embedded Systems (Master´s Thesis)
• Post-Quantum-secure Autentification based on the Learning Parity Problem (Master´s Thesis)
• Survey and Hardware Implementation of McEliece-Type Post-quantum Cryptography (Master´s Thesis)
2016
• Optimierung eines auf Verkettung basierenden Decodieralgorithmus für RM-Codes (Bachelor´s Thesis)