Studentische Arbeiten

Am Lehrstuhl MMK sind ständig Themen für studentische Arbeiten (Bachelor- und Masterarbeiten, Forschungspraxis, IDP) zu vergeben.

Wenn Sie ein passendes Thema für Ihre studentische Arbeit gefunden haben wenden Sie sich an den zuständigen Assistenten.

Ingenieurpraxis: Das Ziel der Ingenieurpraxis ist einen Einblick in die Abläufe in der Industrie zu erhalten. Daher bieten wir keine Ingenieurspraxis bei uns an, betreuen Sie aber gerne, wenn Sie eine Stelle in einer Firma finden.

Ebenso bieten wir keine Praktikumsstellen am Lehrstuhl an! Eingehende Anfragen werden aufgrund der Menge nicht beantwortet.

Termine der aktuellen Abschlussvorträge von studentischen Arbeiten am Lehrstuhl MMK.

Themen für studentische Arbeiten

Sachgebiet: Speech Recognition

Deep Neural Networks for Speech Recognition

Thema Deep Neural Networks for Speech Recognition
Typ Forschungspraxis, IDP, Master
Betreuer Ludwig Kürzinger, Dipl.-Ing. (Univ.)
Tel.: +49 (0)89 289-28562
E-Mail: ludwig.kuerzinger@tum.de
Sachgebiet Speech Recognition
Beschreibung Motivation:
Speech Recognition enables a machine to understand human voice and convert it to text. Conventional speech recognition systems are based on a combination of neural networks and hidden markov models. With the advent of deep learning and increasing computational power, deep neural networks are able to achieve the performance of the traditional systems, but do not require complex feature crafting at the same time.
Your work will be about key concepts of deep neural nets which are not yet fully understood. For example attention [1], inspired by the human ability to concentrate on important information, is a simple but powerful technique that can directly transform any audio signal directly into a sequence of characters.

Task Description:
The main task will be about applying or examining neural networks for speech recognition. The topic can be conducted in English or German. For more information about the topic, please contact the supervisor.

References:
[1] Vaswani, Ashish, et al. Attention is all you need., 2017.
[2] Graves, Alex, et al. Connectionist temporal classification, 2006
Voraussetzung - Experience with Python and/or C++
- Experience with machine learning
- Independent work style
- Motivation to learn new concepts
Bewerbung If you are interested in this topic, we welcome the applications via the email address above. Please set the email subject to " application for topic 'XYZ'", ex. "Master’s thesis application for topic 'XYZ'", while clearly specifying why are you interested in the topic in the text of the message. Also make sure to attach your most recent CV (if you have one) and grade report.

Sachgebiet: Computer Vision

Distracted Driver Dataset

Thema Distracted Driver Dataset
Typ Master
Betreuer Okan Köpüklü, M.Sc.
Tel.: +49 (0)89 289-28554
E-Mail: okan.kopuklu@tum.de
Sachgebiet Computer Vision
Beschreibung Motivation: According to the last National Highway Traffic Safety Administration (NHTSA) report, one in ten fatal crashes and two in ten injury crashes were reported as distracted driver crashes in the United State in 2014. Therefore detecting the drivers distraction state is utmost important to reduce driver-related accidents. For this task, properly annotated dataset for drivers actions observation is necessary. With such a dataset, state-of-the art Deep Learning Architectures can be used to recognize the distraction state of the drivers.

Task: The main task is to collect a “Distracted Driver Dataset”, and use a light-weight Convolutional Neural Networks (CNN) architecture in order to detect driver’s distractive actions. The dataset should contain the following annotations:
1. Predefined distractive actions that the drivers do
2. Drivers hand states (whether they are on the wheel or not)

During the thesis, the following steps will be followed in general:
1. State-of-the-art research
2. Dataset collection and preparation (i.e. labeling and formating)
3. Light-weight CNN Architecture design
4. Evaluation of the CNN Architecture on the prepared dataset
5. Demonstration of the working system

References:
[1] Baheti, B., Gajre, S., & Talbar, S. (2018). Detection of Distracted Driver using Convolutional Neural Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (pp. 1032-1038).
[2] Hssayeni, M. D., Saxena, S., Ptucha, R., & Savakis, A. (2017). Distracted driver detection: Deep learning vs handcrafted features. Electronic Imaging, 2017(10), 20-26.
[3] G. Borghi, E. Frigieri, R. Vezzani and R. Cucchiara, "Hands on the wheel: A Dataset for Driver Hand Detection and Tracking," 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi'an, 2018, pp. 564-570.
Voraussetzung 1. Excellent coding skills, preferable in Python
2. Experience in deep learning frameworks, preferably in Torch/PyTorch
3. Motivation to work on deep learning.
Bewerbung If you are interested in this topic, we welcome the applications via the email address above. Please set the email subject to " application for topic 'XYZ'", ex. "Master’s thesis application for topic 'XYZ'", while clearly specifying why are you interested in the topic in the text of the message. Also make sure to attach your most recent CV (if you have one) and grade report.

Real-time Detection and classification of Dynamic Hand Gestures

Thema Real-time Detection and classification of Dynamic Hand Gestures
Typ Forschungspraxis, Masterarbeit
Betreuer Okan Köpüklü, M.Sc.
Tel.: +49 (0)89 289-28554
E-Mail: okan.kupuklu@tum.de
Sachgebiet Computer Vision
Beschreibung Motivation : Detection and classification of dynamic hand gestures is a challenging task since there is no indication when an action starts in a video stream. However, most of the deep learning architectures which are working offline can also function online with proper adjustments. The topic of this thesis is convert an offline-working architecture to an online-working one.
Task : The main task is to bring an already working deep architecture, which can be seen below, to online functionality. Details of the architecture can be found in [1].
As a further reading, [2] also provides a detailed online detection architecture.


References :
[1] O. Köpüklü, N. Köse, and G. Rigoll. Motion fused frames: Data level fusion strategy for hand gesture recognition. arXiv preprint, arXiv:1804.07187, 2018.
[2] P. Molchanov, X. Yang, S. Gupta, K. Kim, S. Tyree, and J. Kautz. Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 4207–4215, 2016.
Voraussetzung 1. Excellent coding skills in Python,
2. Experience in deep learning frameworks, preferably in Torch/PyTorch.
3. Motivation to work on deep learning.
Bewerbung If you are interested in this topic, we welcome the applications via the email address above. Please set the email subject to “ application for topic 'XYZ'”, ex. “Master’s thesis application for topic 'XYZ'”, while clearly specifying why are you interested in the topic in the text of the message. Also make sure to attach your most recent CV (if you have one) and grade report.

CNN Application to Video Saliency

Thema CNN Application to Video Saliency
Typ Master, Forschungspraxis, Bachelor, Ing.prax.
Betreuer Mikhail Startsev
Tel.: +49 (0)89 289-28550
E-Mail: mikhail.startsev@tum.de
Sachgebiet Computer Vision
Beschreibung One of the important questions in computer vision is how you determine what information in a scene (represented by an image or a video) is relevant. So-called “saliency models” [1] have been used to predict informativeness in images. However for videos the ways of incorporating the temporal component of the series of frames into an attention prediction model range from being extremely computationally intensive (ex. deep neural networks using 3D convolution operators) to the ones using hand-crafted approaches (ex. the use of optical flow or using two subsequent frames as input).

In order to avoid or reduce the “hand-engineered” aspect of the features in use, different modifications of traditional 2D CNNs can be employed. The deep learning methods have already proven their worth in the image saliency task [2] and some results related to videos are starting to appear as well. In this project the candidate will work with various CNN models that work with video data in order to compare their performance. Depending on the progress, learning several models from scratch on pre-recorded data can be beneficial.

[1] https://en.wikipedia.org/wiki/Salience_(neuroscience)#Visual_saliency_modeling
[2] http://saliency.mit.edu/results_mit300.html
Voraussetzung Understanding of machine learning concepts and solid programming skills are desirable.
Bewerbung If you are interested in this topic, we welcome the applications via the email address above. Please set the email subject to “ application for topic 'XYZ'”, ex. “Master’s thesis application for topic 'XYZ'”, while clearly specifying why are you interested in the topic in the text of the message. Also make sure to attach your most recent CV (if you have one) and grade report.