BaSaMa & HiWi

[MA with Bosch R&D] Combining Imitation & Reinforcement Learning to Solve Automated Driving

Institut

Professur für autonome Fahrzeugsysteme (TUM-ED)

Typ

Masterarbeit /

Inhalt

experimentell / theoretisch /

Beschreibung

Description

Imitation Learning (IL) and Reinforcement Learning (RL) each come with distinct strengths and weaknesses. IL is typically sample-efficient and straightforward to implement, but it requires large amounts of expert data and often suffers from distributional shift. In contrast, RL does not rely on expert demonstrations and can learn robust policies through interaction, but it faces challenges such as unstable training dynamics and the difficulty of designing appropriate reward functions. IL has since long been applied to the problem of autonomus driving. Since recently, also RL is getting more traction, among others due to the availability of extremely fast simulators and large-scale compute. Goal of this thesis is to investigate the emergent topic of combining both approaches.

During your thesis, you will conduct in-depth literature research and selectively implement existing approaches in the field of reinforcement and imitation learning.
Furthermore, you will extend established methods and ideas to create innovative solutions that go beyond mere reimplementation.
In addition, you will explore the generation of multi-step models using reinforcement learning to optimize algorithms for use in vehicles.
You will combine reinforcement learning, particularly single-step models, with imitation learning to generate precise trajectories for autonomous driving functions.
Lastly, you will carefully document your research findings and prepare them for scientific publication.

Qualifications

Education: Master studies in the field of Natural Sciences, Computer Science with a focus on AI, or comparable, with a very good GPA
Experience and Knowledge: proficient in Python and PyTorch; solid knowledge gained from lectures in Artificial Intelligence, particularly Autonomous Driving, Imitation Learning, and Reinforcement Learning; initial practical experience, ideally through internships, in these areas is preferred; initial scientific publications are advantageous
Personality and Working Practice: you are able to methodically analyze complex scientific questions and independently develop innovative solutions, while consistently seeking scientific exchange as well as communicating your results precisely
Work Routine: office attendance required
Languages: fluent in English

Additional Information

Start: according to prior agreement
Duration: 6 months
Place: Robert-Bosch-Campus 1, 71272 Renningen, Germany

If you are interested, please send a short motivation letter, CV, and transcript of records to: yuan_avs.gao@tum.de

Tags

AVS Gao

Möglicher Beginn

sofort

Kontakt

Yuan Gao
yuan_avs.gaotum.de

Navigation

Navigation

[MA with Bosch R&D] Combining Imitation & Reinforcement Learning to Solve Automated Driving