Safe Deep Reinforcement Learning based on Bayesian Optimization for Weights-varying Model Predictive Control

Institut
Lehrstuhl für Fahrzeugtechnik
Typ
Semesterarbeit / Masterarbeit /
Inhalt
experimentell / theoretisch /  
Beschreibung

One of the key challenges in autonomous vehicle control is ensuring the vehicle follows a desired trajectory while maintaining stability and safety. The use of Model Predictive Control (MPC) has been widely adopted as an effective method for trajectory following. MPC uses a prediction model of the system to optimize control actions over a finite horizon. However, the performance of MPC is highly dependent on the correct choice of the cost function and its parameters used in the optimization problem.

Determining the optimal cost function parameters to optimize different control objectives is a challenging and time-consuming task that requires hand tuning by skilled MPC experts. Furthermore, changes in the operating conditions make it necessary to repeat the tuning process. Consequently, if a system operates in a changing environment, it is desirable to adapt the controller parameters at runtime.

The goal of this thesis is to develop an automated method for online adaption of the MPC cost function. To improve the MPC performance and ensure robust parametrization, a combined framework of Bayesian Optimization (BO) and Reinforcement Learning (RL) is to be used. BO is an optimization method that searches the optimal values of a function by using prior information and a probabilistic model. In the scope of this thesis, it serves to find a preselection of suitable parametrizations in an offline setting. An RL agent is then trained to select the most beneficial parameters at runtime in order to optimize the driving performance.

 

Tasks to be completed during the thesis:

  • Conduct a state-of-the-art review on the use of Bayesian Optimization in MPC

  • Formulate the problem of optimizing the parameters of the cost function used in the MPC

  • Concept, develop and implement a Bayesian optimization algorithm to tune the MPC parameters

  • Concept, develop and implement a Reinforcement Learning algorithm to select beneficial parameters sets at runtime.

  • Test and evaluate the performance of the proposed approach in simulation and/or experiment

  • Compare the performance of the proposed approach with traditional manual tuning methods

  • Write a comprehensive thesis report documenting the research, methodology, implementation, and results

Voraussetzungen
  • Motivation to familiarize yourself with new topics and to try new ideas
  • Ideally previous theoretical knowledge in Model Predictive Control
  • Ideally previous experience with Python and Git
Verwendete Technologien
Python, C++, Programming, Autonomous Driving, Machine Learning, Reinforcement Learning, Deep Learning, Model Predictive Control, MPC, Bayesian Optimization, Gaussian Process Regression, Parameter Estimation, Heuristics, Bayesian Optimization, Pareto Optimal Sets
Tags
FTM Studienarbeit, FTM AV, FTM Zarrouki, FTM Informatik, FTM Perception
Möglicher Beginn
sofort
Kontakt
Baha Zarrouki, M.Sc.
Raum: MW3527
Tel.: +49 (89) 289 - 10498
baha.zarroukitum.de
Ausschreibung