Developing Large Language Model Search Agent for Test Data using Reinforcement Learning

Institut
Lehrstuhl für Nachhaltige Mobile Antriebssysteme (TUM-ED)
Typ
Semesterarbeit / Masterarbeit /
Inhalt
theoretisch /  
Beschreibung

In testing of electrical drives emerge large amount of data. The value of those data other than measurements are always underestimated and overlooked in the mountain of unstructured data. Retrieval-Augmented Generation (RAG) powered by Large Language Models (LLMs) arises as a solution for accessing static knowledge sources with the capability in searching for and integrating data chunks valuable for joint analysis. Nevertheless, the classical RAG framework is suffering from missing important data that are distributed in multiple documents.

To improve the search accuracy in PDF files and measurement data, the RAG needs to be implemented in an agentic framework. The LLM is supposed to be trained to decide the following search actions and the stop of search round. Different Reinforcement Learning optimization policies are supposed to be applied for training of the LLM. The training process will be conducted on open-sourced dataset and the validation of the agentic RAG with the tuned LLM is supposed to be performed on the own specific test dataset.

 

Your tasks are:

  • Literature research and adaptation of RAG projects and datasets
  • Design and implementation of reward functions
  • Training of the LLM
  • Experiment design and validation
Voraussetzungen
  • Experience with Python
  • Basic knowledge of Deep Learning / Reinforcement Learning
  • Experience with Reinforcement Learning is welcome
  • Curiosity, willingness to learn and a good general technical understanding
Möglicher Beginn
sofort
Kontakt
M.Sc. Kai Cui
Raum: 2107.EG.008
Tel.: +49 8928924108
k.cuitum.de
Ausschreibung