BaSaMa & HiWi

Developing Large Language Model Search Agent for Test Data using Reinforcement Learning

Institut

Lehrstuhl für Nachhaltige Mobile Antriebssysteme (TUM-ED)

Typ

Semesterarbeit / Masterarbeit /

Inhalt

theoretisch /

Beschreibung

In testing of electrical drives emerge large amount of data. The value of those data other than measurements are always underestimated and overlooked in the mountain of unstructured data. Retrieval-Augmented Generation (RAG) powered by Large Language Models (LLMs) arises as a solution for accessing static knowledge sources with the capability in searching for and integrating data chunks valuable for joint analysis. Nevertheless, the classical RAG framework is suffering from missing important data that are distributed in multiple documents.

To improve the search accuracy in PDF files and measurement data, the RAG needs to be implemented in an agentic framework. The LLM is supposed to be trained to decide the following search actions and the stop of search round. Different Reinforcement Learning optimization policies are supposed to be applied for training of the LLM. The training process will be conducted on open-sourced dataset and the validation of the agentic RAG with the tuned LLM is supposed to be performed on the own specific test dataset.

Your tasks are:

Literature research and adaptation of RAG projects and datasets
Design and implementation of reward functions
Training of the LLM
Experiment design and validation

Voraussetzungen

Experience with Python
Basic knowledge of Deep Learning / Reinforcement Learning
Experience with Reinforcement Learning is welcome
Curiosity, willingness to learn and a good general technical understanding

Möglicher Beginn

sofort

Kontakt

M.Sc. Kai Cui
Raum: 2107.EG.008
Tel.: +49 8928924108
k.cuitum.de

Ausschreibung

Navigation

Navigation

Developing Large Language Model Search Agent for Test Data using Reinforcement Learning