Large Language Models for Export Control Compliant Research
- Institut
- Lehrstuhl für Raumfahrtantriebe
- Typ
- Semesterarbeit
- Inhalt
- Beschreibung
One of the greatest yet most underestimated challenges of any engineering project is knowledge management. This is doubly the case in academia, where data is constantly produced by undergraduate students, professors, and everyone in between. Large Language Models (LLMs) have given rise to a suite of research frameworks that help synthesize, interpret, and even critically review the written word as well as numerical data. Tools like Deep Research and CO-STORM allow researchers to cover ground at breakneck speeds, while the collation of data into ranked vector databases allows for existing research to be quickly indexed and searched.
In this new paradigm, researchers working with sensitive or export-controlled research are immediately placed at a major disadvantage - their data cannot leave the country. As a result, the vast majority of existing tools are out of the question, as is working with most commercial partners. The solution to this problem must be local, and better yet tailored to the particular kind of data produced and processed at the Chair.
While open-source frameworks like Ollama and OpenWebUI have made local LLMs increasingly more accessible, they still lack tuning and optimization compared to commercial software. Out of the box, they do not handle content extraction well and are poorly optimized for the hardware and data present at the Chair. Your task would be to propose, implement, and optimize an open-source versatile research and knowledge management platform based on open-source software and models.
- Voraussetzungen
- Interest in and passion for informatics/data science.
- Knowledge of machine learning fundamentals.
- Experience with Linux desirable.
- Möglicher Beginn
- sofort
- Kontakt
-
Tomas Mrazek, M.Eng.
Tel.: +33695349960
tomas.mrazektum.de - Ausschreibung
-