18 September 2025
Universität Potsdam am Campus III - Griebnitzsee
Europe/Berlin timezone

Improving Accessibility and Reproducibility by Guiding Large Language Models

18 Sept 2025, 11:20
20m
Raum 26 (Universität Potsdam am Campus III - Griebnitzsee)

Raum 26

Universität Potsdam am Campus III - Griebnitzsee

Am Neuen Palais 10 14469 Potsdam

Speaker

Florian Marwitz (Universität Hamburg)

Description

Research data repositories store numerous entries of research data, to among other advantages one goal is allowing to store us all data to reproduce experiments.
Working with large corpora of texts is made significantly easier with Large Language Models.
However, Large Language Models are trained for general purposes and are note finetuned for the data originating from different kinds of projects.
But the creators of such texts have an expert viewpoint on the data.
Therefore, we propose to leverage the expert viewpoints of creators to obtain better answers from a Large Language Model.
When creating an entry for the Research Data Repository, the creators have the possibility to add a so-called interpretation prompt.
The interpretation prompt contains their expert viewpoint and be of any textual form to guide the Large Language Model to interpret the project-specific data.
In particular, the interpretation prompt may contain instructions on how to reproduce experiments right inside the LLM invocation.
Afterward, the interpretation prompt is prepended to the query of the Large Language Model.
In our examples, we show how the interpretation prompt helps to receive more tailored answers.

Authors

Florian Marwitz (Universität Hamburg) Marcel Gehrke (Universität Hamburg)

Presentation materials

There are no materials yet.