Conveners
Poster Session
- Katrin Schöning-Stierand (Hub of Computing and Data Science (HCDS))
Recent developments in open data policies of meteorological agencies have much expanded the set of up-to-date weather observation and forecast data that is publicly available to meteorological research and education. To improve use of this open data, we have developed 3-D visualization products that extract and display meteorological information in novel ways. In this demo, we present...
The poster presents findings from the DFG project InterSaME (2020–2023) focused on vowel-dots in early Qur’anic manuscripts. It highlights a vector-based transcription system as there is no current encoding or transcription tool for vowel-dots. Using a customised Archetype software instance, the team developed a pointer-based encoding method that describes each dot's position relative to...
The multidisciplinary nature of manuscript study at the CSMC results in an ever-increasing volume of digital data in various modalities, ranging from raw images of artefacts to automatically generated data from advanced acquisition techniques. The manual analysis of this data is typically time-consuming and susceptible to human error and bias. Therefore, a set of Pattern Analysis Software...
bAIome is the center for biomedical AI at University Medical Center Hamburg-Eppendorf (UKE). The center consists of faculty and staff from various institutes within the UKE who are engaged in research and education in broad areas of biomedical AI. bAIome serves as a competence center bundling knowledge, expertise, & resources to provide a portfolio of services to help students, researchers and...
Prostate cancer relapse prediction is a challenging task within computational pathology as tissue preparation and digitization are not standardized. The different protocols lead to domain shifts, against which a deep learning model must be robust and focus on biological information rather than variations in appearance. We address this challenge through the usage of vision foundation models...
High-throughput scientific experiments generate massive data streams requiring near real-time processing for time-critical decision making. However, developing robust streaming workflows presents significant challenges in distributed computing environments.
We present AsapoWorker [1], a Python library that simplifies the development of processing workers on top of the Asapo [2] streaming...
In this paper, we explore using multi-modal agents based on Large-Vision-Language-Models (LVLMs) what a scholarly collections portal can be beyond a digital showcase of the university’s collections . We focus on the interactive exploration of scientific collections. Collection data is valued differently from different perspectives. For the university administrators, it is an item to be...
Processing large amount of data in near real-time during experiments at synchrotrons is enabling scientists to make the best use of limited beamtime [1]. However, building systems capable of handling data rates of several gigabytes per second over long periods of time requires specialized expertise in distributed computing [2], which limits the broader adoption of such systems at...
The poster to be presented addresses the problem of incorporating a steadily growing number of research software applications into an existing RDM infrastructure as well as transferring their diverse outputs to the existing storage systems using interface definitions. A subprocess in the general RDM infrastructure is proposed integrating a new software component, the data transfer facilitator...
Social media increasingly fuel extremism and disinformation, especially in the right-wing agenda, and enable the rapid spread of antidemocratic narratives. Although there is plenty of research being done in the socio/political fields against these phenomena, there is a considerable gap between it and putting policy into practice. Our conjoined software engineer project called KI4Demo supports...
Research groups in the humanities generate a substantial number of publications, contributing to an ever-expanding body of scholarly work. When a scholar is interested in the topics covered or has specific questions about (subsets of) publications, they must overcome the big number of publications to read. We demonstrate the use of language models in the humanities by showcasing two...
Large language models (LLMs) bear great potential for automating tedious development tasks, like creating and maintaining source code documentation. We assist software developers of European XFEL (EuXFEL) with LLM-powered tools that facilitate knowledge and documentation management. We present findings from two controlled experiments conducted with EuXFEL’s Data department, focusing on...
Scholars in the humanities working with datasets face two challenges: Discovering relevant datasets and publishing their own dataset after their research is completed. We propose a new filetype, namely CSMC (Computer Science Metadata Container), to bundle the raw research data alongside a visualization of the data. Scholars can view the visualization of a dataset before downloading the whole...
The consent management platform, Conseydo, developed in the Flutter framework and funded by the funding program Calls4Transfer, uses a privacy by design approach to enable the GDPR-compliant digital creation, documentation, management and tracking of consent for research, for example within the stakeholder triad of teachers, parents and researchers. The plattform solves organizational...
Our poster presents Protokolibri, a distributed application for logging the browsing behavior of large groups of students on iPads. The developed browser plugin records tab events via Javascript and sends them asynchronously to the Protokolibri node.js server, which stores the data sorted by device name and timestamp.
The focus of the tool is on simplifying data collection. Previously,...
This paper presents UHH’s approach developed for the AVeriTeC shared task. The goal of the challenge is to verify given real-world claims with evidences from the Web. In this shared task, we investigate a Retrieval-Augmented Generation (RAG) model, which mainly contains retrieval, generation, and augmentation components. We start with the selection of the top 10k evidences via BM25 scores, and...
This project presents a browsable digital exploration environment for a multilingual private guestbook from 20th-century Jerusalem. The goal is to investigate curiosity-driven browsing strategies in archival contexts, going beyond systematic searches. By providing intuitive, user-friendly visualization solutions, the project aims to facilitate an exploratory approach and increase serendipitous...
Contemporary earth system models (ESM) perform simulations at kilometer scale resolution at various HPC centers. The data from these simulations aid in research and policy making. Hence the design of the data access system for a federated setup should consider the data, analysis tools and computing resources at each center. Also for efficient discoverability, the data management at each center...