Skip to main content
Aug 12 – 16, 2024
Von-Melle-Park 8
Europe/Berlin timezone

On the continuity of the value function in reinforcement learning and optimal control

Aug 13, 2024, 11:00 AM
30m
Seminarraum 205 (Von-Melle-Park 8)

Seminarraum 205

Von-Melle-Park 8

Minisymposium Contribution MS 01: Optimal Control and Machine Learning MS 01: Optimal Control and Machine Learning

Speaker

Hans Harder (University of Paderborn)

Description

The value function plays a crucial role as a measure for the cumulative future reward an agent receives in both reinforcement learning and optimal control. It is therefore of interest to study how similar the values of neighboring states are, i.e. to investigate the continuity of the value function. We do so by providing and verifying upper bounds on the value function's modulus of continuity. Additionally, we show that the value function is always Hölder continuous under relatively weak assumptions on the underlying system.

Authors

Hans Harder (University of Paderborn) Dr Sebastian Peitz (University of Paderborn)

Presentation materials

There are no materials yet.