IFIP TC7 System Modeling and Optimization

Name: IFIP TC7 System Modeling and Optimization
Start: 2024-08-12T09:00:00+02:00
End: 2024-08-16T14:50:00+02:00
Location: Von-Melle-Park 8

Aug 12 – 16, 2024

Von-Melle-Park 8

Europe/Berlin timezone

On the continuity of the value function in reinforcement learning and optimal control

Aug 13, 2024, 11:00 AM

30m

Seminarraum 205 (Von-Melle-Park 8)

Seminarraum 205

Von-Melle-Park 8

Minisymposium Contribution MS 01: Optimal Control and Machine Learning MS 01: Optimal Control and Machine Learning

Hans Harder (University of Paderborn)

The value function plays a crucial role as a measure for the cumulative future reward an agent receives in both reinforcement learning and optimal control. It is therefore of interest to study how similar the values of neighboring states are, i.e. to investigate the continuity of the value function. We do so by providing and verifying upper bounds on the value function's modulus of continuity. Additionally, we show that the value function is always Hölder continuous under relatively weak assumptions on the underlying system.

Hans Harder (University of Paderborn) Dr Sebastian Peitz (University of Paderborn)

There are no materials yet.

IFIP TC7 System Modeling and Optimization

On the continuity of the value function in reinforcement learning and optimal control

Seminarraum 205

Von-Melle-Park 8

Speaker

Description

Authors

Presentation materials

Choose timezone

IFIP TC7 System Modeling and Optimization

Speaker

Description

Authors

Presentation materials