IFIP TC7 System Modeling and Optimization

Name: IFIP TC7 System Modeling and Optimization
Start: 2024-08-12T09:00:00+02:00
End: 2024-08-16T14:50:00+02:00
Location: Von-Melle-Park 8

Aug 12 – 16, 2024

Von-Melle-Park 8

Europe/Berlin timezone

Adaptive Step Sizes for Preconditioned Stochastic Gradient Descent

Aug 12, 2024, 4:30 PM

30m

Seminarraum 205 (Von-Melle-Park 8)

Seminarraum 205

Von-Melle-Park 8

Minisymposium Contribution MS 01: Optimal Control and Machine Learning MS 01: Optimal Control and Machine Learning

Frederik Köhne (Universität Bayreuth)

The choice of the step size (or learning rate) in stochastic optimization algorithms, such as stochastic gradient descent, plays a central role in the training of machine learning models. Both theoretical investigations and empirical analyses emphasize that an optimal step size not only requires taking into account the nonlinearity of the underlying problem, but also relies on accounting for the local variance within the search directions. In this presentation, we introduce a novel method capable of estimating these fundamental quantities and subsequently using these estimates to derive an adaptive step size for stochastic gradient descent. Our proposed approach leads to a nearly hyperparameter-free variant of stochastic gradient descent. We provide theoretical convergence analyses in the special case of stochastic quadratic, strongly convex problems. In addition, we perform numerical experiments focusing on classical image classification tasks. Remarkably, our algorithm exhibits truly problem-adaptive behavior when applied to these problems that exceed theoretical boundaries. Moreover, our framework facilitates the potential incorporation of a preconditioner, thereby enabling the implementation of adaptive step sizes for stochastic second-order optimization methods.

Prof. Anton Schiela (Universität Bayreuth) Frederik Köhne (Universität Bayreuth) Leonie Kreis (Heidelberg University) Roland Herzog (Heidelberg University)

There are no materials yet.

IFIP TC7 System Modeling and Optimization

Adaptive Step Sizes for Preconditioned Stochastic Gradient Descent

Seminarraum 205

Von-Melle-Park 8

Speaker

Description

Authors

Presentation materials

Choose timezone

IFIP TC7 System Modeling and Optimization

Speaker

Description

Authors

Presentation materials