IFIP TC7 System Modeling and Optimization

Name: IFIP TC7 System Modeling and Optimization
Start: 2024-08-12T09:00:00+02:00
End: 2024-08-16T14:50:00+02:00
Location: Von-Melle-Park 8

Aug 12 – 16, 2024

Von-Melle-Park 8

Europe/Berlin timezone

A Recursive Multilevel Algorithm for Deep Learning

Aug 13, 2024, 9:30 AM

30m

Seminarraum 205 (Von-Melle-Park 8)

Seminarraum 205

Von-Melle-Park 8

Contributed Talk MS 01: Optimal Control and Machine Learning MS 01: Optimal Control and Machine Learning

Isabel Jacob (TU Darmstadt)

As the use cases for neural networks become increasingly complex, modern neural networks must also become deeper and more intricate to keep up, indicating the need for more efficient learning algorithms. Multilevel methods, traditionally used to solve differential equations using a hierarchy of discretizations, offer the potential to reduce computational effort.

In this talk, we combine both concepts and introduce a multilevel stochastic gradient descent algorithm that accelerates learning through a multilevel strategy. A gradient correction term is needed to establish first-order consistency.
We discuss convergence of the method in the case of a deterministic gradient correction as well as a stochastic gradient correction under additional conditions including step size regularization and an angle condition.

To demonstrate the usefulness of our approach, we apply it to residual neural networks in image classification. The resolution of the images is utilized to generate data sets of varying complexity, which are then used to build a hierarchy of neural networks with a decreasing number of variables. Additionally, we construct corresponding prolongation and restriction operators. Numerical results are presented.

Isabel Jacob (TU Darmstadt) Prof. Stefan Ulbrich (TU Darmstadt)

There are no materials yet.

IFIP TC7 System Modeling and Optimization

A Recursive Multilevel Algorithm for Deep Learning

Seminarraum 205

Von-Melle-Park 8

Speaker

Description

Authors

Presentation materials

Choose timezone

IFIP TC7 System Modeling and Optimization

Speaker

Description

Authors

Presentation materials