Geometry and Symmetry in Deep Learning: From Mathematical Foundations to Vision Applications
Doktorsavhandling, 2025

Deep learning — particularly neural networks — has profoundly transformed both industry and academia. However, designing and training effective networks remains challenging, often requiring extensive data and compute. This barrier limits applicability of deep learning in domains with scarce labelled data or limited computational budgets. One way to overcome these barriers is to bake prior knowledge — such as geometry or symmetry — directly into network architectures.

Geometric deep learning focuses on model and data design by leveraging the knowledge of problem specific geometry and symmetries. Encoding this into the pipeline can reduce sample complexity as the models do not need to learn these structures directly from the data. Two common examples of this is equivariant and invariant networks. Equivariant networks guarantee that when the input is transformed the output transforms in a predictable way. On the other hand, an invariant network is a network where the output does not change if the input is transformed.

In this thesis we study both applied and mathematical perspectives on parts of the geometric deep learning field. On the mathematical side I show a theory for equivariant CNNs on (bi)principal bundles and a novel framework for equivariant non-linear maps. On the applied side the I present a study of the effects of imposed equivariance on the data requirements and the increased data efficiency as well as the benefits of using a grid well suited for the underlying geometry of the data.

Pascal, Department of Mathematical Sciences, Chalmers University of Technology
Opponent: Professor Remco Duits, Eindhoven University of Technology, Eindhoven, Netherlands

Författare

Oscar Carlsson

Chalmers, Matematiska vetenskaper, Algebra och geometri

Geometric deep learning and equivariant neural networks

Artificial Intelligence Review,;Vol. 56(2023)p. 14605-14662

Artikel i vetenskaplig tidskrift

Equivariance versus Augmentation for Spherical Images

Proceedings of Machine Learning Research,;Vol. 162(2022)p. 7404-7421

Paper i proceeding

HEAL-SWIN: A Vision Transformer on the Sphere

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,;(2024)p. 6067-6077

Paper i proceeding

In geometric deep learning (GDL) — an active and growing subfield of machine learning — one improves machine learning methods by using knowledge of geometry or symmetries in the problem. For example, using this knowledge can make machine learning models more data efficient as the geometrical structure is built in and does not have to be learned from data. The standard example is a convolutional neural network (CNN) which uses the fact that objects can appear anywhere in an image. Hence, a CNN is designed so tat shifting an object in an image does not change how the network detects it — this property is called translation equivariance and works naturally for flat images. However, when dealing with curved data — such as spherical images — one must use other methods. In this thesis we study geometrical aspects of machine learning both from a mathematical perspective and an applied perspective. Mathematically, we use differential geometry and fibre bundles to investigate properties of layers and to construct a new framework for non-linear maps. On the applied side we examine how imposing rotation equivariance improves performance and data efficiency for models on spherical data and adapt a high performing model to work natively with spherical images.

Ämneskategorier (SSIF 2025)

Datorgrafik och datorseende

Geometri

Artificiell intelligens

Fundament

Grundläggande vetenskaper

ISBN

978-91-8103-249-9

Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5707

Utgivare

Chalmers

Pascal, Department of Mathematical Sciences, Chalmers University of Technology

Online

Opponent: Professor Remco Duits, Eindhoven University of Technology, Eindhoven, Netherlands

Mer information

Senast uppdaterat

2025-08-28