Robust Learning with Limited Labels

Erik Wallin

doi:10.63959/chalmers.dt/5820

Robust Learning with Limited Labels
Doctoral thesis, 2026

Deep learning-based classification systems commonly rely on conditions that are difficult to satisfy for real-world applications. One such requirement is the availability of large-scale, curated, and labeled training data. Another is the absence of unknown classes during training and deployment. Furthermore, many classification systems treat classes as independent, even when they form structured relationships that are important to account for. Overcoming these limitations is central to the practical deployment of these systems.

We address these challenges through five papers that study deep classification under limited supervision, the presence of unknown classes, hierarchical class structures, and combinations thereof. Paper A studies semi-supervised learning, where labeled and unlabeled training data are combined, and proposes a self-supervised component for better utilization of unlabeled data. Papers B and C address unknown classes within semi-supervised learning, enabling learning from realistic, uncurated, unlabeled data. In particular, Paper C proposes a probabilistic method that improves accuracy and uncertainty quantification when detecting unknown samples in this setting. Finally, Papers D and E study hierarchical open-set classification, i.e., assigning unknown classes to appropriate high-level categories of a hierarchy, and propose a method that approximates the predictive distribution over both known classes and higher-level categories. This enables more expressive predictions of unknown samples than binary rejection.

In summary, the included papers propose methods that advance performance on benchmarks for their respective problem settings, while providing empirical analyses that improve understanding of the underlying challenges. Overall, this thesis contributes to more robust and accurate deep classification systems for real-world deployment.

semi-supervised learning

Deep learning

open-set recognition

hierarchical classification

HC1, Hörsalsvägen 14

Opponent: Prof. Yuki M. Asano, Fundamental AI Lab, University of Technology Nuremberg, Germany

Online defence

Author

Erik Wallin

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering

Other publications Research

DoubleMatch: Improving Semi-Supervised Learning with Self-Supervision

Proceedings - International Conference on Pattern Recognition,;Vol. 2022-August(2022)p. 2871-2877

Paper in proceeding

Improving Open-Set Semi-Supervised Learning with Self-Supervision

Proceedings - 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024,;(2024)p. 2345-2354

Paper in proceeding

ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,;(2025)p. 20612-20621

Paper in proceeding

ProSub: Probabilistic Open-Set Semi-supervised Learning with Subspace-Based Out-of-Distribution Detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),;Vol. 15119 LNCS(2025)p. 129-147

Paper in proceeding

Wallin, E., Kahl, F., Hammarstrand, L. Semi-Supervised Hierarchical Open-Set Classification

Deep learning has led to major advances in systems for the automatic classification of data. These systems are now widely applied to images, video, audio, and data collected from a wide range of sensing systems. For example, modern deep learning models can accurately recognize thousands of object categories in images, matching human-level performance in controlled settings. Despite this progress, many successful approaches overlook real-world conditions that limit their practical applicability. For example, they often rely on large amounts of labeled training data, making them costly to deploy. They may also fail to account for unexpected data that can appear during real-world use, potentially leading to safety risks. Moreover, such approaches typically ignore important relationships between data categories. This thesis addresses these challenges by developing methods that learn from both labeled and unlabeled data, can detect and handle previously unseen categories, and make use of hierarchical relationships between classes. Through a series of methodological contributions and empirical studies, we contribute toward more reliable and robust learning systems that are better suited for deployment in practical settings.

Robust and precise Semi-Supervised Learning schemes

Wallenberg AI, Autonomous Systems and Software Program, 2020-08-25 -- 2024-08-23.

Show Project

Subject Categories (SSIF 2025)

Computer graphics and computer vision

Signal Processing

Artificial Intelligence

Infrastructure

C3SE (-2020, Chalmers Centre for Computational Science and Engineering)

Chalmers e-Commons (incl. C3SE, 2020-)

DOI

10.63959/chalmers.dt/5820

Publication data connected to DOI

ISBN

978-91-8103-363-2

Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5820

Publisher

Chalmers

HC1, Hörsalsvägen 14

Online

Opponent: Prof. Yuki M. Asano, Fundamental AI Lab, University of Technology Nuremberg, Germany

More information

Latest update

1/27/2026

Robust Learning with Limited Labels Doctoral thesis, 2026

Author

Erik Wallin

DoubleMatch: Improving Semi-Supervised Learning with Self-Supervision

Improving Open-Set Semi-Supervised Learning with Self-Supervision

ProHOC: Probabilistic Hierarchical Out-of-Distribution Classification via Multi-Depth Networks

ProSub: Probabilistic Open-Set Semi-supervised Learning with Subspace-Based Out-of-Distribution Detection

Wallin, E., Kahl, F., Hammarstrand, L. Semi-Supervised Hierarchical Open-Set Classification

Robust and precise Semi-Supervised Learning schemes

Subject Categories (SSIF 2025)

Infrastructure

DOI

ISBN

Publisher

More information

Latest update

Robust Learning with Limited Labels
Doctoral thesis, 2026