Theoretical Understanding of Gaussian Process Bandits in Practical Applications
Licentiatavhandling, 2026

Bayesian optimization (BO) provides a principled framework for optimizing blackbox functions with noisy outputs, with applications ranging from aircraft design to hyperparameter tuning. BO algorithms can be theoretically analyzed through the lens of Gaussian process (GP) bandits, providing theoretical guarantees of their efficiency. This thesis provides theoretical analyses and experimental evaluation of GP bandit algorithms with relevance to practical applications.

The first part of the thesis is motivated by the need for navigation systems that prioritize energy-efficiency and adapt to collected data. As such, we develop a combinatorial GP bandit framework for online energy-efficient navigation of electric vehicles. We theoretically analyze three algorithms under this framework, providing bounds on their regret. The algorithms are evaluated on real-world road networks, demonstrating that they explore the road network more efficiently compared to previous work.

The second part of the thesis is focused on addressing a discrepancy between theory and practice. The theoretical literature commonly assumes that important characteristics (the prior) of the blackbox function being optimized is known before the optimization process starts. However in practice, the prior must often be inferred from the data, which invalidates any theoretical guarantees. To address this issue, we theoretically analyze two algorithms that simultaneously learn the prior and optimize the unknown function. We identify issues in previous theoretical analyses, correct and improve upon their results. Finally, we experimentally evaluate the algorithms on synthetic and real-world data and demonstrate their effectiveness at simultaneous optimization and prior identification.

machine learning

Thompson sampling

combinatorial bandits

Gaussian processes

Bayesian optimization

energy-efficient navigation

Multi-armed bandits

Analysen, EDIT, Rännvägen 6B
Opponent: Prof. Tobias Oechtering, KTH Royal Institute of Technology, Sweden

Författare

Jack Sandberg

Chalmers, Data- och informationsteknik, Data Science och AI

Bayesian Analysis of Combinatorial Gaussian Process Bandits

13th International Conference on Learning Representations Iclr 2025,;(2025)p. 8895-8928

Paper i proceeding

J. Sandberg, M. Haghir Chehreghani. Comments on ''Surrogate Modelling for Bayesian Optimization Beyond a Single Gaussian Process"

J. Sandberg, M. Haghir Chehreghani. Adaptive Prior Selection in Gaussian Process Bandits with Thompson Sampling

Styrkeområden

Informations- och kommunikationsteknik

Transport

Drivkrafter

Hållbar utveckling

Infrastruktur

C3SE (-2020, Chalmers Centre for Computational Science and Engineering)

Ämneskategorier (SSIF 2025)

Algoritmer

Artificiell intelligens

Utgivare

Chalmers

Analysen, EDIT, Rännvägen 6B

Online

Opponent: Prof. Tobias Oechtering, KTH Royal Institute of Technology, Sweden

Mer information

Senast uppdaterat

2026-05-11