Bayesian Analysis of Combinatorial Gaussian Process Bandits
Paper in proceeding, 2025

We consider the combinatorial volatile Gaussian process (GP) semi-bandit problem. Each round, an agent is provided a set of available base arms and must select a subset of them to maximize the long-term cumulative reward. We study the Bayesian setting and provide novel Bayesian cumulative regret bounds for three GP-based algorithms: GP-UCB, GP-BayesUCB and GP-TS. Our bounds extend previous results for GP-UCB and GP-TS to the infinite, volatile and combinatorial setting, and to the best of our knowledge, we provide the first regret bound for GP-BayesUCB. Volatile arms encompass other widely considered bandit problems such as contextual bandits. Furthermore, we employ our framework to address the challenging real-world problem of online energy-efficient navigation, where we demonstrate its effectiveness compared to the alternatives.

Author

Jack Sandberg

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers)

Niklas Åkerblom

Volvo Group

Chalmers, Computer Science and Engineering (Chalmers)

University of Gothenburg

Morteza Haghir Chehreghani

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers)

13th International Conference on Learning Representations Iclr 2025

8895-8928
9798331320850 (ISBN)

13th International Conference on Learning Representations, ICLR 2025
Singapore, Singapore,

Energy Effective Navigation for EVs (EENE)

VINNOVA (2018-01937), 2019-01-01 -- 2022-12-31.

EENE: Energy Effective Navigation for EVs

FFI - Strategic Vehicle Research and Innovation (2018-01937), 2019-01-01 -- 2022-12-31.

Subject Categories (SSIF 2025)

Computer Sciences

Other Computer and Information Science

More information

Latest update

7/21/2025