Offline and Online Models for Learning Pairwise Relations in Data
Doktorsavhandling, 2023
The first part of the thesis focuses on offline learning by starting with an investigation of the performance modeling of a synchronization method in concurrent programming using a Markov chain whose state transition matrix models pairwise relations between involved cores in a computer process.
Then the thesis focuses on a particular pairwise distance measure, the minimax distance, and explores memory-efficient approaches to computing this distance by proposing a hierarchical representation of the data with a linear memory requirement with respect to the number of data points, from which the exact pairwise minimax distances can be derived in a memory-efficient manner. Then, a memory-efficient sampling method is proposed that follows the aforementioned hierarchical representation of the data and samples the data points in a way that the minimax distances between all data points are maximally preserved. Finally, the thesis proposes a practical non-parametric clustering of vehicle motion trajectories to annotate traffic scenarios based on transitive relations between trajectories in an embedded space.
The second part of the thesis takes an online learning perspective, and starts by presenting an online learning method for identifying bottlenecks in a road network by extracting the minimax path, where bottlenecks are considered as road segments with the highest cost, e.g., in the sense of travel time. Inspired by real-world road networks, the thesis assumes a stochastic traffic environment in which the road-specific probability distribution of travel time is unknown. Therefore, it needs to learn the parameters of the probability distribution through observations by modeling the bottleneck identification task as a combinatorial semi-bandit problem. The proposed approach takes into account the prior knowledge and follows a Bayesian approach to update the parameters. Moreover, it develops a combinatorial variant of Thompson Sampling and derives an upper bound for the corresponding Bayesian regret. Furthermore, the thesis proposes an approximate algorithm to address the respective computational intractability issue.
Finally, the thesis considers contextual information of road network segments by extending the proposed model to a contextual combinatorial semi-bandit framework and investigates and develops various algorithms for this contextual combinatorial setting.
Mulit-Armed Bandit
Pairwise Relations
Performance Modeling
Motion Trajectory Clustering
Minimax Distance
Bottleneck Identification
Online Learning
Representation Learning
Memory Efficiency
Concurrent Programming
Thompson Sampling
Författare
Fazeleh Sadat Hoseini
Nätverk och System
Modeling the performance of atomic primitives on modern architectures
ACM International Conference Proceeding Series,;(2019)
Paper i proceeding
Memory-Efficient Minimax Distance Measures
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),;Vol. 13280 LNAI(2022)p. 419-431
Paper i proceeding
Vehicle Motion Trajectories Clustering via Embedding Transitive Relations
IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC,;Vol. 2021-September(2021)p. 1314-1321
Paper i proceeding
Online Learning of Network Bottlenecks via Minimax Paths
Machine Learning,;Vol. 112(2023)p. 131-150
Artikel i vetenskaplig tidskrift
A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification
International Conference on Information and Knowledge Management, Proceedings,;(2024)p. 3782-3786
Paper i proceeding
Ämneskategorier
Annan data- och informationsvetenskap
Sannolikhetsteori och statistik
Datavetenskap (datalogi)
ISBN
978-91-7905-820-3
Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5286
Utgivare
Chalmers