Geometry and Learning in 3D Computer Vision
Doktorsavhandling, 2025
The thesis is structured around three problems: (1) camera calibration, (2) rotation averaging, and (3) motion segmentation. For each of these problems, we analyze the weak points and failure modes of existing methods and propose new algorithms that leverage standard techniques from geometry and optimization or hybrid learning pipelines that aim to retain the interpretability of geometric models while benefiting from the expressivity and adaptability of deep neural networks.
Our contributions include: (i) a versatile pipeline for calibrating central cameras with various lens configurations that relies on simple techniques and solvers and proves to be very stable, (ii) a semidefinite program for anisotropic rotation averaging that leverages the readily-available uncertainties of the relative estimates and relies on a new convex relaxation, leading to improved reconstruction accuracy, (iii) a fast block-coordinate descent solver for anisotropic rotation averaging that achieves similar reconstruction accuracy while significantly reducing the runtime, (iv) robustification pipelines for anisotropic rotation averaging allowing gross outliers in the data, and (v) a metric learning approach addressing the challenging chicken-and-egg problem of motion segmentation via clustering in the space of trajectory feature representations, where inference is done in a fraction of a second.
computer vision
robust optimization
minimal solvers
rotation averaging
3D reconstruction
camera calibration
global structure from motion
motion segmentation
trajectory clustering
Författare
Yaroslava Lochman
Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik
BabelCalib: A Universal Approach to Calibrating Central Cameras
Proceedings of the IEEE International Conference on Computer Vision,;(2021)p. 15233-15242
Paper i proceeding
Certifiably Optimal Anisotropic Rotation Averaging
Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision,;(2025)p. 14856-14865
Paper i proceeding
Making Rotation Averaging Fast and Robust with Anisotropic Coordinate Descent
Proceedings of the 36th British Machine Vision Conference 2025,;(2025)
Paper i proceeding
Learned Trajectory Embedding for Subspace Clustering
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition,;(2024)p. 19092-19102
Paper i proceeding
Ämneskategorier (SSIF 2025)
Datorgrafik och datorseende
DOI
10.63959/chalmers.dt/5766
ISBN
978-91-8103-309-0
Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5766
Utgivare
Chalmers
HB2, Hörsalsvägen 8, Chalmers
Opponent: Docent and Associate Professor, Per-Erik Forssén, Linkoping University, Sweden