GASP: Unifying Geometric and Semantic Self-Supervised Pretraining for Autonomous Driving
Paper i proceeding, 2026
Similarly, autonomous driving generates vast amounts of spatiotemporal data, alluding to the possibility of harnessing scale to learn the underlying geometric and semantic structure of the environment and its evolution over time.
In this direction, we propose a geometric and semantic self-supervised pre-training method, GASP, that learns a unified representation by predicting, at any queried future point in spacetime, (1) general occupancy, capturing the evolving structure of the 3D scene; (2) ego occupancy, modeling the ego vehicle path through the environment; and (3) distilled high-level features from a vision foundation model.
By modeling geometric and semantic 4D occupancy fields instead of raw sensor measurements, the model learns a structured, generalizable representation of the environment and its evolution through time.
We validate GASP on multiple autonomous driving benchmarks, demonstrating significant improvements in semantic occupancy forecasting, online mapping, and ego trajectory prediction.
Our results demonstrate that continuous 4D geometric and semantic occupancy prediction provides a scalable and effective pre-training paradigm for autonomous driving.
Författare
William Ljungbergh
Adam Lilja
Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik
Adam Tonderski
Zenseact AB
Arvid Laveno Ling
Chalmers, Elektroteknik, System- och reglerteknik
Carl Lindström
Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik
Willem Verbeke
Zenseact AB
Junsheng Fu
Zenseact AB
Christoffer Petersson
Chalmers, Matematiska vetenskaper, Algebra och geometri
Lars Hammarstrand
Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik
Michael Felsberg
Linköpings universitet
Proceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025
Tuscon, USA,
Ämneskategorier (SSIF 2025)
Robotik och automation
Datorgrafik och datorseende
Datavetenskap (datalogi)