QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy

Adam Lilja; Ji Lan; Junsheng Fu; Lars Hammarstrand

QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
Paper in proceeding, 2026

Learning 3D scene geometry and semantics from images is a core challenge in computer vision and a key capability for autonomous driving.
Since large-scale 3D annotation is prohibitively expensive, recent work explores self-supervised learning directly from sensor data without manual labels.
Existing approaches either rely on 2D rendering consistency, where 3D structure emerges only implicitly, or on discretized voxel grids from accumulated lidar point clouds, limiting spatial precision and scalability.
We introduce QueryOcc, a query-based self-supervised framework that learns continuous 3D semantic occupancy directly through independent 4D spatio-temporal queries sampled across adjacent frames.
The framework supports supervision from either pseudo-point clouds derived from vision foundation models or raw lidar data.
To enable long-range supervision and reasoning under constant memory, we introduce a contractive scene representation that preserves near-field detail while smoothly compressing distant regions.
QueryOcc surpasses previous camera-based methods by 26% in semantic RayIoU on the self-supervised Occ3D-nuScenes benchmark while running at 11.6 FPS, demonstrating that direct 4D query supervision enables strong self-supervised occupancy learning.

Author

Adam Lilja

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering

Other publications Research

Ji Lan

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering

Other publications Research

Junsheng Fu

Zenseact AB

Lars Hammarstrand

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering

Other publications Research

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

10636919 (ISSN)

The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026
Denver, USA,

Subject Categories (SSIF 2025)

Robotics and automation

Computer graphics and computer vision

Computer Sciences

More information

Latest update

6/22/2026

QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy Paper in proceeding, 2026

Author

Adam Lilja

Ji Lan

Junsheng Fu

Lars Hammarstrand

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Subject Categories (SSIF 2025)

More information

Latest update

QueryOcc: Query-based Self-Supervision for 3D Semantic Occupancy
Paper in proceeding, 2026