HMambaOcc: Hierarchical Mamba for occupancy flow field prediction in autonomous driving under mixed traffic environments
Artikel i vetenskaplig tidskrift, 2026

Occupancy flow field prediction is essential for autonomous driving under mixed traffic environments, as it provides pivotal information for safe decision-making and planning. Although existing approaches show promise in forecasting surrounding agents, they suffer from limited computational efficiency and challenges in modeling complex interactions, making it difficult to fully capture the intrinsic relationships among various factors. To this end, we propose HMambaOcc, a hierarchical Mamba-based framework for occupancy and flow prediction. Specifically, to fully leverage available multimodal scene information, we introduce the dual-channel input module, incorporating both vectorized and visual channels. Then, the hierarchical Mamba encoder module is developed to efficiently extract latent features from dual-channel inputs with linear computational complexity, significantly improving computational efficiency. Next, we design the cascade interaction module based on the attention mechanism, aiming to promote the model's ability to extract both fine-grained local interaction and high-level global interaction across different input factors. Finally, the multi-branch decoder is presented to generate future occupancy and flow predictions. We conduct extensive experiments on the Waymo Open Motion Dataset to evaluate the effectiveness of our model. Quantitative results demonstrate that our framework exhibits competitive prediction accuracy with high computation efficiency against the state-of-the-art methods. Specifically, HMambaOcc achieves the highest score on AUCb (0.8164), ranks second on Soft-IoUb (0.4937) and Soft-IoUfgo (0.5506), and ranks third on Soft-IoUc (04001) and AUCfgo (0.7928), respectively. Moreover, it reduces inference time to 198.8ms, improving efficiency by 17.1 % and 8.8% over StrajNet and OFMPNet, respectively. Qualitative results demonstrate a consistent correspondence between occupancy prediction and flow prediction, further confirming that HMambaOcc produces effective and reasonable predictions. Additionally, HMambaOcc exhibits a decrease of only 6.38% and 5.11% in the primary metric AUCfgo when facing missing trajectories and unseen scenarios, respectively, demonstrating its robustness and generalization capability.

Mamba

Occupancy flow field prediction

Mixed traffic environments

Autonomous driving

Vision Mamba

Författare

Rui He

Jilin University

Zhiwei Meng

Jilin University

Ruo Jia

Chalmers, Arkitektur och samhällsbyggnadsteknik, Geologi och geoteknik

Shaohua Cui

Chalmers, Arkitektur och samhällsbyggnadsteknik, Geologi och geoteknik

Sumin Zhang

Jilin University

Yupeng Chang

Jilin University

Expert Systems with Applications

0957-4174 (ISSN)

Vol. 299 130056

Ämneskategorier (SSIF 2025)

Datorgrafik och datorseende

Datavetenskap (datalogi)

DOI

10.1016/j.eswa.2025.130056

Mer information

Senast uppdaterat

2025-11-27