Meta-MSCC: A foundation model for adaptive CAV control in highway weaving segments☆,☆
Artikel i vetenskaplig tidskrift, 2025

The connected and automated vehicle (CAV) technology offers new opportunities for active traffic management in highway weaving segments. Current CAV-based control strategies have not fully accounted for their coupling impacts on safety and efficiency in highway weaving segments. Moreover, most existing active traffic control models still have limitations in adaptability and transferability across diverse weaving scenarios. To address these issues, this study proposes a novel multi-strategy cooperative control model based on meta-deep reinforcement learning algorithm, named Meta-MSCC, which serves as a lightweight domain-specific foundation model for optimizing both safety and efficiency in highway weaving segments under mixed connected automated traffic. A total of 22,306 real-world vehicle trajectories, including 1,488 near-crash events, are used to determine an acceptable collision risk level as the safety benchmark. Additionally, 20 high-risk and congested weaving scenarios are constructed through microscopic simulations to train and validate the Meta-MSCC. This study further develops a multi-level CAVbased control strategy by integrating microscopic (trajectory), mesoscopic (speed), and macroscopic (flow) control approaches. Then, a multi-strategy cooperative control model is put forward based on the actor-critic framework, which can effectively capture coupling impacts of this strategy on safety and efficiency. After incorporating a meta-learning algorithm (i.e., meta-deep deterministic policy gradient) into the actor-critic framework, the Meta-MSCC is finally formed as a lightweight domain-specific foundation model, enhancing adaptability and transferability across diverse weaving scenarios. The results show that the multi-level CAV-based control strategy achieves greater improvements in safety and efficiency than single-level strategies. Furthermore, the proposed Meta-MSCC outperforms other classical DRL-based models, with collision risk reduced by 49.1% and time delay by 12.6%, while maintaining stable optimization performance at CAV penetration rates exceeding 60%. The findings of this study might contribute to the enhancement of active traffic management and the optimization of intelligent road infrastructures.

Weaving segments

Foundation model

Meta-learning

Deep reinforcement learning

Multi-strategy cooperative control

Connected and automated vehicles

Författare

Jianqiang Gao

Tongji Univ

Bo Yu

Tongji Univ

Yuren Chen

Tongji Univ

Junhua Wang

Tongji Univ

Zhe Dai

Changan University

Kun Gao

Chalmers, Arkitektur och samhällsbyggnadsteknik, Geologi och geoteknik

Transportation Research, Part C: Emerging Technologies

0968-090X (ISSN)

Vol. 181 105397

Ämneskategorier (SSIF 2025)

Transportteknik och logistik

DOI

10.1016/j.trc.2025.105397

Mer information

Senast uppdaterat

2025-11-14