Efficient Tactical Decision Making for Trucks in Highway Traffic with Deep Reinforcement Learning
Licentiatavhandling, 2026
We propose a hierarchical control architecture in which reinforcement learning is used for high-level tactical decision making, while low-level tactical actions are handled by physics-based controllers. This separation is found to improve the performance by reducing safety risks and facilitates the integration of learning-based decision making with established control methods. A realistic reward function is designed to jointly capture safety, efficiency, and operational costs, and advanced training strategies such as curriculum learning are investigated to handle conflicting objectives within a scalarized framework.
We further explore a multi-objective reinforcement learning formulation to explicitly represent trade-offs between competing objectives, enabling the learning of interpretable Pareto frontiers. The results demonstrate that learning-based tactical decision making policies can achieve meaningful trade-offs between safety and various operational costs in abstracted highway scenarios, and that multi-objective formulations provide valuable insight into the structure of these trade-offs. Overall, this work contributes to methodological foundations and evaluation tools for economically meaningful and extensible learning-based tactical decision making for heavy-duty trucks.
Deep Reinforcement Learning
Tactical Decision making
Autonomous driving
Multi-Objective Reinforcement Learning
Författare
Deepthi Pathare
Chalmers, Data- och informationsteknik, Data Science och AI
Tactical decision making for autonomous trucks by deep reinforcement learning with total cost of operation based reward
Artificial Intelligence Review,;Vol. 59(2026)
Artikel i vetenskaplig tidskrift
Deepthi Pathare, Leo Laine, Morteza Haghir Chehreghani. Multi-Objective Reinforcement Learning for Efficient Tactical Decision Making for Trucks in Highway Traffic
Styrkeområden
Informations- och kommunikationsteknik
Transport
Ämneskategorier (SSIF 2025)
Datavetenskap (datalogi)
Artificiell intelligens
Utgivare
Chalmers
Room Analysen, EDIT building, Rännvägen 6, Chalmers
Opponent: Assistant Professor, Farnaz Adib Yaghmaie, Linköping University, Sweden