Efficient Tactical Decision Making for Trucks in Highway Traffic with Deep Reinforcement Learning
Licentiatavhandling, 2026

This thesis investigates tactical decision making for autonomous heavy-duty trucks in highway traffic using deep reinforcement learning, with a particular emphasis on optimizing safety, efficiency and costs. The key aspects of decision making include Adaptive Cruise Control (ACC) and lane changes, which strongly influence energy consumption, travel time, and traffic interactions. To support a systematic study of this problem, we develop a scalable traffic model on a simulation platform, providing a controlled and extensible environment for autonomous truck driving in multi-lane highways.

We propose a hierarchical control architecture in which reinforcement learning is used for high-level tactical decision making, while low-level tactical actions are handled by physics-based controllers. This separation is found to improve the performance by reducing safety risks and facilitates the integration of learning-based decision making with established control methods. A realistic reward function is designed to jointly capture safety, efficiency, and operational costs, and advanced training strategies such as curriculum learning are investigated to handle conflicting objectives within a scalarized framework.

We further explore a multi-objective reinforcement learning formulation to explicitly represent trade-offs between competing objectives, enabling the learning of interpretable Pareto frontiers. The results demonstrate that learning-based tactical decision making policies can achieve meaningful trade-offs between safety and various operational costs in abstracted highway scenarios, and that multi-objective formulations provide valuable insight into the structure of these trade-offs. Overall, this work contributes to methodological foundations and evaluation tools for economically meaningful and extensible learning-based tactical decision making for heavy-duty trucks.

Deep Reinforcement Learning

Tactical Decision making

Autonomous driving

Multi-Objective Reinforcement Learning

Room Analysen, EDIT building, Rännvägen 6, Chalmers
Opponent: Assistant Professor, Farnaz Adib Yaghmaie, Linköping University, Sweden

Författare

Deepthi Pathare

Chalmers, Data- och informationsteknik, Data Science och AI

Tactical decision making for autonomous trucks by deep reinforcement learning with total cost of operation based reward

Artificial Intelligence Review,;Vol. 59(2026)

Artikel i vetenskaplig tidskrift

Deepthi Pathare, Leo Laine, Morteza Haghir Chehreghani. Multi-Objective Reinforcement Learning for Efficient Tactical Decision Making for Trucks in Highway Traffic

Styrkeområden

Informations- och kommunikationsteknik

Transport

Ämneskategorier (SSIF 2025)

Datavetenskap (datalogi)

Artificiell intelligens

Utgivare

Chalmers

Room Analysen, EDIT building, Rännvägen 6, Chalmers

Online

Opponent: Assistant Professor, Farnaz Adib Yaghmaie, Linköping University, Sweden

Mer information

Senast uppdaterat

2026-03-19