Combining Planning and Deep Reinforcement Learning in Tactical Decision Making for Autonomous Driving

Carl-Johan Hoel; Katherine Driggs-Campbell; Krister Wolff; Leo Laine; Mykel J. Kochenderfer

doi:10.1109/TIV.2019.2955905

Combining Planning and Deep Reinforcement Learning in Tactical Decision Making for Autonomous Driving
Journal article, 2020

Tactical decision making for autonomous driving is challenging due to the diversity of environments, the uncertainty in the sensor information, and the complex interaction with other road users. This article introduces a general framework for tactical decision making, which combines the concepts of planning and learning, in the form of Monte Carlo tree search and deep reinforcement learning. The method is based on the AlphaGo Zero algorithm, which is extended to a domain with a continuous state space where self-play cannot be used. The framework is applied to two different highway driving cases in a simulated environment and it is shown to perform better than a commonly used baseline method. The strength of combining planning and learning is also illustrated by a comparison to using the Monte Carlo tree search or the neural network policy separately.

Monte Carlo tree search

tactical decision making

Autonomous driving

reinforcement learning

Author

Carl-Johan Hoel

Volvo Cars

Katherine Driggs-Campbell

University of Illinois

Krister Wolff

Chalmers, Mechanics and Maritime Sciences (M2), Vehicle Engineering and Autonomous Systems

Other publications Research

Leo Laine

Volvo Cars

Other publications Research

Mykel J. Kochenderfer

Stanford University

IEEE Transactions on Intelligent Vehicles

23798858 (eISSN)

Vol. 5 2 294-305 8911507

Subject Categories

Vehicle Engineering

Computer Vision and Robotics (Autonomous Systems)

DOI

10.1109/TIV.2019.2955905

Publication data connected to DOI

More information

Latest update

1/17/2022

Combining Planning and Deep Reinforcement Learning in Tactical Decision Making for Autonomous Driving Journal article, 2020