Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections

Carl-Johan E Hoel; Tommy Tram; Jonas Sjöberg

doi:10.1109/ITSC45102.2020.9294407

Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
Paper i proceeding, 2020

This paper investigates how a Bayesian reinforcement learning method can be used to create a tactical decision-making agent for autonomous driving in an intersection scenario, where the agent can estimate the confidence of its decisions. An ensemble of neural networks, with additional randomized prior functions (RPF), are trained by using a bootstrapped experience replay memory. The coefficient of variation in the estimated Q-values of the ensemble members is used to approximate the uncertainty, and a criterion that determines if the agent is sufficiently confident to make a particular decision is introduced. The performance of the ensemble RPF method is evaluated in an intersection scenario and compared to a standard Deep Q-Network method, which does not estimate the uncertainty. It is shown that the trained ensemble RPF agent can detect cases with high uncertainty, both in situations that are far from the training distribution, and in situations that seldom occur within the training distribution. This work demonstrates one possible application of such a confidence estimate, by using this information to choose safe actions in unknown situations, which removes all collisions from within the training distribution, and most collisions outside of the distribution.

Författare

Carl-Johan E Hoel

Chalmers, Mekanik och maritima vetenskaper, Fordonsteknik och autonoma system

AI Sweden

Volvo Group

Forskning Andra publikationer

Tommy Tram

Zenuity AB

Chalmers, Elektroteknik, System- och reglerteknik

AI Sweden

Forskning Andra publikationer

Jonas Sjöberg

Chalmers, Elektroteknik, System- och reglerteknik

Forskning Andra publikationer

IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC

21530009 (ISSN) 21530017 (eISSN)

9294407
9781728141497 (ISBN)

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)
Rhodes, Greece,

WASP SAS

Wallenberg AI, Autonomous Systems and Software Program, 2018-01-01 -- 2023-01-01.

Visa projekt

Styrkeområden

Transport

Ämneskategorier (SSIF 2011)

Elektroteknik och elektronik

Sannolikhetsteori och statistik

Datorsystem

DOI

10.1109/ITSC45102.2020.9294407

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2025-04-28

Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections Paper i proceeding, 2020

Författare

Carl-Johan E Hoel

Tommy Tram

Jonas Sjöberg

IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC

WASP SAS

Styrkeområden

Ämneskategorier (SSIF 2011)

DOI

Mer information

Senast uppdaterat

Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
Paper i proceeding, 2020