Reinforcement Learning as an Alternative to Reachability Analysis for Falsification of AD Functions
Övrigt konferensbidrag, 2021

Reachability analysis (RA) is one of the classical approaches to study the safety of autonomous systems, for example through falsification, the identification of initial system states which can under the right disturbances lead to unsafe or undesirable outcome states. The advantage of obtaining exact answers via RA requires analytical system models often unavailable for simulation environments for autonomous driving (AD) sys- tems. RA suffers from rapidly rising computational costs as the dimensionality increases and ineffectiveness in dealing with nonlinearities such as saturation. Here we present an alterna- tive in the form of a reinforcement learning (RL) approach which empirically shows good agreement with RA falsification for an Adaptive Cruise Controller, it can deal with saturation, and, in preliminary data, compares favorably in computational effort against RA. Due to the choice of reward function, the RL’s estimated value function provides insights into the ease of causing unsafe outcomes and allows for direct comparison with the RA falsification results.

Författare

Tobias Johansson

Data Science och AI 1

Angel Molina Acosta

Chalmers, Elektroteknik, System- och reglerteknik

Alexander Schliep

Göteborgs universitet

Paolo Falcone

Chalmers, Elektroteknik, System- och reglerteknik

Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)
Sydney, Australia,

Styrkeområden

Transport

Ämneskategorier

Reglerteknik

Datorsystem

Mer information

Senast uppdaterat

2023-07-18