Reinforcement Learning as an Alternative to Reachability Analysis for Falsification of AD Functions
Other conference contribution, 2021

Reachability analysis (RA) is one of the classical approaches to study the safety of autonomous systems, for example through falsification, the identification of initial system states which can under the right disturbances lead to unsafe or undesirable outcome states. The advantage of obtaining exact answers via RA requires analytical system models often unavailable for simulation environments for autonomous driving (AD) sys- tems. RA suffers from rapidly rising computational costs as the dimensionality increases and ineffectiveness in dealing with nonlinearities such as saturation. Here we present an alterna- tive in the form of a reinforcement learning (RL) approach which empirically shows good agreement with RA falsification for an Adaptive Cruise Controller, it can deal with saturation, and, in preliminary data, compares favorably in computational effort against RA. Due to the choice of reward function, the RL’s estimated value function provides insights into the ease of causing unsafe outcomes and allows for direct comparison with the RA falsification results.

Author

Tobias Johansson

Data Science and AI 1

Angel Molina Acosta

Chalmers, Electrical Engineering, Systems and control

Alexander Schliep

University of Gothenburg

Paolo Falcone

Chalmers, Electrical Engineering, Systems and control

Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)
Sydney, Australia,

Areas of Advance

Transport

Subject Categories (SSIF 2011)

Control Engineering

Computer Systems

More information

Latest update

7/18/2023