Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions
Paper in proceedings, 2019

This paper introduces a new method to solve tactical decision making problems for highway lane changes. In the system design, reference sets for low level controllers are employed to formulate semantic meaningful actions used by reinforcement learning algorithm. Safety is ensured by preemptively shielding the Markov decision process (MDP) from unsafe actions. This frees the agent to focus on learning how to interact efficiently with the surrounding traffic. By introducing human demonstration with supervised loss as better exploration strategy, the learning process and initial performance are boosted further. © 2019 IEEE.

Author

Dapeng Liu

AI Sweden

Zenuity AB

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering, Signal Processing

Mattias Brännström

Zenuity AB

Andrew Backhouse

Zenuity AB

Lennart Svensson

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering, Signal Processing

2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019

1838-1844 8917221

Subject Categories

Other Computer and Information Science

Information Science

Computer Science

DOI

10.1109/ITSC.2019.8917221

More information

Latest update

6/18/2020