Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions
Paper in proceedings, 2019

This paper introduces a new method to solve tactical decision making problems for highway lane changes. In the system design, reference sets for low level controllers are employed to formulate semantic meaningful actions used by reinforcement learning algorithm. Safety is ensured by preemptively shielding the Markov decision process (MDP) from unsafe actions. This frees the agent to focus on learning how to interact efficiently with the surrounding traffic. By introducing human demonstration with supervised loss as better exploration strategy, the learning process and initial performance are boosted further. © 2019 IEEE.

Author

Dapeng Liu

AI Innovation of Sweden

Zenuity

Chalmers, Electrical Engineering, Signalbehandling och medicinsk teknik, Signal Processing

Mattias Brännström

Zenuity

Andrew Backhouse

Zenuity

Lennart Svensson

Chalmers, Electrical Engineering, Signalbehandling och medicinsk teknik, Signal Processing

2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019

1838-1844 8917221

Subject Categories

Other Computer and Information Science

Information Science

Computer Science

DOI

10.1109/ITSC.2019.8917221

More information

Latest update

6/18/2020