Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions

Dapeng Liu; Mattias Brännström; Andrew Backhouse; Lennart Svensson

doi:10.1109/ITSC.2019.8917221

Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions
Paper in proceeding, 2019

This paper introduces a new method to solve tactical decision making problems for highway lane changes. In the system design, reference sets for low level controllers are employed to formulate semantic meaningful actions used by reinforcement learning algorithm. Safety is ensured by preemptively shielding the Markov decision process (MDP) from unsafe actions. This frees the agent to focus on learning how to interact efficiently with the surrounding traffic. By introducing human demonstration with supervised loss as better exploration strategy, the learning process and initial performance are boosted further. © 2019 IEEE.

Author

Dapeng Liu

AI Sweden

Zenuity AB

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering

Other publications Research

Mattias Brännström

Zenuity AB

Other publications Research

Andrew Backhouse

Zenuity AB

Lennart Svensson

Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering

Other publications Research

2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019

1838-1844 8917221

Subject Categories (SSIF 2011)

Other Computer and Information Science

Information Science

Computer Science

DOI

10.1109/ITSC.2019.8917221

Publication data connected to DOI

More information

Latest update

1/3/2024 9

Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions Paper in proceeding, 2019

Author

Dapeng Liu

Mattias Brännström

Andrew Backhouse

Lennart Svensson

2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019

Subject Categories (SSIF 2011)

DOI

More information

Latest update

Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions
Paper in proceeding, 2019