Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions

Dapeng Liu; Mattias Brännström; Andrew Backhouse; Lennart Svensson

doi:10.1109/ITSC.2019.8917221

Learning faster to perform autonomous lane changes by constructing maneuvers from shielded semantic actions
Paper i proceeding, 2019

This paper introduces a new method to solve tactical decision making problems for highway lane changes. In the system design, reference sets for low level controllers are employed to formulate semantic meaningful actions used by reinforcement learning algorithm. Safety is ensured by preemptively shielding the Markov decision process (MDP) from unsafe actions. This frees the agent to focus on learning how to interact efficiently with the surrounding traffic. By introducing human demonstration with supervised loss as better exploration strategy, the learning process and initial performance are boosted further. © 2019 IEEE.

Författare

Dapeng Liu

AI Sweden

Zenuity AB

Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik

Forskning Andra publikationer

Mattias Brännström

Zenuity AB

Forskning Andra publikationer

Andrew Backhouse

Zenuity AB

Lennart Svensson

Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik

Forskning Andra publikationer

2019 IEEE Intelligent Transportation Systems Conference, ITSC 2019

1838-1844 8917221

Ämneskategorier (SSIF 2011)

Annan data- och informationsvetenskap

Systemvetenskap

Datavetenskap (datalogi)

DOI

10.1109/ITSC.2019.8917221

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2024-01-03

Om du har frågor, behöver hjälp, hittar en bugg eller vill ge feedback kan du göra det här nedan. Du når oss också direkt per e-post research.lib@chalmers.se.

Meddelande

Din e-postadress

Research.chalmers.se innehåller information om forskning på Chalmers, publikationer och projekt inklusive information om finansiärer och samarbetspartners.

Läs mer om tjänsten, täckningsgrad och vilka som kan se informationen

Personuppgifter och cookies

Tillgänglighet

Citation Style Language
citeproc-js (Frank Bennett)

Chalmers bibliotek

Chalmers forskning

Chalmers examensarbeten

412 96 GÖTEBORG
TELEFON: 031-772 10 00
WWW.CHALMERS.SE