Constrained Policy Gradient Method for Safe and Fast Reinforcement Learning: a Neural Tangent Kernel Based Approach
Preprint, 2021
RL
Neural Tangent Kernel
ML
Author
Balázs Varga
Chalmers, Electrical Engineering, Systems and control
Balázs Adam Kulcsár
Chalmers, Electrical Engineering, Systems and control
Morteza Haghir Chehreghani
Chalmers, Computer Science and Engineering (Chalmers), Data Science
Real-Time Robust and AdaptIve Learning in ElecTric VEhicles (RITE)
Chalmers, 2020-01-01 -- 2021-12-31.
Chalmers AI Research Centre (CHAIR), 2020-01-01 -- 2021-12-31.
Areas of Advance
Information and Communication Technology
Transport
Subject Categories
Learning
Information Science
Computer Science
Related datasets
URI: https://arxiv.org/abs/2107.09139