Data-Efficient and Robust Reinforcement Learning for Moving Devices
Other text in scientific journal, 2026

One main conclusion in this paper is that model-based RL is much more data-efficient, but also more robust against neglected high-frequency dynamics. A modularized model-based RL strategy is therefore proposed where a nonlinear state-space model is estimated. In this model some minor physical knowledge can be easily introduced. Combining feedback and feedforward control with temporal optimization based on the estimated model, it is shown that energy and peak power for moving devices can be significantly reduced, utilizing much less data compared to standard model-free RL.

Author

Bengt Lennartson

Chalmers, Electrical Engineering, Systems and control

Engineering

2095-8099 (ISSN)

Vol. In Press

Subject Categories (SSIF 2025)

Probability Theory and Statistics

Robotics and automation

Computer Sciences

Control Engineering

DOI

10.1016/j.eng.2026.02.005

More information

Latest update

5/27/2026