Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

Jonathan Lock; Tomas McKelvey

doi:10.1080/00207179.2021.1939892

Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming
Artikel i vetenskaplig tidskrift, 2022

We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.

optimal control

undiscounted infinite-horizon

control policy

Approximate dynamic programming

Författare

Jonathan Lock

Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik

Forskning Andra publikationer

Tomas McKelvey

Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik

Forskning Andra publikationer

International Journal of Control

0020-7179 (ISSN) 13665820 (eISSN)

Vol. 95 10 2854-2864

Styrkeområden

Informations- och kommunikationsteknik

Energi

Ämneskategorier (SSIF 2011)

Beräkningsmatematik

Reglerteknik

DOI

10.1080/00207179.2021.1939892

Publikationsdata kopplat till DOI

Relaterade dataset

UCPADP Matlab implementation [dataset]

URI: https://gitlab.com/lerneaen_hydra/ucpadp

Hämta data

Mer information

Senast uppdaterat

2026-03-26

Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming Artikel i vetenskaplig tidskrift, 2022