Details zur Publikation

Kategorie Textpublikation
Referenztyp Zeitschriften
DOI 10.1016/j.automatica.2025.112372
Lizenz creative commons licence
Titel (primär) A data-driven practical stabilization approach for solving stochastic dynamic programming problems
Autor Tipaldi, M.; Iervolino, R.; Massenio, P.R.; Forootani, A.
Quelle Automatica
Erscheinungsjahr 2025
Department BIOENERGIE
Band/Volume 178
Seite von art. 112372
Sprache englisch
Topic T5 Future Landscapes
Keywords Stochastic dynamic programming; Bellman operator; Switched affine systems; Robust practical stabilization; Data-driven control design
Abstract This paper presents a data-driven practical stabilization approach for solving stochastic Dynamic Programming problems with unknown Markov Decision Process models over an infinite time horizon. The Bellman operator is modeled as a discrete-time switched affine system, with each mode representing a specific stationary stochastic policy and an external bounded disturbance term to account for such modeling issue. A two-step approach is followed. First, a model-based robust practical stabilization problem is solved to derive stabilization conditions which enable the practical convergence of the resulting closed-loop system trajectories towards a chosen reference value function. Then, by exploiting recent model-to-data Linear Matrix Inequality transformation tools, these results are further developed to obtain data-driven robust stabilization conditions for addressing the case of model-free problems. Such data-driven stabilization conditions are deployed into the Value Iteration algorithm, and finally tested on the recycling robot and the parking lot management problems to demonstrate the effectiveness of the proposed method.
dauerhafte UFZ-Verlinkung https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=30822
Tipaldi, M., Iervolino, R., Massenio, P.R., Forootani, A. (2025):
A data-driven practical stabilization approach for solving stochastic dynamic programming problems
Automatica 178 , art. 112372 10.1016/j.automatica.2025.112372