Publication Details

Category Text Publication
Reference Category Journals
DOI 10.1016/j.automatica.2025.112372
Licence creative commons licence
Title (Primary) A data-driven practical stabilization approach for solving stochastic dynamic programming problems
Author Tipaldi, M.; Iervolino, R.; Massenio, P.R.; Forootani, A.
Source Titel Automatica
Year 2025
Department BIOENERGIE
Volume 178
Page From art. 112372
Language englisch
Topic T5 Future Landscapes
Keywords Stochastic dynamic programming; Bellman operator; Switched affine systems; Robust practical stabilization; Data-driven control design
Abstract This paper presents a data-driven practical stabilization approach for solving stochastic Dynamic Programming problems with unknown Markov Decision Process models over an infinite time horizon. The Bellman operator is modeled as a discrete-time switched affine system, with each mode representing a specific stationary stochastic policy and an external bounded disturbance term to account for such modeling issue. A two-step approach is followed. First, a model-based robust practical stabilization problem is solved to derive stabilization conditions which enable the practical convergence of the resulting closed-loop system trajectories towards a chosen reference value function. Then, by exploiting recent model-to-data Linear Matrix Inequality transformation tools, these results are further developed to obtain data-driven robust stabilization conditions for addressing the case of model-free problems. Such data-driven stabilization conditions are deployed into the Value Iteration algorithm, and finally tested on the recycling robot and the parking lot management problems to demonstrate the effectiveness of the proposed method.
Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=30822
Tipaldi, M., Iervolino, R., Massenio, P.R., Forootani, A. (2025):
A data-driven practical stabilization approach for solving stochastic dynamic programming problems
Automatica 178 , art. 112372 10.1016/j.automatica.2025.112372