Details zur Publikation |
Kategorie | Textpublikation |
Referenztyp | Zeitschriften |
DOI | 10.1016/j.automatica.2025.112372 |
Lizenz ![]() |
|
Titel (primär) | A data-driven practical stabilization approach for solving stochastic dynamic programming problems |
Autor | Tipaldi, M.; Iervolino, R.; Massenio, P.R.; Forootani, A. |
Quelle | Automatica |
Erscheinungsjahr | 2025 |
Department | BIOENERGIE |
Band/Volume | 178 |
Seite von | art. 112372 |
Sprache | englisch |
Topic | T5 Future Landscapes |
Keywords | Stochastic dynamic programming; Bellman operator; Switched affine systems; Robust practical stabilization; Data-driven control design |
Abstract | This paper presents a data-driven practical stabilization approach for solving stochastic Dynamic Programming problems with unknown Markov Decision Process models over an infinite time horizon. The Bellman operator is modeled as a discrete-time switched affine system, with each mode representing a specific stationary stochastic policy and an external bounded disturbance term to account for such modeling issue. A two-step approach is followed. First, a model-based robust practical stabilization problem is solved to derive stabilization conditions which enable the practical convergence of the resulting closed-loop system trajectories towards a chosen reference value function. Then, by exploiting recent model-to-data Linear Matrix Inequality transformation tools, these results are further developed to obtain data-driven robust stabilization conditions for addressing the case of model-free problems. Such data-driven stabilization conditions are deployed into the Value Iteration algorithm, and finally tested on the recycling robot and the parking lot management problems to demonstrate the effectiveness of the proposed method. |
dauerhafte UFZ-Verlinkung | https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=30822 |
Tipaldi, M., Iervolino, R., Massenio, P.R., Forootani, A. (2025): A data-driven practical stabilization approach for solving stochastic dynamic programming problems Automatica 178 , art. 112372 10.1016/j.automatica.2025.112372 |