UB Paderborn / Katalog / Suche / Details

Ergebnis 13 von 572

IEEE transaction on neural networks and learning systems, 2018-07, Vol.29 (7), p.2794-2807

2018

Autor(en) / Beteiligte

Titel

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

Ist Teil von

IEEE transaction on neural networks and learning systems, 2018-07, Vol.29 (7), p.2794-2807

Ort / Verlag

United States: IEEE

Erscheinungsjahr

2018

Link zum Volltext

Quelle

IEEE Xplore Digital Library

Beschreibungen/Notizen

Policy iteration approximate dynamic programming (DP) is an important algorithm for solving optimal decision and control problems. In this paper, we focus on the problem associated with policy approximation in policy iteration approximate DP for discrete-time nonlinear systems using infinite-horizon undiscounted value functions. Taking policy approximation error into account, we demonstrate asymptotic stability of the control policy under our problem setting, show boundedness of the value function during each policy iteration step, and introduce a new sufficient condition for the value function to converge to a bounded neighborhood of the optimal value function. Aiming for practical implementation of an approximate policy, we consider using Volterra series, which has been extensively covered in controls literature for its good theoretical properties and for its success in practical applications. We illustrate the effectiveness of the main ideas developed in this paper using several examples including a practical problem of excitation control of a hydrogenerator.

Sprache: Englisch
Identifikatoren: ISSN: 2162-237X
eISSN: 2162-2388
DOI: 10.1109/TNNLS.2017.2702566
Titel-ID: cdi_pubmed_primary_28600262

Format: –
Schlagworte: Approximate dynamic programming (DP), Approximation, Approximation algorithms, Approximation error, Control stability, Convergence, Discrete time systems, Dynamic programming, error bound, Markov analysis, Mathematical analysis, Nonlinear systems, Optimal control, policy approximation, policy iteration, Volterra series

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX