TY - GEN
T1 - On Reverse-Ordered Implementation of Differential Dynamic Programming for Unconstrained Optimal Control Problem
AU - Park, Wonyoung
AU - Park, Gyunghoon
N1 - Publisher Copyright:
© 2023 ICROS.
PY - 2023
Y1 - 2023
N2 - Differential dynamic programming (DDP) has been widely employed to numerically solve finite-time nonlinear optimal control problem. The core of the DDP is to update state and input trajectories at each iteration by repeating two processes termed the backward pass (BP) and the forward pass (FP). While the conventional order of the passes, first-BP-second-FP, explicitly reveals a feedback control structure in the trajectory update, this ordering also introduces extra difficulty when the DDP is applied to constrained problems. In this paper, we study a reverse-ordered implementation of the DDP algorithm in a sense of the first-FP-second-BP, which brings a structural simplicity of the algorithm. It is seen in simulations for a simple example that this reverse-ordered DDP is able to find the optimal trajectory as the conventional DDP, and often experiences a larger transient or a dynamic exploration, while mathematical analysis for these phenomenons is left to future works.
AB - Differential dynamic programming (DDP) has been widely employed to numerically solve finite-time nonlinear optimal control problem. The core of the DDP is to update state and input trajectories at each iteration by repeating two processes termed the backward pass (BP) and the forward pass (FP). While the conventional order of the passes, first-BP-second-FP, explicitly reveals a feedback control structure in the trajectory update, this ordering also introduces extra difficulty when the DDP is applied to constrained problems. In this paper, we study a reverse-ordered implementation of the DDP algorithm in a sense of the first-FP-second-BP, which brings a structural simplicity of the algorithm. It is seen in simulations for a simple example that this reverse-ordered DDP is able to find the optimal trajectory as the conventional DDP, and often experiences a larger transient or a dynamic exploration, while mathematical analysis for these phenomenons is left to future works.
KW - Optimal control
KW - differential dynamic programming
KW - dynamic programming
KW - optimization
UR - http://www.scopus.com/inward/record.url?scp=85179179128&partnerID=8YFLogxK
U2 - 10.23919/ICCAS59377.2023.10316997
DO - 10.23919/ICCAS59377.2023.10316997
M3 - Conference contribution
AN - SCOPUS:85179179128
T3 - International Conference on Control, Automation and Systems
SP - 1238
EP - 1243
BT - 23rd International Conference on Control, Automation and Systems, ICCAS 2023
PB - IEEE Computer Society
T2 - 23rd International Conference on Control, Automation and Systems, ICCAS 2023
Y2 - 17 October 2023 through 20 October 2023
ER -