Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming
Date of Original Version
Goal representation globalized dual heuristic dynamic programming (Gr-GDHP) method is proposed in this paper. A goal neural network is integrated into the traditional GDHP method providing an internal reinforcement signal and its derivatives to help the control and learning process. From the proposed architecture, it is shown that the obtained internal reinforcement signal and its derivatives can be able to adjust themselves online over time rather than a fixed or predefined function in literature. Furthermore, the obtained derivatives can directly contribute to the objective function of the critic network, whose learning process is thus simplified. Numerical simulation studies are applied to show the performance of the proposed Gr-GDHP method and compare the results with other existing adaptive dynamic programming designs. We also investigate this method on a ball-and-beam balancing system. The statistical simulation results are presented for both the Gr-GDHP and the GDHP methods to demonstrate the improved learning and controlling performance.
IEEE Transactions on Cybernetics
Zhong, Xiangnan, Zhen Ni, and Haibo He. "Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming." IEEE Transactions on Cybernetics 47, 10 (2017): 3318-3330. doi:10.1109/TCYB.2016.2598282.