"Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Prog" by Xiangnan Zhong, Zhen Ni et al.
 

Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming

Document Type

Article

Date of Original Version

10-1-2017

Abstract

Goal representation globalized dual heuristic dynamic programming (Gr-GDHP) method is proposed in this paper. A goal neural network is integrated into the traditional GDHP method providing an internal reinforcement signal and its derivatives to help the control and learning process. From the proposed architecture, it is shown that the obtained internal reinforcement signal and its derivatives can be able to adjust themselves online over time rather than a fixed or predefined function in literature. Furthermore, the obtained derivatives can directly contribute to the objective function of the critic network, whose learning process is thus simplified. Numerical simulation studies are applied to show the performance of the proposed Gr-GDHP method and compare the results with other existing adaptive dynamic programming designs. We also investigate this method on a ball-and-beam balancing system. The statistical simulation results are presented for both the Gr-GDHP and the GDHP methods to demonstrate the improved learning and controlling performance.

Publication Title, e.g., Journal

IEEE Transactions on Cybernetics

Volume

47

Issue

10

Plum Print visual indicator of research metrics
PlumX Metrics
  • Citations
    • Citation Indexes: 46
  • Captures
    • Readers: 15
see details

Share

COinS