Electrical, Computer, and Biomedical Engineering Faculty Publications

Model-Free Dual Heuristic Dynamic Programming

Zhen Ni, University of Rhode Island
Haibo He, University of Rhode IslandFollow
Xiangnan Zhong, University of Rhode Island
Danil V. Prokhorov, Toyota Motor Corporation

Document Type

Article

Date of Original Version

8-1-2015

Abstract

Model-based dual heuristic dynamic programming (MB-DHP) is a popular approach in approximating optimal solutions in control problems. Yet, it usually requires offline training for the model network, and thus resulting in extra computational cost. In this brief, we propose a model-free DHP (MF-DHP) design based on finite-difference technique. In particular, we adopt multilayer perceptron with one hidden layer for both the action and the critic networks design, and use delayed objective functions to train both the action and the critic networks online over time. We test both the MF-DHP and MB-DHP approaches with a discrete time example and a continuous time example under the same parameter settings. Our simulation results demonstrate that the MF-DHP approach can obtain a control performance competitive with that of the traditional MB-DHP approach while requiring less computational resources.

Publication Title, e.g., Journal

IEEE Transactions on Neural Networks and Learning Systems

Volume

Issue

Citation/Publisher Attribution

Ni, Zhen, Haibo He, Xiangnan Zhong, and Danil V. Prokhorov. "Model-Free Dual Heuristic Dynamic Programming." IEEE Transactions on Neural Networks and Learning Systems 26, 8 (2015): 1834-1839. doi: 10.1109/TNNLS.2015.2424971.

Link to Full Text

COinS

DOI

https://doi.org/10.1109/TNNLS.2015.2424971

Electrical, Computer, and Biomedical Engineering Faculty Publications

Model-Free Dual Heuristic Dynamic Programming

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

DOI

Search

Browse

Author Corner

Electrical, Computer, and Biomedical Engineering Faculty Publications

Model-Free Dual Heuristic Dynamic Programming

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

Share

DOI

Search

Browse

Author Corner