Electrical, Computer, and Biomedical Engineering Faculty Publications

An online actor-critic learning approach with Levenberg-Marquardt algorithm

Zhen Ni, University of Rhode Island
Haibo He, University of Rhode IslandFollow
Danil V. Prokhorov, Toyota Motor Corporation
Jian Fu, Wuhan University of Technology

Document Type

Conference Proceeding

Date of Original Version

10-24-2011

Abstract

This paper focuses on the efficiency improvement of online actor-critic design base on the Levenberg-Marquardt (LM) algorithm rather than traditional chain rule. Over the decades, several generations of adaptive/approximate dynamic programming (ADP) structures have been proposed in the community and demonstrated many successfully applications. Neural network with backpropagation has been one of the most important approaches to tune the parameters in such ADP designs. In this paper, we aim to study the integration of Levenberg-Marquardt method into the regular actor-critic design to improve weights updating and learning for a quadratic convergence under certain condition. Specifically, for the critic network design, we adopt the LM method targeting improved learning performance, while for the action network, we use the neural network with backpropagation to provide an appropriate control action. A detailed learning algorithm is presented, followed by benchmark tests of pendulum swing up and balance and cart-pole balance tasks. Various simulation results and comparative study demonstrated the effectiveness of this approach. © 2011 IEEE.

Publication Title, e.g., Journal

Proceedings of the International Joint Conference on Neural Networks

Citation/Publisher Attribution

Ni, Zhen, Haibo He, Danil V. Prokhorov, and Jian Fu. "An online actor-critic learning approach with Levenberg-Marquardt algorithm." Proceedings of the International Joint Conference on Neural Networks (2011): 2333-2340. doi: 10.1109/IJCNN.2011.6033520.

Link to Full Text

COinS

DOI

https://doi.org/10.1109/IJCNN.2011.6033520

Electrical, Computer, and Biomedical Engineering Faculty Publications

An online actor-critic learning approach with Levenberg-Marquardt algorithm

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Citation/Publisher Attribution

DOI

Search

Browse

Author Corner

Electrical, Computer, and Biomedical Engineering Faculty Publications

An online actor-critic learning approach with Levenberg-Marquardt algorithm

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Citation/Publisher Attribution

Share

DOI

Search

Browse

Author Corner