A neural network based online learning and control approach for Markov jump systems

Document Type

Article

Date of Original Version

2-3-2015

Abstract

In this paper, we propose an optimal online control method for discrete-time nonlinear Markov jump systems (MJSs). The Markov chain and the weighted sum technique are introduced to convert the Markov jumping problem into an optimal control problem. We then use adaptive dynamic programming (ADP) to accomplish online learning and control with specific learning algorithm and detailed stability analysis, including the convergence of the performance index function sequence and the existence of the corresponding admissible control input. Neural networks are applied to implement this ADP approach and online learning method is used to tune the weights of the critic and the action networks. Two different numerical examples are given to demonstrate the effectiveness of the proposed method.

Publication Title, e.g., Journal

Neurocomputing

Volume

149

Issue

Part A

Share

COinS