A neural network based online learning and control approach for Markov jump systems
Date of Original Version
In this paper, we propose an optimal online control method for discrete-time nonlinear Markov jump systems (MJSs). The Markov chain and the weighted sum technique are introduced to convert the Markov jumping problem into an optimal control problem. We then use adaptive dynamic programming (ADP) to accomplish online learning and control with specific learning algorithm and detailed stability analysis, including the convergence of the performance index function sequence and the existence of the corresponding admissible control input. Neural networks are applied to implement this ADP approach and online learning method is used to tune the weights of the critic and the action networks. Two different numerical examples are given to demonstrate the effectiveness of the proposed method.
Zhong, Xiangnan, Haibo He, Huaguang Zhang, and Zhanshan Wang. "A neural network based online learning and control approach for Markov jump systems." Neurocomputing 149, Part A (2015): 116-123. doi:10.1016/j.neucom.2014.01.060.