The increasing complexity of engineering systems has motivated continuing research on computational learning methods toward making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to sense their environment, but also to integrate information from the environment into all decision-makings. The evolution of such systems is modeled as an unknown controlled Markov chain. In a previous research, the predictive optimal decision-making (POD) model was developed, aiming to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of the POD to the stationary distribution of a Markov chain is proven, thus establishing the POD as a robust model for making autonomous intelligent systems. This paper provides the conditions that the POD can be valid, and be an interpretation of its underlying structure.
Skip Nav Destination
Article navigation
July 2009
Research Papers
Convergence Properties of a Computational Learning Model for Unknown Markov Chains
Andreas A. Malikopoulos
Andreas A. Malikopoulos
Department of Mechanical Engineering,
amaliko@umich.edu
University of Michigan
, Ann Arbor, MI 48109
Search for other works by this author on:
Andreas A. Malikopoulos
J. Dyn. Sys., Meas., Control. Jul 2009, 131(4): 041011 (7 pages)
Published Online: May 20, 2009
Article history
Received:
March 18, 2008
Revised:
February 4, 2009
Published:
May 20, 2009
Citation
Malikopoulos, A. A. (May 20, 2009). "Convergence Properties of a Computational Learning Model for Unknown Markov Chains." ASME. J. Dyn. Sys., Meas., Control. July 2009; 131(4): 041011. https://doi.org/10.1115/1.3117202
Download citation file:
Get Email Alerts
Cited By
Robust Periodical Tracking for Fast Tool Servo Systems With Selective Disturbance Compensation
J. Dyn. Sys., Meas., Control (August 2022)
Feasibility of a Wearable Cold-Gas Thruster for Fall Prevention
J. Dyn. Sys., Meas., Control (August 2022)
Comparing Three Different Decoupling Control Approaches for Roll-To-Roll Printing Systems
J. Dyn. Sys., Meas., Control
Saturated Output Feedback Control for Robot Manipulators with Joints of Arbitrary Flexibility
J. Dyn. Sys., Meas., Control
Related Articles
A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
J. Dyn. Sys., Meas., Control (July,2009)
An Approach for Testing Methods for Modeling Uncertainty
J. Mech. Des (September,2006)
Nonlinear Parameters and State Estimation for Adaptive Nonlinear Model Predictive Control Design
J. Dyn. Sys., Meas., Control (April,2016)
Related Proceedings Papers
Related Chapters
Predicting the Learning Performance of Artificial Intelligent Systems Using Non-Homogeneous Poisson Process Models
Intelligent Engineering Systems Through Artificial Neural Networks, Volume 17
Model-Building for Robust Reinforcement Learning
Intelligent Engineering Systems through Artificial Neural Networks, Volume 20
An Approach for System Development Using Evolutionary Probabilistic Strategy and Grammar Rules
Intelligent Engineering Systems through Artificial Neural Networks, Volume 16