The increasing complexity of engineering systems has motivated continuing research on computational learning methods toward making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to sense their environment, but also to integrate information from the environment into all decision-makings. The evolution of such systems is modeled as an unknown controlled Markov chain. In a previous research, the predictive optimal decision-making (POD) model was developed, aiming to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of the POD to the stationary distribution of a Markov chain is proven, thus establishing the POD as a robust model for making autonomous intelligent systems. This paper provides the conditions that the POD can be valid, and be an interpretation of its underlying structure.
Skip Nav Destination
e-mail: amaliko@umich.edu
Article navigation
July 2009
Research Papers
Convergence Properties of a Computational Learning Model for Unknown Markov Chains
Andreas A. Malikopoulos
Andreas A. Malikopoulos
Department of Mechanical Engineering,
e-mail: amaliko@umich.edu
University of Michigan
, Ann Arbor, MI 48109
Search for other works by this author on:
Andreas A. Malikopoulos
Department of Mechanical Engineering,
University of Michigan
, Ann Arbor, MI 48109e-mail: amaliko@umich.edu
J. Dyn. Sys., Meas., Control. Jul 2009, 131(4): 041011 (7 pages)
Published Online: May 20, 2009
Article history
Received:
March 18, 2008
Revised:
February 4, 2009
Published:
May 20, 2009
Citation
Malikopoulos, A. A. (May 20, 2009). "Convergence Properties of a Computational Learning Model for Unknown Markov Chains." ASME. J. Dyn. Sys., Meas., Control. July 2009; 131(4): 041011. https://doi.org/10.1115/1.3117202
Download citation file:
Get Email Alerts
Modeling and Experimental Validation of Flow Ripple in a Variable Displacement Linkage Pump
J. Dyn. Sys., Meas., Control (July 2025)
Parametrized Maximal Admissible Sets With Application to Constraint Management of Systems With Slowly Varying Parameters
J. Dyn. Sys., Meas., Control (July 2025)
An Integrated Sensor Fault Estimation and Fault-Tolerant Control Design Approach for Continuous-Time Switched Systems
J. Dyn. Sys., Meas., Control (July 2025)
Stabilization of a Rigid Body in Two Fluids With Surface Tension
J. Dyn. Sys., Meas., Control (September 2025)
Related Articles
A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
J. Dyn. Sys., Meas., Control (July,2009)
An Approach for Testing Methods for Modeling Uncertainty
J. Mech. Des (September,2006)
A Set of Estimation and Decision Preference Experiments for Exploring Risk Assessment Biases in Engineering Students
ASME J. Risk Uncertainty Part B (March,2023)
Related Proceedings Papers
Related Chapters
Decision Making in Two-Dimensional Warranty Planning (PSAM-0186)
Proceedings of the Eighth International Conference on Probabilistic Safety Assessment & Management (PSAM)
Predicting the Learning Performance of Artificial Intelligent Systems Using Non-Homogeneous Poisson Process Models
Intelligent Engineering Systems Through Artificial Neural Networks, Volume 17
Model-Building for Robust Reinforcement Learning
Intelligent Engineering Systems through Artificial Neural Networks, Volume 20