| Markov Decision Process |
Article Index for Markov |
Website Links For Markov |
Information About ™Markov Decision Process |
| CATEGORIES ABOUT MARKOV DECISION PROCESS | |
| stochastic processes | |
| machine learning | |
| SHOPPER'S DELIGHT | |
|
A Markov Decision Process (MDP) is a Discrete Time Stochastic Control Process characterized by a set of states, actions, and a state transition function (usually a Transition Probability Matrix for discrete state- and action-spaces). An MDP also posesses the Markov Property . This means that if the current state of the MDP at time is known, transitions to a new state at time are independent of all previous states. MDPs are useful for studying a wide range of Optimization Problem s solved via Dynamic Programming and Reinforcement Learning . They are used in a variety of areas, including robotics, automated control, economics and in fabrication/fulfillment processes. DEFINITION A Markov Decision Process is a tuple , where
|