Content deleted Content added
m v2.04b - Bot T20 CW#61 - Fix errors for CW project (Reference before punctuation) |
→Markov decision process: MDPs are not "closely related" to "reinforcement learning". Reinforcement learning is simply a non-mathematically rigorous application which tries to emulate MDPs. |
||
Line 31:
{{main|Markov decision process}}
A [[Markov decision process]] is a Markov chain in which state transitions depend on the current state and an action vector that is applied to the system. Typically, a Markov decision process is used to compute a policy of actions that will maximize some utility with respect to expected rewards
==Partially observable Markov decision process==
|