Markov decision process: Difference between revisions

Content deleted Content added
Arachnidly (talk | contribs)
deleted repetitive history info related to Markov chain
Arachnidly (talk | contribs)
Changing short description from "Mathematical model" to "Mathematical model for sequential decision making"
Line 1:
{{Short description|Mathematical model for sequential decision making}}
'''Markov decision process''' ('''MDP'''), also called a [[Stochastic dynamic programming|stochastic dynamic program]] or stochastic control problem, is a model for [[sequential decision making]] when [[Outcome (probability)|outcomes]] are uncertain.<ref>{{Cite book |last=Puterman |first=Martin L. |title=Markov decision processes: discrete stochastic dynamic programming |date=1994 |publisher=Wiley |isbn=978-0-471-61977-2 |series=Wiley series in probability and mathematical statistics. Applied probability and statistics section |___location=New York}}</ref>