Markov decision process: Difference between revisions

Content deleted Content added
m Fix typo
Fixed typo "agle" to "angle".
Line 51:
In this example, we have
 
* <math>S</math> is the set of ordered tuples <math>(\theta,\dot \theta, x, \dot x )\subset \mathbb R^4</math> given by pole agleangle, angular velocity, position of the cart and its speed.
* <math>A</math> is <math>\{-1,1\}</math>, corresponding to applying a force on the left (right) on the cart.
* <math>P_a(s, s')</math> is the transition of the system, which in this case is going to be deterministic and driven by the laws of mechanics.