Policy gradient method: Difference between revisions

Content deleted Content added
Line 221:
</math>So far, this is essentially the same as natural gradient method. However, TRPO improves upon it by two modifications:
 
* Use [[conjugate gradient method]] to solve for <math>
x
</math> in <math>Fx = g</math> iteratively without explicit matrix inversion.