Content deleted Content added
Line 221:
</math>So far, this is essentially the same as natural gradient method. However, TRPO improves upon it by two modifications:
* Use [[conjugate gradient method]] to solve for <math>
x
</math> in <math>Fx = g</math> iteratively without explicit matrix inversion.
|