Content deleted Content added
unify notation |
Changing short description from "Class of reinforcement learning algorithms that directly optimize policy parameters by gradient ascent" to "Class of reinforcement learning algorithms" |
||
Line 1:
{{Short description|Class of reinforcement learning algorithms
'''Policy gradient methods''' are a class of [[reinforcement learning]] algorithms.
|