Policy gradient method: Difference between revisions

Content deleted Content added
unify notation
Changing short description from "Class of reinforcement learning algorithms that directly optimize policy parameters by gradient ascent" to "Class of reinforcement learning algorithms"
Line 1:
{{Short description|Class of reinforcement learning algorithms that directly optimize policy parameters by gradient ascent}}
 
'''Policy gradient methods''' are a class of [[reinforcement learning]] algorithms.