Policy gradient method: Difference between revisions

Content deleted Content added
Changing short description from "Class of reinforcement learning algorithms that directly optimize policy parameters by gradient ascent" to "Class of reinforcement learning algorithms"
Added tags to the page using Page Curation (unreliable sources)
Tags: Reverted PageTriage
Line 1:
{{unreliable sources|date=January 2025}}
{{Short description|Class of reinforcement learning algorithms}}