Revision as of 05:20, 21 January 2025 edit Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits unify notation Tag: 2017 wikitext editor ← Previous edit		Revision as of 08:14, 21 January 2025 edit undo GhostInTheMachine (talk \| contribs) Extended confirmed users, Page movers 106,441 edits Changing short description from "Class of reinforcement learning algorithms that directly optimize policy parameters by gradient ascent" to "Class of reinforcement learning algorithms" Tag: Shortdesc helper Next edit →
Line 1: {{Short description\|Class of reinforcement learning algorithms ~~that directly optimize policy parameters by gradient ascent~~}} '''Policy gradient methods''' are a class of [[reinforcement learning]] algorithms.

Policy gradient method: Difference between revisions