Policy gradient method: Difference between revisions

Content deleted Content added
Policy gradient: anchor REINFORCE
Line 27:
 
== REINFORCE ==
{{Anchor|REINFORCE}}
 
=== Policy gradient ===
Line 196 ⟶ 197:
* [[Reinforcement learning]]
* [[Deep reinforcement learning]]
* [[REINFORCE algorithm]]
* [[Actor-critic method]]