Policy gradient method: Difference between revisions

Content deleted Content added
infobox
Line 340:
* {{Cite web |last=Weng |first=Lilian |date=2018-04-08 |title=Policy Gradient Algorithms |url=https://lilianweng.github.io/posts/2018-04-08-policy-gradient/ |access-date=2025-01-25 |website=lilianweng.github.io |language=en}}
* {{Cite web |title=Vanilla Policy Gradient — Spinning Up documentation |url=https://spinningup.openai.com/en/latest/algorithms/vpg.html |access-date=2025-01-25 |website=spinningup.openai.com}}
{{Artificial intelligence navbox}}
 
[[Category:Reinforcement learning]]
[[Category:Machine learning algorithms]]