Revision as of 10:32, 12 February 2020 edit 194.39.218.10 (talk) No edit summary ← Previous edit		Revision as of 21:37, 24 February 2020 edit undo Citation bot (talk \| contribs) Bots 5,863,391 edits m Alter: template type. Add: publisher, isbn, doi. Removed URL that duplicated unique identifier. Removed parameters. \| You can use this bot yourself. Report bugs here. \| Activated by User:AManWithNoPlan \| via #UCB_toolbar Next edit →
Line 172: == Monte Carlo version == Sampled differential dynamic programming (SaDDP) is a Monte Carlo variant of differential dynamic programming.<ref>{{Cite ~~web\|url=https://ieeexplore.ieee.org/~~document~~/7759229~~\|title=Sampled differential dynamic programming - IEEE Conference Publication~~\|website=ieeexplore.ieee.org~~\|language=en-US\|~~access-date~~doi=~~2018-~~10~~-19~~.1109/IROS.2016.7759229}}</ref><ref>{{Cite web\|url=https://ieeexplore.ieee.org/document/8430799\|title=Regularizing Sampled Differential Dynamic Programming - IEEE Conference Publication\|website=ieeexplore.ieee.org\|language=en-US\|access-date=2018-10-19}}</ref><ref>{{Cite ~~journal~~book\|last=Joose\|first=Rajamäki\|date=2018\|title=Random Search Algorithms for Optimal Control\|url=http://urn.fi/URN:ISBN:978-952-60-8156-4\|language=en\|issn=1799-4942\|isbn=9789526081564\|publisher=Aalto University}}</ref> It is based on treating the quadratic cost of differential dynamic programming as the energy of a [[Boltzmann distribution]]. This way the quantities of DDP can be matched to the statistics of a [[Multivariate normal distribution\|multidimensional normal distribution]]. The statistics can be recomputed from sampled trajectories without differentiation. == See also ==

Differential dynamic programming: Difference between revisions