Revision as of 04:15, 31 January 2024 edit HeyElliott (talk \| contribs) Extended confirmed users 121,470 edits MOS:NOTE Tag: 2017 wikitext editor ← Previous edit		Revision as of 07:25, 4 February 2024 edit undo Biggerj1 (talk \| contribs) Extended confirmed users 2,170 edits →See also Tags: Mobile edit Mobile web edit Advanced mobile edit Next edit →
Line 90: ==Efficiency in robotics== A high sample complexity means, that many calculations are needed for running a [[Monte Carlo tree search]].<ref>{{cite conference \|title=Monte-carlo tree search by best arm identification \|author=Kaufmann, Emilie and Koolen, Wouter M \|conference=Advances in Neural Information Processing Systems \|pages=4897–4906 \|year=2017 }}</ref> Its equal to a [[Model-free (reinforcement learning)\|model free]] brute force search in the state space. In contrast, a high efficiency algorithm has a low sample complexity.<ref>{{cite conference \|title=The chin pinch: A case study in skill learning on a legged robot \|author=Fidelman, Peggy and Stone, Peter \|conference=Robot Soccer World Cup \|pages=59–71 \|year=2006 \|publisher=Springer }}</ref> Possible techniques for reducing the sample complexity are [[metric learning]]<ref>{{cite conference \|title=Sample complexity of learning mahalanobis distance metrics \|author=Verma, Nakul and Branson, Kristin \|conference=Advances in neural information processing systems \|pages=2584–2592 \|year=2015 }}</ref> and model based reinforcement learning.<ref>{{cite arXiv \|title=Model-ensemble trust-region policy optimization \|author=Kurutach, Thanard and Clavera, Ignasi and Duan, Yan and Tamar, Aviv and Abbeel, Pieter \|eprint=1802.10592 \|year=2018 \|class=cs.LG }}</ref> == See also == * [[Active learning]] ==References==

Sample complexity: Difference between revisions