Revision as of 22:41, 22 November 2024 edit Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits m →Example Tag: Visual edit ← Previous edit		Revision as of 22:49, 22 November 2024 edit undo Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits →Example: weighted average Tag: Visual edit Next edit →
Line 27: Consider the problem of estimating the mean <math>\theta^</math> of a probability distribution from a stream of independent samples <math>X_1, X_2, \dots</math>. Let <math>N(\theta) := \theta - X</math>, then the unique solution to <math display="inline">\operatorname E[N(\theta)] = 0</math> is the desired mean <math>\theta^</math>. The RM algorithm gives us<math display="block">\theta_{n+1}=\theta_n - a_n(\theta_n - X_n) </math>This is equivalent to [[stochastic gradient descent]] with loss function <math>L(\theta) = \frac 12 \\|X - \theta\\|^2 </math>. It is also equivalent to a weighted average:<math display="block">\theta_{n+1}=(1-a_n)\theta_n + a_n X_n </math>In general, if there exists some function <math>L</math> such that <math>\nabla L(\theta) = N(\theta) - \alpha </math>, then the Robbins–Monro algorithm is equivalent to stochastic gradient descent with loss function <math>L(\theta)</math>. However, the RM algorithm does not require <math>L</math> to exist in order to converge. In general, if there exists some function <math>L</math> such that <math>\nabla L(\theta) = N(\theta) - \alpha </math>, then the Robbins–Monro algorithm is equivalent to stochastic gradient descent with loss function <math>L(\theta)</math>. However, the RM algorithm does not require <math>L</math> to exist in order to converge. ===Complexity results===

Stochastic approximation: Difference between revisions