Revision as of 21:56, 22 November 2024 edit Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits →Robbins–Monro algorithm: in general Tag: Visual edit ← Previous edit		Revision as of 21:56, 22 November 2024 edit undo Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits m →Example Tag: Visual edit Next edit →
Line 27: Consider the problem of estimating the mean <math>\theta^</math> of a probability distribution from a stream of independent samples <math>X_1, X_2, \dots</math>. Let <math>N(\theta) := \theta - X</math>, then the unique solution to <math display="inline">\operatorname E[N(\theta)] = 0</math> is the desired mean <math>\theta^</math>. The RM algorithm gives us<math display="block">\theta_{n+1}=\theta_n - a_n(\theta_n - X_n) </math>This is equivalent to [[stochastic gradient descent]] with loss function <math>L(\theta) = \frac 12 \\|X - \theta\\|^2 </math>. In general, if there exists some function <math>L</math> such that <math>\nabla L(\theta) = N(\theta) - \alpha </math>, then the Robbins–Monro algorithm is equivalent to stochastic gradient descent with loss function <math>L(\theta)</math>

Stochastic approximation: Difference between revisions