Revision as of 20:34, 8 October 2024 edit Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits →Interpretation: improvement Tag: Visual edit ← Previous edit		Revision as of 20:35, 8 October 2024 edit undo Cosmia Nebula (talk \| contribs) Extended confirmed users 11,304 edits →Improvements Tag: Visual edit Next edit →
Line 108: \sigma^2 &= (\alpha E[x]^2 + (1 - \alpha) \mu_{x^2, \text{train}}) - \mu^2 \end{aligned} </math>where <math>\alpha</math> is a hyperparameter to be optimized on a validation set. ~~</math>~~ == Layer normalization ==

Normalization (machine learning): Difference between revisions