Content deleted Content added
Removed a citation needed, The number of bins k, assuming equal bin width, will always be k = range of data / bin width no citation should be needed. |
fixing references |
||
Line 147:
:<math> \sigma_{g_1} = \sqrt { \frac { 6(n-2) }{ (n+1)(n+3) } } </math>
;Scott's normal reference rule:
:<math>h = \frac{3.5 \hat \sigma}{n^{1/3}},</math>
where <math>\hat \sigma</math> is the sample [[standard deviation]]. Scott's normal reference rule<ref name=scott79>{{cite journal |last=Scott |first=David W. |year=1979 |title=On optimal and data-based histograms |journal=Biometrika |volume=66 |issue=3|pages=605–610 |doi=10.1093/biomet/66.3.605}}</ref> is optimal for random samples of normally distributed data, in the sense that it minimizes the integrated mean squared error of the density estimate.<ref name=scott92>{{cite book|last=Scott|first=
;Freedman–Diaconis' choice
Line 161 ⟶ 162:
which is based on the [[interquartile range]], denoted by IQR. It replaces 3.5σ of Scott's rule with 2 IQR, which is less sensitive than the standard deviation to outliers in data.
; Choice based on minimization of an estimated ''L''<sup>2</sup>
:<math> \underset{h}{\operatorname{arg\,min}} \frac{ 2 \bar{m} - v } {h^2} </math>
|