Multivariate kernel density estimation: Difference between revisions

Content deleted Content added
m Objective and data-driven kernel selection: task, replaced: journal = Journal of the Royal Statistical Society, Series B (Statistical Methodology) → journal = Journal of the Royal Statist using AWB
m fix tag issue
Line 50:
: <math>\operatorname{MISE} (\bold{H}) = \operatorname{AMISE} (\bold{H}) + o(n^{-1} |\bold{H}|^{-1/2} + \operatorname{tr} \, \bold{H}^2)</math>
 
where ''o'' indicates the usual [[big O notation|small o notation]]. Heuristically this statement implies that the AMISE is a 'good' approximation of the MISE as the sample size <var>n</var> → ∞.
 
It can be shown that any reasonable bandwidth selector '''H''' has '''H''' = ''O''(''n''<sup>−2/(''d''+4)</sup>) where the [[big O notation]] is applied elementwise. Substituting this into the MISE formula yields that the optimal MISE is ''O''(''n''<sup>−4/(''d''+4)</sup>).<ref name="WJ1995"/>{{rp|99–100}} Thus as ''n'' → ∞, the MISE → 0, i.e. the kernel density estimate [[convergence in mean|converges in mean square]] and thus also in probability to the true density ''f''. These modes of convergence are confirmation of the statement in the motivation section that kernel methods lead to reasonable density estimators. An ideal optimal bandwidth selector is