Kernel density estimation

This is an old revision of this page, as edited by Oleg Alexandrov (talk | contribs) at 19:06, 3 February 2005 (more introduction). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Parzen window method is a way of estimating the probability density function of a random variable. As an illustration, given some data about a sample of a population, the Parzen window method makes it possible to extrapolate the data to the entire population.

If x1, x2, ..., xN is a sample of a random variable, then the Parzen window approximation of its probability density function is

where W is some kernel. Quite often W is taken to be a Gaussian function with mean zero and variance σ2:

The Parzen window density estimate ρ(x) is in blue; the Gaussians which add up to ρ(x) are in red. Six sample points were considered. The variance of the Gaussians was set to 0.5. Note that where the points are denser, the density estimate has higher values.

See also

References

  • Parzen E. (1962). On estimation of a probability density function and mode, Ann. Math. Stat. 33, pp. 1065-1076.
  • Duda, R. and Hart, P. (1973). Pattern Classification and Scene Analysis. John Wiley & Sons. ISBN 0471223611.