Pyramid (image processing)

This is an old revision of this page, as edited by Tpl (talk | contribs) at 16:07, 26 February 2008 (Minor update with references). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Pyramids or 'pyramid representation' is a type of multi-scale signal representation developed by the computer vision, image processing and signal processing communities in which a signal or an image is subject to repeated smoothing and subsampling. Historically, pyramid representation is a predecessor to scale-space representation and multiresolution analysis.

Pyramid generation

There are two main types of pyramids; lowpass pyramids and bandpass pyramids. A lowpass pyramid is generated by first smoothing the image with an appropriate smoothing filter and then subsampling the smoothed image, usually by a factor of two along each coordinate direction. As this process proceeds, the result will be a set of gradually more smoothed images where in addition the spatial sampling density decreases level by level. If illustrated graphically, this multi-scale representation will look like a pyramid, from which the name has been obtained. A bandpass pyramid is obtained by forming the difference between adjacent levels in a pyramid, where in addition some kind of interpolation is performed between representations at different resolution, enabling the computation of differences.

Pyramid generation kernels

A variety of different smoothing kernels have proposed for generating pyramids (Burt 1981; Crowley 1981; Burt and Adelson 1983; Crowley and Sanderson 1984; Meer et al 1987). . Among the suggestions that have been given, the binomial kernels arising from the binomial coefficient stand out as a particularly useful and theoretically well-founded class (Crowley 1981; Lindeberg 1990, 1994).

References

Lindeberg, T., "Scale-space for discrete signals," PAMI(12), No. 3, March 1990, pp. 234-254.


See also