Content deleted Content added
Seliseli88 (talk | contribs) No edit summary |
Seliseli88 (talk | contribs) No edit summary |
||
Line 3:
{{FeatureDetectionCompVisNavbox}}
Let the term image represent a function
Generalized Structure Tensor, GST, is an extension of the Cartesian [[Structure Tensor]] to the [[Curvilinear coordinates]]
To be precise, GST represents the direction along which the image <math>f</math> can undergo an infinitesimal translation with minimal error, along the "lines" fulfilling the following conditions<ref name=bigun04pami3> {{cite article|
author = J. Bigun and T. Bigun and K. Nilsson|▼
title = Recognition by symmetry derivatives and the generalized structure tensor|▼
journal = IEEE trans. Pattern Analysis and Machine Intelligence|▼
pages = 1590--1605|▼
volume = 26|▼
year = 2004|▼
}}</ref>▼
:
1. The "lines" are ordinary lines in the curvilinear coordinate basis
$$ \cos(\theta) \xi(x,y)+\sin(\theta) \eta(x,y)= Constant$$▼
which are curves in Cartesian coordinates as depicted by the equation above. The error is measured in the $L^2$ norm and the minimality of the error refers to [[$L^2$ norm]]. ▼
2. The functions $\xi(x,y), \eta(x,y)$ constitute a harmonic pair, i.e. they fulfill Cauchy-Riemann conditions. Thus, the curvilinear coordinates, of the Generalized Structure Tensor are locally orthogonal coordinates. ▼
▲which are curves in Cartesian coordinates as depicted by the equation above. The error is measured in the
▲2. The functions
The ordinary structure tensor is thus a representation of a translation too. Here the physical process is the ordinary translation, i.e. $\xi,\eta$ are the trival identity transformations, $\xi=x$, $\eta=y$. ▼
Efficient detection of <math>\theta</math> in images is possible by image processing, if the pair <math>\xi</math>, <math>\eta</math> is given. Logarithmic spirals, including circles, can for instance be detected by (complex) convolutions<ref name=bigun04pami3 />. The spirals can be iso-curves in a gray valued image i.e. the image must not be a binary image, nor must its edges be marked.
Image in the context of the GST means both an ordinary image and an image neighborhood therein (local image), the context determining. For example, a photograph as well as any neighborhood of it are images. ▼
*With one template multiple patterns belonging to the same family can be detected
The curvilinear coordinates of GST can explain physical processes applied to images. Two of the most known such processes consist in rotation, and zooming. The first process is related to the transformation <math>\xi=\log(\sqrt{x^2+y^2})</math>. If an image <math>f</math> consists in iso-curves that can be explained by such a transformation, i.e. its iso-curves consist in circles <math>f(\xi,\eta)=g(\xi)</math>, where <math>g </math> is any real valued function defined on 1D, the image is invariant to rotations (around the origin).
Likewise the second process, zooming (comprising unzooming) is explained by <math>f(\xi,\eta)=g(\eta)</math> with <math>\eta=\tan^{-1}(x,y)</math>. Such a function <math>f</math> is invariant to scaling, i.e. zooming/dezooming (w.r.t. the origin).
▲}}</ref>
In combination,
▲ author = J. Bigun and T. Bigun and K. Nilsson|
▲ title = Recognition by symmetry derivatives and
▲ journal = IEEE trans. Pattern Analysis and Machine Intelligence|
▲ pages = 1590--1605|
▲ volume = 26|
▲ year = 2004|
<math>f(\xi,\eta)=g( \cos(\theta) \log(\sqrt{x^2+y^2})+\sin(\theta) \tan^{-1}(x,y))</math>
is invariant to a certain amount of rotation combined with scaling, where the amount is precised by the parameter <math>\theta</math>.
▲
<math> \begin{align}\cos(\theta) x+\sin(\theta) y= Constant \end{align}</math>
where the amount is precised by the parameter <math>\theta</math>. Evidently <math>\theta</math> here represents the direction of the line.
▲Image in the context of the GST means both an ordinary image and an image neighborhood therein (local image), the context determining. For example, a photograph as well as any neighborhood of it are images.
▲ The Generalized structure tensor can be used as an alternative to [[Hough Transform]] in [[image processing]] and [[computer vision]]. The main differences comprise:
▲*Negative voting is allowed
▲*With one template multiple patterns belonging to the same family can be detected, because not nonly negative but also Complex Voting is allowed.
== See also ==
Line 58:
*[[Corner detection]]
*[[Edge detection]]
*[[Affine shape adaptation]]
|