Content deleted Content added
Olexa Riznyk (talk | contribs) m →Physical and mathematical interpretation: Spelling/grammar/punctuation/typographical correction |
Olexa Riznyk (talk | contribs) Adding/removing wikilink(s) Spelling/grammar/punctuation/typographical correction |
||
Line 1:
{{FeatureDetectionCompVisNavbox}}
In [[image analysis]], the '''generalized structure tensor (GST)''' is an extension of the Cartesian [[structure tensor]] to [[curvilinear coordinates]].<ref name="bigun04pami3">{{cite journal |last1=Bigun |first1=J. |last2=Bigun |first2=T. |last3=Nilsson |first3=K. |title=Recognition by symmetry derivatives and the generalized structure tensor |journal=IEEE Transactions on Pattern Analysis and Machine Intelligence |date=December 2004 |volume=26 |issue=12 |pages=1590–1605 |doi=10.1109/TPAMI.2004.126|pmid=15573820 |url=http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-237 }}</ref> It is mainly used to detect and to represent the "direction" parameters of curves, just as the Cartesian structure tensor detects and represents the direction in Cartesian coordinates.
It is a widely known method in applications of image and video processing including computer vision, such as biometric identification by fingerprints,<ref name=fronthaler08tip>{{cite journal
Line 21:
|url=http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-1358
|citeseerx=10.1.1.160.6312
}}</ref>
title=Improvement in cytoarchitectonic mapping by combining electrodynamic modeling with local orientation in high-resolution images of the cerebral cortex|
journal=Microsc. Res. Tech.|
volume= 74|issue=3 |
year=2010|
pages= 225–243|
Line 31:
<ref name=Schmitt2>{{cite journal|author1=O. Schmitt |author2=M. Pakura |author3=T. Aach |author4=L. Homke |author5=M. Bohme |author6=S. Bock |author7=S. Preusse |
title=Analysis of nerve fibers and their distribution in histologic sections of the human brain|
journal=Microsc. Res. Tech.|
volume= 63|issue=4 |
year=2004|
pages= 220–243|doi=10.1002/jemt.20033 |pmid=14988920 |url=https://zenodo.org/record/3447120 }}</ref>
==GST in 2D and locally orthogonal
Let the term image represent a function
<math>f(\xi(x,y),\eta(x,y))</math>
where <math>x,y </math> are real variables and
<math>\xi,\eta </math>, and <math>f</math>, are real valued functions.
GST represents the direction along which the image <math>f</math> can undergo an infinitesimal translation with minimal ([[total least squares]]) error, along the "lines" fulfilling the following conditions:
1. The "lines" are ordinary lines in the curvilinear coordinate basis <math>\xi,\eta</math>
Line 47:
: <math> \cos(\theta) \xi(x,y)+\sin(\theta) \eta(x,y)= \text{constant} </math>
which are curves in Cartesian coordinates as depicted by the equation
2. The functions <math>\xi(x,y), \eta(x,y)</math> constitute a harmonic pair, i.e. they fulfill [[Cauchy–Riemann equations]],
Line 60:
Accordingly, such curvilinear coordinates <math>\xi,\eta</math> are locally orthogonal.
Then
: <math> GST=(\lambda_{max}-\lambda_{min})
\int w(\xi,\eta)\left[
\begin{array}{c}
\frac{\partial f}{\partial \xi} \\
Line 70:
[\frac{\partial f}{\partial \xi}, \frac{\partial f}{\partial \eta}] d\xi d\eta +\lambda_{min} I </math>
where <math> 0\le \lambda_{min}\le \lambda_{max}</math> are
:<math>
\begin{array}{c}
Line 77:
\end{array}
</math>
where <math>z=x+iy</math>.<ref>{{cite journal |last1=Bigun |first1=Josef |title=Pattern Recognition in Images by Symmetries and Coordinate Transformations |journal=Computer Vision and Image Understanding |date=December 1997 |volume=68 |issue=3 |pages=290–307 |doi=10.1006/cviu.1997.0556}}</ref>
Examples of analytic functions include
Thereby,
==Complex version of GST==
As there is a complex version of the ordinary [
:<math>
\begin{array}{c}
Line 89:
\kappa_{11} =\lambda_1+\lambda_2&=&|w|*|h*f|^2\\
\end{array}
</math>
which is
:<math>w=(x \pm iy)^n\exp(-(x^2+y^2)/(2\sigma^2))\propto(D_x \pm iD_y)^n\exp(-(x^2+y^2)/(2\sigma^2))
</math>,
a so called symmetry derivative of a Gaussian. Thus, the orientation wise variation of the pattern to be looked for is directly incorporated into the neighborhood defining function, and the detection occurs in the space of the (ordinary) structure tensor.
==Basic concept for its use in image processing and computer vision ==
Efficient detection of <math>\theta</math> in images is possible
<math>\lambda_{max}+\lambda_{min}</math> can be used as a quality (confidence, certainty) measure for the angle estimation.
Logarithmic spirals, including circles,
Generalized structure tensor can be used as an alternative to [[Hough transform]] in [[image processing]] and [[computer vision]] to detect patterns whose local orientations can be modelled, for example junction points. The main differences comprise:
Line 109:
==Physical and mathematical interpretation==
The curvilinear coordinates of GST can explain physical processes applied to images. A well known pair of
If an image <math>f</math> consists in iso-curves that can be explained by only <math>\xi</math> i.e. its iso-curves consist in circles
Zooming (comprising unzooming) operation is modeled similarly.
In combination,
Line 121:
is invariant to a certain amount of rotation combined with scaling, where the amount is precised by the parameter <math>\theta</math>.
Analogously, the Cartesian [[structure tensor]] is
: <math> \cos(\theta) x+\sin(\theta) y= \text{constant} </math>
where the amount is specified by the parameter
Generally, the estimated <math>\theta</math> represents the
With every curvilinear coordinate basis pair, there is thus a pair of
==Miscellaneous==
|