Reassignment method: Difference between revisions

Content deleted Content added
Rescuing 1 sources and tagging 0 as dead.) #IABot (v2.0.9.2) (Whoop whoop pull up - 11071
added refimprove tag
Line 1:
{{refimprove}}
The '''method of reassignment''' is a technique for
The '''method of reassignment''' is a technique for sharpening a [[time-frequency representation]] by mapping the data to time-frequency coordinates that are nearer to the true [[Support (mathematics)|region of support]] of the analyzed signal. The method has been independently introduced by several parties under various names, including ''method of reassignment'', ''remapping'', ''time-frequency reassignment'', and ''modified moving-window method''.<ref name="hainsworth">{{Cite thesis |type=PhD |chapter=Chapter 3: Reassignment methods |title=Techniques for the Automated Analysis of Musical Audio |last=Hainsworth |first=Stephen |year=2003 |publisher=University of Cambridge |citeseerx=10.1.1.5.9579 }}</ref> In
sharpening a [[time-frequency representation]] by mapping
the case of the [[spectrogram]] or the [[short-time Fourier transform]], the method of reassignment sharpens blurry time-frequency data by relocating the data according to local estimates of instantaneous frequency and group delay. This mapping to reassigned time-frequency coordinates is very precise for signals that are separable in time and frequency with respect to the analysis window.
the data to time-frequency coordinates that are nearer to
the true [[Support (mathematics)|region of support]] of the
analyzed signal. The method has been independently
introduced by several parties under various names, including
''method of reassignment'', ''remapping'', ''time-frequency reassignment'',
and ''modified moving-window method''.<ref name="hainsworth">{{Cite thesis |type=PhD |chapter=Chapter 3: Reassignment methods |title=Techniques for the Automated Analysis of Musical Audio |last=Hainsworth |first=Stephen |year=2003 |publisher=University of Cambridge |citeseerx=10.1.1.5.9579 }}</ref> In
the case of the [[spectrogram]] or the [[short-time Fourier transform]],
the method of reassignment sharpens blurry
time-frequency data by relocating the data according to
local estimates of instantaneous frequency and group delay.
This mapping to reassigned time-frequency coordinates is
very precise for signals that are separable in time and
frequency with respect to the analysis window.
 
== Introduction ==
 
[[Image:Reassigned spectrogral surface of bass pluck.png|thumb|400px|
Reassigned spectral surface for the onset of an acoustic bass tone having a sharp pluck and a fundamental frequency of approximately 73.4&nbsp;Hz. Sharp spectral ridges representing the harmonics are evident, as is the abrupt onset of the tone. The spectrogram was computed using a 65.7 ms Kaiser window with a shaping parameter of 12.]]
Reassigned spectral surface for the onset of an acoustic bass tone
having a sharp pluck and a fundamental frequency of approximately 73.4&nbsp;Hz.
Sharp spectral ridges representing the harmonics are evident, as is the
abrupt onset of the tone.
The spectrogram was computed using a 65.7 ms Kaiser window with a shaping
parameter of 12.]]
 
Many signals of interest have a distribution of energy that varies in time and frequency. For example, any sound signal having a beginning or an end has an energy distribution that varies in time, and most sounds exhibit considerable variation in both time and frequency over their duration. Time-frequency representations are commonly used to analyze or characterize such signals. They map the one-dimensional time-___domain signal into a two-dimensional function of time and frequency. A time-frequency representation describes the variation of spectral energy distribution over time, much as a musical score describes the variation of musical pitch over time.
Many signals of interest have a distribution of energy that
varies in time and frequency. For example, any sound signal
having a beginning or an end has an energy distribution that
varies in time, and most sounds exhibit considerable
variation in both time and frequency over their duration.
Time-frequency representations are commonly used to analyze
or characterize such signals. They map the one-dimensional
time-___domain signal into a two-dimensional function of time
and frequency. A time-frequency representation describes the
variation of spectral energy distribution over time, much as
a musical score describes the variation of musical pitch
over time.
 
In audio signal analysis, the spectrogram is the most commonly used time-frequency representation, probably because it is well understood, and immune to so-called "cross-terms" that sometimes make other time-frequency representations difficult to interpret. But the windowing operation required in spectrogram computation introduces an unsavory tradeoff between time resolution and frequency resolution, so spectrograms provide a time-frequency representation that is blurred in time, in frequency, or in both dimensions. The method of time-frequency reassignment is a technique for refocussing time-frequency data in a blurred representation like the spectrogram by mapping the data to time-frequency coordinates that are nearer to the true region of support of the analyzed signal.
In audio signal analysis, the spectrogram is the most
commonly used time-frequency representation, probably
because it is well understood, and immune to so-called
"cross-terms" that sometimes make other time-frequency
representations difficult to interpret. But the windowing
operation required in spectrogram computation introduces an unsavory tradeoff between time resolution and frequency
resolution, so spectrograms provide a time-frequency
representation that is blurred in time, in frequency, or in
both dimensions. The method of time-frequency reassignment
is a technique for refocussing time-frequency data in a blurred representation like the spectrogram by mapping the
data to time-frequency coordinates that are nearer to the
true region of support of the analyzed signal.
 
== The spectrogram as a time-frequency representation ==
{{main|Spectrogram}}
 
One of the best-known time-frequency representations is the spectrogram, defined as the squared magnitude of the short-time Fourier transform. Though the short-time phase spectrum is known to contain important temporal information about the signal, this information is difficult to interpret, so typically, only the short-time magnitude spectrum is considered in short-time spectral analysis.
One of the best-known time-frequency representations is the
spectrogram, defined as the squared magnitude of the
short-time Fourier transform. Though the short-time phase
spectrum is known to contain important temporal information
about the signal, this information is difficult to
interpret, so typically, only the short-time magnitude
spectrum is considered in short-time spectral analysis.
 
As a time-frequency representation, the spectrogram has relatively poor resolution. Time and frequency resolution are governed by the choice of analysis window and greater concentration in one ___domain is accompanied by greater smearing in the other.
relatively poor resolution. Time and frequency resolution
are governed by the choice of analysis window and greater
concentration in one ___domain is accompanied by greater
smearing in the other.
 
A time-frequency representation having improved resolution, relative to the spectrogram, is the [[Wigner&ndash;Ville distribution]], which may be interpreted as a short-time Fourier transform with a window function that is perfectly matched to the signal. The Wigner&ndash;Ville distribution is highly concentrated in time and frequency, but it is also highly nonlinear and non-local. Consequently, this
A time-frequency representation having improved resolution,
distribution is very sensitive to noise, and generates cross-components that often mask the components of interest, making it difficult to extract useful information concerning the distribution of energy in multi-component signals.
relative to the spectrogram, is the [[Wigner&ndash;Ville distribution]],
which may be interpreted as a short-time
Fourier transform with a window function that is perfectly
matched to the signal. The Wigner&ndash;Ville distribution is
highly concentrated in time and frequency, but it is also
highly nonlinear and non-local. Consequently, this
distribution is very sensitive to noise, and generates
cross-components that often mask the components of interest,
making it difficult to extract useful information concerning
the distribution of energy in multi-component signals.
 
[[Cohen's class distribution function|Cohen's class]] of bilinear time-frequency representations is a class of "smoothed" Wigner&ndash;Ville distributions, employing a smoothing kernel that can reduce sensitivity of the distribution to noise and suppresses cross-components, at the expense of smearing the distribution in time and frequency. This smearing causes the distribution to be non-zero in regions where the true Wigner&ndash;Ville distribution shows no energy.
[[Cohen's class distribution function|Cohen's class]] of
bilinear time-frequency representations is a class of
"smoothed" Wigner&ndash;Ville distributions, employing a smoothing
kernel that can reduce sensitivity of the distribution to
noise and suppresses cross-components, at the expense of
smearing the distribution in time and frequency. This
smearing causes the distribution to be non-zero in regions
where the true Wigner&ndash;Ville distribution shows no energy.
 
The spectrogram is a member of Cohen's class. It is a smoothed Wigner&ndash;Ville distribution with the smoothing kernel equal to the Wigner&ndash;Ville distribution of the analysis window. The method of reassignment smooths the Wigner&ndash;Ville distribution, but then refocuses the distribution back to the true regions of support of the signal components. The method has been shown to reduce time and frequency smearing of any member of Cohen's class
equal to the Wigner&ndash;Ville distribution of the analysis
window. The method of reassignment smooths the Wigner&ndash;Ville
distribution, but then refocuses the distribution back to
the true regions of support of the signal components. The
method has been shown to reduce time and frequency smearing
of any member of Cohen's class
<ref name = "improving">
{{cite journal |author1=F. Auger |author2=P. Flandrin |name-list-style=amp |date=May 1995 |title=Improving the readability of time-frequency and time-scale representations by the reassignment method |journal=IEEE Transactions on Signal Processing |volume=43 |issue=5 |pages=1068–1089 |doi=10.1109/78.382394 |bibcode=1995ITSP...43.1068A |citeseerx=10.1.1.646.794 }}