Content deleted Content added
Submitting using AfC-submit-wizard |
Citation bot (talk | contribs) Alter: pages. Add: pmc, arxiv, bibcode, pmid, doi, url, s2cid. Formatted dashes. | Use this bot. Report bugs. | Suggested by Eastmain | Category:AfC pending submissions by age/1 day ago | #UCB_Category 19/85 |
||
Line 18:
=Mean and variance=
For a probability measure <math>\nu \in \mathcal{F}</math>, consider a [[stochastic process|random process]] <math>\mathfrak{F}</math> such that <math>\nu \sim \mathfrak{F}</math>. One way to define mean and variance of <math>\nu</math> is to introduce the [[Fréchet mean]] and the Fréchet variance. With respect to the metric <math>d</math> on <math>\mathcal{F}</math>, the ''Fréchet mean'' <math>\mu_\oplus</math>, also known as the [[barycenter]], and the ''Fréchet variance'' <math>V_\oplus</math> are defined as<ref>{{Cite journal|last1=Fréchet|first1=M.|date=1948|title=Les éléments aléatoires de nature quelconque dans un espace distancié|journal=Annales de l'Institut Henri Poincaré|volume=10|issue=4|pages=
<math display="block">\begin{align}
\mu_\oplus &= \operatorname{argmin}_{\mu \in \mathcal{F}} \mathbb{E}[d^2(\nu, \mu)], \\
Line 24:
\end{align}</math>
A widely used example is the Wasserstein-Fréchet mean, or simply the ''Wasserstein mean'', which is the Fréchet mean with the <math>L^2</math>-Wasserstein metric <math>d_{W_2}</math>.<ref>{{Cite journal|last1=Agueh|first1=A.|last2=Carlier|first2=G.|date=2011|title=Barycenters in the {Wasserstein} space|journal=SIAM Journal on Mathematical Analysis|volume=43|issue=2|pages=
<math display="block">\begin{align}
\mu_\oplus^* &= \operatorname{argmin}_{\mu \in \mathcal{W}_2} \mathbb{E} \left[ \int_0^1 (Q_\nu (s) - Q_\mu (s))^2 ds \right], \\
Line 36:
==Functional principal component analysis==
[[Functional principal component analysis|Functional principal component analysis(FPCA)]] can be directly applied to the probability density functions.<ref>{{Cite journal|last1=Kneip|first1=A.|last2=Utikal|first2=K.J.|date=2001|title=Inference for density families using functional principal component analysis|journal=Journal of the American Statistical Association|volume=96|issue=454|pages=
By the Karhunen-Loève theorem, <math>
Line 47:
==Transformation FPCA==
Assume the probability density functions <math>f</math> exist, and let <math>\mathcal{F}_f</math> be the space of density functions.
Transformation approaches introduce a continuous and invertible transformation <math>\Psi: \mathcal{F}_f \to \mathbb{H}</math>, where <math>\mathbb{H}</math> is a [[Hilbert space]] of functions. For instance, the log quantile density transformation or the centered log ratio transformation are popular choices.<ref>{{Cite journal|last1=Petersen|first1=A.|last2=Müller|first2=H.-G.|date=2016|title=Functional data analysis for density functions by transformation to a Hilbert space|journal=Annals of Statistics|volume=44|issue=1|pages=
For <math>f \in \mathcal{F}_f</math>, let <math>Y = \Psi(f)</math>, the transformed functional variable. The mean function <math>\mu_Y(t) = \mathbb{E}\left[Y(t)\right]</math> and the covariance function <math>G_Y(s,t) = \operatorname{Cov}(Y(s), Y(t))</math> are defined accordingly, and let <math>\{\lambda_j, \phi_j\}_{j=1}^\infty</math> be the eigenpairs of <math>G_Y(s,t)</math>. The Karhunen-Loève decomposition gives
<math>Y(t) = \mu_Y(t) + \sum_{j=1}^\infty \xi_j \phi_j(t)</math>, where <math>\xi_j = \int_D [Y(t) - \mu_Y(t)] \phi_j(t) dt</math>. Then, the <math>j</math>th transformation mode of variation is defined as <ref>{{Cite journal|last1=Petersen|first1=A.|last2=Müller|first2=H.-G.|date=2016|title=Functional data analysis for density functions by transformation to a Hilbert space|journal=Annals of Statistics|volume=44|issue=1|pages=
<math>
g_{j}^{TF}(t, \alpha) = \Psi^{-1} \left( \mu_Y + \alpha \sqrt{\lambda_j}\phi_j \right)(t), \quad t \in D, \; \alpha \in [-A, A].
Line 60:
Let <math>Y</math> be the projected density onto the tangent space, <math>Y = \log_{\mu_\oplus}(f)</math>.
In Log FPCA, FPCA is performed to <math>Y</math> and then projected back to <math>\mathcal{F}</math> using the exponential map.<ref>{{Cite journal|last1=Fletcher|first1=T.F.|last2=Lu|first2=C.|last3=Pizer|first3=S.M.|last4=Joshi|first4=S.|date=2004|title=Principal geodesic analysis for the study of nonlinear statistics of shape|journal=IEEE Transactions on Medical Imaging|volume=23|issue=8|pages=
<math>g_j^{Log}(t, \alpha) = \exp_{f_\oplus} \left( \mu_{f_\oplus} + \alpha \sqrt{\lambda_j} \phi_j \right)(t), \quad t \in D, \; \alpha \in [-A, A].</math>
Line 68:
</math>
Let the reference measure <math>\nu_0</math> be the Wasserstein mean <math>\mu_\oplus</math>.
Then, a ''principal geodesic subspace (PGS)'' of dimension <math>k</math> with respect to <math>\mu_\oplus</math> is a set <math>G_k = \operatorname{argmin}_{G \in \text{CG}_{\nu_\oplus, k}(\mathcal{W}_2)} K_{W_2}(G)</math>. <ref name="gpca1">{{Cite journal|last1=Bigot|first1=J.|last2=Gouet|first2=R.|last3=Klein|first3=T.|last4=López|first4=A.|date=2017|title=Geodesic PCA in the Wasserstein space by convex PCA|journal=Annales de l'institut Henri Poincare (B) Probability and Statistics|volume=53|issue=1|pages=
Note that the tangent space <math>T_{\mu_\oplus}</math> is a subspace of <math>L^2_{\mu_\oplus}</math>, the Hilbert space of <math>{\mu_\oplus}</math>-square-integrable functions. Obtaining the PGS is equivalent to performing PCA in <math>L^2_{\mu_\oplus}</math> under constraints to lie in the convex and closed subset.<ref name="gpca2"/> Therefore, a simple approximation of the Wasserstein Geodesic PCA is the Log FPCA by relaxing the geodesicity constraint, while alternative techniques are suggested.<ref name="gpca1"/><ref name="gpca2"/>
Line 75:
==Fréchet regression==
Fréchet regression is a generalization of regression with responses taking values in a metric space and Euclidean predictors.<ref name="freg">{{Cite journal|last1=Petersen|first1=A.|last2=Müller|first2=H.-G.|date=2019|title=Fréchet regression for random objects with Euclidean predictors|journal=Annals of Statistics|volume=47|issue=2|pages=
{{NumBlk|::|<math display="block">\begin{align}
m_\oplus (x) &= \operatorname{argmin}_{\omega \in \mathcal{F}} \mathbb{E}\left[ s_G(X,x) d_{W_2}^2(\nu,\omega) \right], \\
Line 98:
<math display="block">
\Gamma g(t) = \langle \beta(\cdot, t),g \rangle_{\omega{\oplus}}, \; t \in D, \; g \in T_{\omega{\oplus}}, \; \beta:D^2 \to \R.</math>
Estimation of the regression operator is based on empirical estimators obtained from samples.<ref>{{Cite journal|last1=Chen|first1=Y.|last2=Lin|first2=Z.|last3=Müller|first3=H.-G.|date=2023|title=Wasserstein regression|journal=Journal of the American Statistical Association|volume=118|issue=542|pages=
Also, the Fisher-Rao metric <math>d_{FR}</math> can be used in a similar fashion.<ref name="review"/><ref name="dai2022">{{Cite journal|last1=Dai|first1=X.|date=2022|title=Statistical inference on the Hilbert sphere with application to random densities|journal=Electronic Journal of Statistics|volume=16|issue=1|pages=
=Hypothesis testing=
Line 105:
==Wasserstein <math>F</math>-test==
Wasserstein <math>F</math>-test has been proposed to test for the effects of the predictors in the Fréchet regression framework with the Wasserstein metric.<ref name="ftest">{{Cite journal|last1=Petersen|first1=A.|last2=Liu|first2=X.|last3=Divani|first3=A.A.|date=2021|title=Wasserstein F-tests and confidence bands for the Fréchet regression of density response curves|journal=Annals of Statistics|volume=49|issue=1|pages=
Let <math>F</math>, <math>Q</math>, <math>f</math>,
Line 146:
[[Autoregressive model|Autoregressive (AR) models]] for distributional time series are constructed by defining [[Stationary process|stationarity]] and utilizing the notion of difference between distributions using <math>d_{W_2}</math> and <math>d_{FR}</math>.
In Wasserstein autoregressive model (WAR), consider a stationary density time series <math>f_t</math> with Wasserstein mean <math>f_\oplus</math>.<ref>{{Cite journal|last1=Zhang|first1=C.|last2=Kokoszka|first2=P.|last3=Petersen|first3=A.|date=2022|title=Wasserstein autoregressive models for density time series|journal=Journal of Time Series Analysis|volume=43|issue=1|pages=
<math display="block">
T_t - \text{id} = \beta (T_{t-1} - \text{id} ) + \epsilon_t.
</math>
On the other hand, the spherical autoregressive model (SAR) considers the Fisher-Rao metric.<ref>{{Cite journal|last1=Zhu|first1=C.|last2=Müller|first2=H.-G.|date=2023|title=Spherical autoregressive models, with application to distributional and compositional time series|journal=Journal of Econometrics|doi=10.1016/j.jeconom.2022.12.008 }}</ref> Following the settings of [[##Tests for the intrinsic mean]], let <math>x_t \in \mathcal{X}</math> with Fréchet mean <math>\mu_x</math>. Let <math>\theta = \arccos(\langle x_t, \mu_x \rangle )</math>, which is the geodesic distance between <math>x_t</math> and <math>\mu_x</math>. Define a rotation operator <math>Q_{x_t, \mu_x}</math> that rotates <math>x_t</math> to <math>\mu_x</math>. The spherical difference between <math>x_t</math> and <math>\mu_x</math> is represented as <math>R_t = x_t \ominus \mu_x = \theta Q_{x_t, \mu_x}</math>. Assume that <math>R_t</math> is a stationary sequence with the Fréchet mean <math>\mu_R</math>, then <math>SAR(1)</math> is defined as
<math display="block">
R_t - \mu_R = \beta (R_{t-1} - \mu_R) + \epsilon_t,
</math>
where <math>\mu_R = \mathbb{E}R_t</math> and mean zero random i.i.d innovations <math>\epsilon_t</math>. An alternative model, the differenced based spherical autoregressive (DSAR) model is defined with <math>R_t = x_{t+1} \ominus x_t</math>, with natural extensions to order <math>p</math>. A similar extension to the Wasserstein space was introduced.<ref>{{Cite journal|last1=Zhu|first1=C.|last2=Müller|first2=H.-G.|date=2023|title=Autoregressive optimal transport models|journal=Journal of the Royal Statistical Society Series B: Statistical Methodology|volume=85|issue=3|pages=1012–1033|doi=10.1093/jrsssb/qkad051 |pmid=37521164 |pmc=10376456 }}</ref>
=References=
|