Revision as of 05:51, 31 December 2023 edit Dedhert.Jr (talk \| contribs) Extended confirmed users 11,356 edits →Notation: display block ← Previous edit		Revision as of 05:20, 1 January 2024 edit undo OAbot (talk \| contribs) Bots 643,717 edits m Open access bot: doi updated in citation with #oabot. Next edit →
Line 43: === Transformation FPCA === Assume the probability density functions <math>f</math> exist, and let <math>\mathcal{F}_f</math> be the space of density functions. Transformation approaches introduce a continuous and invertible transformation <math>\Psi: \mathcal{F}_f \to \mathbb{H}</math>, where <math>\mathbb{H}</math> is a [[Hilbert space]] of functions. For instance, the log quantile density transformation or the centered log ratio transformation are popular choices.<ref>{{Cite journal\|last1=Petersen\|first1=A.\|last2=Müller\|first2=H.-G.\|date=2016\|title=Functional data analysis for density functions by transformation to a Hilbert space\|journal=Annals of Statistics\|volume=44\|issue=1\|pages=183–218\|doi=10.1214/15-AOS1363\|doi-access=free}}</ref><ref>{{Cite journal\|last1=van den Boogaart\|first1=K.G.\|last2=Egozcue\|first2=J.J.\|last3=Pawlowsky-Glahn\|first3=V.\|date=2014\|title=Bayes Hilbert spaces\|journal=Australian and New Zealand Journal of Statistics\|volume=56\|issue=2\|pages=171–194\|doi=10.1111/anzs.12074\|s2cid=120612578 }}</ref> For <math>f \in \mathcal{F}_f</math>, let <math>Y = \Psi(f)</math>, the transformed functional variable. The mean function <math>\mu_Y(t) = \mathbb{E}\left[Y(t)\right]</math> and the covariance function <math>G_Y(s,t) = \operatorname{Cov}(Y(s), Y(t))</math> are defined accordingly, and let <math>\{\lambda_j, \phi_j\}_{j=1}^\infty</math> be the eigenpairs of <math>G_Y(s,t)</math>. The Karhunen-Loève decomposition gives <math>Y(t) = \mu_Y(t) + \sum_{j=1}^\infty \xi_j \phi_j(t)</math>, where <math>\xi_j = \int_D [Y(t) - \mu_Y(t)] \phi_j(t) dt</math>. Then, the <math>j</math>th transformation mode of variation is defined as<ref>{{Cite journal\|last1=Petersen\|first1=A.\|last2=Müller\|first2=H.-G.\|date=2016\|title=Functional data analysis for density functions by transformation to a Hilbert space\|journal=Annals of Statistics\|volume=44\|issue=1\|pages=183–218\|doi=10.1214/15-AOS1363\|doi-access=free}}</ref> <math> g_{j}^{TF}(t, \alpha) = \Psi^{-1} \left( \mu_Y + \alpha \sqrt{\lambda_j}\phi_j \right)(t), \quad t \in D, \; \alpha \in [-A, A]. Line 69: == Distributional regression == === Fréchet regression === Fréchet regression is a generalization of regression with responses taking values in a metric space and Euclidean predictors.<ref name="freg">{{Cite journal\|last1=Petersen\|first1=A.\|last2=Müller\|first2=H.-G.\|date=2019\|title=Fréchet regression for random objects with Euclidean predictors\|journal=Annals of Statistics\|volume=47\|issue=2\|pages=691–719\|doi=10.1214/17-AOS1624 \|doi-access=free}}</ref><ref name="review">{{Cite journal\|last1=Petersen\|first1=A.\|last2=Zhang\|first2=C.\|last3=Kokoszka\|first3=P.\|date=2022\|title=Modeling probability density functions as data objects\|journal=Econometrics and Statistics\|volume=21\|pages=159–178\|doi=10.1016/j.ecosta.2021.04.004 \|s2cid=236589040 }}</ref> Using the Wasserstein metric <math>d_{W_2}</math>, Fréchet regression models can be applied to distributional objects. The global Wasserstein-Fréchet regression model is defined as {{NumBlk\|::\|<math display="block">\begin{align} m_\oplus (x) &= \operatorname{argmin}_{\omega \in \mathcal{F}} \mathbb{E}\left[ s_G(X,x) d_{W_2}^2(\nu,\omega) \right], \\ Line 93: \Gamma g(t) = \langle \beta(\cdot, t),g \rangle_{\omega{\oplus}}, \; t \in D, \; g \in T_{\omega{\oplus}}, \; \beta:D^2 \to \R.</math> Estimation of the regression operator is based on empirical estimators obtained from samples.<ref>{{Cite journal\|last1=Chen\|first1=Y.\|last2=Lin\|first2=Z.\|last3=Müller\|first3=H.-G.\|date=2023\|title=Wasserstein regression\|journal=Journal of the American Statistical Association\|volume=118\|issue=542\|pages=869–882\|doi=10.1080/01621459.2021.1956937 \|s2cid=219721275 }}</ref> Also, the Fisher-Rao metric <math>d_{FR}</math> can be used in a similar fashion.<ref name="review"/><ref name="dai2022">{{Cite journal\|last1=Dai\|first1=X.\|date=2022\|title=Statistical inference on the Hilbert sphere with application to random densities\|journal=Electronic Journal of Statistics\|volume=16\|issue=1\|pages=700–736\|doi=10.1214/21-EJS1942 \|doi-access=free}}</ref> == Hypothesis testing == Line 144: </math> On the other hand, the spherical autoregressive model (SAR) considers the Fisher-Rao metric.<ref>{{Cite journal\|last1=Zhu\|first1=C.\|last2=Müller\|first2=H.-G.\|date=2023\|title=Spherical autoregressive models, with application to distributional and compositional time series\|journal=Journal of Econometrics\|doi=10.1016/j.jeconom.2022.12.008 \|doi-access=free}}</ref> Following the settings of [[##Tests for the intrinsic mean]], let <math>x_t \in \mathcal{X}</math> with Fréchet mean <math>\mu_x</math>. Let <math>\theta = \arccos(\langle x_t, \mu_x \rangle )</math>, which is the geodesic distance between <math>x_t</math> and <math>\mu_x</math>. Define a rotation operator <math>Q_{x_t, \mu_x}</math> that rotates <math>x_t</math> to <math>\mu_x</math>. The spherical difference between <math>x_t</math> and <math>\mu_x</math> is represented as <math>R_t = x_t \ominus \mu_x = \theta Q_{x_t, \mu_x}</math>. Assume that <math>R_t</math> is a stationary sequence with the Fréchet mean <math>\mu_R</math>, then <math>SAR(1)</math> is defined as <math display="block"> R_t - \mu_R = \beta (R_{t-1} - \mu_R) + \epsilon_t,

Distributional data analysis: Difference between revisions