Revision as of 21:02, 10 March 2017 edit Ms.chen (talk \| contribs) 174 edits No edit summary ← Previous edit		Revision as of 21:26, 10 March 2017 edit undo Ms.chen (talk \| contribs) 174 edits No edit summary Tag: Visual edit: Switched Next edit →
Line 7: == Functional linear models (FLMs) == Functional linear models (FLMs) are an extension of [[Linear regression\|traditional multivariate linear models]] with scalar response <math>Y\in\mathbb{R}</math> and scalar covariates <math>X\in\mathbb{R}^p</math>, which can be written as <math display="block">Y = \beta_0 + \langle X,\beta\rangle + \epsilon,</math>, where <math>\langle\cdot,\cdot\rangle</math> denotes the [[Inner product space\|inner product]] in [[Euclidean space\|Euclidean space]], <math>\beta_0\in\mathbb{R}</math> and <math>\beta\in\mathbb{R}^p</math> denote the regression coefficients, and <math>\epsilon</math> is a random error with [[Expected value\|mean]] zero and [[Variance\|variance]] finite. FLMs can be divided into three types based on responses and covariates. Line 17: where in implementation the infinite sum is replaced by a finite sum truncated at <math>K</math> <math display="block">Y = \beta_0 + \sum_{k=1}^K \beta_k x_k +\epsilon</math> where <math>K\in\mathbb{N}</math> is finite<~~sup~~ref idname=~~"cite_ref-Wang_1-0" class="reference"~~wang:16>~~<a href="#cite_note-~~Wang~~-1">[1]</a>~~, Chiou and Müller (2016). Functional data analysis. ''Annual Review of Statistics and Its Application''. '''3''':257–295.</~~sup~~ref>.<br /> Adding multiple functional and scalar covariates, the FLR can be extended as <math display="block">Y = \langle\mathbf{Z},\alpha\rangle + \sum_{j=1}^p \int_{\mathcal{T}_j} X_j^c(t) \beta_j(t) dt + \epsilon</math> where <math>\mathbf{Z}=(Z_1,\cdots,Z_q)^T</math> with <math>Z_1=1</math> is a vector of scalar covariates, <math>\alpha=(\alpha_1,\cdots,\alpha_q)^T</math> is a vector of coefficients corresponding to <math>\mathbf{Z}</math>, <math>\langle\cdot,\cdot\rangle</math> denotes the inner product in Euclidean space, <math>X^c_1,\cdots,X^c_p</math> are multiple centered functional covariates given by <math>X_j^c(\cdot) = X_j(\cdot) - \mathbb{E}(X_j(\cdot))</math>, and <math>\mathcal{T}_j</math> is the interval <math>X_j(\cdot)</math> is defined on. However, due to the parametric component <math>\alpha</math>, the estimation of this model is different from that of the FLR. A possible approach to estimating <math>\alpha</math> is through [[Generalized estimating equation\|generalized estimating equation]] with the nonparametric part <math> \sum_{j=1}^p \int_{\mathcal{T}_j} X_j^c(t) \beta_j(t) dt</math> replaced by its estimate for a given <math>\alpha</math>.<~~sup~~ref>Hu, ~~id="cite_ref~~Wang and Carroll (2004). Profile-~~Hu_2-0"~~kernel ~~class="reference"><a~~versus ~~href="#cite_note-Hu-~~backfitting in the partially linear models for longitudinal/clustered data. ''Biometrika''. '''91''' (2~~">[2]~~): 251–262.</~~a></sup~~ref>. Once <math>\alpha</math> is estimated, one can apply any suitable consistent method to <math>Y-\langle\mathbf{Z}, \hat\alpha\rangle</math> to estimate <math>\beta_j</math>s<~~sup~~ref idname=~~"cite_ref-Wang_1-1" class="reference"><a href="#cite_note-Wang-1">[1]</a><~~wang:16/~~sup~~>.<br /> === Functional linear models with functional response === For a function <math>Y(\cdot)</math> on <math>\mathcal{T}_Y</math> and a functional covariate <math>X(\cdot)</math> on <math>\mathcal{T}_X</math>, two primary models have been considered<~~sup~~ref idname=~~"cite_ref-Wang_1-2" class="reference"><a href="#cite_note-Wang-1">[1]</a><~~wang:16/~~sup~~><sup id="cite_ref-Ramsay_3-0" class="reference"><a href="#cite_note-Ramsay-3">[3]</a></sup>. One functional linear model regressing <math>Y(\cdot)</math> on <math>X(\cdot)</math> is given by <math display="block">Y(s) = \beta_0(s) + \int_{\mathcal{T}_X} \beta(s,t) X^c(t)dt + \epsilon(s)</math> where <math>s\in\mathcal{T}_Y</math>, <math>t\in\mathcal{T}_X</math>, <math>X^c(\cdot) = X(\cdot) - \mathbb{E}(X(\cdot))</math> is still the centered functional covariate, <math>\beta_0(\cdot)</math> and <math>\beta(\cdot,\cdot)</math> are coefficient functions, and <math>\epsilon(\cdot)</math> is usually assumed to be a Gaussian process with mean zero. In this case, at any given time <math>s\in\mathcal{T}_Y</math>, the value of <math>Y</math>, i.e. <math>Y(s)</math>, depends on the entire trajectory of <math>X</math>. This model, for any given time <math>s</math>, is an extension of the traditional multivariate linear regression model by simply replacing the inner product in Euclidean space by that in <math>L^2</math> space. Thus, estimation of this model can be given by analogy to multivariate linear regression <math display="block">r_{XY} = R_{XX}\beta, \text{ for } \beta\in L^2(\mathcal{T}_X\times\mathcal{T}_X)</math> where <math>r_{XY}(s,t) = \text{cov}(X(s),Y(t))</math>, <math>R_{XX}: L^2\times L^2 \rightarrow L^2\times L^2</math> is defined as <math>(R_{XX}\beta)(s,t) = \int r_{XX}(s,w)\beta(w,t)dw</math> with <math>r_{XX}(s,t) = \text{cov}(X(s),X(t))</math>. Furthermore, regularization is needed because <math>R_{XX}</math> is a compact operator and its inverse is not bounded<~~sup~~ref idname=~~"cite_ref-Wang_1-3" class="reference"><a href="#cite_note-Wang-1">[1]</a><~~wang:16/~~sup~~>.<br /> In particular, taking <math>X(\cdot)</math> as a constant function gives a special case of this model <math display="block">Y(s) = \sum_{j=1}^p X_j \beta_j(s) + \epsilon(s)</math> Line 35: Assuming that <math>\mathcal{T}_X = \mathcal{T}_Y := \mathcal{T}</math>, another model called varying-coefficient model is of the form <math display="block">Y(s) = \alpha_0(s) + \alpha(s)X(s)+\epsilon(s)</math> Note that this model assumes the value of <math>Y</math> at time <math>s</math>, i.e. <math>Y(s)</math>, only depends on that of <math>X</math> at the same time, <math>X(s)</math>, and thus is a concurrent regression model. A possible way to estimate <math>\alpha</math> is a two-step procedure: (i) For any <math>s\in\mathcal{T}</math> fixed, an estimate of <math>\alpha(s)</math> can be computed by applying [[Ordinary least squares\|ordinary least squares]] to a neighborhood of <math>s</math>. Let the corresponding estimate be denoted by <math>\tilde\alpha(s)</math>. (ii) The final estimate <math>\hat\alpha</math> is then obtained by smoothing <math>\tilde\alpha(s)</math> with respect to <math>s</math><~~sup~~ref idname=~~"cite_ref-Wang_1-4" class="reference"><a href="#cite_note-Wang-1">[1]</a><~~wang:16/~~sup~~>. == Functional nonlinear models == Line 54: A functional additive model can be given by replacing the linear function of <math>x_k</math> by a general smooth function <math>f_k</math> <math display="block">\mathbb{E}(Y\|X)=\mathbb{E}(Y) + \sum_{k=1}^\infty f_k(x_k)</math> where <math>f_k</math> satisfies <math>\mathbb{E}(f_k(x_k))=0</math> for <math>k\in\mathbb{N}</math><~~sup~~ref idname=~~"cite_ref-Wang_1-5" class="reference"><a href="#cite_note-Wang-1">[1]</a><~~wang:16/~~sup~~>. == Extensions == Line 69: * [[Stochastic processes\|Stochastic processes]] * [[Lp space\|Lp space]] == References ==▼ <references/> == Further reading == * Morris (2015). Functional regression. ''Annual Review of Statistics and Its Application''. '''2''':321–359. [[Digital object identifier\|doi]]:[http://doi.org/10.1146/annurev-statistics-010814-020413 10.1146/annurev-statistics-010814-020413]. ▲== References == ~~<ref>Wang, Chiou and Müller (2016). Functional data analysis. ''Annual Review of Statistics and Its Application''. '''3''':257–295.</ref>~~

Functional regression: Difference between revisions