This sandbox is in the article namespace. Either move this page into your userspace, or remove the {{User sandbox}} template. Functional regression is a version of the regression analysis when responses or covariates include functional data. One the one hand, functional regression models can be classified into four types depending on whether the response or covariates are functional or scalar: (i) scalar response with functional covariates, (ii) functional response with scalar covariates, (iii) functional response with functional covariates, and (iv) scalar or functional response with functional and scalar covariates. On the other hand, functional regression models can be linear, partially linear, or nonlinear. In particular, functional polynomial models, functional single and multiple single models and functional additive models are three special cases of functional nonlinear models.
Functional linear models (FLMs)
Functional linear models (FLMs) are an extension of linear regression with scalar response and scalar covariates , which can be written as where denotes the inner product in Euclidean space, and denote the regression coefficients, and is a random error with mean zero and finite variance. FLMs can be divided into three types based on responses and covariates.
Functional linear models with scalar response
Functional linear models with scalar response (also known as functional linear regression (FLR)) can are obtained by replacing the scalar covariates and the coefficient vector in the traditional multivariate linear model by a centered functional covariate and a coefficient function for , respectively,
1 |
where here denotes the inner product in space. One approach to estimating and is to expand the covariate and the coefficient function in the same functional basis, such as B-spline basis or the eigenfunctions in the Karhunen–Loève expansion. Suppose is an orthonormal basis of space. Expanding and in this basis, , , model (1) becomes
where in implementation the infinite sum is replaced by a finite sum truncated at
where is finite[1].
Adding multiple functional and scalar covariates, the FLR can be extended as
where with is a vector of scalar covariates, is a vector of coefficients corresponding to , denotes the inner product in Euclidean space, are multiple centered functional covariates given by , and is the interval is defined on. However, due to the parametric component , the estimation of this model is different from that of the FLR. A possible approach to estimating is through generalized estimating equation with the nonparametric part replaced by its estimate for a given [2]. Once is estimated, one can apply any suitable consistent method to to estimate s[1].
Functional linear models with functional response
For a function on and a functional covariate on , two primary models have been considered[1][3]. One functional linear model regressing on is given by
where , , is still the centered functional covariate, and are coefficient functions, and is usually assumed to be a Gaussian process with mean zero. In this case, at any given time , the value of , i.e. , depends on the entire trajectory of . This model, for any given time , is an extension of the traditional multivariate linear regression model by simply replacing the inner product in Euclidean space by that in space. Thus, estimation of this model can be given by analogy to multivariate linear regression
where , is defined as with . Furthermore, regularization is needed because is a compact operator and its inverse is not bounded[1].
In particular, taking as a constant function gives a special case of this model
which is a FLM with functional response and scalar covariates.
Concurrent models
Assuming that , another model called varying-coefficient model is of the form Note that this model assumes the value of at time , i.e. , only depends on that of at the same time, , and thus is a concurrent regression model. A possible way to estimate is a two-step procedure: (i) For any fixed, an estimate of can be computed by applying ordinary least squares to a neighborhood of . Let the corresponding estimate be denoted by . (ii) The final estimate is then obtained by smoothing with respect to [1].
Functional nonlinear models
Functional polynomial models
Functional polynomial models is an extension of the FLMs, analogous to extending multivariate linear models to polynomial ones. For a scalar response and a functional covariate defined on an interval , the simplest example of functional polynomial models is functional quadratic regression[4] where is the centered functional covariate, is a scalar coefficient, and are coefficient functions defined on and respectively, and is a random error with mean zero and variance finite. By analogy to FLMs, estimation of functional polynomial models can be obtained through expanding both the centered covariate and the coefficient functions and on an orthonormal basis. Then the model can be equivalently written as multivariate polynomial regression and thus the corresponding estimation is straightforward.
Functional single and multiple index models
A functional multiple index model is given by Taking yields a functional single index model. However, this model is problematic due to curse of dimensionality. In other words, with and relatively small sample sizes, this model often leads to high variability of the estimator[5]. Alternatively, a preferable -component functional multiple index model can be formed as
Functional additive models
Given an expansion of a functional covariate on an orthonormal basis : , a functional linear model with scalar response as stated before can be written as A functional additive model can be given by replacing the linear function of by a general smooth function where satisfies for [1].
Extensions
A direct extension of functional linear models with scalar response is to add a link function to create a generalized functional linear model (GFLM) by analogy to extending linear regression to generalized linear regression where is a pre-specific link function.
See also
References
- ^ a b c d e f Wang, Chiou and Müller (2016). "Functional data analysis". Annual Review of Statistics and Its Application. 3:257–295. doi:10.1146/annurev-statistics-041715-033624
- ^ Hu, Wang and Carroll (2004). "Profile-kernel versus backfitting in the partially linear models for longitudinal/clustered data". Biometrika. 91 (2): 251–262. doi:10.1093/biomet/91.2.251
- ^ Ramsay and Silverman (2005). Functional data analysis, 2nd ed., New York : Springer, ISBN 0-387-40080-X
- ^ Yao and Müller (2010). "Functional quadratic regression". Biometrika. 97 (1):49–64. http://www.jstor.org/stable/27798896
- ^ Chen, Hall and Müller (2011). "Single and multiple index functional regression models with nonparametric link". The Annals of Statistics. 39 (3):1720–1747. http://www.jstor.org/stable/23033613
Further reading
- Morris (2015). Functional regression. Annual Review of Statistics and Its Application. 2:321–359. doi:10.1146/annurev-statistics-010814-020413.