Linear model: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 01:10, 30 November 2012 edit Arthur Rubin (talk \| contribs) Extended confirmed users, Rollbackers 130,168 edits m Reverted edits by 2001:558:6027:10:11E6:826F:25E2:F8AD (talk) to last version by 74.184.158.129 ← Previous edit		Latest revision as of 23:29, 17 November 2024 edit undo 2600:1700:3a60:2870:c0a9:8a9f:20aa:8962 (talk) Undid weird revision 1256895688 by 171.102.210.198 (talk) Tag: Undo
(44 intermediate revisions by 34 users not shown)
Line 1: {{Short description\|Type of statistical model}} {{~~distinguish~~Distinguish\|linear model of innovation}} In [[statistics]], the term '''linear model''' isrefers ~~used~~to inany ~~different~~model ~~ways~~which ~~according~~assumes to[[linearity]] in the ~~context~~system. The most common occurrence is in connection with regression models and the term is often taken as synonymous with [[linear regression]] model. However, the term is also used in [[time series analysis]] with a different meaning. In each case, the designation "linear" is used to identify a subclass of models for which substantial reduction in the complexity of the related [[statistical theory]] is possible. ==Linear regression models== {{main\|Linear regression}} For the regression case, the [[statistical model]] is as follows. Given a (random) sample <math> (Y_i, X_{i1}, \ldots, X_{ip}), \, i = 1, \ldots, n </math> the relation between the observations ~~''Y~~<~~sub~~math>iY_i</~~sub~~math>'' and the [[independent variables]] ~~''X~~<~~sub~~math>X_{ij}</~~sub~~math>'' is formulated as :<math>Y_i = \beta_0 + \beta_1 \phi_1(X_{i1}) + \cdots + \beta_p \phi_p(X_{ip}) + \varepsilon_i \qquad i = 1, \ldots, n </math> where <math> \phi_1, \ldots, \phi_p </math> may be [[Nonlinear system\|nonlinear]] functions. In the above, the quantities ~~''ε~~<~~sub~~math>i\varepsilon_i</~~sub~~math>'' are [[random ~~variables~~variable]]s representing errors in the relationship. The "linear" part of the designation relates to the appearance of the [[regression coefficient]]s, ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>'' in a linear way in the above relationship. Alternatively, one may say that the predicted values corresponding to the above model, namely :<math>\hat{Y}_i = \beta_0 + \beta_1 \phi_1(X_{i1}) + \cdots + \beta_p \phi_p(X_{ip}) \qquad (i = 1, \ldots, n), </math> are linear functions of the ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>''. Given that estimation is undertaken on the basis of a [[least squares]] analysis, estimates of the unknown parameters ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>'' are determined by minimising a sum of squares function :<math>S = \sum_{i = 1}^n \varepsilon_i^2 = \sum_{i = 1}^n \left(Y_i - \beta_0 - \beta_1 \phi_1(X_{i1}) - \cdots - \beta_p \phi_p(X_{ip})\right)^2 .</math> From this, it can readily be seen that the "linear" aspect of the model means the following: :the function to be minimised is a quadratic function of the ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>'' for which minimisation is a relatively simple problem; :the derivatives of the function are linear functions of the ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>'' making it easy to find the minimising values; :the minimising values ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>'' are linear functions of the observations ~~''Y~~<~~sub~~math>iY_i</~~sub~~math>''; :the minimising values ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>'' are linear functions of the random errors ~~''ε~~<~~sub~~math>i\varepsilon_i</~~sub~~math>'' which makes it relatively easy to determine the statistical properties of the estimated values of ~~''β~~<~~sub~~math>j\beta_j</~~sub~~math>''. ==Time series models== An example of a linear time series model is an [[autoregressive moving average model]]. Here the model for values {~~''X~~<~~sub~~math>tX_t</~~sub~~math>''} in a time series can be written in the form :<math> X_t = c + \varepsilon_t + \sum_{i=1}^p \~~varphi_i~~phi_i X_{t-i} + \sum_{i=1}^q \theta_i \varepsilon_{t-i}.\,</math> where again the quantities ~~''ε~~<~~sub~~math>t\varepsilon_i</~~sub~~math>'' are random variables representing [[Innovation (signal processing)\|innovations]] which are new random effects that appear at a certain time but also affect values of ''<math>X''</math> at later times. In this instance the use of the term "linear model" refers to the structure of the above relationship in representing ~~''X~~<~~sub~~math>tX_t</~~sub~~math>'' as a linear function of past values of the same time series and of current and past values of the innovations.<ref>Priestley, M.B. (1988) ''Non-linear and Non-stationary time series analysis'', Academic Press. {{ISBN \|0-12-564911-8}}</ref> This particular aspect of the structure means that it is ~~relative~~relatively simple to derive relations for the mean and [[covariance]] properties of the time series. Note that here the "linear" part of the term "linear model" is not referring to the coefficients ~~''φ~~<~~sub~~math>i\phi_i</~~sub~~math>'' and ~~''θ~~<~~sub~~math>i\theta_i</~~sub~~math>'', as it would be in the case of a regression model, which looks structurally similar. ==Other uses in statistics== There are some other instances where "nonlinear model" is used to contrast with a linearly structured model, although the term "linear model" is not usually applied. One example of this is [[nonlinear dimensionality reduction]]. ==See also== * [[General linear model]] * [[Generalized linear model]] * [[Linear predictor function]] * [[Linear system]] * [[Linear regression]] * [[Statistical model]] ==References== {{Reflist}} ~~<references/>~~ {{Statistics}} {{Authority control}} [[Category:~~Statistical~~Curve ~~models~~fitting]] [[Category:~~Time series~~Regression models]] ~~[[Category:Regression analysis]]~~ ~~[[Category:Statistical terminology]]~~ [[ar:نموذج الانحدار الخطي]] [[defr:~~Lineares~~Modèle ~~Modell~~linéaire]] ~~[[es:Modelo lineal]]~~ ~~[[pt:Modelo linear]]~~