Linear probability model: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 22:11, 3 December 2023 edit Thatsme314 (talk \| contribs) Extended confirmed users 7,865 edits m →Latent-variable formulation: rm space ← Previous edit		Latest revision as of 20:08, 22 May 2025 edit undo Living Echo (talk \| contribs) 83 edits Link suggestions feature: 3 links added. Tags: Visual edit Newcomer task Suggested: add links
(2 intermediate revisions by 2 users not shown)
Line 1: {{Short description\|Statistics model}} In [[statistics]], a '''linear probability model''' (LPM) is a special case of a [[binary regression]] model. Here the [[dependent and independent variables\|dependent variable]] for each observation takes values which are either 0 or 1. The probability of observing a 0 or 1 in any one case is treated as depending on one or more [[dependent and independent variables\|explanatory variables]]. For the "linear probability model", this relationship is a particularly simple one, and allows the model to be fitted by [[linear regression]]. Line 13 ⟶ 14: ==Latent-variable formulation== More formally, the LPM can arise from a latent-variable formulation (usually to be found in the [[econometrics]] literature<ref name=Amemiya>{{cite journal \|last=Amemiya \|first=Takeshi \|year=1981 \|title=Qualitative Response Models: A Survey\|journal=Journal of Economic Literature \|volume =19 \|number =4 \|pages=1483–1536 }}</ref>), as follows: assume the following regression model with a latent (unobservable) dependent variable: : <math>y^* = b_0+ \mathbf x'\mathbf b + \varepsilon,\;\; \varepsilon\mid \mathbf x\sim U(-a,a).</math> The critical assumption here is that the error term of this regression is a symmetric around zero [[Continuous uniform distribution\|uniform]] [[random variable]], and hence, of mean zero. The cumulative distribution function of <math>\varepsilon</math> here is <math>F_{\varepsilon\|\mathbf x}(\varepsilon\mid \mathbf x) = \frac {\varepsilon + a}{2a}.</math> Define the indicator variable <math> y = 1</math> if <math> y^* >0</math>, and zero otherwise, and consider the conditional probability Line 38 ⟶ 39: :<math>\beta_0 = \frac {b_0+a}{2a},\;\; \beta=\frac{\mathbf b}{2a}.</math> This method is a general device to obtain a conditional probability model of a binary variable: if we assume that the distribution of the error term is ~~Logistic~~logistic, we obtain the [[logit model]], while if we assume that it is the ~~Normal~~normal, we obtain the [[probit model]] and, if we assume that it is the logarithm of a [[Weibull distribution]], the [[Generalized linear model\|complementary log-log model]]. == See also ==