Logistic regression is a statistical regression model for binary dependent variables. It can be considered as a generalized linear model that utilizes the logit as its link function, and has binomially distributed errors.
The model takes the form
- ,
- ,
where
- .
The logarithm of the odds (probability divided by one minus the probability) of the outcome is modelled as a linear function of the explanatory variables, to . This can be written equivalently as
The interpretation of the parameter estimates is as a multiplicative effect on the odds ratio. In the case of a dichotomous explanatory variable, for instance sex, (the antilog of ) is the estimate of the odds-ratio of having the outcome for, say, males compared with females.
The parameters are usually estimated by maximum likelihood.
Extensions of the model exist to cope with multi-category dependent variables and ordinal dependent variables.
See also
References
- Agresti, Alan: Categorical Data Analysis. New York: Wiley, 1990.
- Amemiya, T., 1985, Advanced Econometrics, Harvard University Press.
- Hosmer, D. W. and S. Lemeshow: Applied logistic regression. New York; Chichester, Wiley, 2000.