Logistic regression is a statistical regression model for binary dependent variables. It can be considered as a generalized linear model that utilizes the logit as its link function, and has binomially distributed errors.
The model takes the form
i, = 1, ..., n, where
The logarithm of the odds (probability divided by one minus the probability) of the outcome is modelled as a linear function of the explanatory variables, X1 to Xk. This can be written equivalently as
The interpretation of the parameter estimates is as a multiplicative effect on the odds ratio. In the case of a dichotomous explanatory variable, for instance sex, (the antilog of ) is the estimate of the odds-ratio of having the outcome for, say, males compared with females.
The parameters α β1, ..., βk are usually estimated by maximum likelihood.
Extensions of the model exist to cope with multi-category dependent variables and ordinal dependent variables.
See also
References
- Agresti, Alan: Categorical Data Analysis. New York: Wiley, 1990.
- Amemiya, T., 1985, Advanced Econometrics, Harvard University Press.
- Hosmer, D. W. and S. Lemeshow: Applied logistic regression. New York; Chichester, Wiley, 2000.