Logistic regression

This is an old revision of this page, as edited by Michael Hardy (talk | contribs) at 03:09, 26 March 2006 (I'm putting the "x"s in lower-case because they are typically treated as non-random.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Logistic regression is a statistical regression model for binary dependent variables. It can be considered as a generalized linear model that utilizes the logit as its link function, and has binomially distributed errors.

The model takes the form

where

The logarithm of the odds (probability divided by one minus the probability) of the outcome is modelled as a linear function of the explanatory variables, X1 to Xk. This can be written equivalently as

The interpretation of the parameter estimates is as an additive effect on the log of the odds. In the case of a dichotomous explanatory variable, for instance sex, (the antilog of ) is the estimate of the odds-ratio of having the outcome for, say, males compared with females.

Extensions of the model exist to cope with multi-category dependent variables and ordinal dependent variables.

See also

References

  • Agresti, Alan: Categorical Data Analysis. New York: Wiley, 1990.
  • Amemiya, T., 1985, Advanced Econometrics, Harvard University Press.
  • Hosmer, D. W. and S. Lemeshow: Applied logistic regression. New York; Chichester, Wiley, 2000.