Logistic regression

This is an old revision of this page, as edited by Neoforma (talk | contribs) at 21:31, 13 July 2006 (Had to remove comma from i=1,...n, because it wasn't rendering at all.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Logistic regression is a statistical regression model for binary dependent variables. It can be considered as a generalized linear model that utilizes the logit as its link function, and has binomially distributed errors.

The model takes the form

,
,

where

.

The logarithm of the odds (probability divided by one minus the probability) of the outcome is modelled as a linear function of the explanatory variables, to . This can be written equivalently as

The interpretation of the parameter estimates is as a multiplicative effect on the odds ratio. In the case of a dichotomous explanatory variable, for instance sex, (the antilog of ) is the estimate of the odds-ratio of having the outcome for, say, males compared with females.

The parameters are usually estimated by maximum likelihood.

Extensions of the model exist to cope with multi-category dependent variables and ordinal dependent variables.

See also

References

  • Agresti, Alan: Categorical Data Analysis. New York: Wiley, 1990.
  • Amemiya, T., 1985, Advanced Econometrics, Harvard University Press.
  • Hosmer, D. W. and S. Lemeshow: Applied logistic regression. New York; Chichester, Wiley, 2000.