Logistic regression

This is an old revision of this page, as edited by 192.87.16.130 (talk) at 13:36, 6 June 2006 (See also). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Logistic regression is a statistical regression model for binary dependent variables. It can be considered as a generalized linear model that utilizes the logit as its link function, and has binomially distributed errors.

The model takes the form

i, = 1, ..., n, where

The logarithm of the odds (probability divided by one minus the probability) of the outcome is modelled as a linear function of the explanatory variables, X1 to Xk. This can be written equivalently as

The interpretation of the parameter estimates is as a multiplicative effect on the odds ratio. In the case of a dichotomous explanatory variable, for instance sex, (the antilog of ) is the estimate of the odds-ratio of having the outcome for, say, males compared with females.

The parameters α β1, ..., βk are usually estimated by maximum likelihood.

Extensions of the model exist to cope with multi-category dependent variables and ordinal dependent variables.

See also

this can used for habitate modelling.

References

  • Agresti, Alan: Categorical Data Analysis. New York: Wiley, 1990.
  • Amemiya, T., 1985, Advanced Econometrics, Harvard University Press.
  • Hosmer, D. W. and S. Lemeshow: Applied logistic regression. New York; Chichester, Wiley, 2000.