Chi-square automatic interaction detection: Difference between revisions

Content deleted Content added
Delmonde (talk | contribs)
No edit summary
No edit summary
Line 1:
'''CHAID''' is a type of [[Decision_tree_learning|decision tree]] technique, based upon adjusted significance testing ([[Bonferroni testing]]). The technique was developed in [[South Africa]] and was published in 1980 by Gordon V. Kass, who had completed a PhD thesis on this topic. CHAID can be used for prediction (in a similar fashion to [[regression analysis]], this version of CHAID being originally known as XAID) as well as classification, and for detection of interaction between variables. CHAID stands for '''CH'''i-squared '''A'''utomatic '''I'''nteraction '''D'''etector, based upon a formal extension of the US AID (Automatic Interaction Detector) and THAID (THeta Automatic Interaction Detector) procedures of the 1960's and 70's, which in turn were extended versionsextensions of anearlier research, including algorithmthat developedperformed in the UK in the 1950's.
 
In practice, CHAID is often used in the context of [[direct marketing]] to select groups of consumers and predict how their responses to some variables affect other variables, although other early applications were in the field of medical and psychiatric research.
Line 19:
==References==
* W.A. Belson. Matching and prediction on the principle of biological classification. Applied Statistics, Vol. 8 (1959), pp. 65-75.
* J.A. Morgan & J.N. Sonquist. Problems in the analysis of drug data and a proposal. Journal of the American Statistical Association, Vol. 58 (1963), pp. 415-434.
* L.I. Press, M.S. Rogers & G.H. Shure. An interactive techniquen for the analysis of multivariate data. Behavioral Science, Vol. 14 (1969), pp. 364-370.
* G. V. Kass. An Exploratory Technique for Investigating Large Quantities of Categorical Data. Applied Statistics, Vol. 29, No. 2 (1980), pp. 119-127.
* D.M. Hawkins & G.V. Kass. Automatic Interaction Detection. In D.M. Hawkins (ed), Topics in Applied Multivariate Analysis. Cambridge University Press, Cambridge, 1982, pp. 269-302.
* T.M. Hooton, R.W. Haley, D.K. Culver, J.W. White, W.B. Morgan & R.J. Carroll. The Joint Associations of Multiple Risk Factors with the Occurrence of Nosocomial Infections. American Journal of Medicine, Vol. 70, (1981), pp. 960-970.
* S. Brink & D.J. Van Schalkwyk. Serum ferritin and mean corpuscular volume as predictors of bone marrow iron stores. South African Medical Journal, Vol. 61, (1982), pp. 432-434.
* D.P. McKenzie, P.D. McGorry, C.S. Wallace, L.H. Low, D.L. Copolov & B.S. Singh. Constructing a Minimal Diagnostic Decision Tree. Methods of Information in Medicine, Vol. 32 (1993), pp. 161-166.
* J. Magidson. The CHAID approach to segmentation modeling: chi-squared automatic interaction detetction. In R.P. Bagozzi (ed), Advanced Methods of Marketing Research. Blackwell, Oxford, 1994, pp. 118-159.
* D.M. Hawkins, S.S. Young & A. Rosinko. Analysis of a large structure-activity dataset using recursive partitioning. Quantitative Structure-Activity Relationships, Vol. 16, (1997), pp. 296-302.