Talk:Determining the number of clusters in a data set
Latest comment: 13 years ago by Johnmark54 in topic Not that common problem
![]() | Statistics Unassessed | |||||||||
|
Additional updates coming
A colleague will be adding details to the "Elbow method" and "Information criteria" subsections shortly. -JohnMeier (talk) 15:10, 9 April 2009 (UTC)
Not that common problem
There are lots of alternative algorithms that do not require the specification of k beforehand. This is mostly a problem of k-means, k-medoids and the EM-algorithm. Pretty much none of the more recent algorithms has this parameter. --Chire2 (talk) 14:13, 7 May 2010 (UTC)
- Any examples for such algorithms? thanks. Talgalili (talk) 12:36, 20 June 2010 (UTC)
A well known, early example is the AutoClass algorithm, by Cheeseman et al. 1988, which applied the Bayes Information Criterion to selecting k. Johnmark54 (talk) 15:26, 5 October 2011 (UTC)