Revision as of 16:37, 7 July 2025 edit MrOllie (talk \| contribs) Extended confirmed users, Pending changes reviewers, Rollbackers 255,382 edits clean out citespam, again ← Previous edit		Revision as of 17:07, 7 July 2025 edit undo AnomieBOT (talk \| contribs) Bots 6,855,621 edits Rescuing orphaned refs (":5" from rev 1299220194) Next edit →
Line 18: == Cluster Linkage == In order to decide which clusters should be combined (for agglomerative), or where a cluster should be split (for divisive), a measure of dissimilarity between sets of observations is required. In most methods of hierarchical clustering, this is achieved by use of an appropriate [[distance]] ''d'', such as the Euclidean distance, between ''single'' observations of the data set, and a linkage criterion, which specifies the dissimilarity of ''sets'' as a function of the pairwise distances of observations in the sets. The choice of metric as well as linkage can have a major impact on the result of the clustering, where the lower level metric determines which objects are most [[similarity measure\|similar]], whereas the linkage criterion influences the shape of the clusters <ref name=":5">{{Cite journal \|last=Wani \|first=Aasim Ayaz \|date=2024-08-29 \|title=Comprehensive analysis of clustering algorithms: exploring limitations and innovative solutions \|journal=PeerJ Computer Science \|language=en \|volume=10 \|pages=e2286 \|doi=10.7717/peerj-cs.2286 \|issn=2376-5992 \|pmc=11419652 \|pmid=39314716 \|doi-access=free}}</ref>. For example, complete-linkage tends to produce more spherical clusters than single-linkage. The linkage criterion determines the distance between sets of observations as a function of the pairwise distances between observations.

Hierarchical clustering: Difference between revisions