Nearest centroid classifier: Difference between revisions

Content deleted Content added
m Some formatting.
Clarified lead about Rocchio.
Line 1:
[[Image:Rocchioclassgraph.jpg|thumb|right|250px|Rocchio Classification]]
 
In [[machine learning]], a '''nearest centroid classifier''' or '''nearest prototype classifier''' is a [[statistical classification|classification model]] that assigns to observations the label of the class of training samples whose [[mean]] ([[centroid]]) is closest to the observation. When applied to [[text classification]] using [[vector space model|word vectors]] containing [[tf*idf]] vectorsweights to represent documents, the nearest centroid classifier is known as the '''Rocchio classifier''' because of its similarity to the [[Rocchio algorithm]] for [[relevance feedback]].<ref>{{cite book
| last1 = Manning
| first1 = Christopher