Data stream clustering: Difference between revisions

Content deleted Content added
Bfoteini (talk | contribs)
Bfoteini (talk | contribs)
Line 7:
 
== Definition ==
The problem of data stream clustering is defined as:
 
'''Input:''' a sequence of ''n'' points in metric space and an integer ''k''.<br />
'''Output:''' ''k'' centers in the set of the ''n'' points so as to minimize the sum of distances from data points to their closest cluster centers.
<br />
This is the streaming version of the k-median problem.
 
== Algorithms ==