Content deleted Content added
m Typo fixing, etc., typo(s) fixed: , → , using AWB |
|||
Line 1:
{{Multiple issues|
{{disputed|date=March 2014}}
{{
}}
'''Document clustering''' (or '''text clustering''') is the application of [[cluster analysis]] to textual documents. It has applications in automatic document organization, [[topic (linguistics)|topic]] extraction and fast [[information retrieval]] or filtering.
Line 20 ⟶ 23:
* Clustering divides the results of a search for "cell" into groups like "biology," "battery," and "prison."
* [http://FirstGov.gov FirstGov.gov], the official Web portal for the U.S. government, uses document clustering to automatically organize its search results into categories. For example, if a user submits “immigration”, next to their list of results they will see categories for “Immigration Reform”, “Citizenship and Immigration Services”, “Employment”, “Department of Homeland Security”, and more.
Line 29 ⟶ 31:
* Claudio Carpineto, Stanislaw Osiński, Giovanni Romano, Dawid Weiss. A survey of Web clustering engines. ACM Computing Surveys (CSUR), Volume 41, Issue 3 (July 2009), Article No. 17, ISSN:0360-0300
* http://semanticquery.com/archive/semanticsearchart/researchBest.html - comparison of several popular clustering algorithms, data and software to reproduce the result.
* Tanmay Basu, C.A. Murthy
==See
*[[Cluster Analysis]]
*[[Fuzzy clustering]]
|