Content deleted Content added
this has next to nothing to do with text classification. |
m Open access bot: doi added to citation with #oabot. |
||
(8 intermediate revisions by 8 users not shown) | |||
Line 1:
{{short description|Generalization of the vector space model used in information retrieval}}
{{Confusing|date=January 2010}}
The '''Generalized vector space model''' is a generalization of the [[vector space model]] used in [[information retrieval]]. Wong ''et al.''<ref name="wong">{{
==Definitions==
GVSM introduces
For a document ''d<sub>k</sub>'' and a query ''q'' the similarity function now becomes:
Line 20 ⟶ 21:
# compute frequency co-occurrence statistics from large corpora
Recently Tsatsaronis<ref>{{
They measure semantic relatedness (''SR'') using a thesaurus (''O'') like [[WordNet]]. It considers the path length, captured by compactness (''SCM''), and the path depth, captured by semantic path elaboration (''SPE'').
Line 28 ⟶ 29:
where ''s<sub>i</sub>'' and ''s<sub>j</sub>'' are senses of terms ''t<sub>i</sub>'' and ''t<sub>j</sub>'' respectively, maximizing <math>SCM \cdot SPE</math>.
Building also on the first approach, Waitelonis et al.<ref>{{citation | title=Linked Data enabled Generalized Vector Space Model to improve document retrieval | url=http://ceur-ws.org/Vol-1581/paper4.pdf | last1= Waitelonis | first1=Jörg | first2=Claudia | last2=Exeler | last3=Sack| first3=Harald | publisher=ISWC 2015, CEUR-WS 1581 |date=2015-09-11}}</ref> have computed semantic relatedness from [[Linked data|Linked Open Data]] resources including [[DBpedia]] as well as the [[YAGO (database)|YAGO taxonomy]].
Thereby they exploits taxonomic relationships among semantic entities in documents and queries after [[Entity linking|named entity linking]].
== References ==
|