Content deleted Content added
Line 8:
* like HITS, the algorithm assigns two scores to each web page: a hub score and an authority score. An authority is a page which significantly more relevant to a given topic than other pages whereas a hub is a page which contains many links to authorities;
* like HITS, SALSA also works on a ''focused subgraph'' which is topic-dependent. This focused subgraph is obtained by first finding a set of pages most relevant to a given topic (e.g. take the top-n pages returned by a text-based search algorithm) and then augmenting this set with web pages that link directly to it or page that are linked directly from it. Because of this selection process, the hub and authority scores are topic-dependent;
* like PageRank, the algorithm computes the scores by simulating a random walk through a [[Markov chain]] that represents the graph of web pages. SALSA however works with two different Markov chains: a chain of hubs and a chain of authorities. This is a departure from HITS's notions of hubs and authorities based on
== Properties ==
|