Revision as of 14:11, 29 July 2005 edit ISee (talk \| contribs) 11 edits m →Rabin-Karp and multiple pattern search: Code-Layout ← Previous edit		Revision as of 20:37, 30 July 2005 edit undo Edemaine (talk \| contribs) 496 edits →Rabin-Karp and multiple pattern search: improve wording, correct runtimes, remove "worst case" claim Next edit →
Line 57: Rabin-Karp is inferior for single pattern searching to [[Knuth-Morris-Pratt algorithm]], [[Boyer-Moore string searching algorithm]] and other faster single pattern [[string searching algorithm]]s because of its slow worst case behavior. However, Rabin-Karp is an algorithm of choice for multiple pattern search. That is, if we want to find any of a large number, say ''k'', fixed length patterns in a text, we can create a simple variant of Rabin-Karp that uses a [[hash table]] or any other [[set data structure]] to check whether the hash of a given string belongs to a set of hash values of patterns we are looking for: '''function''' RabinKarpSet(''string'' s[1..n], ''set'' of ''string'' subs, m) { Line 74: Here we assume all the substrings have a fixed length ''m'', but this assumption can be eliminated. We simply compare the current hash value against the hash values of all the substrings simultaneously using a quick lookup in our set data structure, and then verify any match we find against all substrings with that hash value. Other algorithms can search for a single pattern in ~~time order~~ O(''n'') time, and hence they ~~will~~can be used to search for ''k'' patterns in ~~time order~~ O(''n*'' ''k'') time. ~~The~~In contrast, the variant Rabin-Karp above ~~will~~can ~~still~~find ~~work~~all in''k'' ~~time~~patterns ~~order~~in O(''n''+''k'') intime ~~the best and average~~in ~~case~~expectation, because a hash table ~~allows to check~~checks whether ~~or not~~a substring hash equals any of the pattern hashes in ~~time order of~~ O(1~~). We can also ensure O(''mn''log ''k''~~) time ~~in the worst case, where ''m'' is the length of the longest of the ''k'' strings, by storing the hashes in a [[self-balancing binary search tree]] instead of a hash table~~. ==References==

Rabin–Karp algorithm: Difference between revisions