Revision as of 21:56, 5 March 2023 edit Justinroche03 (talk \| contribs) 4 edits m Fixed a few minor spelling and grammar mistakes Tag: Visual edit ← Previous edit		Revision as of 07:18, 16 March 2023 edit undo 2a02:908:4f5:69a0:1e2:93cf:8c5d:abc6 (talk) →Applications: important clarification on alphabet size Next edit →
Line 216: return (s, r) </syntaxhighlight> Finding the substring pattern <math>P</math> of length <math>m</math> in the string <math>S</math> of length <math>n</math> takes <math>\mathcal{O}(m \log n)</math> time, given that a single suffix comparison needs to compare <math>m</math> characters. {{harvtxt\|Manber\|Myers\|1990}} describe how this bound can be improved to <math>\mathcal{O}(m + \log n)</math> time using [[LCP array\|LCP]] information. The idea is that a pattern comparison does not need to re-compare certain characters, when it is already known that these are part of the longest common prefix of the pattern and the current search interval. {{harvtxt\|Abouelhoda\|Kurtz\|Ohlebusch\|2004}} improve the bound even further and achieve a search time of <math>\mathcal{O}(m)</math> for constant alphabet size, as known from [[suffix tree]]s. Suffix sorting algorithms can be used to compute the [[Burrows–Wheeler transform\|Burrows–Wheeler transform (BWT)]]. The [[Burrows–Wheeler transform\|BWT]] requires sorting of all cyclic permutations of a string. If this string ends in a special end-of-string character that is lexicographically smaller than all other character (i.e., $), then the order of the sorted rotated [[Burrows–Wheeler transform\|BWT]] matrix corresponds to the order of suffixes in a suffix array. The [[Burrows–Wheeler transform\|BWT]] can therefore be computed in linear time by first constructing a suffix array of the text and then deducing the [[Burrows–Wheeler transform\|BWT]] string: <math>BWT[i] = S[A[i]-1]</math>.

Suffix array: Difference between revisions