Document Type
Conference Proceeding
Publication Date
12-1-2012
Abstract
We study the position restricted substring searching (PRSS) problem, where the task is to index a text T[0.n-1] of n characters over an alphabet set Σ of size σ, in order to answer the following: given a query pattern P (of length p) and two indices ℓ and r, report all occℓ,r occurrences of P in T[ℓ.r]. Known indexes take O(nlogn) bits or O(nlog1 +εn) bits space, and answer this query in O(p+logn+occ ℓ,rlogn) time or in optimal O(p+occℓ,r) time respectively, where ε is any positive constant. The main drawback of these indexes is their space requirement of Ω(nlogn) bits, which can be much more than the optimal nlogσ bits to store the text T. This paper addresses an open question asked by Mäkinen and Navarro [LATIN, 2006], which is whether it is possible to design a succinct index answering PRSS queries efficiently. We first study the hardness of this problem and prove the following result: a succinct (or a compact) index cannot answer PRSS queries efficiently in the pointer machine model, and also not in the RAM model unless bounds on the well-researched orthogonal range query problem improve. However, for the special case of sufficiently long query patterns, that is for p=Ω(log 2+εn), we derive an |CSAf|+|CSAr|+o(n) bits index with optimal query time, where |CSAf| and | CSAr| are the space (in bits) of the compressed suffix arrays (with O(p) time for pattern search) of T and T← (the reverse of T) respectively. The space can be reduced further to |CSAf|+o(n) bits with a resulting query time will be O(p+occℓ,r+log3+εn). For the general case, where there is no restriction on pattern length, we obtain an O(1/ε 3nlogσ) bits index with O(p+occ ℓ,r+nε) query time. We use suffix sampling techniques to achieve these space-efficient indexes. © 2012 Elsevier B.V. All rights reserved.
Publication Source (Journal or Book title)
Journal of Discrete Algorithms
First Page
109
Last Page
114
Recommended Citation
Hon, W., Shah, R., Thankachan, S., & Vitter, J. (2012). On position restricted substring searching in succinct space. Journal of Discrete Algorithms, 17, 109-114. https://doi.org/10.1016/j.jda.2012.09.002