Succinct index for dynamic dictionary matching
Document Type
Conference Proceeding
Publication Date
12-1-2009
Abstract
In this paper we revisit the dynamic dictionary matching problem, which asks for an index for a set of patterns P 1, P 2, ⋯, P k that can support the following query and update operations efficiently. Given a query text T, we want to find all the occurrences of of these patterns; furthermore, as the set of patterns may change over time, we also want to insert or delete a pattern. The major contribution of this paper is the first succinct index for dynamic dictionary matching. Prior to our work, the most compact index is given by Chan et al. (2007), which is based on the compressed suffix arrays (Grossi and Vitter (2005) and Sadakane (2003)) and the FM-index (Ferragina and Manzini (2005)), and it requires O(n σ) bits where n is the total length of patterns and σ is the alphabet size. We develop a dynamic succinct index using a different (and simpler) paradigm based on suffix sampling. The new index not only improves the space complexity to (1+o(1))n logσ+O(klogn) bits, but also the time complexity of the query and update operations. Specifically, the query and update operations respectively take O(|T|logn+occ) and O(|P|logσ+logn) times, where occ is the number of occurrences. © 2009 Springer-Verlag Berlin Heidelberg.
Publication Source (Journal or Book title)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
First Page
1034
Last Page
1043
Recommended Citation
Hon, W., Lam, T., Shah, R., Tam, S., & Vitter, J. (2009). Succinct index for dynamic dictionary matching. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5878 LNCS, 1034-1043. https://doi.org/10.1007/978-3-642-10631-6_104