Faculty Publications

Compression, indexing, and retrieval for massive string data

Wing Kai Hon, National Tsing Hua University
Rahul Shah, Louisiana State University
Jeffrey Scott Vitter, Texas A&M University

Document Type

Conference Proceeding

Publication Date

12-1-2010

Abstract

The field of compressed data structures seeks to achieve fast search time, but using a compressed representation, ideally requiring less space than that occupied by the original input data. The challenge is to construct a compressed representation that provides the same functionality and speed as traditional data structures. In this invited presentation, we discuss some breakthroughs in compressed data structures over the course of the last decade that have significantly reduced the space requirements for fast text and document indexing. One interesting consequence is that, for the first time, we can construct data structures for text indexing that are competitive in time and space with the well-known technique of inverted indexes, but that provide more general search capabilities. Several challenges remain, and we focus in this presentation on two in particular: building I/O-efficient search structures when the input data are so massive that external memory must be used, and incorporating notions of relevance in the reporting of query answers. © Springer-Verlag Berlin Heidelberg 2010.

Publication Source (Journal or Book title)

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

First Page

260

Last Page

274

Recommended Citation

Hon, W., Shah, R., & Vitter, J. (2010). Compression, indexing, and retrieval for massive string data. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6129 LNCS, 260-274. https://doi.org/10.1007/978-3-642-13509-5_24

This document is currently not available here.

COinS

Faculty Publications

Compression, indexing, and retrieval for massive string data

Document Type

Publication Date

Abstract

Publication Source (Journal or Book title)

First Page

Last Page

Recommended Citation

Search

Browse

Author Corner

SPONSORED BY

Faculty Publications

Compression, indexing, and retrieval for massive string data

Authors

Document Type

Publication Date

Abstract

Publication Source (Journal or Book title)

First Page

Last Page

Recommended Citation

Share

Search

Browse

Author Corner

SPONSORED BY