Faculty Publications

MapReduce based parallel suffix tree construction for human genome

Umesh Chandra Satish, LSU College of Engineering
Praveenkumar Kondikoppa, LSU College of Engineering
Seung Jong Park, LSU College of Engineering
Manish Patil, LSU College of Engineering
Rahul Shah, LSU College of Engineering

Document Type

Conference Proceeding

Publication Date

1-1-2014

Abstract

Genome indexing is the basis for many bioinformatics applications. Read mapping(sequence alignment) is one such application where the goal is to align millions of short reads against reference genome. Several tools are available for read mapping which rely on different indexing techniques to expedite the alignment process. However, many of these contemporary alignment programs are sequential, memory intensive and cannot be easily scaled for larger genomes. Suffix tree is one of the most widely used data structures for indexing strings (genomes). Building a scalable suffix-tree based tool is particularly challenging due to the difficulties involved in parallel construction of the suffix tree. Several suffix tree construction techniques have been proposed till date with focus on space-time tradeoff. Most of these existing works address the construction issue for uniprocessor and cannot be easily extended to utilize modern multi-processor systems. In this paper we investigate and propose a MapReduce based parallel construction of suffix tree. We demonstrate the performance of the algorithm over commodity cluster using up to 32 nodes each having 8GB of primary memory.

Publication Source (Journal or Book title)

Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

First Page

664

Last Page

670

Recommended Citation

Satish, U., Kondikoppa, P., Park, S., Patil, M., & Shah, R. (2014). MapReduce based parallel suffix tree construction for human genome. Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS, 2015-April, 664-670. https://doi.org/10.1109/PADSW.2014.7097867

This document is currently not available here.

COinS

Faculty Publications

MapReduce based parallel suffix tree construction for human genome

Document Type

Publication Date

Abstract

Publication Source (Journal or Book title)

First Page

Last Page

Recommended Citation

Search

Browse

Author Corner

SPONSORED BY

Faculty Publications

MapReduce based parallel suffix tree construction for human genome

Authors

Document Type

Publication Date

Abstract

Publication Source (Journal or Book title)

First Page

Last Page

Recommended Citation

Share

Search

Browse

Author Corner

SPONSORED BY