Back to Search Start Over

The longest common substring problem.

Authors :
CROCHEMORE, MAXIME
ILIOPOULOS, COSTAS S.
LANGIU, ALESSIO
MIGNOSI, FILIPPO
Source :
Mathematical Structures in Computer Science; Feb2017, Vol. 27 Issue 2, p277-295, 19p
Publication Year :
2017

Abstract

Given a set $\mathcal{D}$ of q documents, the Longest Common Substring (LCS) problem asks, for any integer 2 ⩽ k ⩽ q, the longest substring that appears in k documents. LCS is a well-studied problem having a wide range of applications in Bioinformatics: from microarrays to DNA sequences alignments and analysis. This problem has been solved by Hui (2000International Journal of Computer Science and Engineering15 73–76) by using a famous constant-time solution to the Lowest Common Ancestor (LCA) problem in trees coupled with the use of suffix trees.In this article, we present a simple method for solving the LCS problem by using suffix trees (STs) and classical union-find data structures. In turn, we show how this simple algorithm can be adapted in order to work with other space efficient data structures such as the enhanced suffix arrays (ESA) and the compressed suffix tree. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09601295
Volume :
27
Issue :
2
Database :
Complementary Index
Journal :
Mathematical Structures in Computer Science
Publication Type :
Academic Journal
Accession number :
120328511
Full Text :
https://doi.org/10.1017/S0960129515000110