Back to Search
Start Over
Approximate String Matching over Ziv—Lempel Compressed Text
- Source :
- Combinatorial Pattern Matching ISBN: 9783540676331, CPM
- Publication Year :
- 2000
- Publisher :
- Springer Berlin Heidelberg, 2000.
-
Abstract
- We present a solution to the problem of performing approximate pattern matching on compressed text. The format we choose is the Ziv-Lempel family, specifically the LZ78 and LZW variants. Given a text of length u compressed into length n, and a pattern of length m, we report all the R occurrences of the pattern in the text allowing up to k insertions, deletions and substitutions, in O(mkn+R) time. The existence problem needs O(mkn) time. We also show that the algorithm can be adapted to run in O(k2n+min(mkn,m2(mσ)k) + R) average time, where σ is the alphabet size. The experimental results show a speedup over the basic approach for moderate m and small k.
- Subjects :
- Speedup
Approximation algorithm
0102 computer and information sciences
02 engineering and technology
String searching algorithm
Approximate string matching
01 natural sciences
Combinatorics
Combinatorial analysis
010201 computation theory & mathematics
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Pattern matching
Alphabet
Mathematics
Data compression
Subjects
Details
- ISBN :
- 978-3-540-67633-1
- ISBNs :
- 9783540676331
- Database :
- OpenAIRE
- Journal :
- Combinatorial Pattern Matching ISBN: 9783540676331, CPM
- Accession number :
- edsair.doi...........239cb57acedd2b92e92f97f04a3ae108