Start Over

An Efficient Search Algorithm for Finding Genomic-Range Overlaps Based on the Maximum Range Length.

Authors :: Seok HS
Song T
Kong SW
Hwang KB
Source :: IEEE/ACM transactions on computational biology and bioinformatics [IEEE/ACM Trans Comput Biol Bioinform] 2015 Jul-Aug; Vol. 12 (4), pp. 778-84.
Publication Year :: 2015
Abstract: Efficient search algorithms for finding genomic-range overlaps are essential for various bioinformatics applications. A majority of fast algorithms for searching the overlaps between a query range (e.g., a genomic variant) and a set of N reference ranges (e.g., exons) has time complexity of O(k + logN), where kdenotes a term related to the length and location of the reference ranges. Here, we present a simple but efficient algorithm that reduces k, based on the maximum reference range length. Specifically, for a given query range and the maximum reference range length, the proposed method divides the reference range set into three subsets: always, potentially, and never overlapping. Therefore, search effort can be reduced by excluding never overlapping subset. We demonstrate that the running time of the proposed algorithm is proportional to potentially overlapping subset size, that is proportional to the maximum reference range length if all the other conditions are the same. Moreover, an implementation of our algorithm was 13.8 to 30.0 percent faster than one of the fastest range search methods available when tested on various genomic-range data sets. The proposed algorithm has been incorporated into a disease-linked variant prioritization pipeline for WGS (http://gnome.tchlab.org) and its implementation is available at http://ml.ssu.ac.kr/gSearch.

Subjects :: Computer Simulation
Algorithms
Genomics methods
Sequence Analysis, DNA methods

Details

Language :: English
ISSN :: 1557-9964
Volume :: 12
Issue :: 4
Database :: MEDLINE
Journal :: IEEE/ACM transactions on computational biology and bioinformatics
Publication Type :: Academic Journal
Accession number :: 26357316
Full Text :: https://doi.org/10.1109/TCBB.2014.2369042

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

An Efficient Search Algorithm for Finding Genomic-Range Overlaps Based on the Maximum Range Length.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

An Efficient Search Algorithm for Finding Genomic-Range Overlaps Based on the Maximum Range Length.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources