Back to Search
Start Over
An encoding-based dual distance tree high-dimensional index
- Source :
- Science in China Series F: Information Sciences. 51:1401-1414
- Publication Year :
- 2008
- Publisher :
- Springer Science and Business Media LLC, 2008.
-
Abstract
- The paper proposes a novel symmetrical encoding-based index structure, which is called EDD-tree (for encoding-based dual distance tree), to support fast k-nearest neighbor (k-NN) search in high-dimensional spaces. In the EDD-tree, all data points are first grouped into clusters by a k-means clustering algorithm. Then the uniform ID number of each data point is obtained by a dual-distance-driven encoding scheme, in which each cluster sphere is partitioned twice according to the dual distances of start-and centroid-distance. Finally, the uniform ID number and the centroid-distance of each data point are combined to get a uniform index key, the latter is then indexed through a partition-based B+-tree. Thus, given a query point, its k-NN search in high-dimensional spaces can be transformed into search in a single dimensional space with the aid of the EDD-tree index. Extensive performance studies are conducted to evaluate the effectiveness and efficiency of our proposed scheme, and the results demonstrate that this method outperforms the state-of-the-art high-dimensional search techniques such as the X-tree, VA-file, iDistance and NB-tree, especially when the query radius is not very large.
Details
- ISSN :
- 18622836 and 10092757
- Volume :
- 51
- Database :
- OpenAIRE
- Journal :
- Science in China Series F: Information Sciences
- Accession number :
- edsair.doi...........3fe833dda5654c3bb3a56ab96a1f5708
- Full Text :
- https://doi.org/10.1007/s11432-008-0104-3