Back to Search Start Over

Fast Error-Bounded Distance Distribution Computation.

Authors :
Zhang, Jiahao
Yiu, Man Lung
Tang, Bo
Li, Qing
Source :
IEEE Transactions on Knowledge & Data Engineering; Nov2022, Vol. 34 Issue 11, p5364-5377, 14p
Publication Year :
2022

Abstract

In this work we study the distance distribution computation problem. It has been widely used in many real-world applications, e.g., human genome clustering, cosmological model analysis, and parameter tuning. The straightforward solution for the exact distance distribution computation problem is unacceptably slow due to (i) massive data size, and (ii) expensive distance computation. In this paper, we propose a novel method to compute approximate distance distributions with error bound guarantees. Furthermore, our method is generic to different distance measures. We conduct extensive experimental studies on three widely used distance measures with real-world datasets. The experimental results demonstrate that our proposed method outperforms the sampling-based solution (without error guarantees) by up to three orders of magnitude. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
HUMAN genome
TIME series analysis

Details

Language :
English
ISSN :
10414347
Volume :
34
Issue :
11
Database :
Complementary Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
160692050
Full Text :
https://doi.org/10.1109/TKDE.2021.3058241