1. DATA DEDUPLICATION.
- Author
-
Budanaev, Ivan A.
- Subjects
- *
ELECTRONIC data processing , *HAMMING distance - Abstract
Each day an abundance of new data is generated. With that comes the necessity of a data deduplication process within each data project. This need arises due to multiple reasons, some stronger than others: storage efficiency, data linkage, information representation, etc. This article describes the application of distance functions in the process of data deduplication which covers the aforementioned uses cases. [ABSTRACT FROM AUTHOR]
- Published
- 2021