Back to Search Start Over

File similarity evaluation scheme for multimedia data using partial hash information

Authors :
Young Woong Ko
Sung-Bong Jang
Su-Jin Oh
Byung-Kwan Kim
Source :
Multimedia Tools and Applications. 76:19649-19663
Publication Year :
2016
Publisher :
Springer Science and Business Media LLC, 2016.

Abstract

File similarity is a numerical indicator that how many duplicated data exist in target files. With this information, we can reduce storage capacity with data deduplication scheme, further it can be exploited in the digital forensic field for finding malicious software. However, measuring file similarity between files can cause a high overhead in terms of processing time and the capacity of disk storage. For this reason, in this paper, we propose a novel file similarity evaluation algorithm called PHISA (Partial Hash Information String Algorithm). To evaluate the performance of the proposed system, we compare PHISA to well-known file similarity tools. The evaluation result shows that PHISA reduces the processing time and increases the similarity evaluation accuracy.

Details

ISSN :
15737721 and 13807501
Volume :
76
Database :
OpenAIRE
Journal :
Multimedia Tools and Applications
Accession number :
edsair.doi...........6e96c657937b80677897784233c2e74f
Full Text :
https://doi.org/10.1007/s11042-016-3373-7