Back to Search Start Over

Tabulation-Based 5-Independent Hashing with Applications to Linear Probing and Second Moment Estimation

Authors :
Mikkel Thorup
Yin Zhang
Source :
SIAM Journal on Computing. 41:293-331
Publication Year :
2012
Publisher :
Society for Industrial & Applied Mathematics (SIAM), 2012.

Abstract

In the framework of Wegman and Carter, a $k$-independent hash function maps any $k$ keys independently. It is known that 5-independent hashing provides good expected performance in applications such as linear probing and second moment estimation for data streams. The classic $5$-independent hash function evaluates a degree 4 polynomial over a prime field containing the key domain $[n]=\{0,\ldots,n-1\}$. Here we present an efficient 5-independent hash function that uses no multiplications. Instead, for any parameter $c$, we make $2c-1$ lookups in tables of size $O(n^{1/c})$. In experiments on different computers, our scheme gained factors of 1.8 to 10 in speed over the polynomial method. We also conducted experiments on the performance of hash functions inside the above applications. In particular, we give realistic examples of inputs that make the most popular 2-independent hash function perform quite poorly. This illustrates the advantage of using schemes with provably good expected performance for all inputs.

Details

ISSN :
10957111 and 00975397
Volume :
41
Database :
OpenAIRE
Journal :
SIAM Journal on Computing
Accession number :
edsair.doi...........c9eba7e1f846fb1c948776896419e515
Full Text :
https://doi.org/10.1137/100800774