Jenkins hash function explained

The Jenkins hash functions are a family of non-cryptographic hash functions for multi-byte keys designed by Bob Jenkins. The first one was formally published in 1997.

The hash functions

one_at_a_time

Jenkins's one_at_a_time hash is adapted here from a WWW page by Bob Jenkins,[1] which is an expanded version of his Dr. Dobb's article.[2] It was originally created to fulfill certain requirements described by Colin Plumb, a cryptographer, but was ultimately not put to use.

uint32_t jenkins_one_at_a_time_hash(const uint8_t* key, size_t length)

Sample hash values for one_at_a_time hash function.one_at_a_time("a", 1)0xca2e9442one_at_a_time("The quick brown fox jumps over the lazy dog", 43)0x519e91f5

The avalanche behavior of this hash is shown on the right.

Each of the 24 rows corresponds to a single bit in the 3-byte input key, and each of the 32 columns corresponds to a bit in the output hash. Colors are chosen by how well the input key bit affects the given output hash bit: a green square indicates good mixing behavior, a yellow square weak mixing behavior, and red would indicate no mixing. Only a few bits in the last byte of the input key are weakly mixed to a minority of bits in the output hash.

Standard implementations of the Perl programming language prior to version 5.28 included Jenkins's one-at-a-time hash or a hardened variant of it, which was used by default.[3] [4]

lookup2

The lookup2 function was an interim successor to one-at-a-time. It is the function referred to as "My Hash" in the 1997 Dr. Dobbs journal article, though it has been obsoleted by subsequent functions that Jenkins has released. Applications of this hash function are found in:

lookup3

The lookup3 function consumes input in 12 byte (96 bit) chunks.[9] It may be appropriate when speed is more important than simplicity. Note, though, that any speed improvement from the use of this hash is only likely to be useful for large keys, and that the increased complexity may also have speed consequences such as preventing an optimizing compiler from inlining the hash function.

The lookup3 function was incorporated into Hierarchical Data Format 5 as a checksum for internal data structures based on its relative strength and speed in comparison to CRC32 and Fletcher32.[10]

SpookyHash

In 2011 Jenkins released a new 128-bit hash function called SpookyHash.[11] SpookyHash is significantly faster than lookup3.

Example for V2 (little-endian x64):

The short method for less than 192 bytes (43 bytes):

Hash128("The quick brown fox jumps over the lazy dog") 2b12e846aa0693c71d367e742407341b

The standard method for more than 191 bytes (219 bytes):

Hash128("The quick brown fox jumps over the lazy dog The quick brown fox jumps over the lazy dog The quick brown fox jumps over the lazy dog The quick brown fox jumps over the lazy dog The quick brown fox jumps over the lazy dog") f1b71c6ac5af39e7b69363a60dd29c49

See also

Notes and References

  1. Web site: Bob. Jenkins . November 3, 2013. A hash function for hash Table lookup. February 9, 2018.
  2. Bob . Jenkins . Hash functions. Dr. Dobb's Journal. September 1997 .
  3. http://www.perlmonks.org/?node_id=381061 "RFC: perlfeaturedelta"
  4. http://perl5.git.perl.org/perl.git/blob/HEAD:/hv_func.h "perl: hv_func.h"
  5. Dillinger . Peter C. . Panagiotis . Manolios . Fast and accurate bitstate verification for SPIN . Proc. 11th International SPIN Workshop . 2004 . 57–75 . 10.1.1.4.6765.
  6. Pablo . Neira Ayuso . Netfilter’s connection tracking system . . 31 . 3 . 2006 .
  7. Noa . Bar-Yosef . Avishai . Wool . Remote algorithmic complexity attacks against randomized hash tables Proc. International Conference on Security and Cryptography (SECRYPT) . 2007 . 117–124 .
  8. Geoffrey . Irving . Jeroen . Donkers . Jos . Uiterwijk . Solving kalah . .
  9. Web site: Bob. Jenkins . lookup3.c source code. April 16, 2009.
  10. Web site: Quincey. Koziol. [svn-r12661] Description: · HDFGroup/hdf5@d3a12e1]. July 18, 2023.
  11. Web site: Bob. Jenkins . SpookyHash: a 128-bit noncryptographic hash. Jan 29, 2012.