Hamming space explained

In statistics and coding theory, a Hamming space is usually the set of all

2N

binary strings of length N, where different binary strings are considered to be adjacent when they differ only in one position. The total distance between any two binary strings is then the total number of positions at which the corresponding bits are different, called the Hamming distance. Hamming spaces are named after American mathematician Richard Hamming, who introduced the concept in 1950.[1] They are used in the theory of coding signals and transmission.

More generally, a Hamming space can be defined over any alphabet (set) Q as the set of words of a fixed length N with letters from Q.[2] If Q is a finite field, then a Hamming space over Q is an N-dimensional vector space over Q. In the typical, binary case, the field is thus GF(2) (also denoted by Z2).[3]

In coding theory, if Q has q elements, then any subset C (usually assumed of cardinality at least two) of the N-dimensional Hamming space over Q is called a q-ary code of length N; the elements of C are called codewords.[3] [2] In the case where C is a linear subspace of its Hamming space, it is called a linear code.[3] A typical example of linear code is the Hamming code. Codes defined via a Hamming space necessarily have the same length for every codeword, so they are called block codes when it is necessary to distinguish them from variable-length codes that are defined by unique factorization on a monoid.

The Hamming distance endows a Hamming space with a metric, which is essential in defining basic notions of coding theory such as error detecting and error correcting codes.[3]

Hamming spaces over non-field alphabets have also been considered, especially over finite rings (most notably over Z4) giving rise to modules instead of vector spaces and ring-linear codes (identified with submodules) instead of linear codes. The typical metric used in this case the Lee distance. There exist a Gray isometry between

2m
Z
2
(i.e. GF(22m)) with the Hamming distance and
m
Z
4
(also denoted as GR(4,m)) with the Lee distance.[4] [5] [6]

Notes and References

  1. Hamming. R. W.. April 1950. Error detecting and error correcting codes. The Bell System Technical Journal. 29. 2. 147–160. 10.1002/j.1538-7305.1950.tb00463.x. 61141773 . 0005-8580. https://ghostarchive.org/archive/20221009/https://calhoun.nps.edu/bitstream/10945/46756/1/Hamming_1982.pdf . free . 10945/46756 . free . 2022-10-09 . live.
  2. Cohen et al., Covering Codes, p. 15
  3. Book: Derek J.S. Robinson. An Introduction to Abstract Algebra. 2003. Walter de Gruyter. 978-3-11-019816-4. 254–255.
  4. Book: Massimiliano Sala . Teo Mora . Ludovic Perret . Shojiro Sakata . Carlo Traverso. Gröbner Bases, Coding, and Cryptography. 2009. Springer Science & Business Media. 978-3-540-93806-4. An Introduction to Ring-Linear Coding Theory. Marcus Greferath.
  5. Web site: Kerdock and Preparata codes - Encyclopedia of Mathematics.
  6. Book: J.H. van Lint. Introduction to Coding Theory. registration. 1999. Springer. 978-3-540-64133-9. 3rd. Chapter 8: Codes over

    Z4

    .