Infomax Explained

Infomax, or the principle of maximum information preservation, is an optimization principle for artificial neural networks and other information processing systems. It prescribes that a function that maps a set of input values

x

to a set of output values

z(x)

should be chosen or learned so as to maximize the average Shannon mutual information between

x

and

z(x)

, subject to a set of specified constraints and/or noise processes. Infomax algorithms are learning algorithms that perform this optimization process. The principle was described by Linsker in 1988.[1] The objective function is called the InfoMax objective.

As the InfoMax objective is difficult to compute exactly, a related notion uses two models giving two outputs

z1(x),z2(x)

, and maximizes the mutual information between these. This contrastive InfoMax objective is a lower bound to the InfoMax objective.[2]

Infomax, in its zero-noise limit, is related to the principle of redundancy reduction proposed for biological sensory processing by Horace Barlow in 1961,[3] and applied quantitatively to retinal processing by Atick and Redlich.[4]

Applications

(Becker and Hinton, 1992) showed that the contrastive InfoMax objective allows a neural network to learn to identify surfaces in random dot stereograms (in one dimension).

One of the applications of infomax has been to an independent component analysis algorithm that finds independent signals by maximizing entropy. Infomax-based ICA was described by (Bell and Sejnowski, 1995),[5] and (Nadal and Parga, 1995).[6]

See also

References

Notes and References

  1. 10.1109/2.36 . Linsker R . Self-organization in a perceptual network . IEEE Computer . 21 . 3 . 105–17 . 1988 . 1527671 .
  2. Becker . Suzanna . Hinton . Geoffrey E. . January 1992 . Self-organizing neural network that discovers surfaces in random-dot stereograms . Nature . en . 355 . 6356 . 161–163 . 10.1038/355161a0 . 1476-4687.
  3. Book: Barlow, H. . Possible principles underlying the transformations of sensory messages . Rosenblith, W. . Sensory Communication . MIT Press . Cambridge MA . 1961 . 217–234 .
  4. 10.1162/neco.1992.4.2.196 . Atick JJ, Redlich AN . What does the retina know about natural scenes? . Neural Computation . 4 . 196–210 . 1992 . 2 . 17515861 .
  5. Bell AJ, Sejnowski TJ . November 1995 . An information-maximization approach to blind separation and blind deconvolution . Neural Comput . 7 . 6 . 1129–59 . 10.1.1.36.6605 . 10.1162/neco.1995.7.6.1129 . 7584893 . 1701422.
  6. Book: Nadal J.P., Parga N. . Sensory coding: information maximization and redundancy reduction . Neural Information Processing . G.. Burdet. P.. Combe. O.. Parodi . World Scientific Series in Mathematical Biology and Medicine . Singapore . World Scientific . 7 . 164–171 . 1999 .