Competitive learning explained

Competitive learning is a form of unsupervised learning in artificial neural networks, in which nodes compete for the right to respond to a subset of the input data.[1] [2] A variant of Hebbian learning, competitive learning works by increasing the specialization of each node in the network. It is well suited to finding clusters within data.

Models and algorithms based on the principle of competitive learning include vector quantization and self-organizing maps (Kohonen maps).

Principle

There are three basic elements to a competitive learning rule:[3] [4]

Accordingly, the individual neurons of the network learn to specialize on ensembles of similar patterns and in so doing become 'feature detectors' for different classes of input patterns.

The fact that competitive networks recode sets of correlated inputs to one of a few output neurons essentially removes the redundancy in representation which is an essential part of processing in biological sensory systems.[5] [6]

Architecture and implementation

Competitive Learning is usually implemented with Neural Networks that contain a hidden layer which is commonly known as “competitive layer”.[7] Every competitive neuron is described by a vector of weights

{w

}_i = \left(\right)^T,i = 1,..,M and calculates the similarity measure between the input data

{x

}^n = \left(\right)^T \in \mathbb^d and the weight vector

{w

}_i .

For every input vector, the competitive neurons “compete” with each other to see which one of them is the most similar to that particular input vector. The winner neuron m sets its output

om=1

and all the other competitive neurons set their output

oi=0,i=1,..,M,i\nem

.

Usually, in order to measure similarity the inverse of the Euclidean distance is used:

\left\|{{x

} - _i } \right\| between the input vector

{x

}^n and the weight vector

{w

}_i.

Example algorithm

Here is a simple competitive learning algorithm to find three clusters within some input data.

1. (Set-up.) Let a set of sensors all feed into three different nodes, so that every node is connected to every sensor. Let the weights that each node gives to its sensors be set randomly between 0.0 and 1.0. Let the output of each node be the sum of all its sensors, each sensor's signal strength being multiplied by its weight.

2. When the net is shown an input, the node with the highest output is deemed the winner. The input is classified as being within the cluster corresponding to that node.

3. The winner updates each of its weights, moving weight from the connections that gave it weaker signals to the connections that gave it stronger signals.

Thus, as more data are received, each node converges on the centre of the cluster that it has come to represent and activates more strongly for inputs in this cluster and more weakly for inputs in other clusters.

See also

Further information and software

Notes and References

  1. Book: Rumelhart, David . David Rumelhart . David Zipser . James L. McClelland . Parallel Distributed Processing, Vol. 1 . MIT Press . 1986 . 151–193 . registration . etal.
  2. Grossberg . Stephen . 1987-01-01 . Competitive learning: From interactive activation to adaptive resonance . Cognitive Science . 11 . 1 . 23–63 . 10.1016/S0364-0213(87)80025-3 . 0364-0213. free .
  3. Rumelhart, David E., and David Zipser. "Feature discovery by competitive learning." Cognitive science 9.1 (1985): 75-112.
  4. Haykin, Simon, "Neural Network. A comprehensive foundation." Neural Networks 2.2004 (2004).
  5. Barlow, Horace B. "Unsupervised learning." Neural computation 1.3 (1989): 295-311.
  6. Edmund T.. Rolls, and Gustavo Deco. Computational neuroscience of vision. Oxford: Oxford university press, 2002.
  7. Web site: Implementation of Competitive Learning Networks for WEKA . Salatas, John . 24 August 2011. ICT Research Blog. 28 January 2012.