Permutation Explained

In mathematics, a permutation of a set can mean one of two different things:

An example of the first meaning is the six permutations (orderings) of the set : written as tuples, they are (1, 2, 3), (1, 3, 2), (2, 1, 3), (2, 3, 1), (3, 1, 2), and (3, 2, 1). Anagrams of a word whose letters are all different are also permutations: the letters are already ordered in the original word, and the anagram reorders them. The study of permutations of finite sets is an important topic in combinatorics and group theory.

Permutations are used in almost every branch of mathematics and in many other fields of science. In computer science, they are used for analyzing sorting algorithms; in quantum physics, for describing states of particles; and in biology, for describing RNA sequences.

The number of permutations of distinct objects is  factorial, usually written as, which means the product of all positive integers less than or equal to .

According to the second meaning, a permutation of a set is defined as a bijection from to itself. That is, it is a function from to for which every element occurs exactly once as an image value. Such a function

\sigma:S\toS

is equivalent to the rearrangement of the elements of in which each element i is replaced by the corresponding

\sigma(i)

. For example, the permutation (3, 1, 2) is described by the function

\sigma

defined as

\sigma(1)=3,\sigma(2)=1,\sigma(3)=2

.

The collection of all permutations of a set form a group called the symmetric group of the set. The group operation is the composition of functions (performing one rearrangement after the other), which results in another function (rearrangement). The properties of permutations do not depend on the nature of the elements being permuted, only on their number, so one often considers the standard set

S=\{1,2,\ldots,n\}

.

In elementary combinatorics, the -permutations, or partial permutations, are the ordered arrangements of distinct elements selected from a set. When is equal to the size of the set, these are the permutations in the previous sense.

History

Permutation-like objects called hexagrams were used in China in the I Ching (Pinyin: Yi Jing) as early as 1000 BC.

In Greece, Plutarch wrote that Xenocrates of Chalcedon (396–314 BC) discovered the number of different syllables possible in the Greek language. This would have been the first attempt on record to solve a difficult problem in permutations and combinations.[1]

Al-Khalil (717–786), an Arab mathematician and cryptographer, wrote the Book of Cryptographic Messages. It contains the first use of permutations and combinations, to list all possible Arabic words with and without vowels.[2]

The rule to determine the number of permutations of n objects was known in Indian culture around 1150 AD. The Lilavati by the Indian mathematician Bhāskara II contains a passage that translates as follows:

The product of multiplication of the arithmetical series beginning and increasing by unity and continued to the number of places, will be the variations of number with specific figures.[3]

In 1677, Fabian Stedman described factorials when explaining the number of permutations of bells in change ringing. Starting from two bells: "first, two must be admitted to be varied in two ways", which he illustrates by showing 1 2 and 2 1. He then explains that with three bells there are "three times two figures to be produced out of three" which again is illustrated. His explanation involves "cast away 3, and 1.2 will remain; cast away 2, and 1.3 will remain; cast away 1, and 2.3 will remain". He then moves on to four bells and repeats the casting away argument showing that there will be four different sets of three. Effectively, this is a recursive process. He continues with five bells using the "casting away" method and tabulates the resulting 120 combinations. At this point he gives up and remarks:

Now the nature of these methods is such, that the changes on one number comprehends the changes on all lesser numbers, ... insomuch that a compleat Peal of changes on one number seemeth to be formed by uniting of the compleat Peals on all lesser numbers into one entire body;
Stedman widens the consideration of permutations; he goes on to consider the number of permutations of the letters of the alphabet and of horses from a stable of 20.

A first case in which seemingly unrelated mathematical questions were studied with the help of permutations occurred around 1770, when Joseph Louis Lagrange, in the study of polynomial equations, observed that properties of the permutations of the roots of an equation are related to the possibilities to solve it. This line of work ultimately resulted, through the work of Évariste Galois, in Galois theory, which gives a complete description of what is possible and impossible with respect to solving polynomial equations (in one unknown) by radicals. In modern mathematics, there are many similar situations in which understanding a problem requires studying certain permutations related to it.

The study of permutations as substitutions on n elements led to the notion of group as algebraic structure, through the works of Cauchy (1815 memoir).

Permutations played an important role in the cryptanalysis of the Enigma machine, a cipher device used by Nazi Germany during World War II. In particular, one important property of permutations, namely, that two permutations are conjugate exactly when they have the same cycle type, was used by cryptologist Marian Rejewski to break the German Enigma cipher in turn of years 1932-1933.[4] [5]

Definition

In mathematics texts it is customary to denote permutations using lowercase Greek letters. Commonly, either

\alpha,\beta,\gamma

or

\sigma,\tau,\rho,\pi

are used.[6]

A permutation can be defined as a bijection (an invertible mapping, a one-to-one and onto function) from a set to itself:

\sigma:S\stackrel{\sim}{\longrightarrow}S.

The identity permutation is defined by

\sigma(x)=x

for all elements

x\inS

, and can be denoted by the number

1

, by

id=idS

, or by a single 1-cycle (x).

Sn

, where the group operation is composition of functions. Thus for two permutations

\sigma

and

\tau

in the group

Sn

, their product

\pi=\sigma\tau

is defined by:

\pi(i)=\sigma(\tau(i)).

Composition is usually written without a dot or other sign. In general, composition of two permutations is not commutative:

\tau\sigma\sigma\tau.

As a bijection from a set to itself, a permutation is a function that performs a rearrangement of a set, termed an active permutation or substitution. An older viewpoint sees a permutation as an ordered arrangement or list of all the elements of S, called a passive permutation. According to this definition, all permutations in are passive. This meaning is subtly distinct from how passive (i.e. alias) is used in Active and passive transformation and elsewhere,[7] [8] which would consider all permutations open to passive interpretation (regardless of whether they are in one-line notation, two-line notation, etc.).

A permutation

\sigma

can be decomposed into one or more disjoint cycles which are the orbits of the cyclic group

\langle\sigma\rangle=\{1,\sigma,\sigma2,\ldots\}

acting on the set S. A cycle is found by repeatedly applying the permutation to an element:

x,\sigma(x),\sigma(\sigma(x)),\ldots,\sigmak-1(x)

, where we assume

\sigmak(x)=x

. A cycle consisting of k elements is called a k-cycle. (See below.)

A fixed point of a permutation

\sigma

is an element x which is taken to itself, that is

\sigma(x)=x

, forming a 1-cycle

(x)

. A permutation with no fixed points is called a derangement. A permutation exchanging two elements (a single 2-cycle) and leaving the others fixed is called a transposition.

Notations

Several notations are widely used to represent permutations conveniently. Cycle notation is a popular choice, as it is compact and shows the permutation's structure clearly. This article will use cycle notation unless otherwise specified.

Two-line notation

Cauchy's two-line notation[9] lists the elements of S in the first row, and the image of each element below it in the second row. For example, the permutation of S = given by the function

\sigma(1)=2,  \sigma(2)=6,  \sigma(3)=5,  \sigma(4)=4,  \sigma(5)=3,  \sigma(6)=1

can be written as

\sigma=\begin{pmatrix} 1&2&3&4&5&6\\ 2&6&5&4&3&1 \end{pmatrix}.

The elements of S may appear in any order in the first row, so this permutation could also be written:

\sigma=\begin{pmatrix} 2&3&4&5&6&1\\ 6&5&4&3&1&2 \end{pmatrix} =\begin{pmatrix} 6&5&4&3&2&1\\ 1&3&4&5&6&2\end{pmatrix}.

One-line notation

If there is a "natural" order for the elements of S, say

x1,x2,\ldots,xn

, then one uses this for the first row of the two-line notation:

\sigma=\begin{pmatrix} x1&x2&x3&&xn\\ \sigma(x1)&\sigma(x2)&\sigma(x3)&&\sigma(xn) \end{pmatrix}.

Under this assumption, one may omit the first row and write the permutation in one-line notation as

\sigma=\sigma(x1)\sigma(x2)\sigma(x3)\sigma(xn)

,that is, as an ordered arrangement of the elements of S. Care must be taken to distinguish one-line notation from the cycle notation described below: a common usage is to omit parentheses or other enclosing marks for one-line notation, while using parentheses for cycle notation. The one-line notation is also called the word representation.

The example above would then be:

\sigma=\begin{pmatrix} 1&2&3&4&5&6\\ 2&6&5&4&3&1 \end{pmatrix}=265431.

(It is typical to use commas to separate these entries only if some have two or more digits.)

This compact form is common in elementary combinatorics and computer science. It is especially useful in applications where the permutations are to be compared as larger or smaller using lexicographic order.

Cycle notation

Cycle notation describes the effect of repeatedly applying the permutation on the elements of the set S, with an orbit being called a cycle. The permutation is written as a list of cycles; since distinct cycles involve disjoint sets of elements, this is referred to as "decomposition into disjoint cycles".

To write down the permutation

\sigma

in cycle notation, one proceeds as follows:
  1. Write an opening bracket followed by an arbitrary element x of

S

:

(x

  1. Trace the orbit of x, writing down the values under successive applications of

\sigma

:

(x,\sigma(x),\sigma(\sigma(x)),\ldots

  1. Repeat until the value returns to x, and close the parenthesis without repeating x:

(x\sigma(x)\sigma(\sigma(x))\ldots)

  1. Continue with an element y of S which was not yet written, and repeat the above process:

(x\sigma(x)\sigma(\sigma(x))\ldots)(y\ldots)

  1. Repeat until all elements of S are written in cycles.

Also, it is common to omit 1-cycles, since these can be inferred: for any element x in S not appearing in any cycle, one implicitly assumes

\sigma(x)=x

.

Following the convention of omitting 1-cycles, one may interpret an individual cycle as a permutation which fixes all the elements not in the cycle (a cyclic permutation having only one cycle of length greater than 1). Then the list of disjoint cycles can be seen as the composition of these cyclic permutations. For example, the one-line permutation

\sigma=265431

can be written in cycle notation as:

\sigma=(126)(35)(4)=(126)(35).

This may be seen as the composition

\sigma=\kappa1\kappa2

of cyclic permutations:

\kappa1=(126)=(126)(3)(4)(5),\kappa2=(35)=(35)(1)(2)(6).

While permutations in general do not commute, disjoint cycles do; for example:

\sigma=(126)(35)=(35)(126).

Also, each cycle can be rewritten from a different starting point; for example,

\sigma=(126)(35)=(261)(53).

Thus one may write the disjoint cycles of a given permutation in many different ways.

A convenient feature of cycle notation is that inverting the permutation is given by reversing the order of the elements in each cycle. For example,

\sigma-1=\left(\vphantom{A2}(126)(35)\right)-1=(621)(53).

Canonical cycle notation

In some combinatorial contexts it is useful to fix a certain order for the elements in the cycles and of the (disjoint) cycles themselves. Miklós Bóna calls the following ordering choices the canonical cycle notation:

For example, (513)(6)(827)(94) is a permutation of

S=\{1,2,\ldots,9\}

in canonical cycle notation.[10]

Richard Stanley calls this the "standard representation" of a permutation,[11] and Martin Aigner uses "standard form".[12] Sergey Kitaev also uses the "standard form" terminology, but reverses both choices; that is, each cycle lists its minimal element first, and the cycles are sorted in decreasing order of their minimal elements.[13]

Composition of permutations

There are two ways to denote the composition of two permutations. In the most common notation,

\sigma\tau

is the function that maps any element x to

\sigma(\tau(x))

. The rightmost permutation is applied to the argument first,[14] because the argument is written to the right of the function.

A different rule for multiplying permutations comes from writing the argument to the left of the function, so that the leftmost permutation acts first.[15] [16] [17] In this notation, the permutation is often written as an exponent, so σ acting on x is written xσ; then the product is defined by

x\sigma\tau=(x\sigma)\tau

. This article uses the first definition, where the rightmost permutation is applied first.

The function composition operation satisfies the axioms of a group. It is associative, meaning

(\rho\sigma)\tau=\rho(\sigma\tau)

, and products of more than two permutations are usually written without parentheses. The composition operation also has an identity element (the identity permutation

id

), and each permutation

\sigma

has an inverse

\sigma-1

(its inverse function) with

\sigma-1\sigma=\sigma\sigma-1=id

.

Other uses of the term permutation

The concept of a permutation as an ordered arrangement admits several generalizations that have been called permutations, especially in older literature.

k-permutations of n

In older literature and elementary textbooks, a k-permutation of n (sometimes called a partial permutation, sequence without repetition, variation, or arrangement) means an ordered arrangement (list) of a k-element subset of an n-set.[18] [19] The number of such k-permutations (k-arrangements) of

n

is denoted variously by such symbols as
n
P
k
,

nPk

,
nP
k
,

Pn,k

,

P(n,k)

, or
k
A
n
, computed by the formula:[20]

P(n,k)=\underbrace{n(n-1)(n-2)(n-k+1)}kfactors

,

which is 0 when, and otherwise is equal to

n!
(n-k)!

.

The product is well defined without the assumption that

n

is a non-negative integer, and is of importance outside combinatorics as well; it is known as the Pochhammer symbol

(n)k

or as the

k

-th falling factorial power

n\underline

:

P(n,k)={n}Pk=(n)k=n\underline{k

} .
This usage of the term permutation is closely associated with the term combination to mean a subset. A k-combination of a set S is a k-element subset of S: the elements of a combination are not ordered. Ordering the k-combinations of S in all possible ways produces the k-permutations of S. The number of k-combinations of an n-set, C(n,k), is therefore related to the number of k-permutations of n by:

C(n,k)=

P(n,k)
P(k,k)

=

n\underline{k
} = \frac.

These numbers are also known as binomial coefficients, usually denoted

\tbinom{n}{k}

:

C(n,k)={n}Ck=\binom{n}{k}.

Permutations with repetition

Ordered arrangements of k elements of a set S, where repetition is allowed, are called k-tuples. They have sometimes been referred to as permutations with repetition, although they are not permutations in the usual sense. They are also called words or strings over the alphabet S. If the set S has n elements, the number of k-tuples over S is

nk.

Permutations of multisets

If M is a finite multiset, then a multiset permutation is an ordered arrangement of elements of M in which each element appears a number of times equal exactly to its multiplicity in M. An anagram of a word having some repeated letters is an example of a multiset permutation. If the multiplicities of the elements of M (taken in some order) are

m1

,

m2

, ...,

ml

and their sum (that is, the size of M) is n, then the number of multiset permutations of M is given by the multinomial coefficient,

{n\choosem1,m2,\ldots,ml}=

n!
m1!m2!ml!

=

l{m
\left(\sum
i
\right)!}{\prod
l{m
i!}}.

For example, the number of distinct anagrams of the word MISSISSIPPI is:

11!
1!4!4!2!

=34650

.

A k-permutation of a multiset M is a sequence of k elements of M in which each element appears a number of times less than or equal to its multiplicity in M (an element's repetition number).

Circular permutations

Permutations, when considered as arrangements, are sometimes referred to as linearly ordered arrangements. If, however, the objects are arranged in a circular manner this distinguished ordering is weakened: there is no "first element" in the arrangement, as any element can be considered as the start. An arrangement of distinct objects in a circular manner is called a circular permutation. These can be formally defined as equivalence classes of ordinary permutations of these objects, for the equivalence relation generated by moving the final element of the linear arrangement to its front.

Two circular permutations are equivalent if one can be rotated into the other. The following four circular permutations on four letters are considered to be the same.

     1           4           2           3
   4   3       2   1       3   4       1   2
     2           3           1           4

The circular arrangements are to be read counter-clockwise, so the following two are not equivalent since no rotation can bring one to the other.

     1          1
   4   3      3   4
     2          2

There are (n – 1)! circular permutations of a set with n elements.

Properties

The number of permutations of distinct objects is !.

The number of -permutations with disjoint cycles is the signless Stirling number of the first kind, denoted

c(n,k)

or

[\begin{smallmatrix}n\k\end{smallmatrix}]

.

Cycle type

The cycles (including the fixed points) of a permutation

\sigma

of a set with elements partition that set; so the lengths of these cycles form an integer partition of, which is called the cycle type (or sometimes cycle structure or cycle shape) of

\sigma

. There is a "1" in the cycle type for every fixed point of

\sigma

, a "2" for every transposition, and so on. The cycle type of

\beta=(125)(34)(68)(7)

is

(3,2,2,1).

This may also be written in a more compact form as .More precisely, the general form is

\alpha1
[1
\alpha2
2

...m

\alphan
n

]

, where

\alpha1,\ldots,\alphan

are the numbers of cycles of respective length. The number of permutations of a given cycle type is
n!
\alpha1
1
\alpha2
2
...m
\alphan
n
\alpha1!\alpha2!...m\alphan!
.

p(n)

.

Polya's cycle index polynomial is a generating function which counts permutations by their cycle type.

Conjugating permutations

In general, composing permutations written in cycle notation follows no easily described pattern – the cycles of the composition can be different from those being composed. However the cycle type is preserved in the special case of conjugating a permutation

\sigma

by another permutation

\pi

, which means forming the product

\pi\sigma\pi-1

. Here,

\pi\sigma\pi-1

is the conjugate of

\sigma

by

\pi

and its cycle notation can be obtained by taking the cycle notation for

\sigma

and applying

\pi

to all the entries in it. It follows that two permutations are conjugate exactly when they have the same cycle type.

Order of a permutation

The order of a permutation

\sigma

is the smallest positive integer m so that

\sigmam=id

. It is the least common multiple of the lengths of its cycles. For example, the order of

\sigma=(152)(34)

is

lcm(3,2)=6

.

Parity of a permutation

See main article: Parity of a permutation.

Every permutation of a finite set can be expressed as the product of transpositions.Although many such expressions for a given permutation may exist, either they all contain an even number of transpositions or they all contain an odd number of transpositions. Thus all permutations can be classified as even or odd depending on this number.

This result can be extended so as to assign a sign, written

\operatorname{sgn}\sigma

, to each permutation.

\operatorname{sgn}\sigma=+1

if

\sigma

is even and

\operatorname{sgn}\sigma=-1

if

\sigma

is odd. Then for two permutations

\sigma

and

\pi

\operatorname{sgn}(\sigma\pi)=\operatorname{sgn}\sigma\operatorname{sgn}\pi.

It follows that

\operatorname{sgn}\left(\sigma\sigma-1\right)=+1.

The sign of a permutation is equal to the determinant of its permutation matrix (below).

Matrix representation

See main article: Permutation matrix.

A permutation matrix is an n × n matrix that has exactly one entry 1 in each column and in each row, and all other entries are 0. There are several ways to assign a permutation matrix to a permutation of . One natural approach is to define

L\sigma

to be the linear transformation of

Rn

which permutes the standard basis

\{e1,\ldots,en\}

by

L\sigma(ej)=e\sigma(j)

, and define

M\sigma

to be its matrix. That is,

M\sigma

has its jth column equal to the n × 1 column vector

e\sigma(j)

: its (i, j) entry is to 1 if i = σ(j), and 0 otherwise. Since composition of linear mappings is described by matrix multiplication, it follows that this construction is compatible with composition of permutations:

M\sigmaM\tau=M\sigma\tau

.
For example, the one-line permutations

\sigma=213,\tau=231

have product

\sigma\tau=132

, and the corresponding matrices are:M_ M_ = \begin 0&1&0\\1&0&0\\0&0&1\end\begin 0&0&1\\1&0&0\\0&1&0\end = \begin 1&0&0\\0&0&1\\0&1&0\end = M_.

It is also common in the literature to find the inverse convention, where a permutation σ is associated to the matrix

P\sigma=(M\sigma)-1=(M\sigma)T

whose (i, j) entry is 1 if j = σ(i) and is 0 otherwise. In this convention, permutation matrices multiply in the opposite order from permutations, that is,

P\sigmaP\tau=P\tau\sigma

. In this correspondence, permutation matrices act on the right side of the standard

1 x n

row vectors

({\bf

T
e}
i)
:

({\bf

T
e}
i)

P\sigma=({\bfe}\sigma(i))T

.

The Cayley table on the right shows these matrices for permutations of 3 elements.

Permutations of totally ordered sets

In some applications, the elements of the set being permuted will be compared with each other. This requires that the set S has a total order so that any two elements can be compared. The set with the usual ≤ relation is the most frequently used set in these applications.

A number of properties of a permutation are directly related to the total ordering of S, considering the permutation written in one-line notation as a sequence

\sigma=\sigma(1)\sigma(2)\sigma(n)

.

Ascents, descents, runs, exceedances, records

An ascent of a permutation σ of n is any position i < n where the following value is bigger than the current one. That is, i is an ascent if

\sigma(i)<\sigma(i{+}1)

. For example, the permutation 3452167 has ascents (at positions) 1, 2, 5, and 6.

Similarly, a descent is a position i < n with

\sigma(i)>\sigma(i{+}1)

, so every i with

1\leqi<n

is either an ascent or a descent.

An ascending run of a permutation is a nonempty increasing contiguous subsequence that cannot be extended at either end; it corresponds to a maximal sequence of successive ascents (the latter may be empty: between two successive descents there is still an ascending run of length 1). By contrast an increasing subsequence of a permutation is not necessarily contiguous: it is an increasing sequence obtained by omitting some of the values of the one-line notation.For example, the permutation 2453167 has the ascending runs 245, 3, and 167, while it has an increasing subsequence 2367.

If a permutation has k − 1 descents, then it must be the union of k ascending runs.

style\left\langle{n\atopk}\right\rangle

; this is also the number of permutations of n with k descents. Some authors however define the Eulerian number

style\left\langle{n\atopk}\right\rangle

as the number of permutations with k ascending runs, which corresponds to descents.

An exceedance of a permutation σ1σ2...σn is an index j such that . If the inequality is not strict (that is,), then j is called a weak exceedance. The number of n-permutations with k exceedances coincides with the number of n-permutations with k descents.

A record or left-to-right maximum of a permutation σ is an element i such that σ(j) < σ(i) for all j < i.

Foata's transition lemma

Foata's fundamental bijection transforms a permutation

\sigma

with a given canonical cycle form into the permutation

f(\sigma)=\hat\sigma

whose one-line notation has the same sequence of elements with parentheses removed. For example:

\sigma=(513)(6)(827)(94) =\begin{pmatrix} 1&2&3&4&5&6&7&8&9\\ 3&7&5&9&1&6&8&2&4 \end{pmatrix},

\hat\sigma=513682794 =\begin{pmatrix} 1&2&3&4&5&6&7&8&9\\ 5&1&3&6&8&2&7&9&4 \end{pmatrix}.

Here the first element in each canonical cycle of

\sigma

becomes a record (left-to-right maximum) of

\hat\sigma

. Given

\hat\sigma

, one may find its records and insert parentheses to construct the inverse transformation

\sigma=f-1(\hat\sigma)

. Underlining the records in the above example:

\hat\sigma=\underline{5}13\underline{6}\underline{8}27\underline{9}4

, which allows the reconstruction of the cycles of

\sigma

.

The following table shows

\hat\sigma

and

\sigma

for the six permutations of S =, with the bold text on each side showing the notation used in the bijection: one-line notation for

\hat\sigma

and canonical cycle notation for

\sigma

.

\hat\sigma=f(\sigma)

\sigma=f-1(\hat\sigma)

123=(1)(2)(3)

123=(1)(2)(3)

132=(1)(32)

132=(1)(32)

213=(21)(3)

213=(21)(3)

231=(312)

321=(2)(31)

312=(321)

231=(312)

321=(2)(31)

312=(321)

As a first corollary, the number of n-permutations with exactly k records is equal to the number of n-permutations with exactly k cycles: this last number is the signless Stirling number of the first kind,

c(n,k)

. Furthermore, Foata's mapping takes an n-permutation with k weak exceedances to an n-permutation with ascents. For example, (2)(31) = 321 has k = 2 weak exceedances (at index 1 and 2), whereas has ascent (at index 1; that is, from 2 to 3).

Inversions

See main article: Inversion (discrete mathematics).

An inversion of a permutation σ is a pair of positions where the entries of a permutation are in the opposite order:

i<j

and

\sigma(i)>\sigma(j)

. Thus a descent is an inversion at two adjacent positions. For example, has (i, j) = (1, 3), (2, 3), and (4, 5), where (σ(i), σ(j)) = (2, 1), (3, 1), and (5, 4).

Sometimes an inversion is defined as the pair of values (σ(i), σ(j)); this makes no difference for the number of inversions, and the reverse pair (σ(j), σ(i)) is an inversion in the above sense for the inverse permutation σ−1.

The number of inversions is an important measure for the degree to which the entries of a permutation are out of order; it is the same for σ and for σ−1. To bring a permutation with k inversions into order (that is, transform it into the identity permutation), by successively applying (right-multiplication by) adjacent transpositions, is always possible and requires a sequence of k such operations. Moreover, any reasonable choice for the adjacent transpositions will work: it suffices to choose at each step a transposition of i and where i is a descent of the permutation as modified so far (so that the transposition will remove this particular descent, although it might create other descents). This is so because applying such a transposition reduces the number of inversions by 1; as long as this number is not zero, the permutation is not the identity, so it has at least one descent. Bubble sort and insertion sort can be interpreted as particular instances of this procedure to put a sequence into order. Incidentally this procedure proves that any permutation σ can be written as a product of adjacent transpositions; for this one may simply reverse any sequence of such transpositions that transforms σ into the identity. In fact, by enumerating all sequences of adjacent transpositions that would transform σ into the identity, one obtains (after reversal) a complete list of all expressions of minimal length writing σ as a product of adjacent transpositions.

The number of permutations of n with k inversions is expressed by a Mahonian number. This is the coefficient of

qk

in the expansion of the product

[n]_q! = \prod_^n\sum_^q^i = 1 \left(1 + q\right)\left(1 + q + q^2\right) \cdots \left(1 + q + q^2 + \cdots + q^\right),

The notation

[n]q!

denotes the q-factorial. This expansion commonly appears in the study of necklaces.

Let

\sigma\inSn,i,j\in\{1,2,...,n\}

such that

i<j

and

\sigma(i)>\sigma(j)

.In this case, say the weight of the inversion

(i,j)

is

\sigma(i)-\sigma(j)

.Kobayashi (2011) proved the enumeration formula \sum_(\sigma(i)-\sigma(j)) = |\

where

\le

denotes Bruhat order in the symmetric groups. This graded partial order often appears in the context of Coxeter groups.

Permutations in computing

Numbering permutations

One way to represent permutations of n things is by an integer N with 0 ≤ N < n!, provided convenient methods are given to convert between the number and the representation of a permutation as an ordered arrangement (sequence). This gives the most compact representation of arbitrary permutations, and in computing is particularly attractive when n is small enough that N can be held in a machine word; for 32-bit words this means n ≤ 12, and for 64-bit words this means n ≤ 20. The conversion can be done via the intermediate form of a sequence of numbers dn, dn−1, ..., d2, d1, where di is a non-negative integer less than i (one may omit d1, as it is always 0, but its presence makes the subsequent conversion to a permutation easier to describe). The first step then is to simply express N in the factorial number system, which is just a particular mixed radix representation, where, for numbers less than n!, the bases (place values or multiplication factors) for successive digits are,, ..., 2!, 1!. The second step interprets this sequence as a Lehmer code or (almost equivalently) as an inversion table.

Rothe diagram for

\sigma=(6,3,8,1,4,9,7,2,5)

123456789Lehmer code
1× × × × × d9 = 5
2× × d8 = 2
3× × × × × d7 = 5
4d6 = 0
5× d5 = 1
6× × × d4 = 3
7× × d3 = 2
8d2 = 0
9d1 = 0
Inversion table3 6 1 2 4 0 2 0 0
In the Lehmer code for a permutation σ, the number dn represents the choice made for the first term σ1, the number dn−1 represents the choice made for the second termσ2 among the remaining elements of the set, and so forth. More precisely, each dn+1−i gives the number of remaining elements strictly less than the term σi. Since those remaining elements are bound to turn up as some later term σj, the digit dn+1−i counts the inversions (i,j) involving i as smaller index (the number of values j for which i < j and σi > σj). The inversion table for σ is quite similar, but here dn+1−k counts the number of inversions (i,j) where k = σj occurs as the smaller of the two values appearing in inverted order.

Both encodings can be visualized by an n by n Rothe diagram (named after Heinrich August Rothe) in which dots at (i,σi) mark the entries of the permutation, and a cross at (i,σj) marks the inversion (i,j); by the definition of inversions a cross appears in any square that comes both before the dot (j,σj) in its column, and before the dot (i,σi) in its row. The Lehmer code lists the numbers of crosses in successive rows, while the inversion table lists the numbers of crosses in successive columns; it is just the Lehmer code for the inverse permutation, and vice versa.

To effectively convert a Lehmer code dn, dn−1, ..., d2, d1 into a permutation of an ordered set S, one can start with a list of the elements of S in increasing order, and for i increasing from 1 to n set σi to the element in the list that is preceded by dn+1−i other ones, and remove that element from the list. To convert an inversion table dn, dn−1, ..., d2, d1 into the corresponding permutation, one can traverse the numbers from d1 to dn while inserting the elements of S from largest to smallest into an initially empty sequence; at the step using the number d from the inversion table, the element from S inserted into the sequence at the point where it is preceded by d elements already present. Alternatively one could process the numbers from the inversion table and the elements of S both in the opposite order, starting with a row of n empty slots, and at each step place the element from S into the empty slot that is preceded by d other empty slots.

Converting successive natural numbers to the factorial number system produces those sequences in lexicographic order (as is the case with any mixed radix number system), and further converting them to permutations preserves the lexicographic ordering, provided the Lehmer code interpretation is used (using inversion tables, one gets a different ordering, where one starts by comparing permutations by the place of their entries 1 rather than by the value of their first entries). The sum of the numbers in the factorial number system representation gives the number of inversions of the permutation, and the parity of that sum gives the signature of the permutation. Moreover, the positions of the zeroes in the inversion table give the values of left-to-right maxima of the permutation (in the example 6, 8, 9) while the positions of the zeroes in the Lehmer code are the positions of the right-to-left minima (in the example positions the 4, 8, 9 of the values 1, 2, 5); this allows computing the distribution of such extrema among all permutations. A permutation with Lehmer code dn, dn−1, ..., d2, d1 has an ascent if and only if .

Algorithms to generate permutations

In computing it may be required to generate permutations of a given sequence of values. The methods best adapted to do this depend on whether one wants some randomly chosen permutations, or all permutations, and in the latter case if a specific ordering is required. Another question is whether possible equality among entries in the given sequence is to be taken into account; if so, one should only generate distinct multiset permutations of the sequence.

An obvious way to generate permutations of n is to generate values for the Lehmer code (possibly using the factorial number system representation of integers up to n!), and convert those into the corresponding permutations. However, the latter step, while straightforward, is hard to implement efficiently, because it requires n operations each of selection from a sequence and deletion from it, at an arbitrary position; of the obvious representations of the sequence as an array or a linked list, both require (for different reasons) about n2/4 operations to perform the conversion. With n likely to be rather small (especially if generation of all permutations is needed) that is not too much of a problem, but it turns out that both for random and for systematic generation there are simple alternatives that do considerably better. For this reason it does not seem useful, although certainly possible, to employ a special data structure that would allow performing the conversion from Lehmer code to permutation in O(n log n) time.

Random generation of permutations

See main article: Fisher–Yates shuffle. For generating random permutations of a given sequence of n values, it makes no difference whether one applies a randomly selected permutation of n to the sequence, or chooses a random element from the set of distinct (multiset) permutations of the sequence. This is because, even though in case of repeated values there can be many distinct permutations of n that result in the same permuted sequence, the number of such permutations is the same for each possible result. Unlike for systematic generation, which becomes unfeasible for large n due to the growth of the number n!, there is no reason to assume that n will be small for random generation.

The basic idea to generate a random permutation is to generate at random one of the n! sequences of integers d1,d2,...,dn satisfying (since d1 is always zero it may be omitted) and to convert it to a permutation through a bijective correspondence. For the latter correspondence one could interpret the (reverse) sequence as a Lehmer code, and this gives a generation method first published in 1938 by Ronald Fisher and Frank Yates.[21] While at the time computer implementation was not an issue, this method suffers from the difficulty sketched above to convert from Lehmer code to permutation efficiently. This can be remedied by using a different bijective correspondence: after using di to select an element among i remaining elements of the sequence (for decreasing values of i), rather than removing the element and compacting the sequence by shifting down further elements one place, one swaps the element with the final remaining element. Thus the elements remaining for selection form a consecutive range at each point in time, even though they may not occur in the same order as they did in the original sequence. The mapping from sequence of integers to permutations is somewhat complicated, but it can be seen to produce each permutation in exactly one way, by an immediate induction. When the selected element happens to be the final remaining element, the swap operation can be omitted. This does not occur sufficiently often to warrant testing for the condition, but the final element must be included among the candidates of the selection, to guarantee that all permutations can be generated.

The resulting algorithm for generating a random permutation of ''a''[0], ''a''[1], ..., ''a''[''n'' − 1] can be described as follows in pseudocode:

for i from n downto 2 do di ← random element of swap a[''d<sub>i</sub>''] and a[''i'' − 1]

This can be combined with the initialization of the array ''a''[''i''] = ''i'' as follows

for i from 0 to n−1 do di+1 ← random element of a[''i''] ← a[''d''<sub>''i''+1</sub>] a[''d''<sub>''i''+1</sub>] ← i

If di+1 = i, the first assignment will copy an uninitialized value, but the second will overwrite it with the correct value i.

However, Fisher-Yates is not the fastest algorithm for generating a permutation, because Fisher-Yates is essentially a sequential algorithm and "divide and conquer" procedures can achieve the same result in parallel.[22]

Generation in lexicographic order

There are many ways to systematically generate all permutations of a given sequence.[23] One classic, simple, and flexible algorithm is based upon finding the next permutation in lexicographic ordering, if it exists. It can handle repeated values, for which case it generates each distinct multiset permutation once. Even for ordinary permutations it is significantly more efficient than generating values for the Lehmer code in lexicographic order (possibly using the factorial number system) and converting those to permutations. It begins by sorting the sequence in (weakly) increasing order (which gives its lexicographically minimal permutation), and then repeats advancing to the next permutation as long as one is found. The method goes back to Narayana Pandita in 14th century India, and has been rediscovered frequently.

The following algorithm generates the next permutation lexicographically after a given permutation. It changes the given permutation in-place.

  1. Find the largest index k such that . If no such index exists, the permutation is the last permutation.
  2. Find the largest index l greater than k such that .
  3. Swap the value of a[''k''] with that of a[''l''].
  4. Reverse the sequence from a[''k'' + 1] up to and including the final element a[''n''].

For example, given the sequence [1, 2, 3, 4] (which is in increasing order), and given that the index is zero-based, the steps are as follows:

  1. Index k = 2, because 3 is placed at an index that satisfies condition of being the largest index that is still less than a[''k'' + 1] which is 4.
  2. Index l = 3, because 4 is the only value in the sequence that is greater than 3 in order to satisfy the condition a[''k''] < a[''l''].
  3. The values of a[2] and a[3] are swapped to form the new sequence [1, 2, 4, 3].
  4. The sequence after k-index a[2] to the final element is reversed. Because only one value lies after this index (the 3), the sequence remains unchanged in this instance. Thus the lexicographic successor of the initial state is permuted: [1, 2, 4, 3].

Following this algorithm, the next lexicographic permutation will be [1, 3, 2, 4], and the 24th permutation will be [4, 3, 2, 1] at which point a[''k''] < a[''k'' + 1] does not exist, indicating that this is the last permutation.

This method uses about 3 comparisons and 1.5 swaps per permutation, amortized over the whole sequence, not counting the initial sort.[24]

Generation with minimal changes

See main article: Steinhaus–Johnson–Trotter algorithm and Heap's algorithm. An alternative to the above algorithm, the Steinhaus–Johnson–Trotter algorithm, generates an ordering on all the permutations of a given sequence with the property that any two consecutive permutations in its output differ by swapping two adjacent values. This ordering on the permutations was known to 17th-century English bell ringers, among whom it was known as "plain changes". One advantage of this method is that the small amount of change from one permutation to the next allows the method to be implemented in constant time per permutation. The same can also easily generate the subset of even permutations, again in constant time per permutation, by skipping every other output permutation.

An alternative to Steinhaus–Johnson–Trotter is Heap's algorithm,[25] said by Robert Sedgewick in 1977 to be the fastest algorithm of generating permutations in applications.[23]

The following figure shows the output of all three aforementioned algorithms for generating all permutations of length

n=4

, and of six additional algorithms described in the literature.
  1. Lexicographic ordering;
  2. Steinhaus–Johnson–Trotter algorithm;
  3. Heap's algorithm;
  4. Ehrlich's star-transposition algorithm: in each step, the first entry of the permutation is exchanged with a later entry;
  5. Zaks' prefix reversal algorithm:[26] in each step, a prefix of the current permutation is reversed to obtain the next permutation;
  6. Sawada-Williams' algorithm:[27] each permutation differs from the previous one either by a cyclic left-shift by one position, or an exchange of the first two entries;
  7. Corbett's algorithm:[28] each permutation differs from the previous one by a cyclic left-shift of some prefix by one position;
  8. Single-track ordering:[29] each column is a cyclic shift of the other columns;
  9. Single-track Gray code:[29] each column is a cyclic shift of the other columns, plus any two consecutive permutations differ only in one or two transpositions.
  10. Nested swaps generating algorithm in steps connected to the nested subgroups

Sk\subsetSk+1

. Each permutation is obtained from the previous by a transposition multiplication to the left. Algorithm is connected to the Factorial_number_system of the index.

Generation of permutations in nested swap steps

Explicit sequence of swaps (transpositions, 2-cycles

(pq)

), is described here, each swap applied (on the left) to the previous chain providing a new permutation, such that all the permutations can be retrieved, each only once.[30] This counting/generating procedure has an additional structure (call it nested), as it is given in steps: after completely retrieving

Sk-1

, continue retrieving

Sk\backslashSk-1

by cosets

Sk-1\taui

of

Sk-1

in

Sk

, by appropriately choosing the coset representatives

\taui

to be described below. Note that, since each

Sm

is sequentially generated, there is a last element

λm\inSm

. So, after generating

Sk-1

by swaps, the next permutation in

Sk\backslashSk-1

has to be

\tau1=(p1k)λk-1

for some

1\leqp1<k

. Then all swaps that generated

Sk-1

are repeated, generating the whole coset

Sk-1\tau1

, reaching the last permutation in that coset

λk-1\tau1

; the next swap has to move the permutation to representative of another coset

\tau2=(p2k)λk-1\tau1

.

Continuing the same way, one gets coset representatives

\tauj=(pjk)λk-1λk-1(pik)λk-1 … λk-1(p1k)λk-1

for the cosets of

Sk-1

in

Sk

; the ordered set

(p1,\ldots,pk-1)

(

0\leqpi<k

) is called the set of coset beginnings. Two of these representatives are in the same coset if and only if

\tauj(\tau

-1
i)

=(pjk)λk-1(pj-1k)λk-1λk-1(pi+1k)=\varkappaij\inSk-1

, that is,

\varkappaij(k)=k

. Concluding, permutations

\taui\inSk-Sk-1

are all representatives of distinct cosets if and only if for any

k>j>i\geq1

,

(λk-1)j-ipipj

(no repeat condition). In particular, for all generated permutations to be distinct it is not necessary for the

pi

values to be distinct. In the process, one gets that

λkk-1(pk-1k)λk-1(pk-2k)λk-1 … λk-1(p1k)λk-1

and this provides the recursion procedure.

EXAMPLES: obviously, for

λ2

one has

λ2=(12)

; to build

λ3

there are only two possibilities for the coset beginnings satisfying the no repeat condition; the choice

p1=p2=1

leads to

λ3=λ2(13)λ2(13)λ2=(13)

. To continue generating

S4

one needs appropriate coset beginnings (satisfying the no repeat condition): there is a convenient choice:

p1=1,p2=2,p3=3

, leading to

λ4=(13)(1234)(13)=(1432)

. Then, to build

λ5

a convenient choice for the coset beginnings (satisfying the no repeat condition) is

p1=p2=p3=p4=1

, leading to

λ5=(15)

.

From examples above one can inductively go to higher

k

in a similar way, choosing coset beginnings of

Sk

in

Sk+1

, as follows: for

k

even choosing all coset beginnings equal to 1 and for

k

odd choosing coset beginnings equal to

(1,2,...,k)

. With such choices the "last" permutation is

λk=(1k)

for

k

odd and

λk=(1k-)(12 … k)(1k-)

for

k

even (

k-=k-1

). Using these explicit formulae one can easily compute the permutation of certain index in the counting/generation steps with minimum computation. For this, writing the index in factorial base is useful. For example, the permutation for index

699=5(5!)+4(4!)+1(2!)+1(1!)

is:

\sigma2(13)λ2(15)λ4(15)λ4(15)λ4(15)λ4(56)λ5(46)λ5(36)λ5(26)λ5(16)λ5=

λ2(13)λ2((15)λ

-1
5)
-1
λ
6=(23)(14325)

(15)(15)(123456)(15)=

(23)(15234)(123456)(15)

, yelding finally,

\sigma=(1653)(24)

.

Because multiplying by swap permutation takes short computing time and every new generated permutation requires only one such swap multiplication, this generation procedure is quite efficient. Moreover as there is a simple formula, having the last permutation in each

Sk

can save even more time to go directly to a permutation with certain index in fewer steps than expected as it can be done in blocks of subgroups rather than swap by swap.

Applications

Permutations are used in the interleaver component of the error detection and correction algorithms, such as turbo codes, for example 3GPP Long Term Evolution mobile telecommunication standard uses these ideas (see 3GPP technical specification 36.212[31]).Such applications raise the question of fast generation of permutations satisfying certain desirable properties. One of the methods is based on the permutation polynomials. Also as a base for optimal hashing in Unique Permutation Hashing.[32]

See also

Bibliography

Further reading

Notes and References

  1. Book: Heath, Thomas Little . A History of Greek Mathematics . 1981 . Dover Publications . 0-486-24073-8 . New York . 7703465.
  2. Broemeling. Lyle D.. An Account of Early Statistical Inference in Arab Cryptology. The American Statistician. 1 November 2011. 65. 4. 255–257. 10.1198/tas.2011.10191. 123537702.
  3. N. L. . Biggs . The Roots of Combinatorics . Historia Math. . 6 . 1979 . 2 . 109–136 . 10.1016/0315-0860(79)90074-0 .
  4. Rejewski . Marian . 1980 . An application of the theory of permutations in breaking the Enigma cipher . Applicationes Mathematicae . 16 . 4 . 543–559 . 10.4064/am-16-4-543-559 . 1233-7234. free .
  5. Web site: Cash . David . 2019 . CMSC 28400 Introduction to Cryptography Autumn 2019 - Notes #2: Permutations and Enigma .
  6. Book: Scheinerman . Edward A. . March 5, 2012 . Chapter 5: Functions . Mathematics: A Discrete Introduction . https://books.google.com/books?id=DZBHGD2sEYwC&pg=PA188 . live . 3rd . Cengage Learning . 188 . 978-0840049421 . https://web.archive.org/web/20200205212843/https://books.google.com/books?id=DZBHGD2sEYwC&pg=PA188 . February 5, 2020 . February 5, 2020 . It is customary to use lowercase Greek letters (especially π, σ, and τ) to stand for permutations..
  7. Book: Conway . John H. . Burgiel . Heidi . Goodman-Strauss . Chaim . 2008 . The Symmetries of Things . A K Peters . 179 . A permutation---say, of the names of a number of people---can be thought of as moving either the names or the people. The alias viewpoint regards the permutation as assigning a new name or alias to each person (from the Latin alias = otherwise). Alternatively, from the alibi viewoint we move the people to the places corresponding to their new names (from the Latin alibi = in another place.) .
  8. Web site: Permutation notation - Wikiversity . 2024-08-04 . en.wikiversity.org . en.
  9. Cauchy . A. L. . Mémoire Sur le Nombre des Valeurs qu'une Fonction peut acquérir, lorsqu'on y permute de toutes les manières possibles les quantités qu'elle renferme . Journal de l'École polytechnique . January 1815 . 10 . 1–28 . Memoir on the number of values which a function can acquire when one permutes within it, in all possible ways, the variables which it contains . French. See p. 4.
  10. [The book has a typo/error here, as it gives (45) instead of (54).]
  11. Book: Stanley, Richard P. . Enumerative Combinatorics: Volume I, Second Edition . Cambridge University Press . 2012 . 978-1-107-01542-5 . 30, Prop 1.3.1.
  12. Book: Aigner, Martin. A Course in Enumeration. limited. 2007. Springer GTM 238. 978-3-540-39035-0. 24–25.
  13. Book: Kitaev, Sergey . Patterns in Permutations and Words. 2011. Springer Science & Business Media. 978-3-642-17333-2. 119.
  14. Book: Biggs . Norman L. . White . A. T.. 1979. Cambridge University Press. Permutation groups and combinatorial structures. 978-0-521-22287-7.
  15. Book: Dixon . John D. . Permutation Groups . Mortimer . Brian . Springer . 1996 . 978-0-387-94599-6 . registration.
  16. Book: Cameron . Peter J. . Permutation groups . Cambridge University Press . 1999 . 978-0-521-65302-2 . registration.
  17. Jerrum . M. . 1986 . A compact representation of permutation groups . J. Algorithms . 7 . 60–78 . 10.1016/0196-6774(86)90038-6 . 18896625 . 1.
  18. Web site: Combinations and Permutations . 2020-09-10 . www.mathsisfun.com.
  19. Web site: Weisstein . Eric W. . Permutation . 2020-09-10 . mathworld.wolfram.com . en.
  20. Book: Charalambides, Ch A.. Enumerative Combinatorics. CRC Press. 2002. 978-1-58488-290-9. 42.
  21. Book: Fisher, R.A. . Yates, F. . Statistical tables for biological, agricultural and medical research. 1938. 3rd. 1948. 26–27. Oliver & Boyd. London. 14222135.
  22. News: Bacher, A. . Bodini, O.. Hwang, H.K.. Tsai, T.H. . Generating Random Permutations by Coin Tossing: Classical Algorithms, New Analysis, and Modern Implementation.. ACM Trans. Algorithms 13(2): 24:1–24:43. 2017. 24–43.
  23. Sedgewick. R. Permutation generation methods. Computing Surveys. 1977. 9. 2. 137–164. https://web.archive.org/web/20080221185652/http://www.math.uiowa.edu/~goodman/22m150.dir/2007/Permutation%20Generation%20Methods.pdf . 2008-02-21 . live. 10.1145/356689.356692. 12139332.
  24. Web site: std::next_permutation. 31 March 2018. cppreference.com. 4 December 2017.
  25. Heap. B. R.. Permutations by Interchanges. The Computer Journal. 1963. 6. 3. 293–298. 10.1093/comjnl/6.3.293. free.
  26. Zaks. S.. A new algorithm for generation of permutations. BIT Numerical Mathematics. 1984. 24. 2. 196–204. 10.1007/BF01937486. 30234652.
  27. A Hamilton path for the sigma-tau problem . Sawada . Joe . Williams . Aaron . 2018 . Society for Industrial and Applied Mathematics (SIAM) . Proceedings of the 29th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2018 . 568–575 . New Orleans, Louisiana . 10.1137/1.9781611975031.37. free .
  28. Corbett. P. F.. Rotator graphs: An efficient topology for point-to-point multiprocessor networks. IEEE Transactions on Parallel and Distributed Systems. 1992. 3. 5. 622–626. 10.1109/71.159045.
  29. Book: Arndt . Jörg. Matters Computational. Ideas, Algorithms, Source Code. 2011. Springer. 10.1007/978-3-642-14764-7. 978-3-642-14763-0.
  30. Book: Popp, O.T. . Quickly Handling Big Permutations. 2002. priv. comm..
  31. Web site: 3GPP TS 36.212.
  32. Shlomi . Dolev . Limor . Lahiani . Yinnon . Haviv . Unique permutation hashing . Theoretical Computer Science . 475 . 2013 . 59–65 . 10.1016/j.tcs.2012.12.047 . free .