Hausdorff maximal principle explained

In mathematics, the Hausdorff maximal principle is an alternate and earlier formulation of Zorn's lemma proved by Felix Hausdorff in 1914 (Moore 1982:168). It states that in any partially ordered set, every totally ordered subset is contained in a maximal totally ordered subset, where "maximal" is with respect to set inclusion.

In a partially ordered set, a totally ordered subset is also called a chain. Thus, the maximal principle says every chain in the set extends to a maximal chain.

The Hausdorff maximal principle is one of many statements equivalent to the axiom of choice over ZF (Zermelo–Fraenkel set theory without the axiom of choice). The principle is also called the Hausdorff maximality theorem or the Kuratowski lemma (Kelley 1955:33).

Statement

P

, every chain

C0

(i.e., a totally ordered subset) is contained in a maximal chain

C

(i.e., a chain that is not contained in a strictly larger chain in

P

). In general, there may be several maximal chains containing a given chain.

An equivalent form of the Hausdorff maximal principle is that in every partially ordered set, there exists a maximal chain. (Note if the set is empty, the empty subset is a maximal chain.)

This form follows from the original form since the empty set is a chain. Conversely, to deduce the original form from this form, consider the set

P'

of all chains in

P

containing a given chain

C0

in

P

. Then

P'

is partially ordered by set inclusion. Thus, by the maximal principle in the above form,

P'

contains a maximal chain

C'

. Let

C

be the union of

C'

, which is a chain in

P

since a union of a totally ordered set of chains is a chain. Since

C

contains

C0

, it is an element of

P'

. Also, since any chain containing

C

is contained in

C

as

C

is a union,

C

is in fact a maximal element of

P'

; i.e., a maximal chain in

P

.

The proof that the Hausdorff maximal principle is equivalent to Zorn's lemma is somehow similar to this proof. Indeed, first assume Zorn's lemma. Since a union of a totally ordered set of chains is a chain, the hypothesis of Zorn's lemma (every chain has an upper bound) is satisfied for

P'

and thus

P'

contains a maximal element or a maximal chain in

P

.

Conversely, if the maximal principle holds, then

P

contains a maximal chain

C

. By the hypothesis of Zorn's lemma,

C

has an upper bound

x

in

P

. If

y\gex

, then

\widetilde{C}=C\cup\{y\}

is a chain containing

C

and so by maximality,

\widetilde{C}=C

; i.e.,

y\inC

and so

y=x

.

\square

Examples

If A is any collection of sets, the relation "is a proper subset of" is a strict partial order on A. Suppose that A is the collection of all circular regions (interiors of circles) in the plane. One maximal totally ordered sub-collection of A consists of all circular regions with centers at the origin. Another maximal totally ordered sub-collection consists of all circular regions bounded by circles tangent from the right to the y-axis at the origin.

If (x0, y0) and (x1, y1) are two points of the plane

R2

, define (x0, y0) < (x1, y1) if y0 = y1 and x0 < x1. This is a partial ordering of

R2

under which two points are comparable only if they lie on the same horizontal line. The maximal totally ordered sets are horizontal lines in

R2

.

Application

H

contains a maximal orthonormal subset

A

as follows. (This fact can be stated as saying that

H\simeq\ell2(A)

as Hilbert spaces.)

Let

P

be the set of all orthonormal subsets of the given Hilbert space

H

, which is partially ordered by set inclusion. It is nonempty as it contains the empty set and thus by the maximal principle, it contains a maximal chain

Q

. Let

A

be the union of

Q

. We shall show it is a maximal orthonormal subset. First, if

S,T

are in

Q

, then either

S\subsetT

or

T\subsetS

. That is, any given two distinct elements in

A

are contained in some

S

in

Q

and so they are orthogonal to each other (and of course,

A

is a subset of the unit sphere in

H

). Second, if

B\supsetneqA

for some

B

in

P

, then

B

cannot be in

Q

and so

Q\cup\{B\}

is a chain strictly larger than

Q

, a contradiction.

\square

For the purpose of comparison, here is a proof of the same fact by Zorn's lemma. As above, let

P

be the set of all orthonormal subsets of

H

. If

Q

is a chain in

P

, then the union of

Q

is also orthonormal by the same argument as above and so is an upper bound of

Q

. Thus, by Zorn's lemma,

P

contains a maximal element

A

. (So, the difference is that the maximal principle gives a maximal chain while Zorn's lemma gives a maximal element directly.)

Proof 1

The idea of the proof is essentially due to Zermelo and is to prove the following weak form of Zorn's lemma, from the axiom of choice.

Let

F

be a nonempty set of subsets of some fixed set, ordered by set inclusion, such that (1) the union of each totally ordered subset of

F

is in

F

and (2) each subset of a set in

F

is in

F

. Then

F

has a maximal element.

(Zorn's lemma itself also follows from this weak form.) The maximal principle follows from the above since the set of all chains in

P

satisfies the above conditions.

By the axiom of choice, we have a function

f:ak{P}(P)-\{\emptyset\}\toP

such that

f(S)\inS

for the power set

ak{P}(P)

of

P

.

For each

C\inF

, let

C*

be the set of all

x\inP-C

such that

C\cup\{x\}

is in

F

. If

C*=\emptyset

, then let

\widetilde{C}=C

. Otherwise, let

\widetilde{C}=C\cup\{f(C*)\}.

Note

C

is a maximal element if and only if

\widetilde{C}=C

. Thus, we are done if we can find a

C

such that

\widetilde{C}=C

.

Fix a

C0

in

F

. We call a subset

T\subsetF

a tower (over

C0

)
if

C0

is in

T

.
  1. The union of each totally ordered subset

T'\subsetT

is in

T

, where "totally ordered" is with respect to set inclusion.
  1. For each

C

in

T

,

\widetilde{C}

is in

T

.

There exists at least one tower; indeed, the set of all sets in

F

containing

C0

is a tower. Let

T0

be the intersection of all towers, which is again a tower.

Now, we shall show

T0

is totally ordered. We say a set

C

is comparable in

T0

if for each

A

in

T0

, either

A\subsetC

or

C\subsetA

. Let

\Gamma

be the set of all sets in

T0

that are comparable in

T0

. We claim

\Gamma

is a tower. The conditions 1. and 2. are straightforward to check. For 3., let

C

in

\Gamma

be given and then let

U

be the set of all

A

in

T0

such that either

A\subsetC

or

\widetilde{C}\subsetA

.

We claim

U

is a tower. The conditions 1. and 2. are again straightforward to check. For 3., let

A

be in

U

. If

A\subsetC

, then since

C

is comparable in

T0

, either

\widetilde{A}\subsetC

or

C\subset\widetilde{A}

. In the first case,

\widetilde{A}

is in

U

. In the second case, we have

A\subsetC\subset\widetilde{A}

, which implies either

A=C

or

C=\widetilde{A}

. (This is the moment we needed to collapse a set to an element by the axiom of choice to define

\widetilde{A}

.) Either way, we have

\widetilde{A}

is in

U

. Similarly, if

C\subsetA

, we see

\widetilde{A}

is in

U

. Hence,

U

is a tower. Now, since

U\subsetT0

and

T0

is the intersection of all towers,

U=T0

, which implies

\widetilde{C}

is comparable in

T0

; i.e., is in

\Gamma

. This completes the proof of the claim that

\Gamma

is a tower.

Finally, since

\Gamma

is a tower contained in

T0

, we have

T0=\Gamma

, which means

T0

is totally ordered.

Let

C

be the union of

T0

. By 2.,

C

is in

T0

and then by 3.,

\widetildeC

is in

T0

. Since

C

is the union of

T0

,

\widetildeC\subsetC

and thus

\widetildeC=C

.

\square

Proof 2

The Bourbaki–Witt theorem can also be used to prove the Hausdorff maximal principle:

References

. Walter Rudin . Real and Complex Analysis (International Series in Pure and Applied Mathematics) . McGraw-Hill . 1986 . 978-0-07-054234-1.