Intuitionistic type theory explained

Intuitionistic type theory (also known as constructive type theory, or Martin-Löf type theory (MLTT)) is a type theory and an alternative foundation of mathematics.Intuitionistic type theory was created by Per Martin-Löf, a Swedish mathematician and philosopher, who first published it in 1972. There are multiple versions of the type theory: Martin-Löf proposed both intensional and extensional variants of the theory and early impredicative versions, shown to be inconsistent by Girard's paradox, gave way to predicative versions. However, all versions keep the core design of constructive logic using dependent types.

Design

Martin-Löf designed the type theory on the principles of mathematical constructivism. Constructivism requires any existence proof to contain a "witness". So, any proof of "there exists a prime greater than 1000" must identify a specific number that is both prime and greater than 1000. Intuitionistic type theory accomplished this design goal by internalizing the BHK interpretation. A useful consequence is that proofs become mathematical objects that can be examined, compared, and manipulated.

Intuitionistic type theory's type constructors were built to follow a one-to-one correspondence with logical connectives. For example, the logical connective called implication (

A\impliesB

) corresponds to the type of a function (

A\toB

). This correspondence is called the Curry–Howard isomorphism. Prior type theories had also followed this isomorphism, but Martin-Löf's was the first to extend it to predicate logic by introducing dependent types.

Type theory

Intuitionistic type theory has three finite types, which are then composed using five different type constructors. Unlike set theories, type theories are not built on top of a logic like Frege's. So, each feature of the type theory does double duty as a feature of both math and logic.

If you are unfamiliar with type theory and know set theory, a quick summary is: Types contain terms just like sets contain elements. Terms belong to one and only one type. Terms like

2+2

and

2 ⋅ 2

compute ("reduce") down to canonical terms like 4. For more, see the article on type theory.

0 type, 1 type and 2 type

There are three finite types: The 0 type contains 0 terms. The 1 type contains 1 canonical term. And the 2 type contains 2 canonical terms.

Because the 0 type contains 0 terms, it is also called the empty type. It is used to represent anything that cannot exist. It is also written

\bot

and represents anything unprovable. (That is, a proof of it cannot exist.) As a result, negation is defined as a function to it:

\negA:=A\to\bot

.

Likewise, the 1 type contains 1 canonical term and represents existence. It also is called the unit type.

Finally, the 2 type contains 2 canonical terms. It represents a definite choice between two values. It is used for Boolean values but not propositions.

Propositions are instead represented by particular types. For instance, a true proposition can be represented by the 1 type, while a false proposition can be represented by the 0 type. But we cannot assert that these are the only propositions, i.e. the law of excluded middle does not hold for propositions in intuitionistic type theory.

Σ type constructor

Σ-types contain ordered pairs. As with typical ordered pair (or 2-tuple) types, a Σ-type can describe the Cartesian product,

A x B

, of two other types,

A

and

B

. Logically, such an ordered pair would hold a proof of

A

and a proof of

B

, so one may see such a type written as

A\wedgeB

.

Σ-types are more powerful than typical ordered pair types because of dependent typing. In the ordered pair, the type of the second term can depend on the value of the first term. For example, the first term of the pair might be a natural number and the second term's type might be a sequence of reals of length equal to the first term. Such a type would be written:

\sumn{N}}\operatorname{Vec}({R},n)

Using set-theory terminology, this is similar to an indexed disjoint union of sets. In the case of the usual cartesian product, the type of the second term does not depend on the value of the first term. Thus the type describing the cartesian product

{N} x {R}

is written:

\sumn{N}}{R}

It is important to note here that the value of the first term,

n

, is not depended on by the type of the second term,

{R}

.

Σ-types can be used to build up longer dependently-typed tuples used in mathematics and the records or structs used in most programming languages. An example of a dependently-typed 3-tuple is two integers and a proof that the first integer is smaller than the second integer, described by the type:

\summ{Z}}{\sumn{Z}}((m<n)=True)}

Dependent typing allows Σ-types to serve the role of existential quantifier. The statement "there exists an

n

of type

{N}

, such that

P(n)

is proven" becomes the type of ordered pairs where the first item is the value

n

of type

{N}

and the second item is a proof of

P(n)

. Notice that the type of the second item (proofs of

P(n)

) depends on the value in the first part of the ordered pair (

n

). Its type would be:

\sumn{N}}P(n)

Π type constructor

Π-types contain functions. As with typical function types, they consist of an input type and an output type. They are more powerful than typical function types however, in that the return type can depend on the input value. Functions in type theory are different from set theory. In set theory, you look up the argument's value in a set of ordered pairs. In type theory, the argument is substituted into a term and then computation ("reduction") is applied to the term.

As an example, the type of a function that, given a natural number

n

, returns a vector containing

n

real numbers is written:

\prodn{N}}\operatorname{Vec}({R},n)

When the output type does not depend on the input value, the function type is often simply written with a

\to

. Thus,

{N}\to{R}

is the type of functions from natural numbers to real numbers. Such Π-types correspond to logical implication. The logical proposition

A\impliesB

corresponds to the type

A\toB

, containing functions that take proofs-of-A and return proofs-of-B. This type could be written more consistently as:

\prodaA}B

Π-types are also used in logic for universal quantification. The statement "for every

n

of type

{N}

,

P(n)

is proven" becomes a function from

n

of type

{N}

to proofs of

P(n)

. Thus, given the value for

n

the function generates a proof that

P()

holds for that value. The type would be

\prodn{N}}P(n)

= type constructor

=-types are created from two terms. Given two terms like

2+2

and

22

, you can create a new type

2+2=2 ⋅ 2

. The terms of that new type represent proofs that the pair reduce to the same canonical term. Thus, since both

2+2

and

2 ⋅ 2

compute to the canonical term

4

, there will be a term of the type

2+2=2 ⋅ 2

. In intuitionistic type theory, there is a single way to introduce =-types and that is by reflexivity:

\operatorname{refl}n{:}\prodaA}(a=a).

It is possible to create =-types such as

1=2

where the terms do not reduce to the same canonical term, but you will be unable to create terms of that new type. In fact, if you were able to create a term of

1=2

, you could create a term of

\bot

. Putting that into a function would generate a function of type

1=2\to\bot

. Since

\ldots\to\bot

is how intuitionistic type theory defines negation, you would have

\neg(1=2)

or, finally,

12

.

Equality of proofs is an area of active research in proof theory and has led to the development of homotopy type theory and other type theories.

Inductive types

Inductive types allow the creation of complex, self-referential types. For example, a linked list of natural numbers is either an empty list or a pair of a natural number and another linked list. Inductive types can be used to define unbounded mathematical structures like trees, graphs, etc.. In fact, the natural numbers type may be defined as an inductive type, either being

0

or the successor of another natural number.

Inductive types define new constants, such as zero

0n{:}{N}

and the successor function

Sn{:}{N}\to{N}

. Since

S

does not have a definition and cannot be evaluated using substitution, terms like

S0

and

SSS0

become the canonical terms of the natural numbers.

Proofs on inductive types are made possible by induction. Each new inductive type comes with its own inductive rule. To prove a predicate

P()

for every natural number, you use the following rule:

{\operatorname{{N}-elim}}n{:}P(0)\to\left(\prodn{N}}P(n)\toP(S(n))\right)\to\prodn{N}}P(n)

Inductive types in intuitionistic type theory are defined in terms of W-types, the type of well-founded trees. Later work in type theory generated coinductive types, induction-recursion, and induction-induction for working on types with more obscure kinds of self-referentiality. Higher inductive types allow equality to be defined between terms.

Universe types

The universe types allow proofs to be written about all the types created with the other type constructors. Every term in the universe type

l{U}0

can be mapped to a type created with any combination of

0,1,2,\Sigma,\Pi,=,

and the inductive type constructor. However, to avoid paradoxes, there is no term in

l{U}n

that maps to

l{U}n

for any

l{n}\inN

.[1]

To write proofs about all "the small types" and

l{U}0

, you must use

l{U}1

, which does contain a term for

l{U}0

, but not for itself

l{U}1

. Similarly, for

l{U}2

. There is a predicative hierarchy of universes, so to quantify a proof over any fixed constant

k

universes, you can use

l{U}k+1

.

Universe types are a tricky feature of type theories. Martin-Löf's original type theory had to be changed to account for Girard's paradox. Later research covered topics such as "super universes", "Mahlo universes", and impredicative universes.

Judgements

The formal definition of intuitionistic type theory is written using judgements. For example, in the statement "if

A

is a type and

B

is a type then

style\suma:AB

is a type" there are judgements of "is a type", "and", and "if ... then ...". The expression

style\suma:AB

is not a judgement; it is the type being defined.

This second level of the type theory can be confusing, particularly where it comes to equality. There is a judgement of term equality, which might say

4=2+2

. It is a statement that two terms reduce to the same canonical term. There is also a judgement of type equality, say that

A=B

, which means every element of

A

is an element of the type

B

and vice versa. At the type level, there is a type

4=2+2

and it contains terms if there is a proof that

4

and

2+2

reduce to the same value. (Terms of this type are generated using the term-equality judgement.) Lastly, there is an English-language level of equality, because we use the word "four" and symbol "

4

" to refer to the canonical term

SSSS0

. Synonyms like these are called "definitionally equal" by Martin-Löf.

The description of judgements below is based on the discussion in Nordström, Petersson, and Smith.

The formal theory works with types and objects.

A type is declared by:

AType

An object exists and is in a type if:

an{:}A

Objects can be equal

a=b

and types can be equal

A=B

A type that depends on an object from another type is declared

(xn{:}A)B

and removed by substitution

B[x/a]

, replacing the variable

x

with the object

a

in

B

.An object that depends on an object from another type can be done two ways.If the object is "abstracted", then it is written

[x]b

and removed by substitution

b[x/a]

, replacing the variable

x

with the object

a

in

b

.The object-depending-on-object can also be declared as a constant as part of a recursive type. An example of a recursive type is:

0n{:}N

Sn{:}N\toN

Here,

S

is a constant object-depending-on-object. It is not associated with an abstraction. Constants like

S

can be removed by defining equality. Here the relationship with addition is defined using equality and using pattern matching to handle the recursive aspect of

S

:

\begin{align} \operatorname{add}&n{:} (N x N)\toN\\ \operatorname{add}(0,b)&=b\\ \operatorname{add}(S(a),b)&=S(\operatorname{add}(a,b))) \end{align}

S

is manipulated as an opaque constant - it has no internal structure for substitution.

So, objects and types and these relations are used to express formulae in the theory. The following styles of judgements are used to create new objects, types and relations from existing ones:

\Gamma\vdash\sigmaType

σ is a well-formed type in the context Γ.

\Gamma\vdashtn{:}\sigma

t is a well-formed term of type σ in context Γ.

\Gamma\vdash\sigma\equiv\tau

σ and τ are equal types in context Γ.

\Gamma\vdasht\equivun{:}\sigma

t and u are judgmentally equal terms of type σ in context Γ.

\vdash\GammaContext

Γ is a well-formed context of typing assumptions.

By convention, there is a type that represents all other types. It is called

l{U}

(or

\operatorname{Set}

). Since

l{U}

is a type, the members of it are objects. There is a dependent type

\operatorname{El}

that maps each object to its corresponding type. In most texts

\operatorname{El}

is never written.
From the context of the statement, a reader can almost always tell whether

A

refers to a type, or whether it refers to the object in

l{U}

that corresponds to the type.

This is the complete foundation of the theory. Everything else is derived.

To implement logic, each proposition is given its own type. The objects in those types represent the different possible ways to prove the proposition. If there is no proof for the proposition, then the type has no objects in it. Operators like "and" and "or" that work on propositions introduce new types and new objects. So

A x B

is a type that depends on the type

A

and the type

B

. The objects in that dependent type are defined to exist for every pair of objects in

A

and

B

. If

A

or

B

has no proof and is an empty type, then the new type representing

A x B

is also empty.

This can be done for other types (booleans, natural numbers, etc.) and their operators.

Categorical models of type theory

Using the language of category theory, R. A. G. Seely introduced the notion of a locally cartesian closed category (LCCC) as the basic model of type theory. This has been refined by Hofmann and Dybjer to Categories with Families or Categories with Attributes based on earlier work by Cartmell.[2]

A category with families is a category C of contexts (in which the objects are contexts, and the context morphisms are substitutions), together with a functor T : CopFam(Set).

Fam(Set) is the category of families of Sets, in which objects are pairs of an "index set" A and a function B: XA, and morphisms are pairs of functions f : AA' and g : XX' , such that B' ° g = f ° B – in other words, f maps Ba to Bg(a).

The functor T assigns to a context G a set of types, and for each, a set of terms. The axioms for a functor require that these play harmoniously with substitution. Substitution is usually written in the form Af or af, where A is a type in and a is a term in, and f is a substitution from D to G. Here and .

The category C must contain a terminal object (the empty context), and a final object for a form of product called comprehension, or context extension, in which the right element is a type in the context of the left element. If G is a context, and, then there should be an object final among contexts D with mappings p : DG, q : Tm(D,Ap).

A logical framework, such as Martin-Löf's, takes the form of closure conditions on the context-dependent sets of types and terms: that there should be a type called Set, and for each set a type, that the types should be closed under forms of dependent sum and product, and so forth.

A theory such as that of predicative set theory expresses closure conditions on the types of sets and their elements: that they should be closed under operations that reflect dependent sum and product, and under various forms of inductive definition.

Extensional versus intensional

A fundamental distinction is extensional vs intensional type theory. In extensional type theory, definitional (i.e., computational) equality is not distinguished from propositional equality, which requires proof. As a consequence type checking becomes undecidable in extensional type theory because programs in the theory might not terminate. For example, such a theory allows one to give a type to the Y-combinator; a detailed example of this can be found in Nordstöm and Petersson Programming in Martin-Löf's Type Theory.[3] However, this does not prevent extensional type theory from being a basis for a practical tool; for example, Nuprl is based on extensional type theory.

In contrast, in intensional type theory type checking is decidable, but the representation of standard mathematical concepts is somewhat more cumbersome, since intensional reasoning requires using setoids or similar constructions. There are many common mathematical objects that are hard to work with or cannot be represented without this, for example, integer numbers, rational numbers, and real numbers. Integers and rational numbers can be represented without setoids, but this representation is difficult to work with. Cauchy real numbers cannot be represented without this.[4]

Homotopy type theory works on resolving this problem. It allows one to define higher inductive types, which not only define first-order constructors (values or points), but higher-order constructors, i.e. equalities between elements (paths), equalities between equalities (homotopies), ad infinitum.

Implementations of type theory

Different forms of type theory have been implemented as the formal systems underlying a number of proof assistants. While many are based on Per Martin-Löf's ideas, many have added features, more axioms, or a different philosophical background. For instance, the Nuprl system is based on computational type theory[5] and Coq is based on the calculus of (co)inductive constructions. Dependent types also feature in the design of programming languages such as ATS, Cayenne, Epigram, Agda,[6] and Idris.[7]

Martin-Löf type theories

Per Martin-Löf constructed several type theories that were published at various times, some of them much later than when the preprints with their description became accessible to specialists (among others Jean-Yves Girard and Giovanni Sambin). The list below attempts to list all the theories that have been described in a printed form and to sketch the key features that distinguished them from each other. All of these theories had dependent products, dependent sums, disjoint unions, finite types and natural numbers. All the theories had the same reduction rules that did not include η-reduction either for dependent products or for dependent sums, except for MLTT79 where the η-reduction for dependent products is added.

MLTT71 was the first type theory created by Per Martin-Löf. It appeared in a preprint in 1971. It had one universe, but this universe had a name in itself, i.e., it was a type theory with, as it is called today, "Type in Type". Jean-Yves Girard has shown that this system was inconsistent, and the preprint was never published.

MLTT72 was presented in a 1972 preprint that has now been published.[8] That theory had one universe V and no identity types (=-types). The universe was "predicative" in the sense that the dependent product of a family of objects from V over an object that was not in V such as, for example, V itself, was not assumed to be in V. The universe was à la Russell's Principia Mathematica, i.e., one would write directly "T∈V" and "t∈T" (Martin-Löf uses the sign "∈" instead of modern ":") without an added constructor such as "El".

MLTT73 was the first definition of a type theory that Per Martin-Löf published (it was presented at the Logic Colloquium '73 and published in 1975[9]). There are identity types, which he describes as "propositions", but since no real distinction between propositions and the rest of the types is introduced the meaning of this is unclear. There is what later acquires the name of J-eliminator but yet without a name (see pp. 94–95). There is in this theory an infinite sequence of universes V0, ..., Vn, ...&thinsp;. The universes are predicative, à la Russell and non-cumulative. In fact, Corollary 3.10 on p. 115 says that if A∈Vm and B∈Vn are such that A and B are convertible then m&thinsp;=&thinsp;n. This means, for example, that it would be difficult to formulate univalence axiom in this theory—there are contractible types in each of the Vi, but it is unclear how to declare them to be equal since there are no identity types connecting Vi and Vj for ij.

MLTT79 was presented in 1979 and published in 1982.[10] In this paper, Martin-Löf introduced the four basic types of judgement for the dependent type theory that has since become fundamental in the study of the meta-theory of such systems. He also introduced contexts as a separate concept in it (see p. 161). There are identity types with the J-eliminator (which already appeared in MLTT73 but did not have this name there) but also with the rule that makes the theory "extensional" (p. 169). There are W-types. There is an infinite sequence of predicative universes that are cumulative.

Bibliopolis: there is a discussion of a type theory in the Bibliopolis book from 1984,[11] but it is somewhat open-ended and does not seem to represent a particular set of choices and so there is no specific type theory associated with it.

See also

References

Further reading

External links

Notes and References

  1. Book: Bertot . Yves . Castéran . Pierre . 2004 . Interactive theorem proving and program development: Coq'Art: the calculus of inductive constructions . Springer . 978-3-540-20854-9 . Texts in theoretical computer science . Berlin Heidelberg.
  2. Clairambault. Pierre. Dybjer. Peter. 2014. The biequivalence of locally cartesian closed categories and Martin-Löf type theories. Mathematical Structures in Computer Science. en. 24. 6. 10.1017/S0960129513000881. 0960-1295. 1112.3456. 416274.
  3. Bengt Nordström; Kent Petersson; Jan M. Smith (1990). Programming in Martin-Löf's Type Theory. Oxford University Press, p. 90.
  4. Altenkirch . Thorsten . Anberrée . Thomas . Li . Nuo . Definable Quotients in Type Theory . https://web.archive.org/web/20240419114156/http://www.cs.nott.ac.uk/~psztxa/publ/defquotients.pdf . 2024-04-19.
  5. Allen. S.F.. Bickford. M.. Constable. R.L.. Eaton. R.. Kreitz. C.. Lorigo. L.. Moran. E.. 2006. Innovations in computational type theory using Nuprl. Journal of Applied Logic. 4. 4. 428–469. 10.1016/j.jal.2005.10.005. free.
  6. Book: Norell . Ulf . 2009 . Proceedings of the 4th international workshop on Types in language design and implementation . Dependently typed programming in Agda . TLDI '09 . New York, NY, USA . ACM . 1–2 . 10.1145/1481861.1481862 . 9781605584201 . 10.1.1.163.7149 . 1777213.
  7. Brady. Edwin. 2013. Idris, a general-purpose dependently typed programming language: Design and implementation. Journal of Functional Programming. 23. 5. 552–593. 10.1017/S095679681300018X. 19895964 . 0956-7968. free.
  8. Book: Martin-Löf . Per . Per Martin-Löf . 1998 . An intuitionistic theory of types, Twenty-five years of constructive type theory (Venice,1995) . Oxford Logic Guides . 36 . 127–172 . Oxford University Press . New York.
  9. Martin-Löf . Per . Per Martin-Löf . 1975 . An intuitionistic theory of types: predicative part . Logic Colloquium '73 (Bristol, 1973) . 73–118 . Studies in Logic and the Foundations of Mathematics . 80 . Amsterdam . North-Holland.
  10. Martin-Löf . Per . Per Martin-Löf . 1982 . Constructive mathematics and computer programming . Logic, methodology and philosophy of science, VI (Hannover, 1979) . Studies in Logic and the Foundations of Mathematics . 104 . 153–175 . North-Holland . Amsterdam.
  11. Book: Martin-Löf . Per . Per Martin-Löf . 1984 . Intuitionistic type theory, Studies in Proof Theory (lecture notes by Giovanni Sambin) . Bibliopolis . 1 . iv, 91.