Constraint Handling Rules (CHR) | |
Paradigms: | Constraint logic, declarative |
Designer: | Thom Frühwirth |
Influenced By: | Prolog |
Constraint Handling Rules (CHR) is a declarative, rule-based programming language, introduced in 1991 by Thom Frühwirth at the time with European Computer-Industry Research Centre (ECRC) in Munich, Germany.[1] [2] Originally intended for constraint programming, CHR finds applications in grammar induction,[3] type systems,[4] abductive reasoning, multi-agent systems, natural language processing, compilation, scheduling, spatial-temporal reasoning, testing, and verification.
A CHR program, sometimes called a constraint handler, is a set of rules that maintain a constraint store, a multi-set of logical formulas. Execution of rules may add or remove formulas from the store, thus changing the state of the program. The order in which rules "fire" on a given constraint store is non-deterministic,[5] according to its abstract semantics and deterministic (top-down rule application), according to its refined semantics.[6]
Although CHR is Turing complete,[7] it is not commonly used as a programming language in its own right. Rather, it is used to extend a host language with constraints. Prolog is by far the most popular host language and CHR is included in several Prolog implementations, including SICStus and SWI-Prolog, although CHR implementations also exist for Haskell,[8] Java, C,[9] SQL,[10] and JavaScript.[11] In contrast to Prolog, CHR rules are multi-headed and are executed in a committed-choice manner using a forward chaining algorithm.
The concrete syntax of CHR programs depends on the host language, and in fact programs embed statements in the host language that are executed to handle some rules. The host language supplies a data structure for representing terms, including logical variables. Terms represent constraints, which can be thought of as "facts" about the program's problem domain. Traditionally, Prolog is used as the host language, so its data structures and variables are used. The rest of this section uses a neutral, mathematical notation that is common in the CHR literature.
A CHR program, then, consists of rules that manipulate a multi-set of these terms, called the constraint store. Rules come in three types:[5]
h1,...,hn\Longleftrightarrowg1,...,gm|b1,...,bo
h1,...,hn
g1,...,gm
b1,...,bo
h1,...,hn\Longrightarrowg1,...,gm|b1,...,bo
h1,...,h\ell\backslashh\ell+1,...,hn\Longleftrightarrowg1,...,gm|b1,...,bo
\ell
\backslash
n-\ell
Since simpagation rules subsume simplification and propagation, all CHR rules follow the format
Hk\backslashHr\LongleftrightarrowG|B
where each of
Hk,Hr,G,B
Hk,Hr
B
G
Hk,Hr
The host language must also define built-in constraints over terms. The guards in rules are built-in constraints, so they effectively execute host language code. The built-in constraint theory must include at least true
(the constraint that always holds), fail
(the constraint that never holds, and is used to signal failure) and equality of terms, i.e., unification.[7] When the host language does not support these features, they must be implemented along with CHR.[9]
Execution of a CHR program starts with an initial constraint store. The program then proceeds by matching rules against the store and applying them, until either no more rules match (success) or the fail
constraint is derived. In the former case, the constraint store can be read off by a host language program to look for facts of interest. Matching is defined as "one-way unification": it binds variables only on one side of the equation. Pattern matching can be easily implemented when as unification when the host language supports it.[9]
The following CHR program, in Prolog syntax, contains four rules that implement a solver for a less-or-equal constraint. The rules are labeled for convenience (labels are optional in CHR).
All three rules are implicitly universally quantified (upper-cased identifiers are variables in Prolog syntax). The idempotence rule is a tautology from the logical viewpoint, but has a purpose in the second reading of the program.
The second way to read the above is as a computer program for maintaining a constraint store, a collection of facts (constraints) about objects. The constraint store is not part of this program, but must be supplied separately. The rules express the following rules of computation:
Given the query
A leq B, B leq C, C leq A
the following transformations may occur:
Current constraints | Rule applicable to constraints | Conclusion from rule application | |
---|---|---|---|
A leq B, B leq C, C leq A | transitivity | A leq C | |
A leq B, B leq C, C leq A, A leq C | antisymmetry | A = C | |
A leq B, B leq A, A = C | antisymmetry | A = B | |
A = B, A = C | none |
The transitivity rule adds A leq C
. Then, by applying the antisymmetry rule, A leq C
and C leq A
are removed and replaced by A = C
. Now the antisymmetry rule becomes applicable on the first two constraints of the original query. Now all CHR constraints are eliminated, so no further rules can be applied, and the answer A = B, A = C
is returned: CHR has correctly inferred that all three variables must refer to the same object.
To decide which rule should "fire" on a given constraint store, a CHR implementation must use some pattern matching algorithm. Candidate algorithms include RETE and TREAT,[12] but most implementation use a lazy algorithm called LEAPS.[13]
The original specification of CHR's semantics was entirely non-deterministic, but the so-called "refined operation semantics" of Duck et al. removed much of the non-determinism so that application writers can rely on the order of execution for performance and correctness of their programs.[5] [14]
Most applications of CHRs require that the rewriting process be confluent; otherwise the results of searching for a satisfying assignment will be nondeterministic and unpredictable. Establishing confluence is usually done by way of the following three properties: