Dangling else explained

The dangling else is a problem in programming of parser generators in which an optional else clause in an if–then(–else) statement results in nested conditionals being ambiguous. Formally, the reference context-free grammar of the language is ambiguous, meaning there is more than one correct parse tree.

In many programming languages one may write conditionally executed code in two forms: the if-then form, and the if-then-else form – the else clause is optional: if a then s if b then s1 else s2

This gives rise to an ambiguity in interpretation when there are nested statements, specifically whenever an if-then form appears as s1 in an if-then-else form:

if a then if b then s else s2In this example, s is unambiguously executed when a is true and b is true, but one may interpret s2 as being executed when a is false (thus attaching the else to the first if) or when a is true and b is false (thus attaching the else to the second if). In other words, one may see the previous statement as either of the following expressions: if a then (if b then s) else s2 if a then (if b then s else s2)

The dangling else problem dates to ALGOL 60,[1] and has been resolved in various ways in subsequent languages. In LR parsers, the dangling else is the archetypal example of a shift-reduce conflict.

Avoiding ambiguity while keeping the syntax

This is a problem that often comes up in compiler construction, especially scannerless parsing. The convention when dealing with the dangling else is to attach the else to the nearby if statement,[2] allowing for unambiguous context-free grammars, in particular. Programming languages like Pascal,[3] C[4] and Java[5] follow this convention, so there is no ambiguity in the semantics of the language, though the use of a parser generator may lead to ambiguous grammars. In these cases alternative grouping is accomplished by explicit blocks, such as begin...end in Pascal[6] and {...} in C.

Depending on the compiler construction approach, one may take different corrective actions to avoid ambiguity:

Avoiding ambiguity by changing the syntax

The problem can also be solved by making explicit the link between an else and its if, within the syntax. This usually helps avoid human errors.[7]

Possible solutions are:

Examples

Concrete examples follow.

C

In C, the grammar reads, in part:

 statement = ...
    | selection-statement

 selection-statement = ...
    | IF (expression) statement
    | IF (expression) statement ELSE statement

Thus, without further rules, the statementif (a) if (b) s; else s2;could ambiguously be parsed as if it were either:if (a)or:if (a)else s2;The C standard clarifies that an else block is associated with the nearest if. Therefore, the first tree is chosen.

Avoiding the conflict in LR parsers

The above example could be rewritten in the following way to remove the ambiguity :

statement: open_statement
         | closed_statement
         ;

open_statement: IF '(' expression ')' statement
              | IF '(' expression ')' closed_statement ELSE open_statement
              ;

closed_statement: non_if_statement
                | IF '(' expression ')' closed_statement ELSE closed_statement
                ;

non_if_statement: ...
                ;

Any other statement-related grammar rules may also have to be duplicated in this way if they may directly or indirectly end with a statement or selection-statement non-terminal.

However, we give grammar that includes both of if and while statements.

statement: open_statement
         | closed_statement
         ;

open_statement: IF '(' expression ')' statement
              | IF '(' expression ')' closed_statement ELSE open_statement
              | WHILE '(' expression ')' open_statement
              ;

closed_statement: simple_statement
                | IF '(' expression ')' closed_statement ELSE closed_statement
                | WHILE '(' expression ')' closed_statement
                ;

simple_statement: ...
                ;

Finally, we give the grammar that forbids ambiguous IF statements.

statement: open_statement
         | closed_statement
         ;

open_statement: IF '(' expression ')' statement
              | IF '(' expression ')' closed_statement ELSE open_statement
              | WHILE '(' expression ')' open_statement
              ;

closed_statement: simple_statement
                | IF '(' expression ')' closed_statement ELSE closed_statement
                | WHILE '(' expression ')' closed_statement
                ;

simple_statement: ...
                ;

With this grammar the statement if (a) if (b) c else d can only be parsed one way, because the other interpretation (if (a) {if (b) c} else d) is produced as

statement
open_statement
IF '(' expression ')' closed_statement ELSE open_statement
'if' '(' 'a' ')' closed_statement 'else' 'd'
and then the parsing fails trying to match closed_statement to "if (b) c". An attempt with closed_statement fails in the same way. The other parse, if (a) {if (b) c else d}) succeeds:
statement
open_statement
IF '(' expression ')' statement
IF '(' expression ')' closed_statement
IF '(' a ')' (IF '(' expression ')' closed_statement ELSE closed_statement)
IF '(' a ')' (IF '(' b ')' c ELSE 'd')

See also

Notes and References

  1. Abrahams . P. W. . A final solution to the Dangling else of ALGOL 60 and related languages . 10.1145/365813.365821 . Communications of the ACM . 9 . 9 . 679–682 . 1966 . 6777841 . free .
  2. Book: Bison 3.7.6 . https://www.gnu.org/software/bison/manual/html_node/Shift_002fReduce.html#Shift_002fReduce. 5.2 Shift/Reduce Conflicts . . 2021-08-07.
  3. ISO 7185:1990 (Pascal) 6.8.3.4: An if-statement without an else-part shall not be immediately followed by the token else.
  4. http://www.open-std.org/jtc1/sc22/wg14/www/standards ISO 9899
  5. Web site: The Java Language Specification, Java SE 9 Edition, 14.5. Statements. jls-14.5.
  6. Book: Dale . Nell B. . Weems . Chip . Introduction to Pascal and Structured Design. Dangling Else . 160–161 . November 1996 . Jones & Bartlett Learning . 9780763703974.
  7. https://code.google.com/p/copute/issues/detail?id=21#c2 Ambiguity of dangling else: non-context-free grammars are semantically opaque
  8. 4.5.1 Conditional Statements — Syntax in P. Nauer (ed.), Revised Report on the Algorithmic Language ALGOL 60, CACM 6,1, 1963 pp. 1-17
  9. https://code.google.com/p/copute/issues/detail?id=21#c1 Ambiguity of dangling else: require braces when else follows if