In computer programming, a common challenge facing systems programmers is that before an operation can be performed, a number of conditions must first be checked to confirm that the operation can be successfully performed. For example, before data can be written to a file, it must be confirmed that 1) the program has the file open for writing; 2) the program has the necessary permissions to write the data; 3) the data to be written is available; 4) the data to be written is of a valid size. A failure at any of these steps means that the write operation cannot be completed and an error should be returned to the calling program.
There are several ways that these multiple required tests can be written in the source code. One way is to check each condition in turn and if a condition fails, return from the subroutine at that point, indicating an error condition exists. This style of coding has the disadvantage that the subroutine returns from multiple (possibly many) points and some coding standards discourage having multiple return points.
Another way to is to check each condition and if the condition succeeds, enter a deeper block of code that checks the next condition and so on. The deepest enclosing block of code is only reached if all of the precondition tests are successful. This style of coding has the disadvantage that the indentation level increases with every test performed. If many tests are required, the enclosed blocks of code can march off the page to the right margin. This typographical effect is referred to as the pyramid of doom.
For example, the pyramid of doom is commonly seen when checking for null pointers or handling callbacks.[1] Two examples of the term are related to a particular programming style in early versions of JavaScript,[2] and the nesting of if statements that occurs in object-oriented programming languages when one of the objects may be a null pointer.[3] [4]
Most modern object-oriented programming languages use a coding style known as dot notation that allows multiple method calls to be written in a single line of code, each call separated by a period. For instance:
This code contains four different instructions; it first looks in the collection of windows for a window with the name "Main", then looks in that window's views collection for the 5th subview within it, then calls the size
method to return a structure with the view's dimensions, and finally calls the width
method on that structure to produce a result that is assigned to a variable name theWidth
.
The problem with this approach is that the code assumes that all of these values exist. While it is reasonable to expect that a window will have a size and that size will have a width, it is not at all reasonable to assume that a window named "Main" will exist, nor that it has five subviews. If either of those assumptions is wrong, the corresponding call will result in a null pointer error.
To avoid this error, the programmer has to check every method call to ensure it returns a value. A safer version of the same code would be:
If the programmer wishes to use that value based on whether or not it exists and is valid, the functional code inside the if
statements is all pushed to the right, making it difficult to read longer lines. This often leads to attempts to "flatten" the code:
Or alternatively:
This sort of programming construct is very common and a number of programming languages have added some sort of syntactic sugar to address this. For instance, Apple's Swift added the concept of optional chaining in if statements[5] while Microsoft's C# 6.0 and Visual Basic 14 added the null-conditional operators ?.
and ?[
for member access and indexing, respectively.[6] [7] [8] JavaScript added support for the optional chaining operator in 2020.[9] The basic idea is to allow a string of method calls to immediately return null if any of its members is null, so for instance:theWidth
if either "Main" or the fifth subview is missing, or complete the statement and return the width if they are both valid. There are many times where the programmer wants to take different actions in these two cases, so Swift adds another form of syntactic sugar for this role, the if let
statement, also known as "optional binding":
Pyramid of Doom can usually be resolved in any language by simply breaking up the code into multiple nested functions (or other groupings). For instance, instead of:
You can break up the functionality like this:
CC
main
Similarly, data structures can be broken up by levels, when similar pyramids occur.
Not only is the Pyramid of Doom solved, but it's better practice to not have large, complicated functions; smaller ones are easier for a programer to write without erring, easier to read, and easier to verify.Choosing function names for each of these levels will also help the author clarify to readers what is done where.Typically, each level doesn't need many connections to levels that are far away, so separating them out is easy.If there are such connections, the author can re-think their design to something more reliable, because this is a fertile source of bugs.