The dead-end elimination algorithm (DEE) is a method for minimizing a function over a discrete set of independent variables. The basic idea is to identify "dead ends", i.e., combinations of variables that are not necessary to define a global minimum because there is always a way of replacing such combination by a better or equivalent one. Then we can refrain from searching such combinations further. Hence, dead-end elimination is a mirror image of dynamic programming, in which "good" combinations are identified and explored further.
Although the method itself is general, it has been developed and applied mainly to the problems of predicting and designing the structures of proteins (and in this wise was cited in the Scientific Background to the 2024 Nobel Prize in Chemistry).[1] It closely related to the notion of dominance in optimization also known as substitutability in a Constraint Satisfaction Problem. The original description and proof of the dead-end elimination theorem can be found in .
An effective DEE implementation requires four pieces of information:
Note that the criteria can easily be reversed to identify the maximum of a given function as well.
Dead-end elimination has been used effectively to predict the structure of side chains on a given protein backbone structure by minimizing an energy function
E
In the following discussion, let
N
rk
kth |
ETOT=\sumkEk(rk)+\sumkEkl(rk,rl)
Where
Ek(rk)
rk
Ekl(rk,rl)
rk,rj
Also note that
Ekk
A | |
(r | |
k |
,
A | |
r | |
k |
)
If a particular rotamer
A | |
r | |
k |
k
B | |
r | |
k |
Ek
A | |
(r | |
k |
)+
N | |
\sum | |
l=1 |
minXEkl
A | |
(r | |
k |
,
X | |
r | |
l |
)>Ek
B | |
(r | |
k |
)+
N | |
\sum | |
l=1 |
maxXEkl
B | |
(r | |
k |
,
X | |
r | |
l |
)
where
minXEkl
A | |
(r | |
k |
,
X | |
r | |
l |
)
A | |
r | |
k |
k
l
maxXEkl
B | |
(r | |
k |
,
X | |
r | |
l |
)
B | |
r | |
k |
k
l
The pairs criterion is more difficult to describe and to implement, but it adds significant eliminating power. For brevity, we define the shorthand variable
AB | |
U | |
kl |
A
B
k
l
AB | |
U | |
kl |
\stackrel{def
A given pair of rotamers
A
B
k
l
C
D
AB | |
U | |
kl |
+
N | |
\sum | |
i=1 |
minX\left(Eki
A | |
(r | |
k |
,
X | |
r | |
i |
)+Elj
B | |
(r | |
l |
,
X | |
r | |
j |
)\right)>
CD | |
U | |
kl |
+
N | |
\sum | |
i=1 |
maxX\left(Eki
C | |
(r | |
k |
,
X | |
r | |
i |
)+Elj
D | |
(r | |
l |
,
X | |
r | |
j |
)\right)
where
A ≠ C
B ≠ D
k ≠ l
For large
N
N
p
p
Np
rk
rl
p
p x p
N2p2
The above two criteria are normally applied iteratively until convergence, defined as the point at which no more rotamers or pairs can be eliminated. Since this is normally a reduction in the sample space by many orders of magnitude, simple enumeration will suffice to determine the minimum within this pared-down set.
Given this model, it is clear that the DEE algorithm is guaranteed to find the optimal solution; that is, it is a global optimization process. The single-rotamer search scales quadratically in time with total number of rotamers. The pair search scales cubically and is the slowest part of the algorithm (aside from energy calculations). This is a dramatic improvement over the brute-force enumeration which scales as
O(pN)
A large-scale benchmark of DEE compared with alternative methods of protein structure prediction and design finds that DEE reliably converges to the optimal solution for protein lengths for which it runs in a reasonable amount of time. It significantly outperforms the alternatives under consideration, which involved techniques derived from mean field theory, genetic algorithms, and the Monte Carlo method. However, the other algorithms are appreciably faster than DEE and thus can be applied to larger and more complex problems; their relative accuracy can be extrapolated from a comparison to the DEE solution within the scope of problems accessible to DEE.
See main article: Protein design. The preceding discussion implicitly assumed that the rotamers
rk
k
More powerful and more general criteria have been introduced that improve both the efficiency and the eliminating power of the method for both prediction and design applications. One example is a refinement of the singles elimination criterion known as the Goldstein criterion, which arises from fairly straightforward algebraic manipulation before applying the minimization:
Ek
A | |
(r | |
k |
)-Ek
B | |
(r | |
k |
)+
N | |
\sum | |
l=1 |
minX\left(Ekl
A | |
(r | |
k |
,
X | |
r | |
l |
)-Ekl
B | |
(r | |
k |
,
X | |
r | |
l |
)\right)>0
Thus rotamer
A | |
r | |
k |
rk
A | |
r | |
k |
A | |
r | |
k |
An extended discussion of elaborate DEE criteria and a benchmark of their relative performance can be found in .