In mathematics, the bisection method is a root-finding method that applies to any continuous function for which one knows two values with opposite signs. The method consists of repeatedly bisecting the interval defined by these values and then selecting the subinterval in which the function changes sign, and therefore must contain a root. It is a very simple and robust method, but it is also relatively slow. Because of this, it is often used to obtain a rough approximation to a solution which is then used as a starting point for more rapidly converging methods. The method is also called the interval halving method,[1] the binary search method, or the dichotomy method.[2]
For polynomials, more elaborate methods exist for testing the existence of a root in an interval (Descartes' rule of signs, Sturm's theorem, Budan's theorem). They allow extending the bisection method into efficient algorithms for finding all real roots of a polynomial; see Real-root isolation.
The method is applicable for numerically solving the equation f(x) = 0 for the real variable x, where f is a continuous function defined on an interval [''a'', ''b''] and where f(a) and f(b) have opposite signs. In this case a and b are said to bracket a root since, by the intermediate value theorem, the continuous function f must have at least one root in the interval (a, b).
At each step the method divides the interval in two parts/halves by computing the midpoint c = (a+b) / 2 of the interval and the value of the function f(c) at that point. If c itself is a root then the process has succeeded and stops. Otherwise, there are now only two possibilities: either f(a) and f(c) have opposite signs and bracket a root, or f(c) and f(b) have opposite signs and bracket a root.[3] The method selects the subinterval that is guaranteed to be a bracket as the new interval to be used in the next step. In this way an interval that contains a zero of f is reduced in width by 50% at each step. The process is continued until the interval is sufficiently small.
Explicitly, if f(c)=0 then c may be taken as the solution and the process stops. Otherwise, if f(a) and f(c) have opposite signs, then the method sets c as the new value for b, and if f(b) and f(c) have opposite signs then the method sets c as the new a. In both cases, the new f(a) and f(b) have opposite signs, so the method is applicable to this smaller interval.[4]
The input for the method is a continuous function f, an interval [''a'', ''b''], and the function values f(a) and f(b). The function values are of opposite sign (there is at least one zero crossing within the interval). Each iteration performs these steps:
When implementing the method on a computer, there can be problems with finite precision, so there are often additional convergence tests or limits to the number of iterations. Although f is continuous, finite precision may preclude a function value ever being zero. For example, consider ; there is no floating-point value approximating that gives exactly zero. Additionally, the difference between a and b is limited by the floating point precision; i.e., as the difference between a and b decreases, at some point the midpoint of will be numerically identical to (within floating point precision of) either a or b.
The method may be written in pseudocode as follows:[5] input: Function f, endpoint values a, b, tolerance TOL, maximum iterations NMAX conditions: a < b, either f(a) < 0 and f(b) > 0 or f(a) > 0 and f(b) < 0 output: value which differs from a root of f(x) = 0 by less than TOL N ← 1 while N ≤ NMAX do // limit iterations to prevent infinite loop c ← (a + b)/2 // new midpoint if f(c) = 0 or (b – a)/2 < TOL then // solution found Output(c) Stop end if N ← N + 1 // increment step counter if sign(f(c)) = sign(f(a)) then a ← c else b ← c // new interval end while Output("Method failed.") // max number of steps exceeded
Suppose that the bisection method is used to find a root of the polynomial
f(x)=x3-x-2.
a
b
f(a)
f(b)
a=1
b=2
f(1)=(1)3-(1)-2=-2
f(2)=(2)3-(2)-2=+4.
In the first iteration, the end points of the interval which brackets the root are
a1=1
b1=2
c1=
2+1 | |
2 |
=1.5
f(c1)=(1.5)3-(1.5)-2=-0.125
f(c1)
a=1
a=1.5
f(a)
f(b)
a
b
Iteration | an | bn | cn | f(cn) | |
---|---|---|---|---|---|
1 | 1 | 2 | 1.5 | −0.125 | |
2 | 1.5 | 2 | 1.75 | 1.6093750 | |
3 | 1.5 | 1.75 | 1.625 | 0.6660156 | |
4 | 1.5 | 1.625 | 1.5625 | 0.2521973 | |
5 | 1.5 | 1.5625 | 1.5312500 | 0.0591125 | |
6 | 1.5 | 1.5312500 | 1.5156250 | −0.0340538 | |
7 | 1.5156250 | 1.5312500 | 1.5234375 | 0.0122504 | |
8 | 1.5156250 | 1.5234375 | 1.5195313 | −0.0109712 | |
9 | 1.5195313 | 1.5234375 | 1.5214844 | 0.0006222 | |
10 | 1.5195313 | 1.5214844 | 1.5205078 | −0.0051789 | |
11 | 1.5205078 | 1.5214844 | 1.5209961 | −0.0022794 | |
12 | 1.5209961 | 1.5214844 | 1.5212402 | −0.0008289 | |
13 | 1.5212402 | 1.5214844 | 1.5213623 | −0.0001034 | |
14 | 1.5213623 | 1.5214844 | 1.5214233 | 0.0002594 | |
15 | 1.5213623 | 1.5214233 | 1.5213928 | 0.0000780 |
After 13 iterations, it becomes apparent that there is a convergence to about 1.521: a root for the polynomial.
The method is guaranteed to converge to a root of f if f is a continuous function on the interval [''a'', ''b''] and f(a) and f(b) have opposite signs. The absolute error is halved at each step so the method converges linearly. Specifically, if c1 = is the midpoint of the initial interval, and cn is the midpoint of the interval in the nth step, then the difference between cn and a solution c is bounded by[6]
|c | ||||
|
.
n\len1/2\equiv
\left\lceillog | ||||
|
\right)\right\rceil,
\epsilon0=|b-a|
\epsilon\le\epsilon0.
\epsilon
However, despite the bisection method being optimal with respect to worst case performance under absolute error criteria it is sub-optimal with respect to average performance under standard assumptions [9] [10] as well as asymptotic performance.[11] Popular alternatives to the bisection method, such as the secant method, Ridders' method or Brent's method (amongst others), typically perform better since they trade-off worst case performance to achieve higher orders of convergence to the root. And, a strict improvement to the bisection method can be achieved with a higher order of convergence without trading-off worst case performance with the ITP Method.[12]
The bisection method has been generalized to multi-dimensional functions. Such methods are called generalized bisection methods.[13] [14]
Some of these methods are based on computing the topological degree, which for a bounded region
\Omega\subseteqRn
f:Rn → Rn
\deg(f,\Omega):=
\sum | |
y\inf-1(0) |
sgn\det(Df(y))
Df(y)
0=(0,0,...,0)T
sgn(x)=\begin{cases} 1,&x>0\\ 0,&x=0\\ -1,&x<0\\ \end{cases}
\deg(f,\Omega) ≠ 0
\Omega
The characteristic bisection method uses only the signs of a function in different points. Lef f be a function from Rd to Rd, for some integer d ≥ 2. A characteristic polyhedron[17] (also called an admissible polygon)[18] of f is a polytope in Rd, having 2d vertices, such that in each vertex v, the combination of signs of f(v) is unique and the topological degree of f on its interior is not zero (a necessary criterion to ensure the existence of a root).[19] For example, for d=2, a characteristic polyhedron of f is a quadrilateral with vertices (say) A,B,C,D, such that:
A proper edge of a characteristic polygon is a edge between a pair of vertices, such that the sign vector differs by only a single sign. In the above example, the proper edges of the characteristic quadrilateral are AB, AC, BD and CD. A diagonal is a pair of vertices, such that the sign vector differs by all d signs. In the above example, the diagonals are AD and BC.
At each iteration, the algorithm picks a proper edge of the polyhedron (say, AB), and computes the signs of f in its mid-point (say, M). Then it proceeds as follows:
Suppose the diameter (= length of longest proper edge) of the original characteristic polyhedron is . Then, at least
log2(D/\varepsilon)