ITP method explained

In numerical analysis, the ITP method, short for Interpolate Truncate and Project, is the first root-finding algorithm that achieves the superlinear convergence of the secant method^[1] while retaining the optimal^[2] worst-case performance of the bisection method.^[3] It is also the first method with guaranteed average performance strictly better than the bisection method under any continuous distribution. In practice it performs better than traditional interpolation and hybrid based strategies (Brent's Method, Ridders, Illinois), since it not only converges super-linearly over well behaved functions but also guarantees fast performance under ill-behaved functions where interpolations fail.

The ITP method follows the same structure of standard bracketing strategies that keeps track of upper and lower bounds for the location of the root; but it also keeps track of the region where worst-case performance is kept upper-bounded. As a bracketing strategy, in each iteration the ITP queries the value of the function on one point and discards the part of the interval between two points where the function value shares the same sign. The queried point is calculated with three steps: it interpolates finding the regula falsi estimate, then it perturbes/truncates the estimate (similar to) and then projects the perturbed estimate onto an interval in the neighbourhood of the bisection midpoint. The neighbourhood around the bisection point is calculated in each iteration in order to guarantee minmax optimality (Theorem 2.1 of). The method depends on three hyper-parameters

\kappa_1\in(0,infty),\kappa₂\in\left[1,1+\phi\right)

and

n_{0\in[0,infty)}

where

\phi

is the golden ratio

\tfrac{1}{2}(1+\sqrt{5})

: the first two control the size of the truncation and the third is a slack variable that controls the size of the interval for the projection step.

Root finding problem

Given a continuous function

defined from

[a,b]

such that

f(a)f(b)\leq0

, where at the cost of one query one can access the values of

f(x)

on any given

. And, given a pre-specified target precision

\epsilon>0

, a root-finding algorithm is designed to solve the following problem with the least amount of queries as possible:

Problem Definition: Find

\hat{x}

such that
|\hat{x}-x^*|\leq\epsilon

, where
x^*

satisfies
f(x^*)=0

.

This problem is very common in numerical analysis, computer science and engineering; and, root-finding algorithms are the standard approach to solve it. Often, the root-finding procedure is called by more complex parent algorithms within a larger context, and, for this reason solving root problems efficiently is of extreme importance since an inefficient approach might come at a high computational cost when the larger context is taken into account. This is what the ITP method attempts to do by simultaneously exploiting interpolation guarantees as well as minmax optimal guarantees of the bisection method that terminates in at most

n_1/2\equiv\lceillog_2((b_0-a_{0)/2\epsilon)\rceil}

iterations when initiated on an interval

[a_0,b_0]

The method

Given

\kappa_1\in(0,infty),\kappa₂\in\left[1,1+\phi\right)

n_1/2\equiv\lceillog_2((b_0-a_{0)/2\epsilon)\rceil}

and

n_{0\in[0,infty)}

where

\phi

is the golden ratio

\tfrac{1}{2}(1+\sqrt{5})

, in each iteration

j=0,1,2...

the ITP method calculates the point

x_ITP

following three steps:

[Interpolation Step] Calculate the bisection and the regula falsi points:

x_1/2\equiv

	a+b
	2

and

x_f\equiv

	bf(a)-af(b)
	f(a)-f(b)

;

[Truncation Step] Perturb the estimator towards the center:

x_t\equivx_f+\sigma\delta

where

\sigma\equivsign(x_1/2-x_f)

and

\delta\equiv

	\kappa₂
min\{\kappa
	1\|b-a\|

,|x_1/2-x_f|\}

;

[Projection Step] Project the estimator to minmax interval:

x_ITP\equivx_1/2-\sigma\rho_k

where

\rho_k\equivmin\left\{\epsilon

	n_1/2+n_0-j
2

	b-a
	2

,|x_t-x_1/2|\right\}

.The value of the function

f(x_ITP)

on this point is queried, and the interval is then reduced to bracket the root by keeping the sub-interval with function values of opposite sign on each end.

The algorithm

The following algorithm (written in pseudocode) assumes the initial values of

y_a

and

y_b

are given and satisfy

y_a<0<y_b

where

y_a\equivf(a)

and

y_b\equivf(b)

; and, it returns an estimate

\hat{x}

that satisfies

|\hat{x}-x^*|\leq\epsilon

in at most

n_1/2+n₀

function evaluations. Input:
a,b,\epsilon,\kappa_1,\kappa_2,n_0,f

Preprocessing: n_1/2=\lceillog_{2\tfrac{b-a}{2\epsilon}\rceil}

, n_max=n_1/2+n₀

, and j=0

; While (
b-a>2\epsilon

) Calculating Parameters: x_1/2=\tfrac{a+b}{2}

, r=\epsilon

n_max-j
2

-(b-a)/2

, \delta=

\kappa₂
\kappa
1(b-a)

; Interpolation: x_f=\tfrac{y_ba-y_ab}{y_b-y_a}

; Truncation: \sigma=sign(x_1/2-x_f)

; If \delta\leq|x_1/2-x_f|

then x_t=x_f+\sigma\delta

, Else x_t=x_1/2

; Projection: If |x_t-x_1/2|\leqr

then x_ITP=x_t

, Else x_ITP=x_1/2-\sigmar

; Updating Interval: y_ITP=f(x_ITP)

; If y_ITP>0

then b=x_ITP

and y_b=y_ITP

, Elseif y_ITP<0

then a=x_ITP

and y_a=y_ITP

, Else a=x_ITP

and b=x_ITP

; j=j+1

; Output:
\hat{x}=\tfrac{a+b}{2}

Example: Finding the root of a polynomial

Suppose that the ITP method is used to find a root of the polynomial

f(x)=x³-x-2.

Using

\epsilon=0.0005,\kappa₁=0.1,\kappa₂=2

and

n₀=1

we find that:

Iteration	a_n	b_n	c_n	f(c_n)
1	1	2	1.43333333333333	-0.488629629629630
2	1.43333333333333	2	1.52713145056966	0.0343383329048983
3	1.43333333333333	1.52713145056966	1.52009281150978	style="text-align: right;"	-0.00764147709265051
4	1.52009281150978	1.52713145056966	1.52137899116052	style="text-align: right;"	-4.25363464540141e-06
5	1.52137899116052	1.52713145056966	1.52138301273268	1.96497878177659e-05
6	1.52137899116052	1.52138301273268	← Stopping Criteria Satisfied

This example can be compared to . The ITP method required less than half the number of iterations than the bisection to obtain a more precise estimate of the root with no cost on the minmax guarantees. Other methods might also attain a similar speed of convergence (such as Ridders, Brent etc.) but without the minmax guarantees given by the ITP method.

Analysis

The main advantage of the ITP method is that it is guaranteed to require no more iterations than the bisection method when

n₀=0

. And so its average performance is guaranteed to be better than the bisection method even when interpolation fails. Furthermore, if interpolations do not fail (smooth functions), then it is guaranteed to enjoy the high order of convergence as interpolation based methods.

Worst case performance

Because the ITP method projects the estimator onto the minmax interval with a

n₀

slack, it will require at most

n_1/2+n₀

iterations (Theorem 2.1 of). This is minmax optimal like the bisection method when

n₀

is chosen to be

n₀=0

Average performance

Because it does not take more than

n_1/2+n₀

iterations, the average number of iterations will always be less than that of the bisection method for any distribution considered when

n₀=0

(Corollary 2.2 of).

Asymptotic performance

If the function

f(x)

is twice differentiable and the root

x^*

is simple, then the intervals produced by the ITP method converges to 0 with an order of convergence of

\sqrt{\kappa_2}

n₀ ≠ 0

or if

n₀=0

and

(b-a)/\epsilon

is not a power of 2 with the term

\tfrac{\epsilon

	n_1/2
2

} not too close to zero (Theorem 2.3 of).

Software

The itp contributed package in R.

External links

An Improved Bisection Method, by Kudos

Notes and References

Book: Argyros. I. K.. Hernández-Verón. M. A.. Rubio. M. J.. On the Convergence of Secant-Like Methods. 2019. Current Trends in Mathematical Analysis and Its Interdisciplinary Applications. http://springer.nl.go.kr/chapter/10.1007/978-3-030-15242-0_5. en. 141–183. 10.1007/978-3-030-15242-0_5. 978-3-030-15241-3. 202156085.
Sikorski. K.. 1982-02-01. Bisection is optimal. Numerische Mathematik. en. 40. 1. 111–117. 10.1007/BF01459080. 119952605. 0945-3245.
Oliveira. I. F. D.. Takahashi. R. H. C.. 2020-12-06. An Enhancement of the Bisection Method Average Performance Preserving Minmax Optimality. ACM Transactions on Mathematical Software. 47. 1. 5:1–5:24. 10.1145/3423597. 230586635 . 0098-3500.

ITP method explained

Root finding problem

The method

The algorithm

Example: Finding the root of a polynomial

Analysis

Worst case performance

Average performance

Asymptotic performance

Software

See also

External links

Notes and References