Method of averaging explained

In mathematics, more specifically in dynamical systems, the method of averaging (also called averaging theory) exploits systems containing time-scales separation: a fast oscillation versus a slow drift. It suggests that we perform an averaging over a given amount of time in order to iron out the fast oscillations and observe the qualitative behavior from the resulting dynamics. The approximated solution holds under finite time inversely proportional to the parameter denoting the slow time scale. It turns out to be a customary problem where there exists the trade off between how good is the approximated solution balanced by how much time it holds to be close to the original solution.

More precisely, the system has the following form $\dot = \varepsilon f(x,t, \varepsilon), \quad 0 \leq \varepsilon \ll 1$ of a phase space variable

The fast oscillation is given by

versus a slow drift of

•

. The averaging method yields an autonomous dynamical system

\dot= \varepsilon \int _^f(y, s, 0)~ds =: \varepsilon (y)

which approximates the solution curves of

•

inside a connected and compact region of the phase space and over time of

1/\varepsilon

Under the validity of this averaging technique, the asymptotic behavior of the original system is captured by the dynamical equation for

. In this way, qualitative methods for autonomous dynamical systems may be employed to analyze the equilibria and more complex structures, such as slow manifold and invariant manifolds, as well as their stability in the phase space of the averaged system.

In addition, in a physical application it might be reasonable or natural to replace a mathematical model, which is given in the form of the differential equation for

•

, with the corresponding averaged system

•

, in order to use the averaged system to make a prediction and then test the prediction against the results of a physical experiment.^[1]

The averaging method has a long history, which is deeply rooted in perturbation problems that arose in celestial mechanics (see, for example in ^[2]).

First example

Consider a perturbed logistic growth $\dot = \varepsilon (x (1 - x) + \sin) \quad \quad x \in \R, \quad 0 \leq \varepsilon \ll 1,$ and the averaged equation $\dot = \varepsilon y (1 - y) \qquad y \in \R.$ The purpose of the method of averaging is to tell us the qualitative behavior of the vector field when we average it over a period of time. It guarantees that the solution

y(t)

approximates

x(t)

for times

t=l{O}(1/\varepsilon).

Exceptionally: in this example the approximation is even better, it is valid for all times. We present it in a section below.

Definitions

We assume the vector field

f:\Rⁿ x \R x \R\to\Rⁿ

to be of differentiability class

C^r

with

r\geq2

(or even we will only say smooth), which we will denote

f\inC^r(\Rⁿ x \R x \R^+;\Rⁿ⁾

. We expand this time-dependent vector field in a Taylor series (in powers of

\varepsilon

) with remainder

f^[k(x,t,\varepsilon)

. We introduce the following notation:

f(x, t, \varepsilon) = f^0(x, t) + \varepsilon f^(x, t) + \dots + \varepsilon^k f^(x, t) + \varepsilon^ f^(x, t, \varepsilon),

where

f^j=

	f^(j)(x,t,0)
	j!

is the

-th derivative with

0\leqj\leqk

. As we are concerned with averaging problems, in general

f^0(x,t)

is zero, so it turns out that we will be interested in vector fields given by

f(x, t, \varepsilon) = \varepsilon f^(x, t, \varepsilon) = \varepsilon f^(x, t) + \varepsilon^ f^(x, t, \varepsilon).

Besides, we define the following initial value problem to be in the standard form:

\dot = \varepsilon f^(x, t) + \varepsilon^ f^(x, t, \varepsilon), \qquad x(0, \varepsilon) =: x_0 \in D \subseteq \R^n, \quad 0 \leq \varepsilon \ll 1.

Theorem: averaging in the periodic case

Consider for every

D\subset\Rⁿ

connected and bounded and every

\varepsilon₀>0

there exist

L>0

and

\varepsilon\leq\varepsilon₀

such that the original system (a non-autonomous dynamical system) given by

\dot = \varepsilon f^(x, t) + \varepsilon^ f^(x, t, \varepsilon), \qquad x_0 \in D \subseteq \R^n, \quad 0 \leq \varepsilon \ll 1,

has solution

x(t,\varepsilon)

, where

f¹\inC^r(D x \R;\Rⁿ⁾

is periodic with period

and

f^[2]\inC^r(D x \R x \R^+;\Rⁿ⁾

both with

r\geq2

bounded on bounded sets. Then there exists a constant

c>0

such that the solution

y(t,\varepsilon)

of the averaged system (autonomous dynamical system) is

\dot= \varepsilon \int _^f^1(y, s)~ds =: \varepsilon^1(y), \quad y(0, \varepsilon) = x_0

\|x(t, \varepsilon) - y(t, \varepsilon)\| < c \varepsilon

for

0\leq\varepsilon\leq\varepsilon₀

and

0\leqt\leqL/\varepsilon

Remarks

There are two approximations in this what is called first approximation estimate: reduction to the average of the vector field and negligence of

l{O}(\varepsilon²⁾

terms.

Uniformity with respect to the initial condition

x₀

: if we vary

x₀

this affects the estimation of

and

. The proof and discussion of this can be found in J. Murdock's book.^[3]

Reduction of regularity: there is a more general form of this theorem which requires only

f¹

to be Lipschitz and

f^[2]

continuous. It is a more recent proof and can be seen in Sanders et al.. The theorem statement presented here is due to the proof framework proposed by Krylov-Bogoliubov which is based on an introduction of a near-identity transformation. The advantage of this method is the extension to more general settings such as infinite-dimensional systems - partial differential equation or delay differential equations.

J. Hale presents generalizations to almost periodic vector-fields.^[4]

Strategy of the proof

Krylov-Bogoliubov realized that the slow dynamics of the system determines the leading order of the asymptotic solution.

In order to proof it, they proposed a near-identity transformation, which turned out to be a change of coordinates with its own time-scale transforming the original system to the averaged one.

Sketch of the proof

Determination of a near-identity transformation: the smooth mapping

y\mapstoU(y,t,\varepsilon)=y+\varepsilonu^[1](y,t,\varepsilon)

where

u^[1]

is assumed to be regular enough and

periodic. The proposed change of coordinates is given by

x=U(y,t,\varepsilon)

Choose an appropriate

u^[1]

solving the homological equation of the averaging theory:

	\partialu^[1]
	\partialt

=f¹(y,t)-\bar{f}^1(y)

Change of coordinates carries the original system to

•

=\varepsilon\bar{f}^1(y)+\varepsilon²

	[2]
f
	*

(y,t,\varepsilon).

Estimation of error due to truncation and comparison to the original variable.

Non-autonomous class of systems: more examples

Along the history of the averaging technique, there is class of system extensively studied which give us meaningful examples we will discuss below. The class of system is given by: $\ddot + z = \varepsilon g(z, \dot, t), \qquad z \in \R,\quad z(0) = z_0 ~\mathrm~ \dot(0) = v_0,$ where

is smooth. This system is similar to a linear system with a small nonlinear perturbation given by

\begin{bmatrix}0\ \varepsilon~g(z,

•

,t)\end{bmatrix}

$\begin\dot &= z_2, & z_1(0) &= z_0 \\\dot &= -z_1 + \varepsilon g(z_1, z_2, t), & z_2(0) &= v_0,\end$ differing from the standard form. Hence there is a necessity to perform a transformation to make it in the standard form explicitly. We are able to change coordinates using variation of constants method. We look at the unperturbed system, i.e.

\varepsilon=0

, given by

\begin \dot \\ \dot \end= \begin 0 & 1 \\ -1 & 0 \end \begin z_1 \\ z_2 \end= A \begin z_1 \\ z_2 \end

\Phi(t)=e^A

corresponding to a rotation. Then the time-dependent change of coordinates is

z(t)=\Phi(t)x

where

is the coordinates respective to the standard form.

If we take the time derivative in both sides and invert the fundamental matrix we obtain $\dot = \varepsilon e^ \begin 0 \\ ~\tilde g(x, \dot, t) \end ~\text~ \tilde g(x, \dot, t)=g(\cos(t)x(t)+\sin(t)\dot x(t), -\sin(t)x(t)+\cos(t)\dot x(t), t).$

Remarks

The same can be done to time-dependent linear parts. Although the fundamental solution may be non-trivial to write down explicitly, the procedure is similar. See Sanders et al. for further details.
If the eigenvalues of

are not all purely imaginary this is called hyperbolicity condition. For this occasion, the perturbation equation may present some serious problems even whether

is bounded, since the solution grows exponentially fast. However, qualitatively, we may be able to know the asymptotic solution, such as Hartman-Grobman results and more.

Occasionally, polar coordinates may yield standard forms that are simpler to analyze. Consider

z₁=r\sin(t-\phi)~and~z₂=r\cos(t-\phi)

, which determines the initial condition

(r(0),\phi(0))

and the system

\begin \dot \\ \dot \end = \varepsilon\begin\cos(t - \phi) g(r \sin(t - \phi), r \cos(t - \phi), t) \\\frac \sin(t - \phi) g(r \sin(t - \phi), r \cos(t - \phi), t)\end.

g\inC¹

we may apply averaging so long as a neighborhood of the origin is excluded (since the polar coordinates fail):

\begin\bar_1^1(r) & = & \displaystyle \frac \int_0^ \cos(s - \phi) g(r \sin (s - \phi), r \cos(s - \phi), s) ds \\[4pt]\bar_2^1(r) & = & \displaystyle \frac \int_0^ \sin(s - \phi) g(r \sin (s - \phi), r \cos(s - \phi), s) ds,\end

where the averaged system is

\begin\dot = \varepsilon \bar_1^1 (\bar) \\\dot = \varepsilon \bar_2^1 (\bar).\end

Example: Misleading averaging results

The method contains some assumptions and restrictions. These limitations play important role when we average the original equation which is not into the standard form, and we can discuss counterexample of it. The following example in order to discourage this hurried averaging: $\ddot + 4 \varepsilon \cos^2 \dot + z = 0, \qquad z(0) = 0,\quad \dot(0) = 1,$ where we put

g(z,

•

,t)=-4\cos^2(t)

•

following the previous notation.

This systems corresponds to a damped harmonic oscillator where the damping term oscillates between

and

4\varepsilon

. Averaging the friction term over one cycle of

2\pi

yields the equation:

\ddot + 2 \varepsilon \dot + \bar = 0, \qquad \bar(0) = 0, \quad \dot(0) = 1.

The solution is

\bar(t) = \frac e^ \sin.

which the convergence rate to the origin is

\varepsilon

. The averaged system obtained from the standard form yields:

\begin\dot = - \frac\varepsilon \bar(2 + \cos(2 \bar)), ~\bar(0) = 1 \\\dot = \frac\varepsilon \sin(2 \bar), ~\bar(0) = 0,\end

which in the rectangular coordinate shows explicitly that indeed the rate of convergence to the origin is

\frac \varepsilon

differing from the previous crude averaged system:

y(t) = e^ \sin

Example: Van der Pol Equation

Van der Pol was concerned with obtaining approximate solutions for equations of the type $\ddot + \varepsilon (1 - z^2) \dot + z = 0,$ where

g(z,

•

,t)=(1-z²⁾

•

following the previous notation. This system is often called the Van der Pol oscillator. Applying periodic averaging to this nonlinear oscillator provides qualitative knowledge of the phase space without solving the system explicitly.

The averaged system is $\begin\dot = \frac\varepsilon \bar(1 - \frac \bar^2) \\\dot = 0,\end$ and we can analyze the fixed points and their stability. There is an unstable fixed point at the origin and a stable limit cycle represented by

\bar{r}=2

The existence of such stable limit-cycle can be stated as a theorem.

Theorem (Existence of a periodic orbit)^[5] : If

p₀

is a hyperbolic fixed point of

\dot= \varepsilon ^1(y)

Then there exists

\varepsilon₀>0

such that for all

0<\varepsilon<\varepsilon₀

\dot = \varepsilon f^(x, t) + \varepsilon^ f^(x, t, \varepsilon)

has a unique hyperbolic periodic orbit

\gamma_\varepsilon(t)=p₀+l{O}(\varepsilon)

of the same stability type as

p₀

The proof can be found at Guckenheimer and Holmes, Sanders et al. and for the angle case in Chicone.

Example: Restricting the time interval

The average theorem assumes existence of a connected and bounded region

D\subset\Rⁿ

which affects the time interval

of the result validity. The following example points it out. Consider the

\ddot + z = 8 \varepsilon \cos \dot^2, ~ z(0) = 0,~ \dot(0) = 1,

where

g(z,

•

,t)=8

•

²\cos(t)

. The averaged system consists of

\begin\dot = 3 \varepsilon \bar^2\cos(\bar), ~\bar(0) = 1 \\\dot = -\varepsilon \bar \sin(\bar), ~\bar(0) = 0,\end

which under this initial condition indicates that the original solution behaves like

z(t) = \frac + \mathcal(\varepsilon),

where it holds on a bounded region over

0\leq\varepsilont\leqL<

	1
	3

Damped Pendulum

Consider a damped pendulum whose point of suspension is vibrated vertically by a small amplitude, high frequency signal (this is usually known as dithering). The equation of motion for such a pendulum is given by $m(l\ddot - ak\omega^2 \sin \omega t \sin \theta) = -mg \sin \theta - k(l\dot + a\omega \cos \omega t \sin \theta)$ where

a\sin\omegat

describes the motion of the suspension point,

describes the damping of the pendulum, and

\theta

is the angle made by the pendulum with the vertical.

The phase space form of this equation is given by $\begin\dot t &= 1 \\\dot\theta &= p \\\dot p &= \frac (mak\omega^2 \sin\omega t \sin \theta - mg\sin\theta - k(l p + a\omega \cos\omega t \sin \theta))\end$ where we have introduced the variable

and written the system as an autonomous, first-order system in

(t,\theta,p)

-space.

Suppose that the angular frequency of the vertical vibrations,

\omega

, is much greater than the natural frequency of the pendulum,

\sqrt

. Suppose also that the amplitude of the vertical vibrations,

, is much less than the length

of the pendulum. The pendulum's trajectory in phase space will trace out a spiral around a curve

, moving along

at the slow rate

\sqrt{g/l}

but moving around it at the fast rate

\omega

. The radius of the spiral around

will be small and proportional to

. The average behaviour of the trajectory, over a timescale much larger than

2\pi/\omega

, will be to follow the curve

Extension error estimates

Average technique for initial value problems has been treated up to now with an validity error estimates of order

1/\varepsilon

. However, there are circumstances where the estimates can be extended for further times, even the case for all times. Below we deal with a system containing an asymptotically stable fixed point. Such situation recapitulates what is illustrated in Figure 1.

Theorem (Eckhaus ^[6] /Sanchez-Palencia ^[7] ) Consider the initial value problem $\dot = \varepsilon f^(x, t), \qquad x_0 \in D \subseteq \R^n, \quad 0 \leq \varepsilon \ll 1.$ Suppose $\dot= \varepsilon \lim_\int _^f^1(y, s)~ds =: \varepsilon^1(y), \quad y(0, \varepsilon) = x_0$ exists and contains an asymptotically stable fixed point

y=0

in the linear approximation. Moreover,

\bar{f}¹

is continuously differentiable with respect to

and has a domain of attraction

D⁰\subsetD

. For any compact

K\subsetD⁰

and for all

x₀\inK

\|x(t) - y(t)\| = \mathcal(\delta(\varepsilon)), \quad 0 \leq t < \infty,

with

\delta(\varepsilon)=o(1)

in the general case and

l{O}(\varepsilon)

in the periodic case.

Notes and References

Book: Charles., Chicone, Carmen. Ordinary differential equations with applications. 2006. Springer. 9780387307695. 2nd. New York. 288193020.
Book: 2007. Averaging Methods in Nonlinear Dynamical Systems. 59. en-gb. 10.1007/978-0-387-48918-6. 978-0-387-48916-2. Sanders. Jan A.. Verhulst. Ferdinand. Murdock. James. Applied Mathematical Sciences .
Book: Murdock, James A.. Perturbations : theory and methods. 1999. Society for Industrial and Applied Mathematics. 978-0898714432. Philadelphia. 41612407.
Book: Hale, Jack K.. Ordinary differential equations. 1980. R.E. Krieger Pub. Co. 978-0898740110. 2nd. Huntington, N.Y.. 5170595.
Book: Guckenheimer. John. Holmes. Philip. Applied Mathematical Sciences . 1983. Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields. 42. en-gb. 10.1007/978-1-4612-1140-2. 0066-5452. 978-1-4612-7020-1.
Eckhaus. Wiktor. 1975-03-01. New approach to the asymptotic theory of nonlinear oscillations and wave-propagation. Journal of Mathematical Analysis and Applications. 49. 3. 575–611 . 10.1016/0022-247X(75)90200-0. 0022-247X. free.
Sanchez-Palencia. Enrique. 1976-01-01. Methode de centrage-estimation de l'erreur et comportement des trajectoires dans l'espace des phases. International Journal of Non-Linear Mechanics. 11. 4. 251–263 . 10.1016/0020-7462(76)90004-4. 1976IJNLM..11..251S. 0020-7462.