Multitaper Explained

In signal processing, multitaper analysis is a spectral density estimation technique developed by David J. Thomson.[1] It can estimate the power spectrum SX of a stationary ergodic finite-variance random process X, given a finite contiguous realization of X as data.

Motivation

The multitaper method overcomes some of the limitations of non-parametric Fourier analysis. When applying the Fourier transform to extract spectral information from a signal, we assume that each Fourier coefficient is a reliable representation of the amplitude and relative phase of the corresponding component frequency. This assumption, however, is not generally valid for empirical data. For instance, a single trial represents only one noisy realization of the underlying process of interest. A comparable situation arises in statistics when estimating measures of central tendency i.e., it is bad practice to estimate qualities of a population using individuals or very small samples. Likewise, a single sample of a process does not necessarily provide a reliable estimate of its spectral properties. Moreover, the naive power spectral density obtained from the signal's raw Fourier transform is a biased estimate of the true spectral content.

These problems are often overcome by averaging over many realizations of the same event after applying a taper to each trial. However, this method is unreliable with small data sets and undesirable when one does not wish to attenuate signal components that vary across trials. Furthermore, even when many trials are available the untapered periodogram is generally biased (with the exception of white noise) and the bias depends upon the length of each realization, not the number of realizations recorded. Applying a single taper reduces bias but at the cost of increased estimator variance due to attenuation of activity at the start and end of each recorded segment of the signal.

The multitaper method partially obviates these problems by obtaining multiple independent estimates from the same sample. Each data taper is multiplied element-wise by the signal to provide a windowed trial from which one estimates the power at each component frequency. As each taper is pairwise orthogonal to all other tapers, the window functions are uncorrelated with one another. The final spectrum is obtained by averaging over all the tapered spectra thus recovering some of the information that is lost due to partial attenuation of the signal that results from applying individual tapers.

This method is especially useful when a small number of trials is available as it reduces the estimator variance beyond what is possible with single taper methods. Moreover, even when many trials are available the multitaper approach is useful as it permits more rigorous control of the trade-off between bias and variance than what is possible in the single taper case.

Thomson chose the Slepian functions[2] or discrete prolate spheroidal sequences as tapers since these vectors are mutually orthogonal and possess desirable spectral concentration properties (see the section on Slepian sequences). In practice, a weighted average is often used to compensate for increased energy loss at higher order tapers.[3]

Formulation

Consider a p-dimensional zero mean stationary stochastic process

X(t)={\lbrackX(1,t),X(2,t),...,X(p,t) \rbrack}T

Here T denotes the matrix transposition. In neurophysiology for example, p refers to the total number of channels andhence

X(t)

can represent simultaneous measurement ofelectrical activity of those p channels. Let the sampling intervalbetween observations be

\Deltat

, so that the Nyquist frequency is

fN=1/(2\Deltat)

.

The multitaper spectral estimator utilizes several different data tapers which are orthogonal to each other. The multitaper cross-spectral estimator between channel l and m is the average of K direct cross-spectral estimators between the same pair of channels (l and m) and hence takes the form

\hat{S}lm(f)=

1
K
K-1
\sum
k=0
lm
\hat{S}
k

(f).

Here,

lm
\hat{S}
k

(f)

(for

0\leqk\leqK-1

) is the kth direct cross spectral estimator between channel l and m and is given by
lm
\hat{S}
k

(f)=

1
N\Deltat

{\lbrack

l
J
k

(f)\rbrack}*{\lbrack

m
J
k

(f) \rbrack},

where

l(f)
J
k

=

N
\sum
t=1

ht,kX(l,t)e-i.

The Slepian sequences

The sequence

\lbraceht,k\rbrace

is the data taper for thekth direct cross-spectral estimator
lm
\hat{S}
k

(f)

and is chosen as follows:

2NW\Deltat

. The quantity 2W defines the resolution bandwidth for the spectral concentration problem and

W\in (0,fN)

. When l = m, we get the multitaper estimator for the auto-spectrum of the lth channel. In recent years, a dictionary based on modulated DPSS was proposed as an overcomplete alternative to DPSS.[5]

See also Window function:DPSS or Slepian window

Applications

Not limited to time series, the multitaper method is easily extensible to multiple Cartesian dimenions using custom Slepian functions,[6] and can be reformulated for spectral estimation on the sphere using Slepian functions constructed from spherical harmonics[7] for applications in geophysics and cosmology[8] [9] among others. An extensive treatment about the application of this method to analyze multi-trial, multi-channel data generated in neuroscience, biomedical engineering and elsewhere can be found here. This technique is currently used in the spectral analysis toolkit of Chronux.

See also

External links

Notes and References

  1. Thomson, D. J. (1982) Spectrum estimation and harmonic analysis. Proceedings of the IEEE, 70, 1055 - 1096
  2. Book: Simons . F. J. . Plattner . A. . Scalar and Vector Slepian Functions, Spherical Signal Estimation and Spectral Analysis . Handbook of Geomathematics . 2015 . 2563–2608 . 10.1007/978-3-642-54551-1_30. 978-3-642-54550-4 .
  3. Percival, D. B., and A. T. Walden. Spectral Analysis for Physical Applications: Multitaper and Conventional Univariate Techniques. Cambridge: Cambridge University Press, 1993.
  4. Slepian, D. (1978) "Prolate spheroidal wave functions, Fourier analysis, and uncertainty  - V: The discrete case." Bell System Technical Journal, 57, 1371 - 1430
  5. E. Sejdić, M. Luccini, S. Primak, K. Baddour, T. Willink, “Channel estimation using modulated discrete prolate spheroidal sequences based frames,” in Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, USA, March 31-April 04, 2008, pp. 2849-2852.
  6. Simons . F. J. . Wang . D. V. . Spatiospectral concentration in the Cartesian plane . GEM: International Journal on Geomathematics. 2011 . 2 . 1–36 . 10.1007/s13137-011-0016-z. .
  7. Simons . F. J. . Dahlen . F. A. . Wieczorek . M. A. . 10.1137/S0036144504445765 . Spatiospectral Concentration on a Sphere . SIAM Review . 48 . 3 . 504–536 . 2006 . math/0408424 . 2006SIAMR..48..504S .
  8. Wieczorek . M. A. . Simons . F. J. . 10.1007/s00041-006-6904-1 . Minimum-variance multitaper spectral estimation on the sphere . Journal of Fourier Analysis and Applications . 13 . 6 . 665 . 2007 .
  9. Dahlen . F. A. . Simons . F. J. . 10.1111/j.1365-246X.2008.03854.x . Spectral estimation on a sphere in geophysics and cosmology . Geophysical Journal International . 174 . 3 . 774 . 2008 . free . 0705.3083 . 2008GeoJI.174..774D .