In numerical analysis and functional analysis, a discrete wavelet transform (DWT) is any wavelet transform for which the wavelets are discretely sampled. As with other wavelet transforms, a key advantage it has over Fourier transforms is temporal resolution: it captures both frequency and location information (location in time).
See main article: Haar wavelet. The first DWT was invented by Hungarian mathematician Alfréd Haar. For an input represented by a list of
2n
2n-1
See main article: Daubechies wavelet. The most commonly used set of discrete wavelet transforms was formulated by the Belgian mathematician Ingrid Daubechies in 1988. This formulation is based on the use of recurrence relations to generate progressively finer discrete samplings of an implicit mother wavelet function; each resolution is twice that of the previous scale. In her seminal paper, Daubechies derives a family of wavelets, the first of which is the Haar wavelet. Interest in this field has exploded since then, and many variations of Daubechies' original wavelets were developed.[1] [2] [3]
See main article: Complex wavelet transform. The dual-tree complex wavelet transform (
C
2d
C
Other forms of discrete wavelet transform include the Le Gall–Tabatabai (LGT) 5/3 wavelet developed by Didier Le Gall and Ali J. Tabatabai in 1988 (used in JPEG 2000 or JPEG XS),[5] [6] [7] the Binomial QMF developed by Ali Naci Akansu in 1990,[8] the set partitioning in hierarchical trees (SPIHT) algorithm developed by Amir Said with William A. Pearlman in 1996,[9] the non- or undecimated wavelet transform (where downsampling is omitted), and the Newland transform (where an orthonormal basis of wavelets is formed from appropriately constructed top-hat filters in frequency space). Wavelet packet transforms are also related to the discrete wavelet transform. Complex wavelet transform is another form.
The Haar DWT illustrates the desirable properties of wavelets in general. First, it can be performed in
O(n)
Due to the rate-change operators in the filter bank, the discrete WT is not time-invariant but actually very sensitive to the alignment of the signal in time. To address the time-varying problem of wavelet transforms, Mallat and Zhong proposed a new algorithm for wavelet representation of a signal, which is invariant to time shifts.[10] According to this algorithm, which is called a TI-DWT, only the scale parameter is sampled along the dyadic sequence 2^j (j∈Z) and the wavelet transform is calculated for each point in time.[11] [12]
The discrete wavelet transform has a huge number of applications in science, engineering, mathematics and computer science. Most notably, it is used for signal coding, to represent a discrete signal in a more redundant form, often as a preconditioning for data compression. Practical applications can also be found in signal processing of accelerations for gait analysis,[13] [14] image processing,[15] [16] in digital communications and many others.[17] [18] [19]
It is shown that discrete wavelet transform (discrete in scale and shift, and continuous in time) is successfully implemented as analog filter bank in biomedical signal processing for design of low-power pacemakers and also in ultra-wideband (UWB) wireless communications.[20]
Wavelets are often used to denoise two dimensional signals, such as images. The following example provides three steps to remove unwanted white Gaussian noise from the noisy image shown. Matlab was used to import and filter the image.
The first step is to choose a wavelet type, and a level N of decomposition. In this case biorthogonal 3.5 wavelets were chosen with a level N of 10. Biorthogonal wavelets are commonly used in image processing to detect and filter white Gaussian noise,[21] due to their high contrast of neighboring pixel intensity values. Using these wavelets a wavelet transformation is performed on the two dimensional image.
Following the decomposition of the image file, the next step is to determine threshold values for each level from 1 to N. Birgé-Massart strategy[22] is a fairly common method for selecting these thresholds. Using this process individual thresholds are made for N = 10 levels. Applying these thresholds are the majority of the actual filtering of the signal.
The final step is to reconstruct the image from the modified levels. This is accomplished using an inverse wavelet transform. The resulting image, with white Gaussian noise removed is shown below the original image. When filtering any form of data it is important to quantify the signal-to-noise-ratio of the result. In this case, the SNR of the noisy image in comparison to the original was 30.4958%, and the SNR of the denoised image is 32.5525%. The resulting improvement of the wavelet filtering is a SNR gain of 2.0567%.[23]
Choosing other wavelets, levels, and thresholding strategies can result in different types of filtering. In this example, white Gaussian noise was chosen to be removed. Although, with different thresholding, it could just as easily have been amplified.
See also: Discrete Fourier transform.
To illustrate the differences and similarities between the discrete wavelet transform with the discrete Fourier transform, consider the DWT and DFT of the following sequence: (1,0,0,0), a unit impulse.
The DFT has orthogonal basis (DFT matrix):
\begin{bmatrix} 1&1&1&1\\ 1&-i&-1&i\\ 1&-1&1&-1\\ 1&i&-1&-i \end{bmatrix}
while the DWT with Haar wavelets for length 4 data has orthogonal basis in the rows of:
\begin{bmatrix} 1&1&1&1\\ 1&1&-1&-1\\ 1&-1&0&0\\ 0&0&1&-1 \end{bmatrix}
(To simplify notation, whole numbers are used, so the bases are orthogonal but not orthonormal.)
Preliminary observations include:
\begin{align} (1,0,0,0)&=
1 | |
4 |
(1,1,1,1)+
1 | |
4 |
(1,1,-1,-1)+
1 | |
2 |
(1,-1,0,0) HaarDWT\\ (1,0,0,0)&=
1 | |
4 |
(1,1,1,1)+
1 | |
4 |
(1,i,-1,-i)+
1 | |
4 |
(1,-1,1,-1)+
1 | |
4 |
(1,-i,-1,i) DFT \end{align}
The DWT demonstrates the localization: the (1,1,1,1) term gives the average signal value, the (1,1,–1,–1) places the signal in the left side of the domain, and the (1,–1,0,0) places it at the left side of the left side, and truncating at any stage yields a downsampled version of the signal:
\begin{align} &\left( | 1 | , |
4 |
1 | , | |
4 |
1 | , | |
4 |
1 | \right)\\ &\left( | |
4 |
1 | , | |
2 |
1 | |
2 |
,0,0\right) 2-termtruncation\\ &\left(1,0,0,0\right) \end{align}
\begin{align} &\left( | 1 | , |
4 |
1 | , | |
4 |
1 | , | |
4 |
1 | \right)\\ &\left( | |
4 |
3 | , | |
4 |
1 | ,- | |
4 |
1 | , | |
4 |
1 | |
4 |
\right) 2-termtruncation\\ &\left(1,0,0,0\right) \end{align}
1/4
1/2
This illustrates the kinds of trade-offs between these transforms, and how in some respects the DWT provides preferable behavior, particularly for the modeling of transients.
Watermarking using DCT-DWT alters the wavelet coefficients of middle-frequency coefficient sets of 5-levels DWT transformed host image, followed by applying the DCT transforms on the selected coefficient sets. Prasanalakshmi B proposed a method [24] that uses the HL frequency sub-band in the middle-frequency coefficient sets LHx and HLx in a 5-level Discrete Wavelet Transform (DWT) transformed image.This algorithm chooses a coarser level of DWT in terms of imperceptibility and robustness to apply 4×4 block-based DCT on them. Consequently, higher imperceptibility and robustness can be achieved. Also, the pre-filtering operation is used before extraction of the watermark, sharpening, and Laplacian of Gaussian (LoG) filtering, which increases the difference between the information of the watermark and the hosted image.
The basic idea of the DWT for a two-dimensional image is described as follows: An image is first decomposed into four parts of high, middle, and low-frequency subcomponents (i.e., LL1, HL1, LH1, HH1) by critically subsampling horizontal and vertical channels using subcomponent filters.
The subcomponents HL1, LH1, and HH1 represent the finest scale wavelet coefficients. The subcomponent LL1 is decomposed and critically subsampled to obtain the following coarser-scaled wavelet components. This process is repeated several times, which is determined by the application at hand.
High-frequency components are considered to embed the watermark since they contain edge information, and the human eye is less sensitive to edge changes. In watermarking algorithms, besides the watermark's invisibility, the primary concern is choosing the frequency components to embed the watermark to survive the possible attacks that the transmitted image may undergo. Transform domain techniques have the advantage of unique properties of alternate domains to address spatial domain limitations and have additional features.
The Host image is made to undergo 5-level DWT watermarking. Embedding the watermark in the middle-level frequency sub-bands LLx gives a high degree of imperceptibility and robustness. Consequently, LLx coefficient sets in level five are chosen to increase the robustness of the watermark against common watermarking attacks, especially adding noise and blurring attacks, at little to no additional impact on image quality. Then, the block base DCT is performed on these selected DWT coefficient sets and embeds pseudorandom sequences in middle frequencies. The watermark embedding procedure is explained below:1. Read the cover image I, of size N×N.
2.The four non-overlapping multi-resolution coefficient sets LL1, HL1, LH1, and HH1 are obtained initially.
3. Decomposition is performed till 5-levels and the frequency subcomponents }} are obtained by computing the fifth level DWT of the image I.
4. Divide the final four coefficient sets: HH5, HL5, LH5 and LL5 into 4 x 4 blocks.
5. DCT is performed on each block in the chosen coefficient sets. These coefficient sets are chosen to inquire about the imperceptibility and robustness of algorithms equally.
6. Scramble the fingerprint image to gain the scrambled watermark WS (i, j).
7. Re-formulate the scrambled watermark image into a vector of zeros and ones.
8. Two uncorrelated pseudorandom sequences are generated from the key obtained from the palm vein. The number of elements in the two pseudorandom sequences must equal the number of mid-band elements of the DCT-transformed DWT coefficient sets.
9. Embed the two pseudorandom sequences with a gain factor α in the DCT-transformed 4x4 blocks of the selected DWT coefficient sets of the host image. Instead of embedding in all coefficients of the DCT block, it is applied only to the mid-band DCT coefficients. If X is denoted as the matrix of the mid-band coefficients of the DCT transformed block, then embedding is done with watermark bit 0, and X' is updated as X+∝*PN0,watermarkbit=0 and done with watermark bit 1 and X' is updated as X+∝*PN1. Inverse DCT (IDCT) is done on each block after its mid-band coefficients have been modified to embed the watermark bits.
10. To produce the watermarked host image, Perform the inverse DWT (IDWT) on the DWT-transformed image, including the modified coefficient sets.
The DWT of a signal
x
g
y[n]=(x*g)[n]=
infty | |
\sum\limits | |
k=-infty |
{x[k]g[n-k]}
h
However, since half the frequencies of the signal have now been removed, half the samples can be discarded according to Nyquist's rule. The filter output of the low-pass filter
g
g
h
ylow[n]=
infty | |
\sum\limits | |
k=-infty |
{x[k]g[2n-k]}
yhigh[n]=
infty | |
\sum\limits | |
k=-infty |
{x[k]h[2n-k]}
This decomposition has halved the time resolution since only half of each filter output characterises the signal. However, each output has half the frequency band of the input, so the frequency resolution has been doubled.
\downarrow
(y\downarrowk)[n]=y[kn]
the above summation can be written more concisely.
ylow=(x*g)\downarrow2
yhigh=(x*h)\downarrow2
However computing a complete convolution
x*g
The Lifting scheme is an optimization where these two computations are interleaved.
This decomposition is repeated to further increase the frequency resolution and the approximation coefficients decomposed with high- and low-pass filters and then down-sampled. This is represented as a binary tree with nodes representing a sub-space with a different time-frequency localisation. The tree is known as a filter bank.
At each level in the above diagram the signal is decomposed into low and high frequencies. Due to the decomposition process the input signal must be a multiple of
2n
n
For example a signal with 32 samples, frequency range 0 to
fn
Level | Frequencies | Samples | |
---|---|---|---|
3 | 0 {{fn}}/8 | 4 | |
{{fn}}/8 {{fn}}/4 | 4 | ||
2 | {{fn}}/4 {{fn}}/2 | 8 | |
1 | {{fn}}/2 fn | 16 |
The filterbank implementation of wavelets can be interpreted as computing the wavelet coefficients of a discrete set of child wavelets for a given mother wavelet
\psi(t)
\psij,k(t)=
1 | |
\sqrt{2j |
where
j
k
Recall that the wavelet coefficient
\gamma
x(t)
x(t)
x(t)
2N
\gammajk=
infty | |
\int | |
-infty |
x(t)
1 | |
\sqrt{2j |
Now fix
j
\gammajk
k
\gammajk
x(t)
h(t)=
1 | |
\sqrt{2j |
1,2j,2 ⋅ {2j},...,2N
j
h[n]
g[n]
\psi(t)
As an example, consider the discrete Haar wavelet, whose mother wavelet is
\psi=[1,-1]
h[n]=
1 | |
\sqrt{2 |
The filterbank implementation of the Discrete Wavelet Transform takes only O(N) in certain cases, as compared to O(N log N) for the fast Fourier transform.
Note that if
g[n]
h[n]
x*h
x*g
g[n]
T(N)=2N+T\left(
N | |
2 |
\right)
which leads to an O(N) time for the entire operation, as can be shown by a geometric series expansion of the above relation.
As an example, the discrete Haar wavelet transform is linear, since in that case
h[n]
g[n]
h[n]=\left[
-\sqrt{2 | |
The locality of wavelets, coupled with the O(N) complexity, guarantees that the transform can be computed online (on a streaming basis). This property is in sharp contrast to FFT, which requires access to the entire signal at once. It also applies to the multi-scale transform and also to the multi-dimensional transforms (e.g., 2-D DWT).[25]
See also: Adam7 algorithm.
{\bfy}=f{{\bfX}}
f
X
EX=1
{\calW}
f{{\bfX}}=f+{f({\bfX}-1)}
{\calW+}
{\calW+}{\bfy}={\calW+}f+{\calW+}{f({\bfX}-1)},
{\calW+}{f({\bfX}-1)}
f
{\calW x }{\bfy}=\left({\calW x }f\right) x \left({\calW x }{{\bfX}}\right).
\alpha
{\calW+}
ck=\alpha(yk+yk-1)
dk=\alpha(yk-yk-1)
\ast | |
c | |
k |
=(yk x yk-1)\alpha
\ast | |
d | |
k |
=\left(
yk | |
yk-1 |
\right)\alpha
{\calW x }
In its simplest form, the DWT is remarkably easy to compute.
The Haar wavelet in Java:
This figure shows an example of applying the above code to compute the Haar wavelet coefficients on a sound waveform. This example highlights two key properties of the wavelet transform:
2N
[2N-j,2N-j+1]
\left[
\pi | |
2j |
,
\pi | |
2j-1 |
\right]
j