+ All Categories
Home > Documents > Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

Date post: 28-Jan-2017
Category:
Upload: lythien
View: 219 times
Download: 0 times
Share this document with a friend
15
1398 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005 Complete-to-Overcomplete Discrete Wavelet Transforms: Theory and Applications Yiannis Andreopoulos, Student Member, IEEE, Adrian Munteanu, Geert Van der Auwera, Jan P. H. Cornelis, Member, IEEE, and Peter Schelkens, Member, IEEE Abstract—A new transform is proposed that derives the over- complete discrete wavelet transform (ODWT) subbands from the critically sampled DWT subbands (complete representation). This complete-to-overcomplete DWT (CODWT) has certain advantages in comparison to the conventional approach that performs the in- verse DWT to reconstruct the input signal, followed by the à-trous or the lowband shift algorithm. Specifically, the computation of the input signal is not required. As a result, the minimum number of downsampling operations is performed and the use of upsampling is avoided. The proposed CODWT computes the ODWT subbands by using a set of prediction-filter matrices and filtering-and-downsampling operators applied to the DWT. This formulation demonstrates a clear separation between the single-rate and multirate compo- nents of the transform. This can be especially significant when the CODWT is used in resource-constrained environments, such as resolution-scalable image and video codecs. To illustrate the applicability of the proposed transform in these emerging appli- cations, a new scheme for the transform-calculation is proposed, and existing coding techniques that benefit from its usage are surveyed. The analysis of the proposed CODWT in terms of arith- metic complexity and delay reveals significant gains as compared with the conventional approach. Index Terms—Complexity reduction, overcomplete discrete wavelet transforms, scalable image and video coding, shift invari- ance. I. INTRODUCTION R ECENTLY, wavelet-based image and video coding sys- tems that utilize the overcomplete discrete wavelet trans- form (ODWT) have been proposed in the literature [1]–[16]. The good coding performance of these techniques stems from wavelet-domain operations that require shift-invariance, such as in-band motion estimation and compensation [1]–[9], in-band motion compensated temporal filtering [10]–[15], and phase-based prediction of wavelet coefficients [16]. The ODWT is used for this purpose since it is a redundant version of the discrete wavelet transform (DWT) that attains shift-invariance [19], [23]. Manuscript received May 7, 2003; revised February 26, 2004. This work was supported in part by the Federal Office for Scientific, Technical, and Cul- tural Affairs (IAP Phase V—Mobile Multimedia) and by the European Com- munity under the IST Program (Mascot, IST-2000-26 467). Part of the work of P. Schelkens was supported by a post-doctoral fellowship with the Fund for Sci- entific Research—Flanders (FWO), Egmontstraat 5, B-1000 Brussels, Belgium. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Henrique Malvar. The authors are with the Department of Electronics and Information Processing (ETRO), Vrije Universiteit Brussel, B-1050 Brussels, Bel- gium (e-mail: [email protected]; [email protected]; gv- [email protected]; [email protected]; [email protected]). Digital Object Identifier 10.1109/TSP.2005.843707 Given the input signal, the classical construction of the ODWT is trivial by using for example the “à-trous” algorithm [19], [23]. However, in wavelet-based coding systems, the codec always processes the critically sampled (complete) DWT subbands. Hence, a complete-to-overcomplete DWT (CODWT) has to take place: First, the inverse DWT is performed in order to reconstruct the input signal, followed by the ODWT [1], [2]. Furthermore, in many image and video-coding systems, the critically sampled DWT subbands may be processed in a resolution-scalable manner (from coarse to fine resolution) [4]–[16]. In such environments, it is imperative that both ends of the system independently construct the identical ODWT in- formation at each resolution level in order to avoid drift effects [6], [7], [16]. Given the resolution levels that are available at both the encoder and decoder, a separate loop is used at each level for the CODWT in order to construct identical ODWT representations at both sides [5]–[9], [16]. This approach has several disadvantages. A direct method for the derivation of the ODWT subbands from the DWT is not provided. Instead, the ODWT construction is a cascade of the inverse DWT and the “à-trous” calculation. As a result, the recon- struction of the input signal is required. This causes significant calculation overhead and delay since the input signal has the highest sampling rate. A multirate calculation scheme is used in every case with a cascade of upsampling and downsampling op- erations. As a result, even for high-speed high-parallel implementations, the achievable percentage of hard- ware utilization is low since the filtering of every level has to be pipelined with the production of the results of the previous and the next level [24]. In this paper, we present a new theory for the CODWT, for- mulated for any arbitrary level of the transform, in which the ODWT of level is produced directly from the DWT of -levels [4], [17]. The reconstruction of the input signal is not required, and furthermore, no upsampling is performed. Our ini- tial findings reported in [4] and [17], as well as independent work reported recently by Li [18], present similar approaches for performing phase shifting in the wavelet domain. This paper generalizes these ideas to a transform that performs the direct construction of an arbitrary phase in the -level wavelet do- main (Proposition 2). In addition, we propose a direct solution for the (all-phase) CODWT (Proposition 3) and present an ef- ficient calculation scheme for the transform. Apart from the new theoretical aspects arising from the proposed transform, we demonstrate the significant practical benefits of the new ap- 1053-587X/$20.00 © 2005 IEEE
Transcript
Page 1: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1398 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

Complete-to-Overcomplete Discrete WaveletTransforms: Theory and Applications

Yiannis Andreopoulos, Student Member, IEEE, Adrian Munteanu, Geert Van der Auwera,Jan P. H. Cornelis, Member, IEEE, and Peter Schelkens, Member, IEEE

Abstract—A new transform is proposed that derives the over-complete discrete wavelet transform (ODWT) subbands from thecritically sampled DWT subbands (complete representation). Thiscomplete-to-overcomplete DWT (CODWT) has certain advantagesin comparison to the conventional approach that performs the in-verse DWT to reconstruct the input signal, followed by the à-trousor the lowband shift algorithm. Specifically, the computation of theinput signal is not required. As a result, the minimum number ofdownsampling operations is performed and the use of upsamplingis avoided.

The proposed CODWT computes the ODWT subbands by usinga set of prediction-filter matrices and filtering-and-downsamplingoperators applied to the DWT. This formulation demonstrates aclear separation between the single-rate and multirate compo-nents of the transform. This can be especially significant whenthe CODWT is used in resource-constrained environments, suchas resolution-scalable image and video codecs. To illustrate theapplicability of the proposed transform in these emerging appli-cations, a new scheme for the transform-calculation is proposed,and existing coding techniques that benefit from its usage aresurveyed. The analysis of the proposed CODWT in terms of arith-metic complexity and delay reveals significant gains as comparedwith the conventional approach.

Index Terms—Complexity reduction, overcomplete discretewavelet transforms, scalable image and video coding, shift invari-ance.

I. INTRODUCTION

RECENTLY, wavelet-based image and video coding sys-tems that utilize the overcomplete discrete wavelet trans-

form (ODWT) have been proposed in the literature [1]–[16].The good coding performance of these techniques stemsfrom wavelet-domain operations that require shift-invariance,such as in-band motion estimation and compensation [1]–[9],in-band motion compensated temporal filtering [10]–[15], andphase-based prediction of wavelet coefficients [16]. The ODWTis used for this purpose since it is a redundant version of thediscrete wavelet transform (DWT) that attains shift-invariance[19], [23].

Manuscript received May 7, 2003; revised February 26, 2004. This workwas supported in part by the Federal Office for Scientific, Technical, and Cul-tural Affairs (IAP Phase V—Mobile Multimedia) and by the European Com-munity under the IST Program (Mascot, IST-2000-26 467). Part of the work ofP. Schelkens was supported by a post-doctoral fellowship with the Fund for Sci-entific Research—Flanders (FWO), Egmontstraat 5, B-1000 Brussels, Belgium.The associate editor coordinating the review of this manuscript and approvingit for publication was Dr. Henrique Malvar.

The authors are with the Department of Electronics and InformationProcessing (ETRO), Vrije Universiteit Brussel, B-1050 Brussels, Bel-gium (e-mail: [email protected]; [email protected]; [email protected]; [email protected]; [email protected]).

Digital Object Identifier 10.1109/TSP.2005.843707

Given the input signal, the classical construction of theODWT is trivial by using for example the “à-trous” algorithm[19], [23]. However, in wavelet-based coding systems, thecodec always processes the critically sampled (complete) DWTsubbands. Hence, a complete-to-overcomplete DWT (CODWT)has to take place: First, the inverse DWT is performed in orderto reconstruct the input signal, followed by the ODWT [1],[2]. Furthermore, in many image and video-coding systems,the critically sampled DWT subbands may be processed in aresolution-scalable manner (from coarse to fine resolution)[4]–[16]. In such environments, it is imperative that both endsof the system independently construct the identical ODWT in-formation at each resolution level in order to avoid drift effects[6], [7], [16]. Given the resolution levels that are available atboth the encoder and decoder, a separate loop is used at eachlevel for the CODWT in order to construct identical ODWTrepresentations at both sides [5]–[9], [16]. This approach hasseveral disadvantages.

• A direct method for the derivation of the ODWTsubbands from the DWT is not provided. Instead, theODWT construction is a cascade of the inverse DWTand the “à-trous” calculation. As a result, the recon-struction of the input signal is required. This causessignificant calculation overhead and delay since theinput signal has the highest sampling rate.

• A multirate calculation scheme is used in every casewith a cascade of upsampling and downsampling op-erations. As a result, even for high-speed high-parallelimplementations, the achievable percentage of hard-ware utilization is low since the filtering of every levelhas to be pipelined with the production of the resultsof the previous and the next level [24].

In this paper, we present a new theory for the CODWT, for-mulated for any arbitrary level of the transform, in whichthe ODWT of level is produced directly from the DWT of

-levels [4], [17]. The reconstruction of the input signal is notrequired, and furthermore, no upsampling is performed. Our ini-tial findings reported in [4] and [17], as well as independentwork reported recently by Li [18], present similar approachesfor performing phase shifting in the wavelet domain. This papergeneralizes these ideas to a transform that performs the directconstruction of an arbitrary phase in the -level wavelet do-main (Proposition 2). In addition, we propose a direct solutionfor the (all-phase) CODWT (Proposition 3) and present an ef-ficient calculation scheme for the transform. Apart from thenew theoretical aspects arising from the proposed transform,we demonstrate the significant practical benefits of the new ap-

1053-587X/$20.00 © 2005 IEEE

Page 2: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1399

proach under the application scenarios that require resolutionscalability. In such cases, the proposed CODWT is computedusing a single-rate calculation scheme while providing exactlythe same results as the conventional multi-rate approach.

The paper is structured as follows. Section II introduces thebasic notations and definitions and presents the CODWT underdifferent constraints. The CODWT formulation for the generalcase of level is given in Section III. In addition, several proper-ties of the proposed transform are discussed, and a novel schemefor the computation of the CODWT is presented. Section IVpresents a complexity study that identifies the implementationbenefits offered by the proposed transform. In order to demon-strate in more detail the applications that benefit from the pro-posed approach, some indicative results for scalable image andvideo coding based on CODWT are presented in Section V.

II. PROBLEM FORMULATION

A. Notations and Symbolism

In this paper, bold-faced capital and lower letters indicate ma-trices and vectors, respectively, while denotes the identity ma-trix. Calligraphic notation is reserved for operators (e.g., de-notes the polyphase separation). All signals and filters are con-sidered in the -domain as Laurent polynomials, and the letter

is reserved for this purpose. All the used indices are integers.For all matrices, vectors, signals, and filters, the superscripts de-note the decomposition level, except for superscript , whichdenotes transposition. Subscripts are used to enumerate signalsor filters in matrices and vectors. Additionally, they are used toindicate polyphase components; each case is identified from thecontext.

The polyphase separation of a signal or filter is givenby , with the commutative operationgiven by . For theDWT, the Type-I analysis polyphase matrices that produce theeven and odd polyphase components of the transform are de-noted as respectively; their definitions are

where are the lowpass and highpass analysisDWT filters, respectively. The corresponding Type-II synthesispolyphase matrices are , e.g.,

In order to simplify the expressions we always assume (withoutloss of generality) that the filters and are prop-erly shifted so that perfect reconstruction is achieved withzero delay, and [25, Sec. 3]. Using theNoble identity [25], the single-level filtering-and-downsam-pling operator can be written as , with

denoting the retained polyphase component and

, respectively. For decompositionlevels , this operator is

(1)

Definition 1: For a signal and an analysis filter-bank with , we defineas the -phase wavelet subbands of the ODWT of decom-position level [5], [18], [23], given by

,where are the low- and high-frequency subbands,respectively.

This definition coincides with the ODWT subbands calcu-lated by the cycle-spinning algorithm of [26]: The binary rep-resentation of is actually the binary map of the DWT definedin [26]. In total, the -level ODWT of a signal is given by

, for every , where

is a matrix operator, and . Forexample, for , we have . Thefollowing lemma establishes the operator recursively.

Lemma 1: The operator applied on signalsand , with ,

having a sampling rate of , satisfies

(2)

Proof: By expanding the right part of (2) using (1) and thedefinition of the operator, we reach the left part as follows:

(3)

B. Problem Description

In this section, we briefly present two methods for theCODWT under the image and video coding scenarios ofinterest [1]–[16]. The first method is based on the IDWTfollowed by the lowband shift (LBS) method [1], [23]. TheLBS can be seen as a specific implementation technique of the“à-trous” algorithm [19], where the different ODWT subbandsare produced and stored according to the retained polyphasecomponents. The second method represents our proposedapproach.

Fig. 1(a) shows an example of the 1-D ODWT for three de-composition levels starting from an input signal . Thisfigure facilitates the description of the LBS method [1], [2], [23].Initially, the input signal is decomposed in two subbandsets and by retaining separately the

Page 3: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1400 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

Fig. 1. (a) Construction of the ODWT of three levels starting from the input signal X . A number of one-level discrete wavelet transforms are performed thatretain the even or odd samples of the nondecimated transform (L with i = 0; 1, respectively). (b) Level-by-level construction of the ODWT (example for level3) using the conventional multirate IDWT and LBS approach that perform a set of inverse and forward transforms. (c) Level-by-level construction of the ODWTfor all three levels using the proposed single-rate approach.

even and odd polyphase components of the nondecimated de-composition, respectively, or, equivalently, by performing twowavelet decompositions: one to the zero-shifted and one to theunit-shifted input signal, respectively [1], [2], [23]. Each of thelow-frequency subbands and is further analyzedin the same manner, while the high-frequency subbandsand are the outputs of the first decomposition level. Thisprocess is repeated successively, yielding the ODWT represen-tation from the input signal [see Fig. 1(a)] [1], [23]. Thesubbands and represent the criticallysampled (complete) DWT of three decomposition levels, whilethe subbands representthe calculated ODWT. Hence, for the CODWT based on thismethod, the signal has to be reconstructed by performingthe IDWT to the subbands , followed by the LBS.

Notice that the subbands shown in Fig. 1(a)stem from the classical ODWT decomposition scheme of [23],which is equivalent to the “à-trous” algorithm [19]. The differ-ence is that, at every level, the subbands of Fig. 1(a) must be in-

terleaved in order to produce the nondecimated ODWT obtainedwith the algorithm of [23]. As a result, any subband in theODWT of Fig. 1(a) is the th polyphase component of the non-decimated ODWT of level [23], [19].

In the two-dimensional (2-D) case, the 2-D ODWT can beconstructed in the same manner as in Fig. 1(a), by applying theLBS method on the input-subband rows and on the columns ofthe results [1], [23]. Hence, to facilitate the description, we focusin the following analysis on the 1-D case, with the extension in2-D following the row-column approach.

In a progressive resolution-refinement (resolution scal-able) framework, the additional constraint that the subbandcoding and decoding occurs in a bottom-up manner is imposed[4]–[16]: The coarsest-resolution subbands of the DWT areprocessed independently (subbands ) and forevery higher resolution level with , the subbandis additionally processed. In the case of hybrid (closed-loop)systems, these subbands are decoded at predefined base-qualitylevels. In this way, for each target resolution-level ,

Page 4: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1401

the encoder uses the same reference frames as the decoderwill be able to create at the client side, and no drift occursacross resolutions [5]–[9]. This can be seen as an extension ofthe base-layer concept used in quality-progressive closed-loopvideo coding [27]; although the creation of the referenceframes at the base-layer potentially lowers the performance ofthe subset of decoders that progressively process additionalquality layers (or resolution levels in our case), this guaranteesdrift-free operation [5], [7], [9], [27].

Under such a resolution-scalable coding framework, theCODWT based on the LBS method is readily adaptable to per-form a level-by-level construction of the ODWT representation(denoted by LL-LBS), starting from the subbands of the crit-ically sampled transform of each decoded level. This processis illustrated in Fig. 1(b). Starting from the DWT subbands

(coarsest resolution level), three single-level inversewavelet transforms are performed. Subsequently, from the re-constructed signal , all the subbandsare constructed by performing the single-level forward trans-forms shown in Fig. 1(b). It is important to notice that, sincein this case the subbands are not available,the reconstructed signal and the subbands areonly an approximation of and of the original ODWTof level 3, respectively, shown in Fig. 1(a). However, in thisresolution-scalable scenario, given the common informationavailable at both the encoder and decoder sides, this ODWTrepresentation is the best possible approximation for the currentresolution level [4]–[16]. Finally, if higher resolution versionsof the input data are required, the ODWT construction by theLL-LBS is repeated for the finer resolution levels ( or

, depending on the target resolution) [5]–[16].Notice that in coding applications, for every decomposition

level , the low-frequency subbands are partof the calculated ODWT only if level is the coarsest resolutionof the decomposition, i.e., if in the example of Fig. 1(c)[5], [7], [11], [17]. We define this construction that generates thesignals

...

as the full-overcomplete transform-production mode (FO-mode). In all the other cases [levels of Fig. 1(c)], thecritically sampled DWT consists of the subband , andhence, in coding applications, only the high-frequency ODWTsubbands need to be calculated [5]–[16];this case is defined as the high-frequency overcomplete trans-form-production mode (HFO-mode). The difference betweenthe FO and HFO modes is illustrated in Fig. 1(c).

Additionally, Fig. 1(c) presents the proposed alternative ap-proach to the LL-LBS method for the level-by-level CODWT

of levels 3, 2, and 1. In this approach, the CODWT uses a setof prediction filters [4], [17], which are denoted as

and are convolved with the subbandsto calculate the ODWT representation of each

level. Notice that for levels 2 and 1 of Fig. 1(c), the ODWTconstruction occurs in the HFO mode, i.e., only the high-fre-quency subbands of the ODWT are calculated. Additionally, thefigure illustrates that by using the prediction-filters, the over-complete representation of each level is “predicted,” as shownwith the dotted lines. As a result, no upsampling is performed,and no reconstruction of the spatial-domain signal is re-quired. The mathematical derivation of the prediction filters andthe proposed CODWT are presented next.

III. COMPLETE TO OVERCOMPLETE REPRESENTATIONS

In this section, we present a generic framework for the cal-culation of the ODWT subbands of decomposition level as afunction of the critically sampled wavelet decomposition (i.e.,the subbands , with ). From this for-malism, the level-by-level CODWT of level , which is espe-cially interesting for applications requiring progressive-refine-ment in resolution (e.g., [5]–[16]), can be found as a specialcase. A symmetry property for the prediction filters of everylevel is also proven. Based on this property, an efficient schemefor the CODWT calculation is presented.

A. Derivation of the ODWT Subbands of Decomposition LevelFrom the -Level DWT—The Prediction Filters

We start with by exemplifying the proposed CODWT forlevels (Section III-A1). Then, Section III-A2 formu-lates the CODWT for any arbitrary level .

1) Calculation of the ODWT Subbands of Levels and:

Definition 2: We define and as theprediction-filter matrices of levels and , respec-tively. They are given by (4) and (5), shown at the bottom of thepage, where and are the and polyphasecomponents of filter , respectively.

Proposition 1: The subbands of the ODWT of levelare given by

(6)

while subbands of the ODWT of levelare

(7)

(4)

(5)

Page 5: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1402 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

Proof: The proof of (6) can be derived by performing aninverse transform to subbands followed by a forwardwavelet transform that retains the odd polyphase componentsof the nondecimated decomposition, as shown in Fig. 1(a). Intotal

where is the prediction-filter matrix given in (4).Applying an inverse wavelet transform to subbands

yields . Then, as shown inFig. 1(a), by performing a forward transform that retains theodd polyphase components of the nondecimated transform,we reach the subbands given in (7). Additionally,by using a part of (6), we derive the subband as

. The final result is reached byperforming a forward wavelet transform retaining the even orodd polyphase components for , respectively, and usingthe Noble identity that exchanges the filtering and downsam-pling order [25]. Hence

(8)

where is the prediction-filter matrices given in (5).Proposition 1 hints that in the general case of an arbitrary

level , the calculation of the ODWT subbands from the-level DWT involves a) the single-rate filtering of the DWT

subbands of level and b) the cascade applicationof filtering-and-downsampling operations to the high-fre-quency subbands of higher-resolution levels of the DWT

. This intuitive link is mathematicallyformulated next.

2) Calculation of the ODWT Subbands of Level : Thegeneralization of Proposition 1 and the corresponding defini-tions are given below for an arbitrary level .

Definition 3: We define as the prediction- filter ma-trices of level with , given by

(9)

where are prediction filters of level , the filtersubscripts are defined as, and given by (1).

Definition 4: We define as the diagonal matrix of pre-diction filters given by

Definition 5: We define withas the vector of high-frequency subbands of levels

of the DWT. It is given by

Proposition 2: The subbands of the ODWT of level, with , are given by

(10)

where denotes the ODWT-subband index at level (phase )and is written as , while are given as in Defini-tion 3, and . In the particular case of corre-sponding to and , we set and(zero operator) as well as for any indices or

. For this case, (10) becomes Proposition 1.Proof: The proof is inductive. Initially, it is noted that (10)

holds for levels and since it corresponds to Propo-sition 1. Subsequently, assume that (10) holds for a particular

. It is now proven that, given the subbands(DWT subbands of level ), if (10) holds for , itholds also for .

Let us start by performing an inverse DWT in order to calcu-late the subband in function of subbands

(11)

The derivation of subbands follows immediately byperforming a forward transform with . Hence, we derive(10) for the special case of with replaced by .For the remaining subbands of level , we can apply (10)(involving the filters of the matrix) since it is true forlevel . We can calculate any subband as

(12)

As shown in Fig. 1(b) for the example of , in order tocalculate the subbands (even-numbered subbands oflevel ), we need to perform a single-level forward transformin (12), retaining the even samples (“classical” DWT):

(13)

Based on Lemma 1, (11), and the Noble identity, the last equa-tion can be written as

(14)

After simplifications, (14) becomes equivalent to (10) with re-placed by and, hence, , while the ODWTsubband index of (14) is bounded by . In ad-dition, the resulting prediction- filter matrix of (14) is expressedas in (9) with and . Hence,

Page 6: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1403

Fig. 2. (a) Straightforward calculation of the CODWT based on Proposition 2. (b) Initialization of the calculation of A ;D .

(14) corresponds to (10) for the even-numbered subbands, withreplaced by .In order to calculate the subbands (odd-numbered

subbands of level ), we perform a forward DWT retainingthe odd samples, reaching, similarly as before, the following:

(15)

After simplifications, (15) becomes (10) with replaced byand, hence, , while the ODWT sub-

band index of (15) is bounded by .In addition, the resulting prediction-filter matrix of (15) is ex-pressed as in (9) with and .As a result, (15) corresponds to (10) for the odd-numbered sub-bands, with replaced by . One concludes that Proposition2 is true for the case of . This means, by induction,that it is true for every .

For each decomposition level of the DWT, Proposition 2given in (10) consists of a single-rate and a multirate calculationpart. The first consists of the convolution of the critically sam-pled subbands of level with the prediction filters of the matrix

, while the second consists of the convolu-tions of the vector of high-frequency DWT subbands of levels

with the diagonal matrix of predictionfilters, followed by the successive filtering-and-downsamplingwith the analysis filters. This result is summarized below.

Proposition 3: The complete-to-overcomplete discretewavelet transform of any level is

...

...(16)

with

... ...

......

. . ....

array of operators for every , and .

B. Properties of the Derived Formulation

A straightforward calculation of any given subband pairof Proposition 2 is given schematically in Fig. 2, which

also gives a pseudo-program describing the initialization pro-cedure used to select the appropriate filters involved in thecalculation of an arbitrary set of ODWT subbands of(10). The calculation of the multirate part of (10) shown inFig. 2 is based on the following derivation:

(17)

which comes from the property that the downsampling and sum-mation operations can be interchanged. Fig. 2 shows that theproposed approach involves the minimum number of downsam-pling operations and no upsampling is used.

In the following parts of this section, we present a symmetryproperty for the prediction filters of each decomposition levelas well as an efficient calculation scheme of the single-rate andmultirate parts of Proposition 3.

Page 7: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1404 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

Fig. 3. Lattice structure that implements the two convolutions V (z) = F (z)I(z) and W (z) = z F (z�1)I(z) for a filter F with L taps using L multipliers.

1) Symmetry Properties of the Filters: Equation (9)provides some insight for the reduction of the necessary oper-ations for the production of all the different subbands .Below, we demonstrate a useful symmetry property.

Proposition 4: For the prediction-filter matrices ofany level , the following property holds:

(18)

with and defined as for (9).Proof: The proof is inductive. Hence, for

(19)

In Definition 1, we have assumed zero-delay perfect recon-struction with , which can be satisfied with

for biorthogonalpoint-symmetric filter-pairs,

for biorthogonal half-point symmetric filter pairs,and for orthogonal filter pairs. Basedon these equations, the proof of (19) is concluded for eachcase by deriving the symmetry relations between the Type-Ipolyphase components of the analysis filters and using them incombination with the definition of of (4).

Assuming that the proposition of (18) is true for the case oflevel , in the Type-I polyphase components of ,we can replace the terms by (18),yielding

(20)

where . If we further impose that is even, then wehave according to the definition of in (1). Combiningthe general definition of the prediction filters of (9) (which istrue for level ) and (19) and (20), we reach, after severalmanipulations, (18), where is replaced by . If we imposethat is odd, the equivalent result is reached in the same manner.We conclude that the proposition of (18) is true for every level, where .

A corollary of Proposition 4 is that (20) holds as well.2) Calculation of the Single-Rate Part of the Proposed

CODWT: Based on the derived symmetry property of (20),having the same input signal, the convolutions with filters

can be reused to produce the convolu-tions with the filters ,respectively. This is achieved by using a lattice structure (LS)such as the one depicted in Fig. 3. This is of great practical im-portance since the utilization of this symmetry in the prediction

filters of (9) leads to the reduction of the necessary arithmeticoperations for the proposed CODWT. As an example, basedon (20), if we expand the form of the prediction filters of (9)in the general expression of (16) for , with the setting

(single-rate calculation), we reach the followingresult:

(21)

The set of (21) shows that the single-rate calculation of theproposed CODWT is simplified to the convolutions with filters

and filters with the use of a setof lattice structures of Fig. 3. To demonstrate this in practice,based on (21), Fig. 4(a) shows the calculation of the ODWT sub-bands of level 3 under the single-rate construction for resolutionscalability. In the figure, a set of three processor elements (PEs)is used for the production of subbands . Thedefinition of a PE is given in Fig. 4(b); it contains four latticestructures such as the one shown in Fig. 3. In the general caseof decomposition level , the calculation structure of Fig. 4(a)uses PEs for the production of the subbands.

3) Calculation of the Multirate Part of the ProposedCODWT: Concerning the multirate part of the givenin (16), for each level , the calculation of the single-rate partis augmented by each of the terms

. The explicit form of these terms was given in (17); itconsists of the convolutions ,i.e., the elements of and followed by fil-tering-and-downsampling with the analysis filterbank. Fig. 5presents an efficient calculation structure for the multirate partof (16) for . In the figure, the recursive definition offilters given by (9) and the symmetry property of(20) are used for the reduction of the necessary multiplica-tions. Specifically, the lattice structure of Fig. 3 is used forthe production of the results of the convolutions with filters

. In general, the figure demonstrates that a recur-sive calculation is used: For each subband ,a new stage is inserted with filter and latticestructures (see the left side of the figure); also, lowpass

Page 8: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1405

Fig. 4. (a) Calculation of the single-rate part of (16). An example with k = 3 is given, where the ODWT is computed using subbandsA ;D , and three processingelements (PEs). (b) Design of a PE.

Fig. 5. Calculation of the multirate part of (16). An example with k = 3 is given.

filtering-and-downsampling operations are applied to subband. In addition, analysis filterbanks

are used for .

IV. COMPUTATIONAL BENEFITS OF THE PROPOSED CODWTIN APPLICATIONS

We evaluate the computational requirements of the proposedCODWT (Proposition 3) versus the conventional approachthat uses the IDWT and LBS algorithm for the production ofthe subbands. In addition to that, by focusing onpractical applications [4]–[17], we examine the computationalcomplexity and delay for the transform part of a system thatencodes/decodes using the CODWT in a resolution-scalableframework. In this paper, the computational complexity isformulated based on the number of required multiplications

and additions. Furthermore, for a coding system that utilizes theCODWT of levels, we define the delay for the transform partthat produces the ODWT of level as the total time(in processing cycles) that is required to produce the ODWT ofthat resolution level.

A. Computational Complexity for the Calculation of theSubbands

For a one-level critically sampled wavelet decomposition orreconstruction of an -sample signal, the computational com-plexity can be expressed as

DWT Cost Cost

(22)

Page 9: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1406 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

where Cost and Cost express the implementationcomplexity of one multiplication and addition, respectively,and and are factors that denote the number of mul-tiplications and additions for each application of the filterbankto the input. For a decomposition where only the low- or thehigh-frequency subband is produced, these factors are replacedin (22) by and , which denote the computations(multiplications and additions) performed by one applica-tion of each filter . All these factors depend onthe implementation (e.g., convolution-based or lifting-based[25]). For example, by using the classical lifting factorizationproposed in [20] and [25] for the 9/7 filter-pair, we have

and , instead of andachieved with convolution.1

As explained in Section II-B, the conventional multirate ap-proach for the production of subbands consists ofthe inverse DWT followed by the LBS algorithm. For a -levelDWT decomposition, after a few straightforward summationsof terms DWT , the complexity of the con-struction in the FO mode (production of ) is

LBS Cost

Cost (23)

where LBS denotes the computational complexity of theconventional approach.

Concerning the proposed approach, as explained in Sec-tion III-B2, Fig. 4(a) demonstrates the calculation of thesingle-rate part of (16). For each processor element (PE) ofFig. 4(a), if the applied filters have nonzerocoefficients, respectively, multiplications and

additions are performed within the PE foreach set of four inputs; notice that this holds for , and asfor , there are no PEs. Using Fig. 4, the computationalcomplexity of the single-rate part of the proposed CODWT inFO construction mode can be deduced:

PF

Cost

Cost (24)

1Note that we will estimate the complexity of lifting factorizations withoutscaling factors in order to minimize the arithmetic complexity. In a codingsystem, the multiplications with these factors can be implicitly performed byembedding the scaling factors in the quantization formulas applied to eachsubband at each resolution level.

where PF is the computational complexity2 of the single-rate part of (16) for the production of , and is thenumber of nonzero coefficients of filter .

From the description of Section III-B3 and Fig. 5, the compu-tational complexity of the multirate part of (16) for the generalcase of decomposition levels can be deduced. Specifically, foreach subband

PF

Cost

Cost

(25)

where, for each subband , the multiplication and addition-related factors for the used lattice structures are

for , where . In addi-tion, is the number of the used lowpass filters.

Note that as shown in (16), the multirate calculation occursfor . For this case, the total complexity of the final resultof (16) is given by

PF PF PF Cost (26)

The last term of (26) corresponds to the complexity of the addi-tions between the single- and multirate part of the transform inorder to create the final result.

It is important to notice that the results of this section are ap-plicable also for the 2-D case of a input image sinceboth techniques can be applied in a separable manner along therows and columns of the input. Specifically, the 2-D implemen-tation of the proposed CODWT for any level consists of 1)applying the prediction-filters row-wise to the 2-D DWT sub-bands using the calculation scheme of Figs. 4(a) and 5, therebyproducing the ODWT subbands with phase(2-D phase [5], [9]); and 2) the column-wise filtering of the re-sulting subbands with the same 1-D scheme. Hence, to producethe ODWT subbands of level , the complexity associ-ated with the row-wise filtering is PF . In addition,the complexity of the column-wise filtering is PF . Inthe same manner, the separable application of the 1-D LBS ap-proach indicates that the complexity is LBS .As a result, the complexity reduction offered by the proposedapproach is PF LBS in both the 1-D and 2-Dcases.

2In (24), the number of multiplications for the prediction filters of level 1is reduced by taking advantage of the symmetry property of (19), which alsoholds for filter F (z) since F (z) = �F (z) from the definitions of (4).Furthermore, if one restricts the complexity analysis to biorthogonal filter pairs,it is straightforward to prove that F (z ) = F (z); F (z ) = z F (z)and use these symmetries to further reduce the multiplications reported in (24).

Page 10: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1407

TABLE IMULTIPLICATIONS AND ADDITIONS FOR THE CODWT OF VARIOUS LEVELS, NORMALIZED PER INPUT SAMPLE

For a numerical comparison between the two approaches, byusing (26) for the proposed CODWT and (23) for the conven-tional IDWT LBS approach, we identified that both methodshave comparable arithmetic complexity under a convolution-based implementation. This is shown in Table I, where numer-ical examples of the multiplication and addition requirementsare presented for the levels using the proposed andthe conventional approaches. Both cases have been normalizedby the number of input samples (pixels) . The use of liftingreduces the computational requirements for both approaches;however, we generally found that lifting factorizations tend tobenefit less the proposed approach since they are only used inthe analysis filters of the multirate part of the transform.

On the other hand, as shown in Table II, the proposedapproach exhibits a practical reduction in computation timesin comparison to the conventional method for “ANSI-C”implementations running on a personal computer with anIntel Pentium IV processor. Both convolution-based andlifting-based implementations have been tested. The experi-ments were carried out for an SD-resolution video sequence(720 480 pixels, three color channels, 100 frames) withthe 9/7 filter pair. The row-column implementation was usedfor both methods. Table II demonstrates that the proposedapproach runs, on average, 35.83% faster in comparison to theconventional approach when both employ a lifting-based imple-mentation. Profiling of the execution revealed that, although theproposed method performs (on average) more instruction-re-lated operations in the case of lifting-based implementations, asignificantly better utilization of the cache memory is achievedin comparison to the IDWT LBS approach. This is attributedto the fact that by using the proposed calculation scheme, theproposed CODWT utilizes the same input to produce a numberof intermediate results; for example, Fig. 4(a) demonstratesthat the single-rate part of all the subbands of level 3 is pro-duced by using . As a result, theproposed CODWT achieves a significantly higher locality inthe processing. This leads to the speedup in computation timesreported in Table II.

In resolution-scalable coding, the comparison between thelifting-based implementations of the proposed CODWT and

TABLE IIPERCENTAGE OF SPEEDUP IN EXECUTION TIME OF THE PROPOSED METHOD

VERSUS THE CONVENTIONAL APPROACH (9/7 FILTER-BANK) IN A

PENTIUM IV PROCESSOR

LL-LBS approaches is given in Table III. The case of a codingsystem that supports a maximum of decomposition(resolution) levels and decodes to any output resolution-level

is assumed. In this scenario, as explained in Sec-tion II-B, decoding a resolution level requires that the CODWTis performed for resolution levels . These resultsshow that in resolution-scalable coding, the proposed CODWTachieves notable complexity reductions in comparison toLL-LBS. Furthermore, we find that significant reductions in thecomputation time are experimentally observed. This is shownin Table IV, where we assume a resolution-scalable scenariowith a HDTV video sequence processed in four (dyadicallyreduced) resolutions, ranging from 1920 1088 pixelsto 240 136 pixels . The proposed CODWT exhibitsan average reduction of 78.45% in computation time for liftingimplementations.

An interesting observation stemming from Table III is thatthe computational complexity of both techniques scales up withthe number of decoded resolution levels: a result that is highlyuseful for complexity-scalable image and video coding.

The significant complexity reductions of Tables III and IVthat are achieved with the proposed approach in resolution scal-able scenarios can be attributed to the following two aspects.

• A single-rate calculation is used. Hence, the series ofinverse and forward wavelet transforms required by theconventional multi-rate approach is avoided.

• The scheme of Section III-B2 reduces the complexitybased on the filter symmetries of (20).

Page 11: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1408 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

TABLE IIIMULTIPLICATION AND ADDITION BUDGET FOR THE CODWT IN A RESOLUTION-SCALABLE CODEC WITH FOUR DECOMPOSITION LEVELS. THREE REPRESENTATIVE

FILTERBANKS (OF VARYING COMPLEXITY) USED IN LOSSY IMAGE AND VIDEO CODING ARE PRESENTED

TABLE IVEXAMPLE OF SPEEDUP IN COMPUTATION TIMES FOR THE CODWT UNDER RESOLUTION-SCALABLE CONSTRUCTION; ONE HUNDRED FRAMES OF A HDTV COLOR

VIDEO SEQUENCE (1920� 1088 PIXELS) WERE USED WITH THE 9/7 FILTERBANK

As a result, the proposed approach becomes increasinglymore efficient in comparison to the conventional approach asthe resolution-level increases. Interestingly, this tendency isreversed at the coarsest resolution level where the computa-tional-reduction percentage offered by the proposed approachis usually decreased (see Table III and experimental resultsof Table IV). This phenomenon occurs since the ODWT ofthe coarsest resolution is constructed in FO mode (i.e., boththe low- and high-frequency ODWT subbands are created),whereas the ODWT of the lower resolution levels are con-structed in HFO-mode (i.e., only the high-frequency ODWTsubbands are created).

B. Delay for the Level-by-Level Calculation of theSubbands

Assuming the classical point-symmetric or periodic exten-sion for the signal edges, consider that the two methods areimplemented in a system where one application of a filter (orfilter-bank) on a set of input samples requires processingcycles for the LL-LBS method and processing cycles for the

prediction-filters method. To diminish the side effects of sched-uling algorithms for the multiple filtering operations, we assumethe case of high parallelism, where one filter per required con-volution is present (similar to the system of [24]). In this way,every filtering application initiates as soon as sufficient input ispresent [24]. Furthermore, to facilitate the description, the delayresulting from the storage or retrieval of intermediate results isnot taken into account.

Starting with the LL-LBS method, single-level inversetransforms and single-level forward transformsare performed for the production of the subbands of the ODWTof level ; see the example in Fig. 1(b) for . After aninitiation latency (which is denoted ), the firstcoefficients of the reconstructed input signal are produced.Then, all filter kernels of every level work in parallel to performthe inverse and forward transforms. This is equivalent to a cas-cade connection of the inverse and forward recursive pyramidalgorithm of Vishwanath [24].

The total time required for the completion of the calcula-tion of all subbands is determined by the delay for the produc-tion of the signal with the maximum length, since during this

Page 12: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1409

TABLE VEXAMPLES OF THE ODWT DELAY-REDUCTION GAINS OFFERED BY THE PROPOSED APPROACH (FO-MODE, 9/7 FILTER-PAIR) FOR VARIOUS DECOMPOSITION

LEVELS OF A 512-SAMPLE SIGNAL. THREE INDICATIVE CASES FOR THE RATIO u = a =a ARE USED

Fig. 6. Architecture of a predictive coder with wavelet-domain (in-band) ME/MC that uses the resolution-scalable CODWT.

process the highest number of consecutive filter applications oc-curs [24]. The signal with the maximum length produced by theLL-LBS is the reconstructed input signal . The production of

requires applications of filter (synthesis lowpassfilter). Additionally, after the production of all samples of ,the completion of the forward transforms at each level requiresan additional number of filtering operations that will cause anadditional number of processing cycles, which are denoted as

. Hence, the total delay of the LL-LBS systemfor the production of all the subbands of decomposition levelfor an -point input signal is

(27)

For the prediction-filters method, the application of filtersto can be initiated in parallel for the calcula-

tion of subbands . After this initiation, which in this caserequires processing cycles, the PEs that produce therest of the subbands of level (if ) can also be applied inparallel by reusing the coefficients of subbands , as seenin Fig. 4(a). As a result, the required delay for the completionof the process is

(28)

Equations (27) and (28) show that, in systems that can achievea high-degree of hardware parallelism, the delay of the proposedCODWT for resolution level is expected to be proportionalto , while the LL-LBS approach achieves a delayproportional to . Examples of the delay ratios be-tween the two approaches under this high-parallelism scenarioare given in Table V. Since it is difficult to quantify the actual

ratio between the factors and without measurementsfrom a realization of the two approaches in a custom hardwaredesign, we resort to reporting the delay-reduction gains offeredby the proposed approach under three assumptions:

, with , corresponding to a “pessimistic,”“average,” and “optimistic” case for the relative efficiency inhardware implementation of the proposed method versus theconventional approach.

For the 2-D processing of a image, as explained before,both methods can be implemented via two identical one-dimen-sional (1-D) systems used row-wise and column-wise. How-ever, instead of processing all the rows and consequently pro-cessing the columns, the filtering in each direction can be inter-leaved, and the column processing begins after an initiation la-tency so that enough coefficients exist column-wise for the mir-roring and for the initiation of the filter-applications required forevery method. Hence, the comparison of both methods in termsof delay for the 2-D CODWT follows the result of the 1-D case.

V. APPLICATIONS

A number of coding applications that require wavelet-domainoperations in the ODWT [1]–[16] can benefit from the pro-posed transform formulation. In Section IV, we focused on thelevel-by-level CODWT, which is applied in coding schemes thattarget progressive-refinement in resolution [4]–[16]. Below, webriefly discuss these scalable coding applications.

Motion-compensated wavelet video coders with in-band pre-diction have been proposed in [1]–[3] and [5]–[9]. There, theconventional closed-loop coding schemes are modified in orderto provide resolution scalability, as seen in the architecture ofFig. 6. The encoder performs motion estimation and compen-sation (ME/MC) in the wavelet domain (in-band) following

Page 13: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1410 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

TABLE VIAVERAGE PSNR (IN DECIBELS, LUMINANCE CHANNEL) FOR TWO TYPICAL SEQUENCES IN CIF RESOLUTION (PROGRESSIVELY SCANNED, 352� 288 FRAMES/S).

FOR THE COMPARISON AT HALF RESOLUTION, THE NONSCALABLE MPEG-4 AND THE CONVENTIONAL SPATIAL-DOMAIN ME/MC CODER SEPARATELY

ENCODED THE LOW-FREQUENCY SUBBAND OF THE ONE-LEVEL DWT OF EACH FRAME

a level-by-level refinement of the compressed information.However, since the critically sampled wavelet decompositionis only periodically shift-invariant, the ME/MC procedures areperformed in the ODWT domain [1]–[3], which is constructedper resolution level by the CODWT module. In this way, theproduced bit-stream can be decoded at a variety of resolutionswithout drifting problems [5]–[8], [9], [13].

In Table VI, we present typical coding results obtained withthe resolution-scalable video coder of [9] equipped with half-pixel accurate in-band ME and employing a two-level decom-position with the 9/7 filter pair. The quantization andentropy coding is performed with the QuadTree-Limited (QT-L)coder of [28], which is an intraband embedded wavelet codingalgorithm combining quadtree coding and block-based codingof the significance maps [28]. Decoding to level (full-res-olution) and (half-resolution) is performed. For compar-ison purposes, Table VI presents the results obtained with twononscalable solutions: the equivalent coder that performs con-ventional spatial-domain ME/MC [1], [9] followed by DWT anderror-frame coding, the nonscalable DCT-based MPEG-4 coderusing half-pixel ME/MC, and the same block-size, search range,and prediction structure (I and P frames). It is important to no-tice that the proposed scalable system produces the two resolu-tion levels from the same compressed bit-stream. In contrast, thetwo nonscalable systems generate one bit-stream per resolutionand target bit-rate, with the low-resolution video produced usingthe uncoded low-frequency subbands of the one-level DWT asinput.

The results of Table VI demonstrate that the proposedsystem outperforms the equivalent system using spatial-domainME/MC and the nonscalable MPEG-4 coder, both producingeither of the two resolution levels. Although we present PSNRfor the luminance channel only, similar gains occur for thechrominance channels as well. In a video broadcasting sce-nario, the nonscalable codecs require, for example, 2.9 Mb/sto provide two resolutions of the “Stefan” sequence (seeTable VI), while the proposed system provides both resolutionswith superior PSNR quality at only 2 Mb/s.

Alternative systems for scalable video coding are theopen-loop video coding algorithms proposed in [10]–[15],employing wavelet-domain motion estimation and mo-tion-compensated temporal filtering (MCTF) [29], [30].

The conventional approaches for open-loop video coding applyMCTF on the input frames and perform DWT on the resultingmotion-compensated residual frames [29]. In contrast to thesetechniques, the in-band MCTF systems of [10]–[15] first de-compose spatially the video frames into multiple subbandsusing the DWT and then remove the temporal correlations foreach subband using MCTF in the wavelet domain. The residualsignal after the MCTF can be coded using any desired tex-ture-coding technique (DCT-based, wavelet-based, matchingpursuit, etc.). In this way, decoding to arbitrary resolution,quality, or frame rate is achieved from a single compressedbit stream. In these systems, for efficient wavelet-domain ME,the ODWT can be constructed from the critically sampledDWT of the reference frames, assuming resolution scalability.In [11]–[13], a comparison of the coding efficiency of spa-tial-domain and in-band MCTF employing the level-by-levelCODWT is performed under many different scenarios. Theresults demonstrated that the in-band MCTF system achievescomparable or superior coding efficiency in comparison tothe spatial-domain alternative for the vast majority of the tests[11]–[15].

Another application that uses the CODWT is the still-imagecoding system proposed in [16]. In that system, the level-by-level CODWT is used for the realization of a scheme that per-forms phase-based prediction of the subbands of each resolu-tion level. Hence, instead of simply coding, with an intra-bandcoder, the original subband content, a prediction of the subbandcoefficients of each resolution level is performed using the inter-polated ODWT of the previous-level subbands, which, in a res-olution-scalable framework, can be constructed independentlyat both the encoder and decoder sides using the CODWT. Sig-nificant coding gains are reported in comparison to the codecusing the same subband and entropy-coding techniques and noODWT-based prediction [16].

VI. CONCLUSION

A new framework is presented for the construction of theovercomplete DWT starting from the subbands of the criticallysampled decomposition. The proposed framework has inherentadvantages, in comparison to the conventional approach, since it

Page 14: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

ANDREOPOULOS et al.: COMPLETE-TO-OVERCOMPLETE DISCRETE WAVELET TRANSFORMS: THEORY AND APPLICATIONS 1411

consists of a direct transform from the complete to the overcom-plete DWT, using the minimum number of downsampling oper-ations and no upsampling. For image and video coding applica-tions that utilize a level-by-level construction of the ODWT, it isdemonstrated that the proposed CODWT has significant imple-mentation advantages over the conventional approach becauseit offers a) significant computation savings and b) a single-ratecalculation that can provide a scalable reduction in the trans-form-production delay. These features lead to inherent compu-tational scalability in comparison to the conventional approach.Finally, concrete application examples that use the proposedtransform have been presented.

REFERENCES

[1] H. W. Park and H. S. Kim, “Motion estimation using low-band-shiftmethod for wavelet-based moving-picture coding,” IEEE Trans. ImageProcess., vol. 9, no. 4, pp. 577–587, Apr. 2000.

[2] J. Skowronski, “Pel recursive motion estimation and compensationin subbands,” Signal Process.: Image Commun., vol. 14, no. 5, pp.389–396, Apr. 1999.

[3] R. Zaciu, C. Lamba, C. Burlacu, and G. Nicula, “Image compres-sion using an overcomplete discrete wavelet transform,” IEEE Trans.Consum. Electron., vol. 42, no. 3, pp. 800–807, Aug. 1996.

[4] G. Van der Auwera, A. Munteanu, P. Schelkens, and J. Cornelis,“Bottom-up motion compensated prediction in the wavelet domain forspatially scalable video,” Electron. Lett., vol. 38, no. 21, pp. 1251–1253,Oct. 2002.

[5] X. Li, L. Kerofski, and S. Lei, “All-phase motion compensated predic-tion in the wavelet domain for high performance video coding,” in Proc.IEEE Int. Conf. Image Process., vol. 3, Thessaloniki, Greece, Oct. 2001,pp. 538–541.

[6] X. Li and L. Kerofsky, “High performance resolution scalable videocoding via all-phase motion-compensated prediction of wavelet coef-ficients,” in Proc. SPIE Visual Commun. Image Process., San Jose, CA,Jan. 2002, pp. 1080–1090.

[7] C. Mayer, “Motion compensated in-band prediction for wavelet-basedspatially scalable video coding,” in Proc. IEEE Int. Conf. Acoust.,Speech, Signal Process., vol. 3, Hong Kong, Apr. 2003, pp. 73–76.

[8] C. Mayer and S. Albert, “Scalable video coding with in-band predic-tion,” in Proc. SPIE Elec. Imaging—Image Video Commun. Process.,vol. 5022, Santa Clara, CA, May 2003, pp. 973–983.

[9] Y. Andreopoulos, A. Munteanu, G. Van der Auwera, P. Schelkens, and J.Cornelis, “Scalable wavelet video-coding with in-band prediction—Im-plementation and experimental results,” in Proc. IEEE Int. Conf. ImageProcess., vol. 3, Rochester, NY, Sep. 2002, pp. 729–732.

[10] Y. Wang, S. Cui, and E. Fowler, “Fully scalable video coding usingredundant-wavelet multihypothesis and motion-compensated temporalfiltering,” in Proc. IEEE Int. Conf. Image Process., vol. 2, Barcelona,Spain, Sep. 2003, pp. 53–56.

[11] Y. Andreopoulos, M. van der Schaar, A. Munteanu, J. Barbarien, P.Schelkens, and J. Cornelis, “Fully scalable wavelet video coding usingin-band motion compensated temporal filtering,” in Proc. IEEE Int.Conf. Acoust., Speech, Signal Process., vol. 2, Hong Kong, Apr. 2003,pp. 795–798.

[12] , “Open-loop, in-band motion-compensated temporal filteringfor objective full-scalability in wavelet video coding,” in ISO/IECJTC1/SC29/WG11, m9026, MPEG, Shanghai, China, Oct. 2002.

[13] Y. Andreopoulos, A. Munteanu, J. Barbarien, M. Van der Schaar, J.Cornelis, and P. Schelkens, “In-band motion compensated temporalfiltering,” Signal Process.: Image Commun. Special Issue InterframeWavelet Video Coding, to be published.

[14] J. Ye and M. van der Schaar, “3-D lifting structure for sub-pixel ac-curacy motion compensated temporal filtering in overcomplete waveletdomain,” in ISO/IEC JTC1/SC29/WG11, m9554, MPEG. Pattaya, Thai-land, Mar. 2003.

[15] M. van der Schaar, J. Ye, Y. Andreopoulos, and A. Munteanu, “Fullyscalable 3-D overcomplete wavelet video coding using adaptive motioncompensated temporal filtering,” in ISO/IEC JTC1/SC29/WG11, m9037,MPEG, Shanghai, China, Oct. 2002.

[16] X. Li, “On exploiting geometric constraint of image wavelet coeffi-cients,” IEEE Trans. Image Process., vol. 12, no. 11, pp. 1378–1387,Nov. 2003.

[17] Y. Andreopoulos, A. Munteanu, G. Van der Auwera, P. Schelkens, and J.Cornelis, “A new method for complete-to-overcomplete discrete wavelettransforms,” in Proc. Int. Conf. Digit. Signal Process., Santorini, GR, Jul.2002, pp. 501–504.

[18] X. Li, “New results of phase shifting in the wavelet space,” IEEE SignalProcess. Lett., vol. 10, no. 7, pp. 193–195, July 2003.

[19] S. Mallat, A Wavelet Tour of Signal Process.. San Diego, CA: Aca-demic, 1998.

[20] “JPEG 2000 part I final draft international standard,” in ISO/IECJTC1/SC29/WG1, FCD 15 444-1, M. Boliek, C. Christopoulos, and E.Majani, Eds., Dec. 2000.

[21] “JPEG-2000 part II final committee draft,” in ISO/IECJTC1/SC29/WG1, FCD 15 444-2, M. Boliek, Ed., Dec. 2000.

[22] C. Brislawn, “Interim report on (part 2) core experiment CodEff07,7-tap/5-tap filter bank option,” in ISO/IEC JTC1/SC29/WG1, N1761,JPEG, Jul. 2000.

[23] H. Sari-Sarraf and D. Brzakovic, “A shift-invariant discrete wavelettransform,” IEEE Trans. Signal Process., vol. 45, no. 10, pp. 2621–2626,Oct. 1997.

[24] M. Vishwanath, “The recursive pyramid algorithm for the discretewavelet transform,” IEEE Trans. Signal Process., vol. 42, no. 3, pp.673–676, Mar. 1994.

[25] I. Daubechies and W. Sweldens, “Factoring wavelet transforms intolifting steps,” J. Fourier Anal. Appl., vol. 4, no. 3, pp. 247–269, Mar.1998.

[26] R. R. Coifman and D. L. Donoho, “Translation invariant de-noising,”in Wavelets and Statistics, Lecture Notes in Statistics 103, A. Anto-niadis and G. Oppenheim, Eds. New York, NY: Springer-Verlag, pp.125–150.

[27] H. M. Radha, M. van der Schaar, and Y. Chen, “The MPEG-4 fine-grained scalable video coding method for multimedia streaming overIP,” IEEE Trans. Multimedia, vol. 3, no. 1, pp. 53–68, Mar. 2001.

[28] P. Schelkens, A. Munteanu, J. Barbarien, M. Galca, X. Giro-Nieto, and J.Cornelis, “Wavelet coding of volumetric medical datasets,” IEEE Trans.Med. Imag. Special Issue on Wavelets in Medical Imaging, vol. 22, no.3, pp. 441–458, Mar. 2003.

[29] J. R. Ohm, “Three-dimensional subband coding with motion compensa-tion,” IEEE Trans. Image Process., vol. 3, no. 5, pp. 559–571, Sep. 1994.

[30] D. Turaga and M. van der Schaar, “Wavelet coding for video streamingusing new unconstrained motion compensated temporal filtering,” inProc. Int. Workshop Dig. Commun. Adv. Methods for Multimedia SignalProcess., Capri, Italy, Sep. 2002, pp. 41–48.

Yiannis Andreopoulos (S’01) was born in 1977in Aeghion, Greece. He received the electricalengineering diploma and the M.Sc. degree in signaland image processing systems from the University ofPatras, Patrtas, Greece. Since October 2000, he hasbeen working toward the Ph.D. degree at the VrijeUniversiteit Brussel (Free University of Brussels),Belgium.

His research interests are in the fields of trans-forms, fast algorithms, and video coding, specializingin combined algorithmic and implementation topics

for hardware and software systems. He contributes regularly to the ISO/IECJTC1/SC29/WG11 (MPEG) committee (Scalable Video Coding group).

Adrian Munteanu was born in Constanta, Romania,in 1970. He received the M.Sc. degree in electronicsand telecommunications from the “Politehnica”University of Bucharest, Bucharest, Romania, in1994, the M.Sc. degree in biomedical engineeringfrom the Technical University of Patras, Patras,Greece, in 1996, and the Ph.D. degree in appliedsciences from the Vrije Universiteit Brussel (VUB),Brussels, Belgium, in 2003.

Since October 1996, he has been a member ofthe Department of Electronics and Information

Processing (ETRO), VUB, and since July 2003, he has been a post-doctoralresearcher at ETRO. His research interests include scalable still image andvideo coding, multiresolution image analysis, image and video transmissionover networks, video segmentation and indexing, and statistical modeling ofimages. He is the author or co-author of more than 70 scientific publications,patent applications, and contributions to standards, and he has contributed totwo books in his areas of interest.

Page 15: Complete-to-Overcomplete Discrete Wavelet Transforms: Theory ...

1412 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 4, APRIL 2005

Geert Van der Auwera was born in Bonheiden, Bel-gium, in 1974. He received the electrotechnical civilengineering degree from Vrije Universiteit Brussel(VUB), Brussels, Belgium in 1997.

Presently, he is scientific advisor for IWT-Flan-ders, the Institute for the Promotion of Innovationby Science and Technology in Flanders, Belgium.In 2000, he joined IWT-Flanders after performingresearch funded by an IWT-Flanders grant at theElectronics and Information Processing Department(ETRO), VUB. His fields of interest are image and

video coding and image processing.Mr. Van der Auwera’s master’s thesis received the Barco and IBM prizes by

the Fund for Scientific Research—Flanders.

Jan P. H. Cornelis (M’80) was born in Wilrijk, Bel-gium, in 1950. He received the M.D. degree in 1973and the Ph.D. degree in 1980.

He is a professor in electronics, medical imaging,and digital image processing at the Vrije UniversiteitBrussel (VUB), Brussels, Belgium, where he isalso Vice-rector Research. He is director of theElectronics and Information Processing Department(ETRO), VUB, at the Faculty of Applied Sciencesand coordinates the research group on image pro-cessing and machine vision (IRIS). His current

research projects within the IRIS group include:various applications in medicalvision, remote sensing, mine and minefield detection, and design of algorithmsand computer architectures for image processing. His current research interestis mainly concentrated in the area of image and video compression.

Peter Schelkens (M’99) received the electrical engi-neering degree in applied physics in 1994, the Med-ical Physicist degree in 1995, and the Ph.D. degree inapplied sciences in 2001, all from the Vrije Univer-siteit Brussel (VUB), Brussels, Belgium.

Since October 1994, he has been a member of theIMEC-ETRO Laboratory at the VUB, where he had aposition as assistant to Prof. J. Cornelis. Since 2000,he has been leading a research team in the field ofimage and video compression and related multimediatechnologies. The team’s research focus is both on al-

gorithmic and implementation aspects. The team is very involved in the ISO/IECJTC1/SC29/WG1 (JPEG2000) and WG11 (MPEG) committees. He is also theBelgian head of delegation for the ISO/IEC JPEG standardization committee,and co-editor of part 10 of JPEG2000: “Extensions for Three-Dimensional andFloating Point Data.”


Recommended