ezw wave coder.pdf

Upload: faizal-haris

Post on 19-Oct-2015

41 views

Category:

Documents


0 download

DESCRIPTION

about ezw coder

TRANSCRIPT

  • A Multi-threshold Embedded Zerotree Wavelet Coder

    Eui-Sung Kang

    1

    , Toshihisa Tanaka

    2

    , Tae-Hyung Lee

    1

    , and Sung-Jea Ko

    1

    1

    Department of Electronics Engineering, Korea University

    Anam-dong, Sungbuk-ku, Seoul 136-701 Korea

    fmagasa, jeeni, [email protected]

    2

    Department of Electrical and Electronic Engineering, Tokyo Institute of Technology

    Ookayama, Meguro-ku, Tokyo 152-8552 Japan

    [email protected]

    Abstract| In this paper, a novel and simple embed-

    ded wavelet coder, called a multi-threshold embedded zerotree

    wavelet coder, is proposed. The well-known embedded ze-

    rotree wavelet (EZW) coder uses the successive approx-

    imation quantization (SAQ) process and zerotree struc-

    tures of wavelet coecients. The EZW coder scans itera-

    tively whole wavelet coecients during the SAQ process,

    which decreases the coding eciency considerably. In

    the proposed scheme, we scan only signicant subbands

    using the signicant test with the multi-threshold. The

    multi-threshold obtained from the coecient with the

    maximum magnitude in each subband is used to deter-

    mine whether a subband has signicant coecients or

    not. Since it is not necessary to encode the insignif-

    icant subbands having no signicant coecients during

    the SAQ process, we reduce signicantly the redundancy

    generated by scanning higher subbands. Experimental

    results show that the proposedmethod outperforms pop-

    ular image coders such as EZW and JPEG.

    I. Introduction

    Wavelet-based image coding is a promising technique

    to achieve ecient image coding due to being free from

    blocking artifacts that block-based image coders pro-

    duce [1], [2]. Moreover, this technique oers adapt-

    ability to achieve scalability for multimedia and vi-

    sual communications. Particularly, the embedded coder

    can achieve exact rate control, since it generates a sin-

    gle bitstream which can be truncated at any desired

    rate [3], [4]. The major dierence between the embed-

    ded coding and the traditional coding lies in the suc-

    cessive approximation quantization where wavelet coef-

    cients greater than a given threshold are assumed to

    have equal importance and should be transmitted be-

    fore coecients with smaller magnitudes. This SAQ

    scheme adopted by the embedded coder can be eec-

    tively applied to progressive image transmission. In the

    EZW coder [3], an initial threshold is applied to whole

    wavelet coecients. During the SAQ process, each coef-

    cient is classied into one of four symbols POS, NEG,

    ZTR, and IZ. These symbols are losslessly encoded us-

    ing adaptive arithmetic coding [5]. This process is con-

    tinued until stopping condition is met. Since the EZW

    This work was supported by the Korea Science and Engineering

    Foundation (KOSEF) under grant 985-0900-007-2.

    coder scans whole wavelet coecients with respect to

    the current threshold, some redundancy exists in the

    symbols generated when higher subbands are scanned.

    In this paper, we propose a novel wavelet coder, which

    can eciently reduce the redundancy produced when

    higher subbands are scanned. While the EZW coder

    scans whole wavelet coecients during the SAQ pro-

    cess, the proposed method scans only signicant sub-

    bands according to the results of signicant test using

    the multi-threshold. The multi-threshold is a set of coef-

    cients with the maximummagnitude in each subband.

    This multi-threshold is used to determine whether a

    subband have signicant coecients or not. Since it

    is not necessary to scan insignicant subbands having

    no signicant coecients, our approach is not only com-

    putationally ecient, but also obtains high coding ef-

    ciency. Furthermore, our proposed embedded coder

    can achieve the exact rate control and is utilized for

    progressive image transmission.

    This paper is organized as follows. In Section II, we

    review the embedded zerotree coding and successive ap-

    proximation quantization. In Section III, we demon-

    strate our novel approach, a multi-threshold embedded

    zerotree wavelet coder. In Section IV, we present exper-

    imental results for our coder, and compare them with

    other famous methods. Conclusions of this paper are

    given in Section V.

    II. Embedded wavelet image coding

    The Shapiro's EZW coder exploits the self-similarity

    of the wavelet transform image across dierent scales by

    using a tree structure. The typical wavelet tree struc-

    ture is dened recursively as shown in Fig. 1. We call

    a coecient at a coarse scale a parent. All coecients

    at the next ner scale at the same location and of sim-

    ilar orientation are children. All coecients at all ner

    scales at the same location and of similar orientation are

    descendants. Note that coecients in the highest sub-

    bands and LL subband can not have zerotree structure,

    since they have no children. All the rest coecients have

    four children. If a coecient is small in magnitude with

  • HL1

    HH1

    LH1

    HL

    HHLH

    LH

    LL HL

    HH

    2

    22

    3

    33

    3

    Fig. 1. Wavelet tree structure of wavelet transform.

    respect to a given threshold, then all of its descendants

    of the similar orientation in the same spatial location

    at all ner scales are likely to be small as well. The

    EZW scheme uses eectively the SAQ and zerotrees.

    The SAQ iteratively applies a sequence of thresholds

    T

    0

    ; : : : ; T

    N1

    to determine signicance of a coecient,

    where T

    i

    = T

    i1

    =2. The initial threshold T

    0

    is cho-

    sen so that jc

    j

    j < 2T

    0

    for all the wavelet coecients

    c

    j

    . A coecient is called signicant if its amplitude is

    greater than the threshold T

    i

    ; otherwise insignicant.

    When the parent is insignicant and its descendants

    are insignicant, the tree is called zerotree and the tree

    node is called zerotree root. A signicant map is de-

    ned as the bitplane indicating whether the coecient

    is signicant or not with respect to the current thresh-

    old. The signicant map is encoded using four symbols,

    POS, NEG, ZTR, and IZ as positive signicant, nega-

    tive signicant, zerotree root, and isolated zero, respec-

    tively, where isolated zero means an insignicant pixel

    but there exists one more signicant descendants. This

    is called dominant pass. Then, renement pass

    1

    is per-

    formed for the coecients which have been contained

    in the signicant map at earlier iterations. During the

    i

    th

    renement pass, binary symbols (0 or 1) correspond-

    ing to the i+ 1

    th

    most signicant bits of coecients are

    produced. Those symbols generated in both dominant

    pass and renement pass are encoded using the adaptive

    arithmetic coder [5]. The encoder halves the threshold

    and repeats another dominant and renement pass un-

    til a target bitrate is met. More details are explained

    in [3].

    III. The Proposed Approach

    As seen previously, if the current pixel is signicant,

    the EZW method needs to scan its children. And if

    1

    In Shapiro's paper, this pass is called subordinate pass, but

    we use more general term renement pass taking other similar

    algorithms into consideration.

    the current pixel is insignicant, this technique needs

    to identify whether there are any signicant descen-

    dants. Next, the proposed method using the multi-

    threshold and zerotrees is presented. When we perform

    a N -level wavelet decomposition, the multi-threshold,

    M = ft

    k

    ; k = 1; 2; : : : ; 3N + 1g is a set of the maxi-

    mum magnitudes of coecients in each subband. Let

    t

    3i2

    ; t

    3i1

    , and t

    3i

    , respectively, represent the maxi-

    mummagnitude in subbandsHH

    i

    ; LH

    i

    , andHL

    i

    where

    1 i N . And t

    3N+1

    corresponds to the maximum

    magnitude of coecients in the subband LL

    N

    . Fig. 2

    shows the multi-threshold , M = f10; 15; : : :; 50; 70g, of

    a 3-level decomposed image. Next, the initial threshold

    T

    0

    for the SAQ is selected such that

    T

    0

    = 2

    blog

    2

    Cc

    (1)

    where C is the largest magnitude of all the wavelet co-

    ecients and bxc represents the largest integer which is

    not greater than x. During each SAQ process, a binary

    image is generated by applying this threshold to whole

    wavelet coecients. In some subbands of the binary im-

    age, all the coecients are equal to zero especially in the

    rst several quantization processes. At the i

    th

    iteration,

    a subband which satises t

    k

    < T

    i

    does not need to be

    scanned. Here, we dene a signicant subband as a sub-

    band that satises t

    k

    T and an insignicant subband

    as a subband that satises t

    k

    < T . In Fig. 2, shaded

    subbands indicate signicant subbands with respect to

    the current threshold and the other subbands represent

    insignicant subbands. To determine which subband is

    signicant or insignicant with respect to the threshold

    in the decoder, the multi-threshold information should

    be transmitted to the decoder. To reduce overhead of

    the multi-threshold information, we transmit n

    k

    which

    satises

    n

    k

    = blog

    2

    t

    k

    c (2)

    instead of transmitting real coecient values.

    In our approach, the zerotree coding is performed

    like the EZW coder and compression algorithm with

    reversible embedded wavelets (CREW) [6]. However,

    we employ binary symbols, 0 for insignicant pixel and

    1 for signicant one. For a signicant coecient, one

    additional bit is needed to encode the sign of the co-

    ecient. For a insignicant coecient, one additional

    bit is also needed in order to indicate whether the co-

    ecient is zerotree root or not. Like the CREW, POS,

    NEG, IZ, and ZTR are replaced with 11, 10, 01, and

    00, respectively. In our coder, however, the conversion

    is performed to reduce the redundancy of symbols of the

    subbands, called the marginal subbands, which are adja-

    cent to insignicant subbands. Since all the coecients

  • t =101t =152

    t =183

    t =234

    t =306

    t =335

    t =7010 t

    =509

    t =437t

    =478

    t =183

    t =101t =152

    t =234

    t =306

    t =335

    t =7010 t

    =509

    t =478 t

    =437

    t =101t =152

    t =18

    t =234t =335

    t =306

    3

    t =7010 t

    =509

    t =478 t

    =437

    (a) T

    0

    = 2

    6

    (b) T

    1

    = 2

    5

    (c) T

    2

    = 2

    4

    Fig. 2. Examples of the multi-threshold and signicant subbands. Shaded regions indicate signicant subbands and thick lines

    indicate the marginal subbands .

    in the marginal subbands have no signicant descen-

    dants, we no longer need an additional bit to identify

    whether those coecients have signicant descendants

    or not. In the EZW and the CREW, if the current coef-

    cient is signicant, its descendants should be scanned

    and encoded. This fact implies that our method can of-

    fer an ecient scanning which reduces this redundancy.

    In addition, to encode coecients in the lowest sub-

    band, we use only one bit for each coecients: 0 for

    insignicant pixels and 1 for signicant ones, because

    all the coecients in that subband are identied as pos-

    itive.

    Binary symbols generated in each pass are encoded

    by adaptive arithmetic coding. Unlike the EZW, we

    use the QM-coder [7] which is more computationally

    ecient than the adaptive arithmetic coding [5] and

    Q-coder [8]. And it can eectively exploit the strong

    correlation of quantized wavelet coecients. To encode

    symbols eciently, the QM-coder employs context mod-

    eling [9], [10] which is the set of past sequence of sym-

    bols on which the probability of the current symbol is

    conditioned. For all quantized wavelet coecients as a

    sequence of binary symbols x

    1

    ; x

    2

    ; : : : ; x

    n

    , the minimum

    codelength l is given by

    l = log

    2

    n

    Y

    i=1

    p(x

    i

    jx

    i1

    ; x

    i2

    ; : : : ; x

    1

    ): (3)

    Here, to reduce the problem of estimating the sym-

    bol distributions , we should estimate S, which is a

    subsequence of x

    1

    ; x

    2

    ; : : : ; x

    n

    . During the dominant

    pass of the EZW, the signicance of the parent coef-

    cient and the previous one are used for context model-

    ing of the current symbol x

    i

    , that is S = fx

    i1

    ; x

    P

    g,

    where x

    P

    is the parent symbol. Context modeling

    for our coder is shown in Fig. 3: for a signicant

    bit, S = fx

    P

    ; x

    N

    ; x

    W

    ; x

    E

    ; x

    S

    g, and for a sign bit,

    x N

    x Ex W

    x S

    x P

    x i

    x N

    x Ex W

    x S

    x i

    (a) Signicant bit (b) Sign bit

    Fig. 3. Context modeling of the proposed coder. x

    i

    is the current

    pixel and x

    P

    is its parent.

    S = fx

    N

    ; x

    W

    ; x

    E

    ; x

    S

    g. Note that x

    N

    and x

    W

    are sig-

    nicance bits generated at the current threshold, and

    x

    E

    and x

    S

    are those which are obtained at the previous

    threshold. And context modeling for a renement bit

    consists of only the previous pixel, i.e. S = fx

    i1

    g.

    IV. Experimental Results

    Coding simulations were performed for the 512 512

    grayscale Lena image. We investigated the performance

    of our proposed approach using the six-scale wavelet

    decomposition with the 9/7 tap biorthogonal wavelet

    lter [1]. We utilized the QM-coder to encode the sym-

    bols obtained by the scanning process with the multi-

    threshold. We compare the performance of the pro-

    posed method with that of the EZW coder, the DCT-

    based embedded image coder [11], the ACTCQ (arith-

    metic and entropy constrained trellis quantization) [12],

    and the JPEG coder [7]. The EZW coder and the DCT-

    based embedded image coder are the embedded coders.

    Note that the coding scheme in [7], [12] are not the em-

    bedded coder. Fig. 4 shows the original image and the

    decoded images obtained by our proposed method. It is

    seen that the decoded images do not produce any block-

  • (a) Original image (b) at rate 0.5bpp (c) at rate 0.25bpp

    Fig. 4. Original and decoded 512 512 Lena images.

    ing artifacts and exhibit good visual quality. And Fig. 5

    shows the rate-distortion performance for the original

    image. At lower bit rates (below about 0.95bpp), our

    method appears to produce better results than all the

    candidates considered in our simulation. At higher bit

    rates (around 1bpp), only ACTCQ give slightly better

    performance than the proposed approach. This exper-

    imental result shows that our coder outperforms those

    popular image coders.

    V. Conclusions

    We have presented an image coding algorithm, a

    multi-threshold embedded zerotree wavelet coder. This

    approach utilizes eciently the multi-threshold and ze-

    rotree structures of wavelet coecients. The proposed

    method can reduce the redundancy produced when

    higher subbands are iteratively scanned during the SAQ

    process. We can also reduce symbol redundancy us-

    ing the multi-threshold and binary representation of the

    symbols generated by the SAQ process. Furthermore,

    since our proposed method produces the fully embedded

    bitstream like the other embedded coders, it can achieve

    the exact rate control and can be easily applied to pro-

    gressive image transmission. It was shown that the pro-

    posed coder outperforms well-known image coders such

    as the EZW coder, the JPEG, and the ACTCQ.

    References

    [1] M. Antonini, M. Barlaud, P. Mathieu, and I. Daubechies,

    \Imagecoding using wavelet transform," IEEE Trans. Image

    Processing, vol. 1, pp. 205{220, Apr. 1992.

    [2] J. Lu, V. R. Algazi, and R. R. Estes, Jr., \A comparative

    study of wavelet image coders,"Optical Engineering, vol. 35,

    Sept. 1996.

    [3] J. M. Shapiro, \Embedded image coding using zerotrees of

    wavelet coeceints," IEEE Trans. Signal Processing, vol. 41,

    pp. 3445{3462, Dec. 1993.

    [4] C. D. Creusere, \A new method of robust image compression

    based on the embedded zerotree wavelet algorithm," IEEE

    Trans. Image Processing, vol. 6, pp. 1436{1442, Oct. 1997.

    [5] I. H. Witten, R. M. Neal, and J. G. Cleary, \Arithmetic cod-

    ing for data compression,"Commun. ACM, vol. 30, pp. 520{

    540, June 1987.

    [6] A. Zandi, J. D. Allen, E. L. Schwarts, and M. Boliek,

    \CREW: Compression with reversible embedded wavelets,"

    in Proc. Data Compression Conference, (Snowbird, Utah),

    pp. 212{221, Mar. 1995.

    [7] W. B. Pennebakerand L. J. Mitchell, JPEG Still Image Data

    Compression Standard. New York: Van Nostrand Reinhold,

    1992.

    [8] W. B. Pennebaker, J. L. Mitchell, G. G. Langdon, and R. B.

    Arps, \An overview of the basic principles of the Q-coder

    adaptive binary arithmetic coder," IBM J. Res. Develop.,

    vol. 32, pp. 717{726, Nov. 1988.

    [9] V. R. Algazi and R. R. Estes, Jr., \Analysis based coding of

    image transform and subband coecients," in SPIE, Appli-

    cations of Digital Image Processing XVII, pp. 11{21, 1995.

    [10] C. Chrysas and A. Ortega, \Ecient context-based entropy

    coding for lossy wavelet image compression," in Proc. IEEE

    Data Compression Conf. '97, pp. 241{250, 1997.

    [11] Z. Xiong, O. Guleryuz, and M. T. Orchard, \A DCT-based

    embedded image coder," IEEE Signal Proc. Letters, vol. 3,

    pp. 289{290, Nov. 1996.

    [12] R. L. Joshi, V. J. Crump, and T. R. Fischer, \Image subband

    coding using arithmetic coded trellis coded quantization,"

    IEEE Trans. Circuits and Systems for Video Technology,

    vol. 5, pp. 515{523, Dec. 1995.

    20

    22

    24

    26

    28

    30

    32

    34

    36

    38

    40

    0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

    PSNR

    [dB]

    Rate[bpp]

    ProposedShapiro[3]

    JPEG[7]Xiong et al[11]Joshi et al[12]

    20

    22

    24

    26

    28

    30

    32

    34

    36

    38

    40

    0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

    PSNR

    [dB]

    Rate[bpp]

    ProposedShapiro[3]

    JPEG[7]Xiong et al[11]Joshi et al[12]

    Fig. 5. Comparison of coding performance.