SIViPDOI 10.1007/s11760-013-0583-6
ORIGINAL PAPER
Artifact reduction in JPEG2000 compressed images at low bit-rateusing mathematical morphology filtering
Layachi Bennacer · Badreddine Bouledjfane ·Amine Nait-Ali
Received: 29 August 2011 / Revised: 10 November 2013 / Accepted: 10 November 2013© Springer-Verlag London 2013
Abstract JPEG2000 is known as an efficient standard toencode images. However, at very low bit-rates, artifacts ordistortions can be observed in decoded images. In order toimprove the visual quality of decoded images and make themperceptually acceptable, we propose in this work a new pre-processing scheme. This scheme consists in preprocessingthe image to be encoded using a nonlinear filtering, consid-ered as a prior phase to JPEG 2000 compression. More specif-ically, the input image is decomposed into low- and high-frequency sub-images using morphological filtering. After-ward, each sub-image is compressed using JPEG2000, byassigning different bit-rates to each sub-image. To evaluatethe quality of the reconstructed image, two different met-rics have been used, namely (a) peak signal to noise ratio, toevaluate the visual quality of the low-frequency sub-image,and (b) structural similarity index measure, to evaluate thevisual quality of the high-frequency sub-image. Based on thereconstructed images, experimental results show that, at lowbit-rates, the proposed scheme provides better visual qual-ity compared to a direct use of JPEG2000 (excluding anypreprocessing).
L. Bennacer (B) · B. BouledjfaneLaboratoire d’Etude et de Recherche en Instrumentation etTélécommunications Avancées (LERICA), Université BadjiMokhtar, B.P. 12, 23000 Annaba, Algériee-mail: [email protected]
B. Bouledjfanee-mail: [email protected]
A. Nait-AliLaboratoire Images, Signaux et Systèmes Intelligents (LISSI, EA 3956),Université Paris-Est Créteil (UPEC), 61 Ave du Générale de Gaulle,94010 Créteil, Francee-mail: [email protected]
Keywords Morphological filter · JPEG2000 ·Low bit-rate compression · Image quality metrics ·PSNR · SSIM
List of symbols
JPEG2000 Joint Photographic Experts Group commit-tee in 2000
DWT Discrete wavelet transformDCT Discrete cosine transformIDWT Inverse discrete wavelet transformI (x, y) Image with spatial coordinate x and yS Structuring elementI ⊕ S Dilation of I by SI � S Erosion of I by SI ◦ S Morphological opening of I by SI•S Morphological closing of I by Sg, f g is the mask and f is the markerRg( f ) Reconstruction of g from fγ (rec)( f, g) Opening by reconstruction of g from fILow Decomposed image I at low frequencyIHigh Decomposed image I at high frequencyRate Compression ratioα Compression ratio of the low-frequency
sub-image (bit per pixel)β Compression ratio of the high-frequency
sub-image (bit per pixel)Ψ (ILow, α) Compression operator of ILow by αΨ (IHigh, β) Compression operator of IHigh by βCompLow Compressed low-frequency sub-imageCompHigh Compressed high-frequency sub-imageΨ−1(ILow, α) The inverse compression operator of ILow
by α
123
SIViP
Ψ−1(IHigh, β) The inverse compression operator of IHigh
by βbpp Bit per pixelMSE Mean square errorPSNR Peak signal to noise ratioSSIM Structural SIMilarityl() Luminance comparison functionc() Contrast comparison functions() Structure comparison functionμ f The average of fσ 2
f The variance of fσ f g The covariance between f and gL. The dynamic range of the pixel valuesB The bit depth used for noncompressed
image codingC1,C2 Two variables to stabilize the division with
weak denominatorPSNRNew Proposed image quality metrics
1 Introduction
Nowadays, standards are highly considered when deal-ing with information technology systems, especially fordata transmission and storage. The well-known Joint Pho-tographic Experts Group (JPEG) standard is intensivelyemployed in multimedia applications, and the JPEG2000 hasbeen proposed as an alternative. JPEG2000 standard allowsmany useful options. In particular, it uses wavelet transforminstead of discrete cosine transform (DCT), which makes thequality of the reconstructed image at a given bit-rate, muchbetter than the one obtained using JPEG. The JPEG2000standard supports lossy and lossless compression of single-component (e.g., grayscale) and multicomponent (e.g., color)images. Due to its excellent coding performance, JPEG2000is employed in numerous application fields, including imagearchiving, Internet and Web browsing, medical image storageand transmission, etc [1,2]. However, the major drawback ofstandard JPEG2000 is that at a low compression bit-rates,numerous artifacts can be observed such as ringing, blur,and false edges [3–5].
The aim of this paper is to present a compression schemerequiring a nonlinear filtering before using JPEG2000 encod-ing phase. More specifically, morphological filters are used todecompose any input image into both high and low frequen-cies, providing, hence, two sub-images. For each sub-image,JPEG2000 encoder is applied by assigning different bit-rates.In order to evaluate the quality of the reconstructed image,two metrics are used: (a) the common peak signal to noiseratio (PSNR) is employed to assess the visual quality of thelow-frequency sub-image, and (b) structural similarity indexmeasure (SSIM) is used to assess the high-frequency sub-image. In this work, it will be shown how one can improve
the quality of the reconstructed image by considering the pro-posed scheme. This paper is organized as follows: Sect. 2 pro-vides a brief description of the observed artifacts when usingdirectly JPEG2000 standard. Section 3 presents some basicmorphological operations and morphology-based filtering,commonly used in image processing. Section 4 describes theproposed new scheme, including the optimization process ofthe required parameters. Afterward, results are highlightedin Sect. 5. Finally, Sect. 6 provides some main conclusions,including some perspectives future work.
2 Artifacts in JPEG2000 standard
A generic JPEG2000 compression scheme is illustrated inFig. 1. In this scheme, three main phases are required, namelythe discrete wavelet transform (DWT), followed by a quan-tization phase, and finally, an entropic encoding phase. Theoutput data are called bitstream, which is required to recon-struct the image through an inverse discrete wavelet trans-form. More details can be obtained in [1].
As specified in [3–5], JPEG2000 may cause blur and ring-ing artifacts. In the reconstructed image, ringing artifactsare caused by the quantization or truncation of the high-frequency transform coefficients resulting from wavelet-based coding. Consequently, in the spatial domain, thiscauses ripples or oscillations around sharp edges or contours.This is also known as Gibbs phenomenon.
Wavelet TransformDWT
Quantization
Entropy coding
Inverse TransformIDWT
InverseQuantization
EntropyDecoding
Input image Reconstructed image
Bitstream
Fig. 1 JPEG2000 block diagrams
123
SIViP
Fig. 2 Illustration of ringing artifacts in JPEG2000 [6]. a Originalimage Flash, b JPEG2000-coded Flash at 0.07825 bpp
The problem of removing ringing artifacts has been con-sidered in [6,7]. For example, Fig. 2a, b shows an origi-nal “Flash” image and the corresponding JPEG2000 recon-structed image at a biterate: 0.07825 bpp.
The purpose of “removing ringing artifacts” is to makethe smooth regions as smooth as they appear in the originalimage, by preserving real edges, considered as useful infor-mation.
On the other hand, blur is due to the attenuation of thehigh spatial frequencies, which commonly occurs during fil-tering or visual data compression. The problem of removingblurring artifacts is considered in [7,8].
Blur and ringing artifacts are defined generally in the spa-tial domain. Both artifacts appear basically along edges orin textured areas. In order to reduce these artifacts, a pre-processing method is often used. Within this context, andby considering that the proposed method is based on mor-phological filtering (as a phase to improve the quality of thereconstructed image), some basis will be described in thenext section.
3 Morphological operators and morphology-basedfiltering
3.1 Morphological operators
Used in numerous applications, morphological operationsare efficient image processing tools employed for imageenhancement, segmentation, edge detection, and featureextraction [9,10]. When considering grayscale morphology,we define two basic operations, namely dilation and erosion.When dealing with two-dimensional gray-level images, mor-phological dilation (erosion) of an image I (x, y) is definedas:
Dilation : (I ⊕ S)(i, j) = min {I (x + i, j + y)/(x, y) ∈ S}(1)
Erosion : (I �S)(i, j) = min {I (x + i, j + y)/(x, y) ∈ S}(2)
In image processing applications, the set S mentioned aboveis usually called the structuring element representing a smallgeometric template (e.g. line segment, disk, and square).
3.2 Nonlinear filtering
Combining morphological dilation and morphological erosionsubsequently, result in two other operations known as morpho-logical opening (◦) and closing (•) defined as:
Opening : (I ◦ S)(i, j) = [(I � S)⊕ S] (3)
Closing : (I • S)(i, j) = [(I ⊕ S)�S] (4)
Morphological opening and closing are the most well-knownnonlinear filters used in image enhancement and noise reduction[11,12]. However, these filters present several drawbacks.
In general, if the undesirable features are eliminated, theremaining structures will be changed. Recently, filters by recon-struction are considered as powerful tools that allow the elimi-nation of undesirable features without affecting desirable ones[13]. Consequently, our preprocessing method includes a 2Dmorphological filter by reconstruction in the proposed compres-sion scheme.
3.3 Filtering by reconstruction
Reconstruction is a morphological transformation requiring twoimages and a structuring element (instead of a single image anda structuring element). The first image (marker) is the startingpoint for the transformation. The second image (mask) con-strains the transformation. The structuring element used definesconnectivity [14,15]. If g is the mask and f is the marker,grayscale morphological reconstruction of g from f is definedby the following iterative procedure:
Step 1: Initialize h1 to be the marker image fStep 2: Create the structuring element SStep 3: Repeat:Step 4: hk+1 = (hk ⊕ S) ∩ g = inf(hk ⊕ S, g)Step 5: until hk+1 = hk
Step 6 : Marker f must be a subset of g, that is f ⊆ g
The symbol ⊕ is used for the dilation operation, and ∩ standsfor the point-wise minimum between the dilated marker imageand the mask image. Filtering by reconstruction is the oldesttype of connected filter [16].
In the case of openings by reconstruction, the marker is theresult of an opening or erosion by some structuring element.Afterward, structures not removed entirely by the opening orerosion are reconstructed exactly, as shown in Fig. 3. The open-ing by reconstruction noted γ (rec)(g, f ) is defined as:
γ (rec)(g, f ) = δ(∞)(g, f )
= δ(1)(δ(1)(. . . δ(1)(g, f ) . . .) (5)
123
SIViP
Image source
Structuring element IHigh
ILowMorphological opening
by reconstruction
Fig. 3 Decomposition of the source image into its low and high-frequency sub-images
The transformation δ(∞)(g, f ) defines the geodesic unitary dila-tion. It is defined by the following expression: δ(1)(g, f ) =Min{δ1(g), f }. The operator δn denotes the morphological dila-tion with a structuring element of n size, and f is the marker. Itis defined as the eroded form of the original image g (mask)with the structuring element S. As with the structuring ele-ment in dilation and erosion operations, the characteristics ofthe marker image determine the processing performed in mor-phological reconstruction. We denote opening by reconstructionas:
Rg( f ) = Rg(g�S) (6)
The opening by reconstruction permits to discard all the imagecomponents that are smaller than a specific size limit (given bythe size of the structuring element) and restore the contours ofthe elements that are not totally erased by the selection.
4 A scheme to improvement of JPEG 2000
The purpose of this section is to show how one can improvethe performances of JPEG2000 (lossy mode) by includingin the scheme, a prior preprocessing phase. The main ideaconsists in distributing the total compression ratio, betweenthe low- and high-frequency sub-images. This phase requiresa preliminary decomposition stage and uses different codersalong with different assigned bit-rates [12]. By consideringthe fact that the information can be distributed in low andhigh frequency [17], we used two metrics to evaluate the qual-ity of the reconstructed image, namely the PSNR and SSIM[18–20].
The distortions of the global information are measured by aPSNR at low spatial frequency, and the distortions of the localinformation are measured by SSIM at high spatial frequency.Therefore, in order to describe the visual quality of the recon-structed image, these two measurements are combined using aproposed metric, denoted here PSNRNew. Using this approach,we will notice that a better visual quality is achieved com-pared with a direct compression scheme using the same globalbit-rate.
More specifically, low-frequency sub-image can be obtainedby smoothing the source image using the filtering by recon-struction. Similarly, high-frequency sub-image corresponds tothe detailed information on an image and can be easily obtained
by a simple subtraction of the low-frequency sub-image fromthe original image.
Figure 3 shows the principle of this decomposition. For agiven image I , and a structuring element S, the smoothed versionof the image by a structuring element S is obtained by openingreconstruction. As specified by the formula (4), it can be definedas:
ILow = RI (I �S) (7)
IHigh = I − ILow (8)
By choosing different compression ratios for both the low-frequency sub-image and the high-frequency sub-image (Fig. 4),one can compress the low- and high-frequency parts of an imageby:
CompLow = Ψ (ILow, α) (9)
CompHigh = Ψ (IHigh, β) (10)
where ψ is the compression operator, CompLow, CompHigh, α
andβ correspond, respectively, to the compressed low-frequencysub-image, high-frequency sub-image, bit-rate for low-fre-quency sub-image and the selected bit-rate for the high-frequency sub-image. Similarly, we can obtain the recon-structed image for low-frequency sub-image (respectively, high-frequency sub-image) using the decompression operator, whichis the inverse of the compression operator. The decompressionoperator can be obtained using the following equations:
IRECLow = Ψ−1(CompLow, α) (11)
COMPHighIHigh
COMPLow
ββ (bpp)
α (bpp)
ILow
JPEG2000 Encoder
JPEG2000 Encoder
Fig. 4 Compression of low and high-frequency sub-images at differentbit-rates by JPEG2000 encoder
Quality measurementof
reconstructed imageIREC
IRECHigh
IRECLow
COMPHigh
ββ (bpp)
α (bpp)
COMPLowJPEG2000
Decoder
JPEG2000Decoder
Fig. 5 Proposed reconstruction
123
SIViP
IRECHigh = Ψ−1(CompHigh, β) (12)
The original image can be reconstructed by summing these twosub-images. The proposed reconstruction is illustrated in Fig. 5.
4.1 Proposed image quality metrics
In order to enhance the visual quality of reconstructed image,we have used an analytical relation between PSNR and SSIMindex. Given a reference image f and a test image g, both ofsize M × N , the PSNR between f and g is defined by:
PSNRn,m = 20 log10
((2B − 1)√
MSEn,m
)(13)
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1-10
0
10
20
30
40
50
60
SSIM
PS
NR
New
(dB
)
σfg
=400
σfg
=100
σfg
=1
Fig. 6 Variation in the PSNR versus the SSIM for different fixed valuesof σ f g : theoretical situation
0.68 0.7 0.72 0.74 0.76 0.78 0.826.4
26.6
26.8
27
27.2
27.4
27.6
27.8
SSIM
PS
NR
New
(dB
)
Fig. 7 Variation in the PSNR versus the SSIM for the image ‘18’ fromKodak dataset: experimental situation
where
MSEn,m = 1
M.N
M−1∑i=0
N−1∑j=0
∥∥∥ f (i, j)− g(i, j)∥∥∥2
(14)
The PSNR value approaches the infinite as the mean squarederror (MSE) approaches zero. This shows that a higher PSNRvalue provides a higher image quality. At the other end ofthe scale, a small value of the PSNR implies high numeri-
COMPHigh
PSNRNew
COMPLow
Local distortion measurement(with SSIM)
Fused
metrics
Global distortion measurement (with PSNR)
Fig. 8 Framework of PSNRNew
RATE, k=1, n=1
n>Nk=k+1
K=N
end
Block of reconstructed
image for a given
structuring element Sk,
bitrate αn and βn
PSNR New caculated
n=n+1
PSNRNew(1,1)............................PSNRNew(1,N)
PSNRNew(2,1)............................PSNR New(2,N)
………………………………………………
………………………………………………
………………………………………………
PSNRNew(N,1)............................PSNRNew(N,N)
Fig. 9 The proposed algorithm
123
SIViP
Table 1 Optimization of alpha (α), beta (β), SSIM values as well as the diameter of the disk-shaped structuring element, using the proposedalgorithm
Rate (bpp) PSNRNew1 (db) SSIM1 PSNRNew2 (db) SSIM2 α (bpp) β (bpp) Diameter (pixel) Gain (dB)
0.005 26.36 0.589 27.30 0.596 0.003 0.002 11 0.93
0.007 26.41 0.626 27.51 0.636 0.005 0.002 20 1.10
0.011 26.42 0.695 27.84 0.713 0.009 0.002 18 1.42
0.015 26.65 0.731 28.01 0.753 0.013 0.002 20 1.36
0.019 27.16 0.769 28.20 0.781 0.017 0.002 17 1.03
0.021 27.45 0.787 28.21 0.798 0.019 0.002 17 0.75
0.023 27.79 0.806 28.30 0.815 0.021 0.002 14 0.50
Tests achieved at different bit-rates, on image “18” from Kodak datasetPSNRNew1: between the source image and the reconstructed image JPEG 2000 (directly) at total bit-rate (α + β)
PSNRNew2: between the source image and the reconstructed image by the proposed methodα and β correspond, respectively, to bit-rate for low-frequency sub-image and the selected bit-rate for the high-frequency sub-image. (α+β = Rate)SSIM1: Index similarity between test image ‘18’ and the reconstructed image JPEG 2000 (directly) at total bit-rate (α + β)
SSIM2: Index similarity between test image ‘18’ and the proposed methodRow 03 of Table 1 indicates the optimal parameters obtained at rate = 0.011: PSNRNew1 = 26.42 db,SSIM1 = 0.695, PSNRNew2 =27.84 db,SSIM2 = 0.713, α = 0.009 bpp, β = 0.002 bpp, diameter of structuring element = 18 pixels, gain = 1.42 dB
cal differences between images. The SSIM index is a qual-ity metric used to measure the similarity between two images,and it is considered to be correlated with the quality percep-tion of the HVS [18,20]. It is proposed as a good candi-date to improve PSNR and MSE, which have been shown tobe inconsistent regarding human eye perception [18]. SSIMis considered by modeling any image distortion as a combi-nation of three factors, which are loss of correlation, lumi-nance distortion, and contrast distortion [18]. It is definedas
SSIM ( f, g) = l ( f, g) .c ( f, g) .s ( f, g) (15)
where
l( f, g) = 2μ f μg + C1
μ2f + μ2
g + C1(16)
c( f, g) = 2σ f σg + C2
σ 2f + σ 2
g + C2(17)
s( f, g) = 2σ f g + C3
μ2f + μ2
g + C3(18)
The first term in Eq. 15 is the luminance comparison func-tion, which measures the closeness of the two images meanluminance (μ f and μg ). This factor is maximal and equal to1 only if μ f = μg . The second term is the contrast compari-son function, which measures the closeness of the contrast ofthe two images. Here, the contrast is measured by the standarddeviation σ f and σg . This term is maximal and equal to 1 ifand only if σ f = σg . The third term is the structure comparisonfunction, which measures the correlation coefficient between thetwo images f and g. Note that σ f g is the covariance betweenf and g. The positive values of the SSIM index are within therange [0, 1]. A value equals to 0 means no correlation betweenimages, whereas a value equals to 1 means that f = g. The pos-itive constants C1, C2, C3 are used to avoid a null denominator,
’ 01’ ’ 02’ ’ 03’ ’ 04’
’ 05’ ’ 06’ ’ 07’ ’ 08’
’ 09’ ’ 10’ ’ 11’ ’ 12’
’ 13’ ’ 14’ ’ 15’ ’ 16’
’ 17’ ’ 18’ ’ 19’ ’ 20’
’ 21’ ’ 22’ ’ 23’
Fig. 10 Kodak image dataset
123
SIViP
Table 2 Objective comparison between the proposed method, the reference method [12] and JPEG 2000 (directly)
Image set Rate = 0.007bpp, α = 0.005bpp, β = 0.002bpp
JPEG 2000 Reference [12] Proposed method
PSNRNew1(db)
SSIM1 PSNRNew2(db)
SSIM2 Diameter(pixels)
Gain(dB)
PSNRNew3(dB)
SSIM3 Diameter(pixels)
Gain(dB)
01 24.14 0.580 24.26 0.479 8 0.12 24.30 0.584 20 0.16
02 33.30 0.814 33.50 0.808 10 0.20 33.64 0.850 17 0.64
03 34.35 0.862 34.13 0.867 5 −0.21 34.30 0.894 20 0.24
04 31.96 0.795 31.84 0.785 6 −0.12 32.23 0.822 20 0.56
05 23.04 0.576 23.58 0.426 10 0.54 24.18 0.529 15 1.44
06 27.21 0.674 28.32 0.585 20 1.10 27.79 0.662 20 0.88
07 30.40 0.811 29.96 0.793 3 −0.43 30.14 0.818 10 0.04
08 21.67 0.625 22.64 0.487 10 0.96 22.80 0.583 17 1.42
09 31.32 0.857 31.25 0.876 1 −0.06 31.26 0.898 1 0.23
10 30.94 0.803 31.23 0.798 4 0.29 31.55 0.827 19 0.91
11 28.16 0.729 29.09 0.658 18 0.92 29.26 0.740 17 1.39
12 32.03 0.819 32.16 0.831 4 0.13 32.09 0.858 16 0.35
13 22.56 0.501 24.39 0.396 14 1.82 24.21 0.470 20 1.94
14 27.01 0.660 27.13 0.572 9 0.11 26.88 0.646 13 0.17
15 31.19 0.827 31.35 0.849 2 0.16 31.36 0.870 2 0.46
16 31.71 0.762 31.95 0.777 1 0.23 32.01 0.788 20 0.59
17 30.72 0.780 30.84 0.745 7 0.12 30.92 0.812 4 0.50
18 26.41 0.626 27.11 0.540 15 0.70 27.51 0.636 20 1.10
19 26.36 0.748 27.28 0.679 14 0.91 26.75 0.742 17 0.69
20 31.62 0.865 31.78 0.885 2 0.15 31.71 0.900 20 0.38
21 29.08 0.796 29.40 0.711 18 0.31 29.18 0.818 1 0.39
22 28.62 0.699 29.30 0.701 3 0.67 29.43 0.726 11 1.10
23 34.39 0.898 34.17 0.912 2 −0.22 34.00 0.921 19 −0.09
PSNRNew1: between the source image and the reconstructed image JPEG 2000 (directly) at total bit-rate (α + β)
PSNRNew2: between the source image and the reconstructed image using the reference [12]PSNRNew3: between the source image and the reconstructed image using the proposed methodSSIM1: Index similarity between test image and the reconstructed image using JPEG 2000 (directly) at total bit-rate (α + β)
SSIM2: Index similarity between test image and the reference [12]SSIM3: Index similarity between test image and the proposed method
if C3 = C2/2. The general form of the SSIM between signal fand g is defined as [18]:
SSIM( f, g) = (2μ f μg + C1)(2σ f g + C2)
(μ2f + μ2
g + C1)(σ2f + σ 2
g + C2)(19)
where C1 = (k1L)2, C2 = (k2 L)2 and k1 = 0.01 and k2 =0.03. L the dynamic range of the pixel values (typically this is2B −1). B is the bit depth used for noncompressed image coding(8 bpp).
To establish the relationship between the SSIM and thePSNR, we first derive the relationship between the SSIM andthe MSE, and then, we use that relation to link the SSIM to thePSNR. The MSE in Eq. 14 can be rewritten as:
MSE( f, g) = 1
M.N
M−1∑i=0
N−1∑j=0
( fi j − gi j )2 (20)
= 1
M.N
M−1∑i=0
N−1∑j=0
(( fi j − μ f )− (gi j − μg)
+(μ f − μg))2 (21)
= σ 2f + σ 2
g − 2σ f g + (μ f − μg)2 (22)
where σ 2f and σ 2
g are the variances of images f and g, and σ f g
the covariance between f and g:
σ 2f = 1
M.N
M−1∑i=0
N−1∑j=0
( fi j − μ f )2 (23)
123
SIViP
0.007 0.008 0.009 0.01 0.011 0.012 0.013 0.014 0.01531.8
32
32.2
32.4
32.6
32.8
33
Rate (BPP)
PS
NR
New
(dB
)Quality performance of the image 04
JPEG2000
Proposed Method
Reference[12]
0.007 0.008 0.009 0.01 0.011 0.012 0.013 0.014 0.01528
28.5
29
29.5
30
30.5
Rate(BPP)
PS
NR
(dB
)
Quality performance of the image 04
JPEG2000
Proposed Method
Reference[12]
0.008 0.01 0.012 0.014 0.016 0.018 0.0223.4
23.6
23.8
24
24.2
24.4
24.6
Rate (BPP)
PS
NR
New
(dB
)
Quality performance of the image 05
JPEG2000
Proposed Method
Reference[12]
0.008 0.01 0.012 0.014 0.016 0.018 0.0220.6
20.8
21
21.2
21.4
21.6
21.8
22
22.2
22.4
22.6
Rate(BPP)
PS
NR
(dB
)
Quality performance of the image 05
JPEG2000
Proposed Method
Reference[12]
Fig. 11 Comparison of performances versus bit-rates at low bit-rate, for the proposed, JPEG2000 and the reference method [12], using PSNRnew,and PSNR
σ f g = 1
M.N
M−1∑i=0
N−1∑j=0
( fi j − μ f )(gi j − μg) (24)
The PSNR defined in Eq. 13 can be rewritten as:
PSNRNew = 10 log10(255)2 − 10 log10
( a
SSIM+ b
)(25)
where
a = l( f, g)∗(2σ f g + C2)
b = (μ f − μg)2 − (2σ f g + C2).
Using l( f, g) = 1, which also means μ f = μg and C2 = 58.5,Eq. 25 is rewritten as:
PSNRNew =48.13 + 10 log10
[SSIM
(2σ f g +58.5) ∗ (1 − SSIM)
](26)
As indicated by Eq. 25, there is an interesting correlationbetween the PSNR and the SSIM. It shows that the values ofthe SSIM and those of the PSNR are not independent. Figure 6illustrates PSNR versus the SSIM, for various values of σ f g
in the interval[0 . . . 2552
]in Eq. (26). Figure 7 illustrates an
application example for the image ‘18’ from Kodak dataset.This novel image quality metric allows taking into considera-
tion both the global and local distortions existing in different spa-tial frequency components of the image. The global distortionis measured by the PSNR in the decomposed low sub-images,and the local distortion is measured by the SSIM metric in the
123
SIViP
decomposed high sub-images. We define the PSNRNew mea-surement of the low-frequency sub-image (respectively, high-frequency sub-image) to evaluate the visual influences of thelocal and global distortion. Finally, the image quality of recon-structed image is predicted by the fusion of both local and globalinformation using the PSNRNew metrics (Fig. 8). To obtain anoptimal PSNRNew, an algorithm has been implemented to assessthe visual image quality for the low bit-rate (Fig. 9).
4.2 An algorithm to optimize the proposed methodparameters
The proposed algorithm consists in finding the optimal alpha–beta and SSIM values as well as the diameter of the disk-shapedstructuring element that maximize a certain quality measure onthe reconstructed image.
It is used for the morphological reconstruction filtering dueto its higher smoothing capabilities and easy adaptation toany type of image. For implementation purpose, we have used“Jasper Software”, to compress and decompress test images.The following steps describe the operation of the proposedalgorithm:
Step 1: decompose the input image data into low and high-frequency sub-images.Step 2: for each decomposed image, computes PSNR for lowsub-images, and SSIM for high sub-images.Step 3: set a bit-rate as follows: RATE = αn + βn where αn =[0.01, . . . , 0.04] and βn = [0.04, . . . 0.01].Step 4: construction of a predefined set of structuring elementSk for k = 1 . . . .N .Step 5: each test image is filtered by opening by reconstruction,starting from the first structuring element in the set. The result-ing kth low- and high-frequency sub-images is iteratively com-pressed and then reconstructed by αn and βn for n = 1 . . . N .Step 6: PSNRNew of the reconstructed images is calculated andstructured into a K × N matrix by choosing k and n indices(Fig. 9).Step 7: PSNR of the same image compressed by JPEG2000(direct compression) and reconstructed at the RATE is cal-culated and then compared with the previously constructedPSNRNew matrix. The highest value of PSNRNew in the matrix,which is greater than the PSNR obtained from the direct com-pression at the RATE, has been selected.Step 8: the optimum structuring element size α and β valuesfor the tested image are extracted from the highest PSNRNew.
The results obtained by the proposed algorithm are summa-rized in Table 1.
5 Experiment results
In this section, we have considered 23 test original images fromKodak dataset (see Fig. 10). Dataset includes uncompressed768 × 512 grayscale images, with various statistical features.
Fig. 12 Performance comparison in terms of perceptual distortionmetric on zoomed Kodak image “01” at 0.007 bpp a reference image;b JPEG2000 with PSNRNew = 24.14 dB; c proposed method withPSNRNew = 24.30 dB, α = 0.005 bpp, and d = 20 pixels; d reference[12] with PSNRNew = 24.26 dB, α = 0.005 bpp
PSNRNew and PSNR are used to compare objectively at lowbit-rate, the performance of the proposed method with respectto both the reference one [12] and JPEG2000 (directly).
As shown in Table 2 and Fig. 11, it can be pointed out thatthe results obtained using the proposed method outperform thoseobtained by both JPEG 2000 (directly) and the reference method[12]. Moreover, Fig. 12 illustrates the visual effect of the imageKodak image “01”, highlighting best filtering result obtained bythe proposed method. Objectively, Fig. 11 illustrates the perfor-mance curves versus bite-rate, showing the superiority of thePSNRNew in comparison with other techniques [21].
6 Conclusion
In this work, we have presented a new scheme that allows animprovement in JPEG 2000 performances in lossy compressionmode. More specifically, morphological filtering by reconstruc-tion has been used. Based on two metrics, we have shown thatthe visual quality can be improved (by reducing the effect of arti-facts) compared with the direct use of JPEG2000, or with thereference method. From the experiment result, we have obtainedan optimal PSNRNew using a disk-shaped structuring elementfor image filtering, by opening reconstruction.
In terms of metrics, the study shown that a simple analyticalcorrelation exists between the PSNR and the SSIM. It appearsthat the values of the PSNR can be predicted from the SSIM andvice-versa. Moreover, we believe that the quality of the recon-structed image can be enhanced using other shapes of structuringelement that should be adapted to image features.
123
SIViP
References
1. Christopoulos, C.: The JPEG2000 still image coding system: anoverview. Proc. IEEE Trans. Consumer Electron. 46(4), 1103–1127(2000)
2. Adams, M.D.: The JPEG-2000 Still Image Compression Stan-dard, ISO/IEC JTC 1/SC 29/WG 1N 2412. Dept. of Electrical andComputer Engineering, University of Victoria, Canada (December2005)
3. Winkler, S.: Digital Video Quality: Vision Models and Metrics.Wiley, New York (2005)
4. Singh, S., Sharma, R.K., Sharma, M.K.: Tile boundary artifactsreduction of JPEG2000 compressed images. Proc. Comput. Sci.Inf. Technol. 04, 281–290 (2012)
5. Dherete, P., Durand, S., Froment, J., Rouge, B.: Best wavelet packetbasis for joint image deblurring-denoising and compression. In:Proceedings of the Mathematics of Data/Image Coding, Compres-sion and Encryption V, with Applications, pp. 2279–2283, Seattle,WA (Jan. 8–11 2003)
6. Tai, S.-C., Wang, C.-C., Huang, L.-S., Chen, Y.-R.: MorphologicalDe-ringing filter design for JEPG-2000. Proc. J. Inf. Sci. Eng 19(4),667–680 (2003)
7. Wang, C.-C., Hwang, C.B.: ‘Ringing Artifacts Reduction usingAdaptive Morphological filter for JPEG2000’, Proc; Journal ofComputers, Vol. 19, pp. 15–23 (2008)
8. Nosratinia, A.: Post-processing of JPEG-2000 images to removecompression artifacts. Proc. IEEE Signal Process. Lett. 10, 296–299 (2003)
9. Matheron, G.: Random Sets and Integral Geometry. Wiley, NewYork (1975)
10. Serra, J.: Image Analysis and Mathematical Morphology. Acad-emic Press, London (1982)
11. Koskinen, L., Jaakko, A.: Soft morphological filters: a robust mor-phological filtering method. J. Electron. Imaging 3, 60–70 (1994)
12. Zeybeck, E.H., Nait-Ali, A.: Improvement of JPEG2000 lossy com-pression performance using preliminary nonlinear-filtering. Proc.Int. J. Signal Process. 4, 24–30 (2008)
13. Parvati, K., Prakasa Rao, B.S., Mariya Das, M.: Image segmenta-tion using gray-scale morphology and marker-controlled watershedtransformation. Dis. Dyn. Nat. Soc. 2008, 1–8 (2008)
14. Podlasov, A., Ageenko, E.: Morphological Reconstruction ofSemantic Layers in Map Images. University of Joansuu, Depart-ment of computer science, Finland, Technical report (2004)
15. Salembier, P., Serra, J.: Flat zones filtering, connected operators,and filters by reconstruction. Proc. IEEE Trans Image Process. 4(8),53–60 (1995)
16. Vincent, L.: Morphological grayscale reconstruction: Definition,efficient algorithms and applications in image analysis. In: Proceed-ings of the International Computer Vision and Pattern RecognitionConference, pp. 176–201. Piscataway, NJ, USA (1993)
17. Guangyao, C., Luhong, L., Siwei, M., Zhao, D.: Image QualityAssessment Using Spatial Frequency Component. In: Proceedingsof the Pacific Rim Conference on Multimedia: Advances in Mul-timedia Information Processing, pp. 201–211, Berlin, Heidelberg(2009)
18. Wang, Z., Bovik, A.C.: A universal image quality index. Proc. IEEESignal Process. Lett. 9(3), 81–84 (2002)
19. Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multi-scale structural sim-ilarity for image quality assessment. In: Proceedings of the IEEEAsilomar Conference on Signals, Systems, and Computers, pp.1398–1402 (2003)
20. Hore, A., Ziou, D.: Image quality metrics: PSNR vs. SSIM. In: Pro-ceedings of the International Conference on Pattern Recognition,pp. 2366–2369. IEEE Computer Society, Washington, DC, USA(2010)
21. Ouni, T., Lassoued, A., Abid, M.: Lossless image compressionusing gradient based space filling curves (G-SFC). In: Proceedingsof the Springer, Signal, Image and Video Processing, Vol. 7 (2013)
123