SVD+DWT Compression
SVD+DWT Compression
6,700
Open access books available
182,000
International authors and editors
195M
Downloads
154
Countries delivered to
TOP 1%
most cited scientists
12.2%
Contributors from top 500 universities
http://dx.doi.org/10.5772/61335
Abstract
In this chapter, we propose a new lossy image compression technique that uses
singular value decomposition (SVD) and wavelet difference reduction (WDR)
technique followed by resolution enhancement using discrete wavelet transform
(DWT) and stationary wavelet transform (SWT). The input image is decomposed
into four different frequency subbands by using DWT. The low-frequency sub‐
band is the being compressed by using DWR and in parallel the high-frequency
subbands are being compressed by using SVD which reduces the rank by ignor‐
ing small singular values. The compression ratio is obtained by dividing the total
number of bits required to represent the input image over the total bit numbers
obtain by WDR and SVD. Reconstruction is carried out by using inverse of WDR
to obtained low-frequency subband and reconstructing the high-frequency sub‐
bands by using matrix multiplications. The high-frequency subbands are being
enhanced by incorporating the high-frequency subbands obtained by applying
SWT on the reconstructed low-frequency subband. The reconstructed low-fre‐
quency subband and enhanced high-frequency subbands are being used to gener‐
ate the reconstructed image by using inverse DWT. The visual and quantitative
experimental results of the proposed image compression technique are shown
and also compared with those of the WDR with arithmetic coding technique and
JPEG2000. From the results of the comparison, the proposed image compression
technique outperforms the WDR-AC and JPEG2000 techniques.
© 2015 The Author(s). Licensee InTech. This chapter is distributed under the terms of the Creative Commons
Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution,
and reproduction in any medium, provided the original work is properly cited.
36 Wavelet Transform and Some of Its Real-World Applications
1. Introduction
With the growing demand for multimedia applications, especially high-definition images,
efficient storage and transmission of images have been issues of great concern [1, 2, 3, 4]. Image
processing deals with the reduction of the amount of bits used to represent an image. Not only
that but also resolution of an image plays an important role in many image processing
applications, such as video resolution enhancement [5], feature extraction [6], and satellite
image resolution enhancement [7]. In general, there are two types of super resolution ap‐
proaches, multi-image super resolution and single image. Multiple-image super-resolution
algorithms, like [8], [9], [10] to name a few, receive a couple of low-resolution images of the
same scene as input and usually employ a registration algorithm to find the transformation
between them. This transformation information is then used along with the estimated blurring
parameters of the input low-resolution images, to combine them into a higher-scale framework
to produce a super-resolved output image. For multiple-image super-resolution algorithms to
work properly there should be subpixels displacements between input low-resolution images.
Furthermore, these subpixels displacements should be estimated properly by the registration
algorithm, which is usually a challenging task, especially when complicated motion of
nonrigid objects, like human body, needs to be modeled. These algorithms are guaranteed to
produce proper higher-resolution details; however, their improvement factors are usually
limited by factors close to 2 [11].
Single-image super-resolution algorithms, like [12, 13, 14], to name a few, do not have the
possibility of utilizing subpixel displacements, because they only have a single input. Instead,
they employ a kind of training algorithm to learn the relationship between a set of high-
resolution images and their low-resolution counterparts. This learned relationship is then used
to predict the missing high-resolution details of the input low-resolution images. Depending
on the relationship between the training low- and high-resolution images, these algorithms
can produce high-resolution images that are far better than their inputs, by improvement
factors that are much larger than 2 [15]. Hence, compression of an image and yet reconstructing
the image with good resolution is important. Information theory is playing an important role
in image compression. Information theory can be used in order to reduce the dimensionality
of data such as histogram [16, 17]
There are two categories of image compression techniques, namely lossless and lossy image
compression techniques [18, 19]. In lossless image compression, the original image can be
perfectly recovered from the compressed image while in lossy compression the original image
cannot be perfectively recovered from the compressed image because some information is lost
as a result of compression. Lossless compression is used in applications with high requirements
such as medical imaging. Lossy compression techniques are very popular because they offer
higher compression ratio. The objective of image compression is to achieve as much compres‐
sion as possible with little loss of information [20, 21].
Wavelets are also playing significant role in many image processing applications [12, 22, 23,
24]. The two-dimensional wavelet decomposition of an image is performed by applying the
one-dimensional DWT along the rows of the image first, and then the results are decomposed
Resolutıon Enhancement Based Image Compression Technique… 37
http://dx.doi.org/10.5772/61335
along the columns. This operation results in four decomposed subband images referred to
Low-Low (LL), Low-High (LH), High-Low (HL), and High-High (HH). The frequency
components of those subbands cover the full frequency spectrum of the original image. Figure
1 shows different subband images of Lena image where the top-left image is the LL subband
and the bottom-right image is the HH subband.
Figure 1. LL, LH, HL, and HH subbands of Lena image obtained by using DWT.
In this research work, a new lossy compression technique which employs singular value
decomposition (SVD) and wavelet difference reduction (WDR) is presented. SVD is a lossy
image compression technique which can be regarded as a quantization process where it
38 Wavelet Transform and Some of Its Real-World Applications
reduces the physcovisual redundancies of the image [25, 26]. In order to enhance the resolution
of the decompressed image, stationary wavelet transform (SWT) is used. WDR is one of the
state-of-the-art techniques in image compression which uses wavelet transform. It is a lossy
image compression technique which achieves compression by first taking the wavelet
transform of the input image and then applying the difference reduction method on the
transform values [27, 28, 29, 30].
Wavelet transform based techniques also play a significant role in many image processing
applications, in particular in resolution enhancement, and recently, many novel resolution
enhancement by using wavelet transforms have been proposed. Demirel and Anbarjafari [31]
proposed an image resolution enhancement technique based on the input image and interpo‐
lation of the high-frequency subband images obtained by DWT. In their technique, an SWT
technique is used in order to enhance the edges. Then, at the same time input image is
decomposed into four frequency subbands image by using DWT. After that the input image,
as well as the high-frequency subbands are interpolated. The high-frequency subbands of SWT
are used to modify the estimated high-frequency subbands. Finally, inverse DWT (IDWT) is
applied to combine all frequency subbands in order to generate a high-resolution image. Figure
2 shows the block diagram of the proposed method in [31].
LL
LH
SWT
HL
Estimated
+ LH
DWT
Interpolation
with factor 2
LH
LL
Interpolation with
factor α/2
high-frequency features. Two frequency subbands, LH and HL, were estimated based on
wavelet frame in order to get a high-resolution image. The LH and HL frequency subbands
were used to prepare their training sets. Then, they used the training set in order to estimate
wavelet coefficients for both LH and HL frequency subbands. Finally, the IDWT was used in
order to reconstruct a high-resolution image.
In [33], the authors used a complex wavelet-domain image resolution enhancement algorithm
based on the estimation of wavelet coefficients. Their method uses a dual-tree complex wavelet
transform (DT-CWT) in order to generate a high-resolution image. First, they estimate a set of
wavelet coefficients from the DT-CWT decomposition of the rough estimation of the high-
resolution image. Then, the inverse DT-CWT is used to combine the wavelet coefficients and
the low-resolution input image in order to reconstruct a high-resolution image. Figure 3 shows
the block diagram of the proposed method in [33].
Interpolation with a
factor α/2
Low frequency
subband images (n/2xm/2)
Super
Input DT-CWT
IDT-CWT resolved high
Image Inteprolation resolution
(nxm) with a image
factor α (αn x αm)
+75 +45 +15 -15 -45 -75 +75 +45 +15 -15 -45 -75
High frequency subband images Interpolated High frequency subband images
(n/2xm/2) (αn/2xαm/2)
Patel and Joshi [34] proposed a new learning-based approach for super resolution using DWT.
The novelty of their method lies in designing application-specific wavelet basis (filter coeffi‐
cients). First the filter coefficients and learning the high-frequency details in the wavelet
domain is used to initial estimate of super-resolution image. Then, they used a sparsely based
regularization framework, in which image there was degradation. Finally, the super-resolution
image is estimated by the initial super-resolution estimate and the estimated wavelet filter
coefficients. Their algorithm has some advantages such as avoiding the use of registered
images while learning the initial estimate, use of sparsity prior to preserving neighborhood
dependencies in super-resolution image and use of estimated wavelet filter coefficients to
represent an optimal point spread function to model image acquisition process. Figure 4
illustrates the block diagram of the proposed method in [34].
In [35], similar to the proposed method in [30], the authors used wavelet domain in order to
generate super-resolution image from a single low-resolution image. They proposed an
intermediate stage with the aim of estimating high-frequency subbands. The intermediate
stage consists of an edge preservation procedure and mutual interpolation between the input
low-resolution image and the HF subband images. Sparse mixing weights are calculated over
blocks of coefficients in an image, which provides a sparse signal representation in the low-
40 Wavelet Transform and Some of Its Real-World Applications
resolution image. Finally, they used IDWT to combine all frequency subbands in order to
reconstruct a high-resolution image. The block diagram of their proposed method is shown in
Fig. 5.
In the proposed compression technique, the input image is firstly decomposed into its different
frequency subbands by using 1 level DWT. The LL subband is then being compressed by using
DWR and the high-frequency subbands, i.e., LH, HL, and HH, are being compressed by using
SVD. The proposed technique has been tested on several well-known images such as, Lena,
Peppers, Boat, and Airfield. The results of this technique have been compared with those of
JPEG2000 and WDR with arithmetic coding techniques. The quantitative experimental results
based on PSNR show that the proposed technique overcomes the aforementioned techniques.
The SVD and WDR image compression techniques are discussed in the next section.
42 Wavelet Transform and Some of Its Real-World Applications
From a mathematical point of view, an image can be represented by a matrix, which consists
of one or three layers in the case the image is grayscale or RGB, respectively. The results
of the implementation of SVD on a grayscale image, which is represented by the single-
layer image A, are three matrices U, Σ, and V, where U and V are orthogonal, and Σ is a
diagonal matrix containing the singular values of A. In what follows, the SVD procedure
is briefly reviewed. The relation between the matrix A, and the decomposed components,
U, Σ, and V, can be mathematically presented through the formulation provided in Eqn.
(1), where the dimensions of all the matrices are shown, given that the dimensions of the
matrix A has been m×n [37, 38, 39]:
Eqn. (2) shows how a matrix Σ̄ p×q with smaller dimensions p ≤ m and q ≤ n can be used to
approximate the diagonal matrix with the dimensions m and n:
é ù
Sm´n = êS p´q 0 ú p £ m and q £ n (2)
êë 0 Oúû
Some columns of U and rows of V are then reduced in order to reconstruct the compressed
image by multiplication. This is shown in Eqn. (3):
U m´m = êéU m ´ p U
± ù
m ´ ( m - p ) ú and Vn´n = êéV n ´ q ±
V n ´ ( n - q ) úù (3)
ë û ë û
( )
T
Because the singular matrix has sorted singular values (in descending order), by using the
physcovisual concept, ignoring low singular value will not significantly reduce the visual
quality of the image. Figure 7 shows Lena’s picture being reconstructed by using different
amount of singular values. This characteristic that an image can be reconstructed by fewer
amounts of singular values makes SVD suitable for compression. Because after reconstruc‐
tion of the image the ignored singular values cannot be recovered, the compression by SVD is
lossy [33].
(a) (b)
(c) (d)
Figure 7. Lena’s image of size 256x256 reconstructed by Eq. (4) (a) orıginal Lena image; (b) reconstructed using128 σ;
(c) reconstructed using 64 σ; (d) reconstructed using32 σ.
The WDR is a compression technique, which is based on the difference reduction method. The
wavelet transform of the input image is first made; bit plane encoding is then applied to the
44 Wavelet Transform and Some of Its Real-World Applications
transform values. The bit plane encoding procedure starts with the initialization stage, where
a threshold To is chosen such that To is greater than all the transform values, and at least, one
of the transform values has a magnitude of To/2. The next stage is the initialization, where the
threshold T = Tk-1 is updated to T = Tk, where Tk = Tk-1/2. New significant transform values (w(i))
which are satisfying T ≤ | w(i) | ≤ 2T are then identified at the significant pass stage. The
transform values of these significant transform values are then encoded using the difference
reduction method. At the significant pass stage, already quantized values (wQ) which satisfy
|wQ| ≥ 2T are then refined in order to reduce error [27, 29, 30].
The proposed image compression technique is a lossy compression technique. Firstly, the
image is decomposed into its frequency subbands by using DWT. Among these subbands, LL
subband is being compressed by using WDR. The high-frequency subband images are being
compressed by using SVD. The number of singular values that are being used in order to
reconstruct the high-frequency subbands can be reduced into 1, i.e., the highest singular value
is enough to reconstruct the high-frequency subbands. If only one singular value is being used
in order to reconstruct a matrix, this means that only one column of U and V matrices are being
used. The qualitative loss is not psychovisually noticeable up to some point. In order to obtain
the compression ratio of the proposed technique, the total number of bits required to represent
the original image is divided by the total of number of bits which is obtained by adding the
number of bit streams of WDR for LL and that of the SVD compression for LH, HL, and HH.
Decompression is carried out by taking the inverse WDR (IWDR) of the bit streams in order
to reconstruct the LL subband and in parallel the matrix multiplications are conducted in order
to reconstruct LH, HL, and HH subbands. Due to the losses by ignoring low-valued singular
values, high-frequency subbands need to be enhanced. For this purpose, stationary wavelet
transform (SWT) is applied to the LL subband image which results in new low- and high-
frequency subbands. These high-frequency subbands will have the same direction as the ones
obtained by DWT (e.g., horizontal, vertical, and diagonal), so they will be added to the
respective ones reconstructed by matrix multiplications. Now, the LL subband image obtained
by IWDR and the enhanced LH, HL, and HH subbands are combined by using inverse DWT
(IDWT) in order to reconstruct the decompressed image. The enhancement of high-frequency
subbands by using SWT results in more sharpened decompressed image. The block diagram
of the proposed lossy image compression technique is shown in Fig. 8. The experimental
qualitative and quantitative results are represented and discussed in the next section.
As it was mentioned in the Introduction, for comparison purposes, the proposed lossy image
compression was tested on many benchmark images, namely, Lena, Pepper, Boats, Airfield,
Resolutıon Enhancement Based Image Compression Technique… 45
http://dx.doi.org/10.5772/61335
DWR
SVD based
Encoding
Compression
Compressed
DWT
bit stream
SVD based
Compression
SVD based
Compression
LL LL LH LL
Inverse of
LL SWT
WDR
Decoding
HLLL HHLL
Compressed
bit stream
IDWT
LH LH
Inverse of
SVD
procedure
HL
+ HL
HH HH
Enhanced
Figure 8. Block diagram of the proposed blocked based lossy image compression technique.
and Goldhill. All the input images were of resolutions 256 x 256 pixels, 8-bit grayscale. Tables
1, 2, and 3 provide a quantitative comparison between the proposed technique, JPEG2000, and
WDR [40, 41] based on PSNR values, in dB, for compression ratios 20:1, 40:1, and 80:1,
respectively.
The foregoing tables illustrate the superiority of the proposed method in terms of its capability
in leading to significantly higher PSNR values compared to the other techniques proposed,
previously, in the literature. It is worth noticing that the improvement in the PSNR values
brought about by considering the proposed method might better show its impact while
keeping in mind the fact that they are calculated in dB, meaning that a logarithmic function
determines them, which clarifies how considerable the difference between the actual values
has been. To be more clear, if one calculates the difference between the PSNR values obtained
using WDR and JPEG2000, and subsequently, that of JPEG2000 and the proposed method, it
can be seen that the latter is much higher than the former, although JPEG2000 has always been
deemed of significantly better performance than WDR. Thus, it can be concluded that the
proposed method makes an enormous enhancement to the PSNR values compared to the ones
obtained upon employing WDR or JPEG2000.
46 Wavelet Transform and Some of Its Real-World Applications
Techniques
Image
WDR JPEG2000 Proposed
Techniques
Image
WDR JPEG2000 Proposed
Techniques
Image
WDR JPEG2000 Proposed
In order to ensure the quality of the output of the proposed technique, and for visual illustra‐
tion, the images resulting from the implementation of the foregoing approach were obtained,
along with that of JPEG2000 and WDR. Figure 9 shows a part of the magnified Lena image
having been compressed using the foregoing approaches, separately, with compression ratio
40:1. As sought from the outset, the proposed method is competent enough to maintain the
quality of the image while compressing it, and at the same time, result in better PSNR, which
shows its capability in correctly deciding on a reasonable trade-off between the amount of data
needed to be transferred, or kept, and the visibility and authenticity of the details in the image
Resolutıon Enhancement Based Image Compression Technique… 47
http://dx.doi.org/10.5772/61335
blocks, which is, probably, the most tricky criterion in devising image compression algorithms.
As Fig. 9 illustrates, the overall quality of the Lena image being compressed by the proposed
method is satisfactory despite possessing much higher PSNR value compared to the JPEG2000
and WDR techniques, and the details are clear and visible, even better than the output of the
WDR.
(a) (b)
(c) (d)
Figure 9. Zoomed (a) original Lena image, and compressed images by using (b) JPEG2000, (c) WDR, and (d) the pro‐
posed image compression technique at 40:1 compression ratio.
5. Conclusion
In this research work, a new lossy image compression technique which uses singular value
decomposition and wavelet difference reduction techniques, followed by resolution enhance‐
ment, using discrete wavelet transform and stationary wavelet transform was proposed.
As the first step in the proposed image compression technique, the input image was decom‐
posed into four different frequency subbands using discrete wavelet transform. The low-
frequency subband was compressed using wavelet difference reduction, and in parallel, the
high-frequency subbands were compressed using singular value decomposition. The com‐
pression ratio was obtained by dividing the total number of bits required to represent the input
48 Wavelet Transform and Some of Its Real-World Applications
image over the total bit numbers obtained by wavelet difference reduction and singular value
decomposition.
Reconstruction was carried out using inverse wavelet difference reduction to obtain low-
frequency subband and reconstructing the high-frequency subbands using matrix multipli‐
cations. The high-frequency subbands were enhanced using high frequency obtained by
stationary wavelet transform. The reconstructed low-frequency subband and enhanced high-
frequency subbands were used to generate the reconstructed image using inverse discrete
wavelet transform.
The visual and quantitative experimental results of the proposed image compression technique
showed that the proposed image compression technique outperformed the wavelet difference
reduction and JPEG2000 techniques.
Acknowledgements
The research was supported by the ERDF program, “Estonian higher education information
and communications technology and research and development activities state program
2011-2015 (ICT program)” and Estonian Research Council grant (PUT638).
Author details
1 iCV Group, IMS Lab, Institute of Technology, University of Tartu, Tartu, Estonia
2 Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, United
Kingdom
References
[7] H. Demirel and G. Anbarjafari, "Discrete wavelet transform based satellite image res‐
olution enhancement," Geoscience and Remote Sensing, IEEE Transactions on, vol. 49,
no. 6, pp. 1997-2004, 2011.
[8] C. Liu and D. Sun, "On Bayesian Adaptive Video Super Resolution," Pattern Analysis
and Machine Intelligence, IEEE Transactions on, vol. 36, no. 2, pp. 346-360, 2014.
[9] G. Polatkan, M. Zhou, L. Carin, D. Blei and I. Daubechies, "A Bayesian Nonparamet‐
ric Approach to Image Super-Resolution," Pattern Analysis and Machine Intelligence,
IEEE Transactions on, vol. 37, no. 2, pp. 346-358, 2015.
[10] P. Rasti, H. Demirel and G. Anbarjafari, "Image resolution enhancement by using in‐
terpolation followed by iterative back projection," in Signal Processing and Communi‐
cations Applications Conference (SIU), 2013 21st, pp. 1-4, 2013.
[11] D. Glasner, S. Bagon and M. Irani, "Super-resolution from a single image," in Comput‐
er Vision, 2009 IEEE 12th International Conference on, pp. 349-356, 2009.
[14] Y. Zhu, Y. Zhang and A. Yuille, "Single Image Super-resolution Using Deformable
Patches," in Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on,
pp. 2917-2924 2014.
[16] H. Demirel and G. Anbarjafari, "Data fusion boosted face recognition based on prob‐
ability distribution functions in different colour channels," EURASIP Journal on Ad‐
vances in Signal Processing, vol. 2009, p. 25, 2009.
[17] H. Demirel and G. Anbarjafari, "Iris recognition system using combined colour statis‐
tics.," in Signal Processing and Information Technology, pp. 175-179, 2008.
[18] X. Zhang, "Lossy compression and iterative reconstruction for encrypted image," In‐
formation Forensics and Security, IEEE Transactions on, vol. 16, no. 1, pp. 53-58, 2011.
[19] F. Hussain and J. Jeong, "Efficient Deep Neural Network for Digital Image Compres‐
sion Employing Rectified Linear Neurons," Journal of Sensors, 2015.
[20] M. Boliek, "Beyond compression: a survey of functionality derived from still image
coding," in Signals, Systems and Computers, 2004. Conference Record of the Thirty-Seventh
Asilomar Conference on, pp. 1971-1974, 2003.
[21] R. C. Gonzalez and R. E. Woods, Digital Image Processing, Prentice Hall Upper Sad‐
dle River, NJ, 2002.
[25] J.-F. Yang and C.-L. Lu, "Combined Techniques of Singular Value Decomposition
and Vector Quantization for Image Coding," IEEE Transactions on Image Processing,
vol. 4, no. 8, pp. 1141-1146, 1995.
[26] M. D. Greenberg, Differential equations & Linear algebra, Prentice Hall, 2001.
[27] J. Tian and R. O. Wells Jr, "Embedded image coding using wavelet difference reduc‐
tion," in Wavelet image and video compression, pp. 289-301, 2002.
[28] J. Tian and R. O. Wells Jr, "A lossy image codec based on index coding," in IEEE Data
Compression Conference,p. 456, 1996.
Resolutıon Enhancement Based Image Compression Technique… 51
http://dx.doi.org/10.5772/61335
[29] J. Tian and R. O. Wells Jr, "Image data processing in the compressed wavelet do‐
main," in Signal Processing, 1996., 3rd International Conference on, pp. 978-981 1996.
[30] S. Raja and A. Suruliandi, "Image Compression Using WDR and ASWDR Techniques
With Different Wavelet Codecs," ACEEE Int. J. Inform. Technol. v01, pp. 23-26, 2011.
[31] H. Demirel and G. Anbarjafari, "Image resolution enhancement by using discrete and
stationary wavelet decomposition.," Image Processing, IEEE Transactions on, vol. 20,
no. 5, pp. 1458-1460, 2011.
[32] C. Kim, K. Choi, K. Hwang and J. B. Ra, "Learning-based superresolution using a
multi-resolution wavelet approach," in Iternational workshop on Advance Image Technol‐
ogy (IWAIT), 2009.
[33] H. Demirel and G. Anbarjafari, "Satellite image resolution enhancement using com‐
plex wavelet transform.," Geoscience and Remote Sensing Letters, IEEE, vol. 7, no. 1, pp.
123-126, 2010.
[34] R. C. Patel and M. V. Joshi, "Super-Resolution of Hyperspectral Images: Use of Opti‐
mum Wavelet Filter Coefficients and Sparsity Regularization," Geoscience and Remote
Sensing, IEEE Transactions on, vol. 53, no. 4, pp. 1728 - 1736, 2015.
[36] P. P. Gajjar and M. V. Joshi, "New learning based super-resolution: use of DWT and
IGMRF prior," Image Processing, IEEE Transactions on, vol. 19, no. 5, pp. 1201-1213,
2010.
[37] D. Kaiman, "A Singularly Valuable Decomposition," College Mathematics Journal, vol.
27, no. 1, pp. 2-23, 1996.
[38] L. Knockaert, B. De Backer and D. De Zutter, "SVD compression, unitary transforms,
and computational complexity," Signal Processing, IEEE Transactions on, vol. 47, no.
10, pp. 2724-2729, 1999.
[39] S. J. Nivedita, "Performance Analysis of SVD and SPIHT Algorithm for Image Com‐
pression Application," International Journal of Advanced Research in Computer Science
and Software Engineering, vol. 2, no. 2, 2012.
[40] J. S. Walker, T. Q. Nguyen and Y.-J. Chen, "A low-power, low-memory system for
wavelet-based image compression," Optical Engineering, Research Signposts, vol. 5, pp.
111-125, 2003.