IP Unit 5 Notes

The document discusses various techniques for compressing digital images including lossless techniques like run length encoding, differential coding, and predictive coding as well as lossy techniques involving quantization. It covers goals of image compression, definitions of compression ratio and data redundancy, components of a generic compression system, and specific methods like Huffman coding, arithmetic coding, and dictionary based coding.

Uploaded by

kartik gupta

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views

IP Unit 5 Notes

Uploaded by

kartik gupta

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 97

Unit 5.

IMAGE COMPRESSION AND

RECOGNITION
IMAGE COMPRESSION
• Image compression deals with reducing the amount of data
required to represent a digital image by removing of
redundant data.
• Images can be represented in digital format in many ways.
• Encoding the contents of a 2-D image in a raw bitmap (raster)
format is usually not economical and may result in very large
files.
• Since raw image representations usually require a large
amount of storage space (and proportionally long
transmission times in the case of file uploads/ downloads),
most image file formats employ some type of compression.
• The need to save storage space and shorten transmission
time, as well as the human visual system tolerance to a
modest amount of loss, have been the driving factors behind
image compression techniques.
Goal of image compression
• The goal of image compression is to reduce the amount of
data required to represent a digital image.
Data ≠ Information:
• Data and information are not synonymous terms!
• Data is the means by which information is conveyed.
• Data compression aims to reduce the amount of data
required to represent a given quality of information
while preserving as much information as possible.
• The same amount of information can be represented
by various amount of data.
• Ex1: You have an extra class after completion of 3.50
p.m
• Ex2: Extra class have been scheduled after 7th hour for
you.
• Ex3: After 3.50 p.m you should attended extra class.
Definition of compression ratio
Definitions of Data Redundancy
• Two-dimensional intensity arrays suffer from three
principal types of data redundancies that can be
identified and exploited:
1. Coding redundancy
2. Spatial and temporal redundancy.
3. Irrelevant information.
Coding redundancy
• Code: a list of symbols (letters, numbers, bits etc.,)
• Code word: a sequence of symbol used to represent a piece of
information or an event (e.g., gray levels).
• Code word length: number of symbols in each code word.
Spatial and Temporal Redundancy
• In the corresponding 2-D intensity array:
1. All 256 intensities are equally probable.
As Fig. 8.2 shows, the histogram of the image is
uniform.
2. Because the intensity of each line was selected
randomly, its pixels are independent of one another
in the vertical direction.
3. Because the pixels along each line are identical, they
are maximally correlated (completely dependent on
one another) in the horizontal direction.
Irrelevant Information
• Most 2-D intensity arrays contain information that is
ignored by the human visual system and/or
extraneous to the intended use of the image.
• It is redundant in the sense that it is not used.
• Its main components are:
• Mapper: transforms the input data into a (usually nonvisual)
format designed to reduce interpixel redundancies in the
input image. This operation is generally reversible and may or
may not directly reduce the amount of data required to
represent the image.
• Quantizer: reduces the accuracy of the mapper’s output in
accordance with some pre- established fidelity criterion.
Reduces the psychovisual redundancies of the input image.
This operation is not reversible and must be omitted if lossless
compression is desired.
• Symbol (entropy) encoder: creates a fixed- or variable-length
code to represent the quantizer’s output and maps the output
in accordance with the code. In most cases, a variable-length
code is used. This operation is reversible.
Some Basic Compression Methods
• Error-free compression
• Error-free compression techniques usually rely on entropy-based
encoding algorithms.
• The concept of entropy is mathematically described in equation:
where:
• a j is a symbol produced by the information source
• P ( a j ) is the probability of that symbol
• J is the total number of different symbols
• H ( z ) is the entropy of the source.
• The concept of entropy provides an upper bound on how much
compression can be achieved, given the probability distribution of
the source.
• In other words, it establishes a theoretical limit on the amount of
lossless compression that can be achieved using entropy encoding
techniques alone.
Huffman Coding
• One of the most popular techniques for removing coding
redundancy is due to Huffman (Huffman [1952]).
• When coding the symbols of an information source
individually, Huffman coding yields the smallest possible
number of code symbols per source symbol.
• The first step in Huffman’s approach is to create a series
of source reductions by ordering the probabilities of the
symbols under consideration and combining the lowest
probability symbols into a single symbol that replaces
them in the next source reduction.
• Figure 8.7 illustrates this process for binary coding (K-ary
Huffman codes can also be constructed).
• The second step in Huffman’s procedure is to code each
reduced source, starting with the smallest source and
working back to the original source.
• The minimal length binary code for a two-symbol source, of
course, are the symbols 0 and 1.
• As Fig. 8.8 shows, these symbols are assigned to the two
symbols on the right (the assignment is arbitrary; reversing
the order of the 0 and 1 would work just as well).
• The average length of this code is

and the entropy of the source is 2.14 bits/symbol.

• Huffman’s procedure creates the optimal code for a
set of symbols and probabilities subject to the
constraint that the symbols be coded one at a time.
Variable Length Coding (VLC)
• Most entropy-based encoding techniques rely on
assigning variable-length codewords to each symbol,
whereas the most likely symbols are assigned shorter
codewords.
• In the case of image coding, the symbols may be raw
pixel values or the numerical values obtained at the
output of the mapper stage (e.g., differences between
consecutive pixels, run-lengths, etc.).
• The most popular entropy-based encoding technique is
the Huffman code.
• It provides the least amount of information units (bits)
per source symbol.
Arithmetic Coding
• Unlike the variable-length codes of the previous
two sections, arithmetic coding generates
nonblock codes.
• In arithmetic coding, which can be traced to the
work of Elias (see Abramson [1963]), a one-to-
one correspondence between source symbols
and code words does not exist.
• Instead, an entire sequence of source symbols (or
message) is assigned a single arithmetic code
word.
• Figure 8.12 illustrates the basic arithmetic coding
process.
LZW Coding
• In this section, we consider an error-free
compression approach that also addresses
spatial redundancies in an image.
• The technique, called Lempel-Ziv-Welch (LZW)
coding, assigns fixed-length code words to
variable length sequences of source symbols.
Run-length encoding (RLE)
• RLE is one of the simplest data compression
techniques.
• It consists of replacing a sequence (run) of
identical symbols by a pair containing the symbol
and the run length.
• It is used as the primary compression technique
in the 1-D CCITT Group 3 fax standard and in
conjunction with other techniques in the JPEG
image compression standard.
Differential coding
• Differential coding techniques explore the
interpixel redundancy in digital images.
• The basic idea consists of applying a simple
difference operator to neighboring pixels to
calculate a difference image, whose values are
likely to follow within a much narrower range
than the original gray-level range.
• As a consequence of this narrower distribution –
and consequently reduced entropy – Huffman
coding or other VLC schemes will produce shorter
codewords for the difference image.
Predictive coding
• Predictive coding techniques constitute
another example of exploration of interpixel
redundancy, in which the basic idea is to
encode only the new information in each
pixel.
• This new information is usually defined as the
difference between the actual and the
predicted value of that pixel.
Lossless predictive coding
• Figure 8.33 shows the basic components of a lossless
predictive coding system.
• The system consists of an encoder and a decoder,
each containing an identical predictor.
• As successive samples of discrete time input signal,
f(n), are introduced to the encoder, the predictor
generates the anticipated value of each sample
based on a specified number of past samples.
• The output of the predictor is then rounded to the
nearest integer, denoted and used to form the
difference or prediction error
Dictionary-based coding
• Dictionary-based coding techniques are based on the
idea of incrementally building a dictionary (table) while
receiving the data.
• Unlike VLC techniques, dictionary-based techniques
use fixed-length codewords to represent variable-
length strings of symbols that commonly occur
together.
• Consequently, there is no need to calculate, store, or
transmit the probability distribution of the source,
which makes these algorithms extremely convenient
and popular.
• The best-known variant of dictionary-based coding
algorithms is the LZW (Lempel-Ziv-Welch) encoding
scheme, used in popular multimedia file formats such
as GIF, TIFF, and PDF.
Lossy compression
• Lossy compression techniques deliberately
introduce a certain amount of distortion to
the encoded image, exploring the
psychovisual redundancies of the original
image.
• These techniques must find an appropriate
balance between the amount of error (loss)
and the resulting bit savings.
Quantization
• The quantization stage is at the core of any lossy image encoding
algorithm.
• Quantization, in at the encoder side, means partitioning of the
input data range into a smaller set of values.
• There are two main types of quantizers: scalar quantizers and
vector quantizers.
• A scalar quantizer partitions the domain of input values into a
smaller number of intervals.
• If the output intervals are equally spaced, which is the simplest way
to do it, the process is called uniform scalar quantization;
otherwise, for reasons usually related to minimization of total
distortion, it is called nonuniform scalar quantization.
• One of the most popular nonuniform quantizers is the Lloyd-Max
quantizer.
• Vector quantization (VQ) techniques extend the basic principles of
scalar quantization to multiple dimensions. Because of its fast
lookup capabilities at the decoder side, VQ-based coding schemes
are particularly attractive to multimedia applications.
Transform coding
• The techniques discussed so far work directly on the pixel values
and are usually called spatial domain techniques.
• Transform coding techniques use a reversible, linear mathematical
transform to map the pixel values onto a set of coefficients, which
are then 137 quantized and encoded.
• The key factor behind the success of transform-based coding
schemes many of the resulting coefficients for most natural images
have small magnitudes and can be quantized (or discarded
altogether) without causing significant distortion in the decoded
image.
• Different mathematical transforms, such as Fourier (DFT), Walsh-
Hadamard (WHT), and Karhunen-Loeve (KLT), have been considered
for the task.
• For compression purposes, the higher the capability of compressing
information in fewer coefficients, the better the transform; for that
reason, the Discrete Cosine Transform (DCT) has become the most
widely used transform coding technique.
Wavelet coding
• Wavelet coding techniques are also based on the idea that
the coefficients of a transform that decorrelates the pixels
of an image can be coded more efficiently than the original
pixels themselves.
• The main difference between wavelet coding and DCT-
based coding is the omission of the first stage.
• Because wavelet transforms are capable of representing an
input signal with multiple levels of resolution, and yet
maintain the useful compaction properties of the DCT, the
subdivision of the input image into smaller subimages is no
longer necessary.
• Wavelet coding has been at the core of the latest image
compression standards, most notably JPEG 2000, which is
discussed in a separate short article.
Image compression standards
• JPEG
• One of the most popular and comprehensive continuous
tone, still frame compression standards is the JPEG
standard.
• It defines three different coding systems:
• (1) a lossy baseline coding system, which is based on the
DCT and is adequate for most compression applications;
• (2) an extended coding system for and
• (3) a lossless independent coding system for reversible
compression.
• To be JPEG compatible, a product or system must include
support for the baseline system.
• No particular file format, spatial resolution, or color
space model is specified.
MPEG
Unit 5.1

IMAGE COMPRESSION AND

RECOGNITION
Representation and Description
– Representing regions in 2 ways:
• Based on their external characteristics (its
boundary):
– Shape characteristics
• Based on their internal characteristics (its
region):
– Regional properties: color, texture, and …
• Both
– Describes the region based on a selected
representation:
• Representation -> boundary or textural features
• Description -> length, orientation, the number
of concavities in the boundary, statistical
measures of region.
• Invariant Description:
– Size (Scaling)
– Translation
– Rotation
Boundary (Border) Following:
• We need the boundary as a ordered sequence of points.
Polygonal Approximations Using Minimum-Perimeter
Polygons
MPP algorithm
1. The MPP bounded by a simply connected cellular
complex is not self intersecting.
2. Every convex vertex of the MPP is a W vertex, but not
every W vertex of a boundary is a vertex of the MPP.
3. Every mirrored concave vertex of the MPP is a B vertex,
but not every B vertex of a boundary is a vertex of the
MPP.
4. All B vertices are on or outside the MPP, and all W
vertices are on or inside the MPP.
5. The uppermost, leftmost vertex in a sequence of
vertices contained in a cellular complex is always a W
vertex of the MPP.
Other Polygonal Approximation Approaches
Regional Descriptors
FIGURE 11.22 Infrared images of the Americas at night.
FIGURE 11.23 A region with two holes.
FIGURE 11.24 A region with three connected components.
FIGURE 11.25 Regions with Euler numbers equal to 0 and respectively. -1,
FIGURE 11.26 A region containing a polygonal network.
Example
Patterns and Pattern Classes
• A pattern is an arrangement of descriptors.
• The name feature is used often in the pattern
recognition literature to denote a descriptor.
• A pattern class is a family of patterns that
share some common properties.
• Pattern classes are denoted w1,w2,.....,wW
where W is the number of classes.
• Three common pattern arrangements used in
practice are vectors (for quantitative
descriptions) and strings and trees (for
structural descriptions).
• Pattern vectors are represented by bold
lowercase letters, such as x, y, and z, and take
the form
Recognition Based on Matching
• Recognition techniques based on matching
represent each class by a prototype pattern
vector.
• An unknown pattern is assigned to the class to
which it is closest in terms of a predefined
metric.
Minimum distance classifier
• Suppose that we define the prototype of each
pattern class to be the mean vector of the patterns
of that class:

HireDesk HRIS Implementation Project Plan Document
100% (2)
HireDesk HRIS Implementation Project Plan Document
42 pages
Math Paper 2 - MR 6points
100% (14)
Math Paper 2 - MR 6points
140 pages
MuleSoft Training
No ratings yet
MuleSoft Training
25 pages
Data Analytics For Accounting - Chapter 3 - Data Analytics For Accouting - Performing The Test Plan and
No ratings yet
Data Analytics For Accounting - Chapter 3 - Data Analytics For Accouting - Performing The Test Plan and
10 pages
DSP Questions
No ratings yet
DSP Questions
46 pages
Dce Syllabus
No ratings yet
Dce Syllabus
3 pages
DIP02-Image Enhancement-Point Processing
No ratings yet
DIP02-Image Enhancement-Point Processing
34 pages
Detailed Lesson Plan-Dsp
No ratings yet
Detailed Lesson Plan-Dsp
6 pages
Error-Free Compression: Variable Length Coding
No ratings yet
Error-Free Compression: Variable Length Coding
13 pages
DIP IPT Unit V Complete
No ratings yet
DIP IPT Unit V Complete
69 pages
Gujcost MRP Scheme Final
0% (1)
Gujcost MRP Scheme Final
18 pages
Optimum Statistical Classifiers
100% (1)
Optimum Statistical Classifiers
12 pages
PIP Unit 3 Image Segmentation
No ratings yet
PIP Unit 3 Image Segmentation
54 pages
Digital Signal Processing Question Bank 01
No ratings yet
Digital Signal Processing Question Bank 01
37 pages
VHDL Operators
100% (1)
VHDL Operators
18 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
41 pages
DSP Lab Content Beyond Syllabus Code
No ratings yet
DSP Lab Content Beyond Syllabus Code
3 pages
Dilation and Erosion
No ratings yet
Dilation and Erosion
10 pages
Dip 15ec72 Module1
No ratings yet
Dip 15ec72 Module1
27 pages
M.Tech CSE Syllabus Notes
No ratings yet
M.Tech CSE Syllabus Notes
32 pages
Ad3351 Daa Important Questions
No ratings yet
Ad3351 Daa Important Questions
94 pages
Dce Sem 5 Syllabus
0% (1)
Dce Sem 5 Syllabus
3 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
Analysis of Covariance
No ratings yet
Analysis of Covariance
4 pages
Ece596:Graph Theory and Optimization Techniques: Course Objectives
No ratings yet
Ece596:Graph Theory and Optimization Techniques: Course Objectives
1 page
Ec8093 Dip - Question Bank With Answers
No ratings yet
Ec8093 Dip - Question Bank With Answers
189 pages
Lab Manual 15
No ratings yet
Lab Manual 15
9 pages
Biomedical Image Processing
No ratings yet
Biomedical Image Processing
26 pages
Msi Logic Circuit
No ratings yet
Msi Logic Circuit
17 pages
Histogram Equalization
No ratings yet
Histogram Equalization
15 pages
Chapter 9: Morphological Image Processing Digital Image Processing
No ratings yet
Chapter 9: Morphological Image Processing Digital Image Processing
58 pages
CH - 12 MAC
No ratings yet
CH - 12 MAC
55 pages
BM6712 Digital Image Processing Laboratory
No ratings yet
BM6712 Digital Image Processing Laboratory
61 pages
Digital Image Processing Lab - 1
No ratings yet
Digital Image Processing Lab - 1
5 pages
K Means Clustering Solved Numerical - 5 Minutes Engineering
No ratings yet
K Means Clustering Solved Numerical - 5 Minutes Engineering
8 pages
BM2406 Digital Image Processing Lab Manual
No ratings yet
BM2406 Digital Image Processing Lab Manual
24 pages
Question Bank New
No ratings yet
Question Bank New
3 pages
Figure 1.1 A Block Diagram of A Communication Systems
No ratings yet
Figure 1.1 A Block Diagram of A Communication Systems
14 pages
Unit 2 Fod
No ratings yet
Unit 2 Fod
27 pages
MA3355 - AprMay 2023
No ratings yet
MA3355 - AprMay 2023
4 pages
Module 3a - Itext & Image Compression
No ratings yet
Module 3a - Itext & Image Compression
54 pages
3d Visualization PDF
No ratings yet
3d Visualization PDF
24 pages
Department of Electronics and Communication Engineering: Digital Signal Processing
No ratings yet
Department of Electronics and Communication Engineering: Digital Signal Processing
25 pages
Basic Relationship Between Pixels
No ratings yet
Basic Relationship Between Pixels
22 pages
Set Theory: Prof. Asim Tewari IIT Bombay
No ratings yet
Set Theory: Prof. Asim Tewari IIT Bombay
22 pages
Eng4Bf3 Medical Image Processing: Image Enhancement in Frequency Domain
No ratings yet
Eng4Bf3 Medical Image Processing: Image Enhancement in Frequency Domain
59 pages
RM UNIT-1 RESEARCH DESIGN
No ratings yet
RM UNIT-1 RESEARCH DESIGN
27 pages
Two Mark Questions and Answers
No ratings yet
Two Mark Questions and Answers
19 pages
Unit-I Introduction To Image Processing
No ratings yet
Unit-I Introduction To Image Processing
23 pages
EC 6303 Question Bank SIGNALS & SYSTEM May 1 16
No ratings yet
EC 6303 Question Bank SIGNALS & SYSTEM May 1 16
56 pages
Question Bank Internal Image Processing
No ratings yet
Question Bank Internal Image Processing
6 pages
BM3411 Set3
No ratings yet
BM3411 Set3
2 pages
Image Enhancement Histogram Processing: by Dr. K. M. Bhurchandi
No ratings yet
Image Enhancement Histogram Processing: by Dr. K. M. Bhurchandi
25 pages
Subband Coding: Presented by DR.R Murugan NIT Silchar
No ratings yet
Subband Coding: Presented by DR.R Murugan NIT Silchar
11 pages
Database Design and Management - AD3391 - Important Questions With Answer - Unit 2 - Relational Model and SQL
100% (1)
Database Design and Management - AD3391 - Important Questions With Answer - Unit 2 - Relational Model and SQL
12 pages
17EC72 DIP Question Bank
No ratings yet
17EC72 DIP Question Bank
12 pages
Fuzzy Logic Syllabus
No ratings yet
Fuzzy Logic Syllabus
2 pages
Digital Image Processing: Huffman Coding Example
No ratings yet
Digital Image Processing: Huffman Coding Example
3 pages
B e Bme
No ratings yet
B e Bme
42 pages
Tamil Nadu State Council For Science and Technology: Founder Chairman, Velammal Educational Trust
No ratings yet
Tamil Nadu State Council For Science and Technology: Founder Chairman, Velammal Educational Trust
3 pages
Module 1:image Representation and Modeling
No ratings yet
Module 1:image Representation and Modeling
48 pages
Adic Syllabus
No ratings yet
Adic Syllabus
1 page
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
UNIT V Lecture notes
No ratings yet
UNIT V Lecture notes
14 pages
Download Full Applied Statistics and the SAS Programming Language 5th Edition Ron P. Cody PDF All Chapters
100% (4)
Download Full Applied Statistics and the SAS Programming Language 5th Edition Ron P. Cody PDF All Chapters
61 pages
Vanessa Frank 02 Femicide 2nd Edition Pascal Engman All Chapter Instant Download
100% (10)
Vanessa Frank 02 Femicide 2nd Edition Pascal Engman All Chapter Instant Download
22 pages
I-Series NVR Firmware Upgrade Instructions
No ratings yet
I-Series NVR Firmware Upgrade Instructions
2 pages
OS Electrical Installation Level 6
No ratings yet
OS Electrical Installation Level 6
97 pages
Unit 4
No ratings yet
Unit 4
96 pages
9618_w23_qp_23
No ratings yet
9618_w23_qp_23
24 pages
C++ 7 Worksheet
No ratings yet
C++ 7 Worksheet
6 pages
Describe About Reinforcement Learning, Passive Reinforcement Learning and Active Reinforcement
No ratings yet
Describe About Reinforcement Learning, Passive Reinforcement Learning and Active Reinforcement
1 page
Transformation of Graphs by Greatest Integer Function 5
No ratings yet
Transformation of Graphs by Greatest Integer Function 5
7 pages
Ports and Connectors
100% (5)
Ports and Connectors
24 pages
Dpu4f
No ratings yet
Dpu4f
17 pages
Cyber Security
No ratings yet
Cyber Security
16 pages
Manual Escaner Zebra Sdk-Windows-Dg-En
No ratings yet
Manual Escaner Zebra Sdk-Windows-Dg-En
150 pages
Deep Spoken Keyword Spotting_ An Overview
No ratings yet
Deep Spoken Keyword Spotting_ An Overview
32 pages
B.Sc. (C.A.) Semester-V: Department of Computer Science
No ratings yet
B.Sc. (C.A.) Semester-V: Department of Computer Science
40 pages
Thesis Awards - Mango Architecture
No ratings yet
Thesis Awards - Mango Architecture
12 pages
COD Special Codes Matic (Before 2010)
No ratings yet
COD Special Codes Matic (Before 2010)
3 pages
Cybersecurity Internship Week 01
No ratings yet
Cybersecurity Internship Week 01
4 pages
Anomaly Detection in Cybersecurity With Graph Based Approaches
No ratings yet
Anomaly Detection in Cybersecurity With Graph Based Approaches
9 pages
413 (F) M.Idrees
No ratings yet
413 (F) M.Idrees
9 pages
Jilsoft Collection Proposal
No ratings yet
Jilsoft Collection Proposal
29 pages
Unit 12 Software Development - Assignment 2 - Bianca P
100% (1)
Unit 12 Software Development - Assignment 2 - Bianca P
19 pages
Paresh - Big Data - Resume
No ratings yet
Paresh - Big Data - Resume
4 pages
Read and Write From Serial Port With Raspberry Pi
No ratings yet
Read and Write From Serial Port With Raspberry Pi
5 pages
Unigraphics NX Interview Questions and Answers 3 Engineering Wave PDF
No ratings yet
Unigraphics NX Interview Questions and Answers 3 Engineering Wave PDF
3 pages
App - Question Bank
No ratings yet
App - Question Bank
15 pages

IP Unit 5 Notes

Uploaded by

IP Unit 5 Notes

Uploaded by

Unit 5.

IMAGE COMPRESSION AND

and the entropy of the source is 2.14 bits/symbol.

IMAGE COMPRESSION AND

You might also like