Open navigation menu

Scribd

0% found this document useful (0 votes)

10 views36 pages

Information Theory Module 4

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views36 pages

Information Theory Module 4

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Module-4

Non Probability based Source Coding

Dr. Markkandan S

School of Electronics Engineering (SENSE)

Vellore Institute of Technology
Chennai

1
LEMPEL-ZIV ALGORITHM
Lempel-Ziv Algorithm

• Huffman coding requires symbol probabilities. But most real life scenarios do not
provide the symbol probabilities in advance. Huffman coding is optimal for DMS
source where the occurrence of one symbol does not alter the probabilities of the
subsequent symbols.
• It might be more efficient to use the statistical inter-dependence of the letters in
the alphabet along with their individual probabilities of occurrence.
• Lempel-ziv algorithm does not need the source statistics.
• It is variable -to-fixed length source coding algorithm and belongs to the class of
universal source coding algorithm

Dr. Markkandan S Module-4 Non Probability based Source Coding 3/36

Lempel-Ziv Algorithm

• The compression of an arbitrary sequence of bits is possible by coding a series of

0’s and 1’s as some previous such string (prefix string) plus one new bit
• The new string is formed by adding the new bit to the previously used prefix string
becomes a potential prefix string for further strings
• These variable length blocks are called as Phrases. The phrases are listed in
dictionary which stores the existing phrases and their locations.
• in encoding a new phrase, we specify the location of the existing phrase in the
dictionary and append the new letter

Dr. Markkandan S Module-4 Non Probability based Source Coding 4/36

Lempel-Ziv Algorithm : Example

Code the String 101011011010101011

1. We will begin by parsing it in to comma seperated phrases that represent strings
that can be represented by a previous string as a prefix, plus a bit
2. The first bit 1 has no predecessors, it has a null prefix string and the one extra bit
itself.
1,01011011010101011
3. The same for next bit 0 it doesnot have any prefix string
1,0,1011011010101011
4. So far our dictionary contains the strings ’1’ and ’0’. Next we encounter a 1, but
it already exists in our dictionary. Hence we proceed further. The 10 is obviously
combination of prefix 1 and 0 so we have
1,0,10,11011010101011
5. Continue this way, we will be parsed the string as 1,0,10,11,01,101,010,1011
Dr. Markkandan S Module-4 Non Probability based Source Coding 5/36
Lempel-Ziv Algorithm : Example

• Step-6: Construct the Dictionary as follows using the strings parsed. Since we
have 8 strings 3 bit bianry positions are used
String Position Number Position Number in Binary
1 1 001
0 2 010
10 3 011
11 4 100
01 5 101
101 6 110
010 7 111
1011 8 -

Dr. Markkandan S Module-4 Non Probability based Source Coding 6/36

Lempel-Ziv Algorithm : Example

Step-7: Check for Prefix Availability

String Position Number Position Number in Binary Prefix
1 1 001 No
0 2 010 No
10 3 011 1
11 4 100 1
01 5 101 0
101 6 110 10
010 7 111 01
1011 8 - 101

Dr. Markkandan S Module-4 Non Probability based Source Coding 7/36

Lempel-Ziv Algorithm : Example

Step-8: Identify the Position Number of Prefix

String Position Position No. in Prefix Position No. of
No. Binary prefix
1 1 001 No 000
0 2 010 No 000
10 3 011 1 001
11 4 100 1 001
01 5 101 0 010
101 6 110 10 011
010 7 111 01 101
1011 8 - 101 110

Dr. Markkandan S Module-4 Non Probability based Source Coding 8/36

Lempel-Ziv Algorithm : Example

Step-9: Write the Codeword such a way that, Position Number of that prefix with last
bit of the string we considered
String Position Position No. Prefix Position No. Code
No. in Binary of prefix word
1 1 001 No 000 0001
0 2 010 No 000 0000
10 3 011 1 001 0010
11 4 100 1 001 0011
01 5 101 0 010 0101
101 6 110 10 011 0111
010 7 111 01 101 1010
1011 8 - 101 110 1101

Dr. Markkandan S Module-4 Non Probability based Source Coding 9/36

Lempel-Ziv Algorithm : Example

• Hence for string, 101011011010101011.

• The coded word is 00010000001000110101011110101101.
• This code is not efficient, since codeword length is higher than given code.
• But the dictionary size will increase, if a large amount of data to be coded and
eventually less number of codes will be used.
• Lempel-Ziv algorithm is useful for large files

Dr. Markkandan S Module-4 Non Probability based Source Coding 10/36

Lempel-Ziv Algorithm : Example with strings

Code the String THIS IS HIS HIT

1. We will begin by parsing it in to comma seperated phrases that represent strings

that can be represented by a previous string as a prefix, plus a character
2. The first letter T has no predecessors, it has a null prefix string.
T , HIS IS HIS HIT
3. The same for next character H it doesnot have any prefix string
T , H, IS IS HIS HIT
4. Keep parsing, eventually we will get T , H, I , S, , IS, H, IS , HI , T

Dr. Markkandan S Module-4 Non Probability based Source Coding 11/36

Lempel-Ziv Algorithm : Example with strings

Step-5: Construct the Dictionary and codeword

String Position Prefix Position No. of Code
No. prefix word
T 1 No 0 0T
H 2 No 0 0H
I 3 No 0 0I
S 4 No 0 0S
5 No 0 0
IS 6 I 3 3S
H 7 5 5H
IS 8 IS 6 6
HI 9 H 2 2I
T 10 T 1 1T
Dr. Markkandan S Module-4 Non Probability based Source Coding 12/36
Lempel-Ziv Algorithm : Example with strings

The coded String for THIS IS HIS HIT is 0T 0H0I 0S0 3S5H6 2I 1T .
• Lempel-ziv algorithm is widely used in practice. The compress and uncompress
utilities of the UNIX operating system use a modified version of this algorithm.
• The standard algorithms for compressing binary files use codewords of 12 bits and
transmit 1 extra bit to indicate a new sequence.
• Using such code, Lempel-Ziv algorithm can compress transmissions of English text
by about 55 percent.

Dr. Markkandan S Module-4 Non Probability based Source Coding 13/36

RUN LENGTH ENCODING
RUN LENGTH ENCODING (RLE)

• Used to reduce the size of a repeating string of characters . This repeating string
is called run
• RLE encodes a run of symbols in to two bytes , a count and a symbol
• RLE can compress any type of data regardless of its information content, but the
content of data to be compressed affects the compression ratio.
• RLE cannot achieve high compression ratios compared to other compression
methods, but it is easy to implement and is quick to execute. It is supported by
most bit map file formats such as TIFF, JPG, BMP, PCX and FAX machines

Dr. Markkandan S Module-4 Non Probability based Source Coding 15/36

RUN LENGTH ENCODING (RLE)

• RLE used for compression of images in the PCX format.

• PCX was the initial image format in DOS environment
• Now PCX is replaced by JPEG, BMP, PNG compression methods

Dr. Markkandan S Module-4 Non Probability based Source Coding 16/36

EXAMPLE: RUN LENGTH ENCODING (RLE)

Consider the following bit stream

S=11111111111100000000000000000001110001100000.
1. This can be coded in to (12,1), (19,0),(3,1),(3,0), (2,1),(5,0)
2. Maximum repetition is 19, which can be coded in to 5 bits
3. The encoded bit stream is (01100,1), (10011,0), (00011,1), (00011,0), (00010,1),
(00101,0)
4. Number of transmitted bits : 44
5. Number of encoded bits: 6 symbols X 6 = 36 bits
6. Compression Ratio is 36:44 = 1:1.22

Dr. Markkandan S Module-4 Non Probability based Source Coding 17/36

RATE DISTORTION FUNCTION
Rate Distortion Function

• Although we live in an anlog world, most of the communication takes place in the
digital form. Since most natural sources are analog, they are first sampled,
quantized and then processed
• However, this representation of an arbitrary real number requires an infinite
number of bits. Thus a finite representation of a continuous random variable can
be never be perfect
• Consider an Analog message waveform x(t), which is a sample waveform of a
stochastic process X(t). Assuming X(t) is a band limited, statioanry process, it
can be represented by a sequence of non uniform samples taken at Nyquist rate.
• These samples are quantized in amplitude and encoded as a sequence of bianry
digits.

Dr. Markkandan S Module-4 Non Probability based Source Coding 19/36

Rate Distortion Function

A simple encoding strategy can be used to define L levels and encode every sample
using
R = log2 L bits, if L is a power of 2 or

R = ⌊log2 L⌋ + 1 bits, if L is not a power of 2

The Squared Error Distrotion is defined as

d(xk , x˜k ) = (xk − x˜k )2

In general distortion measure may be represented as

d(xk , x˜k ) = |xk − x˜k |p

Dr. Markkandan S Module-4 Non Probability based Source Coding 20/36

Distortion and Rate Distortion Function

The Distrotion between a sequence of n samples Xn , and their corresponding n

quantised Values X˜n is defined as
n
1X
D = E [d(Xn , X˜n )] = E [d(xk , x˜k )] = E [d(xk , x˜k )]
n
k=1
The minimum rate (in bits/souurce output) required to represent the output X of the
memoryless source with a distortion less than or equal to D is called the Rate
Distortion Function R(D) is defined as

R(D) = min I (X; X̃)

p(x̃|x):E [d(X,X̃)]≤D
The distortion rate function for a discrete time, memoryless Gaussian source is defined
as
Dg (R) = 2−2R σx2
Dr. Markkandan S Module-4 Non Probability based Source Coding 21/36
TRANSFORM CODING
Image Compression Methods

• High quality images are represented by very large data sets

• Applications that involve imagery seem to be inherently linked to immediate
human consumption, and so need to be fast in execution on computers and in
transmission.
• Imagery has the quality of higher redundancy than we can generally expect in
arbitrary data. For example, a pair of adjacent horizontal lines in an image are
nearly identical (typically), while, two adjacent lines in a book have essentially no
commonality.
• The human eye is very tolerant to approximation error in an image. Thus, it may
be possible to compress the image data in a manner in which the less important
information (to the human eye) can be dropped.

Dr. Markkandan S Module-4 Non Probability based Source Coding 23/36

Image Compression Methods

• That is, by trading off some of the quality of the image we might obtain lossy
compression, as opposed to the lossless compression
• Lossy compression can only be applied to data such as images and audio for which
human beings will tolerate some loss of fidelity.
• That is, by trading off some of the quality of the image we might obtain lossy
compression, as opposed to the lossless compression
• JPEG compression standard is actually a description of 29 distinct coding systems
fro compression images

Dr. Markkandan S Module-4 Non Probability based Source Coding 24/36

JPEG Standard for Lossless Compression

There are eight prediction methods available in the JPEG coding standards. One of the
eight (which is the no prediction option) is not used for the lossless coding option that
we are examining here. The other seven may be divided into the following categories:

• Predict the next pixel on the line as having the same value as the last one.
• Predict the next pixel on the line as having the same value as the pixel in this
position on the previous line (that is, above it).
• Predict the next pixel on the line as having a value related to a combination of
the previous, above and previous to the above pixel values. One such combination
is simply the average of the other three.

Dr. Markkandan S Module-4 Non Probability based Source Coding 25/36

JPEG Standard for Lossy Compression

• The JPEG standard includes a set of sophisticated lossy compression options

which resulted from much experimentation by the creators of JPEG with regard to
human acceptance of types of image distortion.
• The JPEG standard was the result of years of effort by the JPEG which was
formed as a joint effort by two large, standing, standards organizations, the
CCITT (The European telecommunications standards organization) and the ISO
(International Standards Organization).
• The stages of lossy compression algorithm are
• Lossy image Simplification - To Remove image complexity
• Lossless compression step - Based on predictive filtering
• Huffman or Arithmatic Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 26/36

JPEG Standard for Lossy Compression - DCT

• The lossy image simplification step is based on Discrete Cosine Transform(DCT)

N−1
X M−1
X πk πl
Y (k, l) = 4y (i, j)cos (2i + 1) cos (2j + 1)
2N 2M
i=0 j=0

Where input image is NXM Pixels, y(i,j) is the intensity of the pixel in row i and
column j.
• For most iamges, much of the signal energy lies at lower frequencies, which
appear in the upper left corner of the DCT.
• The lower right values represent higher frequencies, often small
• DCT is computationally intensive with complexity of O(N 2 ). Hence images are
divided in to blocks
Dr. Markkandan S Module-4 Non Probability based Source Coding 27/36
JPEG Standard for Lossy Compression - image Reduction

• DCT is applied to 8 by 8 pixel blocks of the image

• The 64 pixel values in each block are transformed by DCT into a new set of 64
values as DCT coefficients
• DCT coefficients represent spatial frequency of the image sub-block with AC and
DC coefficients.

Dr. Markkandan S Module-4 Non Probability based Source Coding 28/36

JPEG Standard for Lossy Compression - Image Reduction

• Due to nature of most natural iamges, maximum energy lies in low frequency as
opposed to high frequency.
• For lossy compression following steps are followed
1. First the lowest weights are trimmed by setting them to zero
2. The remaining weights are quantized (that is, rounded off to the nearest of some
number of discrete code represented values), some more coarsely than others
according to observed levels of sensitivity of viewers to these degradations.
3. Then several losssless compression methods are applied. DC coeeficients, vary slowly
from one block to next block, Hence predicion is performed
4. We have to send One DC coeeficient and difference between DC coefficients of
surrounding blocks

Dr. Markkandan S Module-4 Non Probability based Source Coding 29/36

JPEG Standard for Lossy Compression - Zig ZAg Coding

• The purpose of Zig-Zag coding is that we gradually move from the low frequency
to high frequency, avoiding abrupt jumps in the values.
• Zig-Zag coding will lead to long runs of 0’s, which are ideal for RLE followed by
Huffman or Arithmetic Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 30/36

Lossy Compression -JPEG

Dr. Markkandan S Module-4 Non Probability based Source Coding 31/36

Lossy Compression -Transform Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 32/36

Lossy Compression -Transform Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 33/36

Lossy Compression -Transform Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 34/36

Lossy Compression -Transform Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 35/36

Lossy Compression -Transform Coding

Dr. Markkandan S Module-4 Non Probability based Source Coding 36/36

You might also like

L117, L18, L19, L20, L21 - Module 5 - Source Coding - II
No ratings yet
L117, L18, L19, L20, L21 - Module 5 - Source Coding - II
53 pages
Information Theory: Mohamed Hamada
No ratings yet
Information Theory: Mohamed Hamada
44 pages
Analog & Digital Communication Presentation On Data Compression
No ratings yet
Analog & Digital Communication Presentation On Data Compression
31 pages
chapter 7
No ratings yet
chapter 7
70 pages
TOPIC
No ratings yet
TOPIC
18 pages
Image Compression
100% (1)
Image Compression
38 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
Lec-2 Source Coding v3.0
No ratings yet
Lec-2 Source Coding v3.0
10 pages
Lecture 13 - Delta Coding
No ratings yet
Lecture 13 - Delta Coding
41 pages
Compression: Author: Paul Penfield, Jr. 2004 Massachusetts Institute of Technology Url: Start: Back: Next
No ratings yet
Compression: Author: Paul Penfield, Jr. 2004 Massachusetts Institute of Technology Url: Start: Back: Next
8 pages
Tutorial 8
No ratings yet
Tutorial 8
20 pages
Arithmetic, Run Length, Compression
No ratings yet
Arithmetic, Run Length, Compression
62 pages
Unit3 Ece Mmc 6th Sem (2)
No ratings yet
Unit3 Ece Mmc 6th Sem (2)
96 pages
Data Compression
No ratings yet
Data Compression
22 pages
chap2
No ratings yet
chap2
47 pages
Unit 1 Data Compression
No ratings yet
Unit 1 Data Compression
30 pages
Tanaman Indah Dan Bersih
No ratings yet
Tanaman Indah Dan Bersih
5 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Implementation of Lempel-Ziv Algorithm For Lossless Compression Using VHDL
No ratings yet
Implementation of Lempel-Ziv Algorithm For Lossless Compression Using VHDL
2 pages
Data Compression Unit-1 - 1
No ratings yet
Data Compression Unit-1 - 1
21 pages
Dce Easy Solution
0% (1)
Dce Easy Solution
87 pages
Lempel-Ziv Codes: 5 .1 Lemp El - Ziv P Ar Sing
No ratings yet
Lempel-Ziv Codes: 5 .1 Lemp El - Ziv P Ar Sing
12 pages
ECE359_Image Compression
No ratings yet
ECE359_Image Compression
42 pages
ut1ppt
No ratings yet
ut1ppt
77 pages
MM Unit-III - 0
No ratings yet
MM Unit-III - 0
22 pages
Forouzan6e ch11 PPTs Accessible
No ratings yet
Forouzan6e ch11 PPTs Accessible
119 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Data Compression
No ratings yet
Data Compression
113 pages
Source Coding Techniques: 1. Huffman Code. 2. Two-Pass Huffman Code. 3. Lemple-Ziv Code
No ratings yet
Source Coding Techniques: 1. Huffman Code. 2. Two-Pass Huffman Code. 3. Lemple-Ziv Code
111 pages
Main Techniques and Performance of Each Compression
No ratings yet
Main Techniques and Performance of Each Compression
23 pages
ICT
No ratings yet
ICT
10 pages
Question Bank: Information Coding Techniques
No ratings yet
Question Bank: Information Coding Techniques
10 pages
Data and Voice Coding
No ratings yet
Data and Voice Coding
20 pages
Chapter 4 - Introduction To Source Coding
No ratings yet
Chapter 4 - Introduction To Source Coding
72 pages
Lecture19 PDF
No ratings yet
Lecture19 PDF
8 pages
Compression PDF
No ratings yet
Compression PDF
55 pages
CH 6
No ratings yet
CH 6
21 pages
Group Presentation Digital Communication Systems
No ratings yet
Group Presentation Digital Communication Systems
29 pages
L15-Compression
No ratings yet
L15-Compression
63 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
Note 2 - Source Coding Techniques
No ratings yet
Note 2 - Source Coding Techniques
129 pages
Compression 2
No ratings yet
Compression 2
70 pages
Agenda For The Lecture: C Himanshu Tyagi. Feel Free To Use With Acknowledgement
No ratings yet
Agenda For The Lecture: C Himanshu Tyagi. Feel Free To Use With Acknowledgement
7 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
3 Source Coding
No ratings yet
3 Source Coding
31 pages
Nen Anh
No ratings yet
Nen Anh
36 pages
Lecture I: Data Compression Data Encoding: Efficient Information Encoding To
No ratings yet
Lecture I: Data Compression Data Encoding: Efficient Information Encoding To
48 pages
Algorithms For Data Compression in Wireless Computing Systems
No ratings yet
Algorithms For Data Compression in Wireless Computing Systems
7 pages
Chapter 3-Part II
100% (1)
Chapter 3-Part II
26 pages
Data Compression Chapter 7
No ratings yet
Data Compression Chapter 7
40 pages
Department of Information Technology Information Theory and Coding Question Bank Unit-I Part - A
No ratings yet
Department of Information Technology Information Theory and Coding Question Bank Unit-I Part - A
6 pages
Fundamentals of Compression: Prepared By: Haval Akrawi
No ratings yet
Fundamentals of Compression: Prepared By: Haval Akrawi
21 pages
3.source Coding Data Compression
No ratings yet
3.source Coding Data Compression
25 pages
Chapter 08
No ratings yet
Chapter 08
111 pages
Chapter 4 - Introduction To Source Coding PDF
No ratings yet
Chapter 4 - Introduction To Source Coding PDF
72 pages
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Next Generation Excel: Modeling In Excel For Analysts And MBAs (For MS Windows And Mac OS)
From Everand
Next Generation Excel: Modeling In Excel For Analysts And MBAs (For MS Windows And Mac OS)
Isaac Gottlieb
No ratings yet
Precalculus: A Self-Teaching Guide
From Everand
Precalculus: A Self-Teaching Guide
Steve Slavin
4.5/5 (5)
Chapter 11 - MPEG Video Coding I - MPEG-1 and 2
No ratings yet
Chapter 11 - MPEG Video Coding I - MPEG-1 and 2
39 pages
Unit 4 Data Compression (1)
No ratings yet
Unit 4 Data Compression (1)
10 pages
Dokument - Pub Cardiff University Examination Paper Academic Year Flipbook PDF
No ratings yet
Dokument - Pub Cardiff University Examination Paper Academic Year Flipbook PDF
13 pages
Mais Informaçoes Do Arquivo
No ratings yet
Mais Informaçoes Do Arquivo
2 pages
Bandlimiting Filter U-Law or A-Law Compressor Linear PCM
No ratings yet
Bandlimiting Filter U-Law or A-Law Compressor Linear PCM
8 pages
Ic23 Unit03 Script
No ratings yet
Ic23 Unit03 Script
26 pages
Mdcs Lab Questions
No ratings yet
Mdcs Lab Questions
5 pages
Trace
No ratings yet
Trace
2,391 pages
Data Compression.ppt
No ratings yet
Data Compression.ppt
23 pages
Vector Quantization
100% (1)
Vector Quantization
25 pages
Assignment-5 Compression
No ratings yet
Assignment-5 Compression
2 pages
Untitled Document
No ratings yet
Untitled Document
2 pages
Image Compression: Presented by Nermine Salama & Mohamed Hagras
No ratings yet
Image Compression: Presented by Nermine Salama & Mohamed Hagras
24 pages
Erased Log by Sos
No ratings yet
Erased Log by Sos
4 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Trace
No ratings yet
Trace
26 pages
Data Compressionquestion Bank kcs064
No ratings yet
Data Compressionquestion Bank kcs064
51 pages
27 tar gzip command
No ratings yet
27 tar gzip command
3 pages
Review of Data Compression and Different Techniques of Data Compression IJERTV2IS1106
No ratings yet
Review of Data Compression and Different Techniques of Data Compression IJERTV2IS1106
8 pages
HEIC vs HEI Image Files
No ratings yet
HEIC vs HEI Image Files
3 pages
Chapter 8
No ratings yet
Chapter 8
91 pages
Assignment Agmase
No ratings yet
Assignment Agmase
14 pages
Chapter 4 DIP
No ratings yet
Chapter 4 DIP
17 pages
Its Over Guantanamo Bay All Fucunig Lurfer All Alive Adminstration Bye Scum Rats You Deserve To Die
No ratings yet
Its Over Guantanamo Bay All Fucunig Lurfer All Alive Adminstration Bye Scum Rats You Deserve To Die
9 pages
Log
No ratings yet
Log
227 pages
Trace
No ratings yet
Trace
1,174 pages
Coding Integer-: Jpeg2000 Compression Standard Is The Coding
No ratings yet
Coding Integer-: Jpeg2000 Compression Standard Is The Coding
5 pages
MBR GPT Cheatsheet
No ratings yet
MBR GPT Cheatsheet
3 pages
SWE 423: Multimedia Systems: Chapter 7: Data Compression
No ratings yet
SWE 423: Multimedia Systems: Chapter 7: Data Compression
25 pages
Data Compression Home Assignment Questions
No ratings yet
Data Compression Home Assignment Questions
3 pages