IT Project Par 1

The document describes the implementation of Huffman coding in MATLAB across 5 sections. Section 1 extracts symbols and calculates probabilities from a text file. Section 2 generates the Huffman code dictionary. Section 3 calculates coding parameters. Section 4 encodes a file, and Section 5 decodes the encoded file and verifies successful decoding.

Uploaded by

Mina Emad

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

IT Project Par 1

Uploaded by

Mina Emad

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

IT Major task part 1

Students:
Amr Khaled Mahmoud 19P8679
Mina Emad Maurice 19P
1. Code Sections:

1.1. The first section:

In this section we extracted the symbols out of the text files, the variable
‘Content’ is used to store the content of the text file in characters.
The unique function extracts the unique symbols ( in our case is the
characters) out of the text files so every unique character is counted as a
symbol.

In order to calculate probabilities first we define the vector to store it, then in
the later for-loop we found the no of occurrences of each letter in the text
file then we divided it by the total number of characters in order to get each
symbol probability, ismember function marks every occurrence of the
symbol then the combination of length and find is used to calculate the
number of occurrences.
1.2. The second Section

In order to begin generating the codes, we needed to combine both the

symbols and their probabilities in a cell as we will need it to connect the
symbols and the probabilities then we sorted the cell as a requirement for
Huffman, then we took a copy of the cell as we are going to edit in it.
Later we defined a new cell to include code words generated, we used cell as
it is defined as a null so we can use it to concatenate, then we introduced the
special case where the number of the symbols is equal to one so we assign
the code word ‘1’ to it.
In order to implement the Huffman algorithm we used the same steps we
used in our course, first we sorted out the symbols discendingly according to
their probability, in the next step we take out the last 2 symbols and we
iterate through them if it was a combination of symbols (if it is a single
symbol the length is equal to 1) then we assign ‘0’ or ‘1’ to the components
of the symbol if it was combined or directly if it was a single symbol.
At the end we add the probabilities and combine the symbols added also we
delete the last row of the combined symbols.
In the end in order to generate the dictionary we used a cell that contains the
symbols in the first column and codes in the second column.
1.3. Section three

In this section we calculate the parameters required the average length

(L_Avg) is the summation of code lengths multiplied with their probability.
The Entropy (H_of_S) is the summation of the symbol probability
multiplied by the log base 2 of 1/probability
The efficiency is the entropy divided by the average length
The compression ratio represents how much compression have we made
when we coded a single symbol, the average length of the symbols
MATLAB use to represent its characters is 8 (ASCII) so we divided the
average length of our code to the average length that MATLAB uses to find
how much have we compressed.
1.4. Section four

In this section we are encoding the trial.txt file with our generated code
The shown for loop loops for the length of the characters of the trial file and
compares each character to symbols in the created dictionary , once it finds
the character is equal to the symbol it inserts its code instead. Then we use
the combination of string and strjoin to combine the codes as a string with
no spaces, then we print it into a text file called ‘Tx.txt’
1.5. Section five

In this section we are trying to decode the file we previously coded, we

receive the string file in a variable called Rx, then we loop for the length of
the received file.
To decode we add a single bit for every iteration on our loop and compare it
to the codes we are having, since we are using a prefix code we will have no
problem with this method as no code is the beginning of another code.
In the end we compare the retrieved file (Retr.txt) to the content file
(trial.txt) using strcmp which gives us a value of one meaning that our
decoding was successful.
2.0. Results

2.1. Section one

Probabilities and Symbols:
2.2. Section two
Dictionary:
2.3. Section Three
Average Length

Info:
Efficiency:

Compression Ratio:

Entropy
2.4. Section four
Transmitted Bit stream as a string:
2.5. Section five
Retrieved (Decoded) File:

Comparison Value:

Brain Bugs How The Brain's Flaws Shape Our Lives
100% (2)
Brain Bugs How The Brain's Flaws Shape Our Lives
275 pages
A Day in Code- Python: Learn to Code in Python through an Illustrated Story (for Kids and Beginners)
From Everand
A Day in Code- Python: Learn to Code in Python through an Illustrated Story (for Kids and Beginners)
Shari Eskenas
5/5 (1)
Lexical Analysis
No ratings yet
Lexical Analysis
45 pages
Module 5 Lexical Analyser
No ratings yet
Module 5 Lexical Analyser
10 pages
Unit 2 Lexical Analysis - Part 1: Harshita Sharma
No ratings yet
Unit 2 Lexical Analysis - Part 1: Harshita Sharma
55 pages
CD Aii Partb Ans
No ratings yet
CD Aii Partb Ans
8 pages
MDCS Final
No ratings yet
MDCS Final
30 pages
PL Lec 2 Syntax and Semantics
No ratings yet
PL Lec 2 Syntax and Semantics
48 pages
Lexical Analyzer
No ratings yet
Lexical Analyzer
5 pages
Unit - 4 CTPS
No ratings yet
Unit - 4 CTPS
26 pages
Chapter 3 Lexical Analyser
No ratings yet
Chapter 3 Lexical Analyser
29 pages
Compiler - 2
No ratings yet
Compiler - 2
15 pages
Atcd 2ia
No ratings yet
Atcd 2ia
2 pages
ch-2 Compiler Design
No ratings yet
ch-2 Compiler Design
9 pages
Assignment Writeup
No ratings yet
Assignment Writeup
26 pages
compiler_design- Module2-print
No ratings yet
compiler_design- Module2-print
16 pages
String Programs Resume
No ratings yet
String Programs Resume
6 pages
YACT
No ratings yet
YACT
25 pages
CD - Module 2
No ratings yet
CD - Module 2
12 pages
Chapter 2 Lexical Analysis
No ratings yet
Chapter 2 Lexical Analysis
26 pages
Unit 01 - PART 2
No ratings yet
Unit 01 - PART 2
25 pages
Write A Computer Language Using Go (Golang)
100% (1)
Write A Computer Language Using Go (Golang)
14 pages
2-Lexical Analysis
No ratings yet
2-Lexical Analysis
52 pages
Ch2 Lexical Analysis
No ratings yet
Ch2 Lexical Analysis
11 pages
Lexical Analysis
No ratings yet
Lexical Analysis
5 pages
Chapter 2 - Lexical Analyser
No ratings yet
Chapter 2 - Lexical Analyser
40 pages
Comparing Two Strings
No ratings yet
Comparing Two Strings
2 pages
pcdunit2 class
No ratings yet
pcdunit2 class
21 pages
14 - String
No ratings yet
14 - String
19 pages
Lexical Analysis: CD: Compiler Design
No ratings yet
Lexical Analysis: CD: Compiler Design
122 pages
Lex and Yacc
No ratings yet
Lex and Yacc
8 pages
Lexical Analysis: S. M. Farhad
No ratings yet
Lexical Analysis: S. M. Farhad
28 pages
CC2
No ratings yet
CC2
6 pages
Python Lab: Coding The Matrix, Summer 2013
No ratings yet
Python Lab: Coding The Matrix, Summer 2013
19 pages
UNIT 2 Compiler Design
No ratings yet
UNIT 2 Compiler Design
23 pages
Assignment 4: Hashing For Strings: Goal
No ratings yet
Assignment 4: Hashing For Strings: Goal
3 pages
Lec 6
No ratings yet
Lec 6
7 pages
CD Unit I Part II Lexical Analysis
No ratings yet
CD Unit I Part II Lexical Analysis
58 pages
Unit 2 - Part 7 Coding Information Sources: 1 Adaptive Variable-Length Codes
No ratings yet
Unit 2 - Part 7 Coding Information Sources: 1 Adaptive Variable-Length Codes
5 pages
sample
No ratings yet
sample
15 pages
Lesson3.3 AdvancedScripting
No ratings yet
Lesson3.3 AdvancedScripting
1 page
5.Tokens, Patterns, and Lexemes
No ratings yet
5.Tokens, Patterns, and Lexemes
7 pages
Chapter 2
No ratings yet
Chapter 2
6 pages
Python (Notes, Ch. 1-7)
No ratings yet
Python (Notes, Ch. 1-7)
11 pages
Comp Chap2
No ratings yet
Comp Chap2
36 pages
Journal 1 07032022 052238pm
No ratings yet
Journal 1 07032022 052238pm
2 pages
CD 1
No ratings yet
CD 1
23 pages
Chapter 2 Lexical Analysis (Scanning) (1)
No ratings yet
Chapter 2 Lexical Analysis (Scanning) (1)
56 pages
CD_UNIT-2
No ratings yet
CD_UNIT-2
64 pages
Dive Into Python
No ratings yet
Dive Into Python
4 pages
CD Unit-2
No ratings yet
CD Unit-2
64 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
4 Abstract Syntax
No ratings yet
4 Abstract Syntax
17 pages
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
No ratings yet
VMKV Engineering College Department of Computer Science & Engineering Principles of Compiler Design Unit I Part-A
80 pages
Project Report
No ratings yet
Project Report
26 pages
Lexical Analysis
No ratings yet
Lexical Analysis
6 pages
Lecture 4 Lexical Analysis
No ratings yet
Lecture 4 Lexical Analysis
23 pages
SE Compiler Chapter 2
No ratings yet
SE Compiler Chapter 2
16 pages
Programming Assignment Unit 1
No ratings yet
Programming Assignment Unit 1
12 pages
Rail fence cipher
No ratings yet
Rail fence cipher
6 pages
Huffman Coding Technique
No ratings yet
Huffman Coding Technique
13 pages
Ece
No ratings yet
Ece
58 pages
Channel Capacity PDF
No ratings yet
Channel Capacity PDF
30 pages
Unit1 Data Representation_1
No ratings yet
Unit1 Data Representation_1
35 pages
FC Notes
No ratings yet
FC Notes
57 pages
The Mathematical Miracles of The Holy Quran
83% (6)
The Mathematical Miracles of The Holy Quran
341 pages
Andi Syafri Idris (History of Semiotics)
No ratings yet
Andi Syafri Idris (History of Semiotics)
19 pages
(T1) Lecture 1 - COMMUNICATION THEORY
No ratings yet
(T1) Lecture 1 - COMMUNICATION THEORY
30 pages
Gray Level Count Probabil Ity 21 12 3/8 95 4 1/8 169 4 1/8 243 12 3/8
No ratings yet
Gray Level Count Probabil Ity 21 12 3/8 95 4 1/8 169 4 1/8 243 12 3/8
51 pages
Process of Human Communication
No ratings yet
Process of Human Communication
4 pages
Purposive Communication Module 1
100% (4)
Purposive Communication Module 1
19 pages
Mass Communication: Potentially, The Most Influential Form of Human Communication
No ratings yet
Mass Communication: Potentially, The Most Influential Form of Human Communication
22 pages
Computer Codes
100% (17)
Computer Codes
38 pages
Iso 9897 1997
100% (1)
Iso 9897 1997
89 pages
Chapter 5. The Beauty of Codes
No ratings yet
Chapter 5. The Beauty of Codes
7 pages
S-57 Appendix A: Chapter 2 - Attributes
No ratings yet
S-57 Appendix A: Chapter 2 - Attributes
264 pages
Igcse Compsci 2ed TR Teaching Notes 1
No ratings yet
Igcse Compsci 2ed TR Teaching Notes 1
13 pages
Communication and Negotiation in The Organization
No ratings yet
Communication and Negotiation in The Organization
21 pages
SIM - Nucleic Acid - For Publication
100% (1)
SIM - Nucleic Acid - For Publication
16 pages
Channel Coding Using Matlab
No ratings yet
Channel Coding Using Matlab
14 pages
Fibonacci Coding and A Card Trick: by Kiran Ananthpur Bacche
No ratings yet
Fibonacci Coding and A Card Trick: by Kiran Ananthpur Bacche
5 pages
Entropy Coding
No ratings yet
Entropy Coding
18 pages
6th Sem Cse Data Science Analytics SM o
No ratings yet
6th Sem Cse Data Science Analytics SM o
40 pages
5g Ug PDF
No ratings yet
5g Ug PDF
26 pages
Steganography: The Art of Data Hiding
No ratings yet
Steganography: The Art of Data Hiding
24 pages
Communication Skills Notes-1
No ratings yet
Communication Skills Notes-1
62 pages
SM Mass Communication
No ratings yet
SM Mass Communication
61 pages
5th and 6th Syllabus-9.2.2024
No ratings yet
5th and 6th Syllabus-9.2.2024
66 pages
M60 - M60S Series - MELDASMAGIC64 Programming Manual (Machining
No ratings yet
M60 - M60S Series - MELDASMAGIC64 Programming Manual (Machining
469 pages
ECE - DIP - Unit 4
No ratings yet
ECE - DIP - Unit 4
43 pages

IT Project Par 1

Uploaded by

IT Project Par 1

Uploaded by

IT Major task part 1

1.1. The first section:

In order to begin generating the codes, we needed to combine both the

In this section we calculate the parameters required the average length

In this section we are trying to decode the file we previously coded, we

2.1. Section one

You might also like