0% found this document useful (0 votes)

34 views15 pages

ETE-DIP Solution

The document provides solutions to questions from a PhD examination in image and video processing. It discusses fundamental steps in digital image processing such as image acquisition, enhancement, restoration, color processing, compression, and segmentation. Steps for equalizing an image histogram and computing principal components analysis on image data are also provided. Finally, it summarizes the scale invariant feature transform (SIFT) algorithm for image matching.

Uploaded by

ANSH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views15 pages

ETE-DIP Solution

Uploaded by

ANSH

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Reg.No.

Faculty of Engineering
School of Computer Science and Engineering
Department of CSE
PhD End Term Examination: 2022-23
CS8012 – Image and Video Processing (MOOC)
Solution
Time: 2 hours MAX.MARKS: 40

Instructions to Candidates
 Answer any five questions.
 Missing data, if any, may be assumed suitably.
 Scientific Calculator is allowed.

Q. 1 (a)
Solution:
Q. 1 (b) z
Q. 2 (a) Solution: Following are Fundamental Steps of Digital Image Processing:

1. Image Acquisition

Image acquisition is the first step of the fundamental steps of DIP. In this stage, an image is given
in digital form. Generally, in this stage, pre-processing such as scaling is done.

2. Image Enhancement

Image enhancement is the simplest and most attractive area of DIP. In this stage details which are not known, or we
can say that interesting features of an image is highlighted. Such as brightness, contrast, etc.

3. Image Restoration

Image restoration is the stage in which the appearance of an image is improved.

4. Color Image Processing

Color image processing is a famous area because it has increased the use of digital images on the internet. This
includes color modeling, processing in a digital domain, etc....

5. Wavelets and Multi-Resolution Processing

In this stage, an image is represented in various degrees of resolution. Image is divided into smaller regions for data
compression and for the pyramidal representation.

6. Compression
Compression is a technique which is used for reducing the requirement of storing an image. It is a very important stage
because it is very necessary to compress data for internet use.

7. Morphological Processing

This stage deals with tools which are used for extracting the components of the image, which is useful in the
representation and description of shape.

8. Segmentation

In this stage, an image is a partitioned into its objects. Segmentation is the most difficult tasks in DIP. It is a process
which takes a lot of time for the successful solution of imaging problems which requires objects to identify
individually.

9. Representation and Description

Representation and description follow the output of the segmentation stage. The output is a raw pixel data which has
all points of the region itself. To transform the raw data, representation is the only solution. Whereas description is
used for extracting information's to differentiate one class of objects from another.

10. Object recognition

In this stage, the label is assigned to the object, which is based on descriptors.

11. Knowledge Base

Knowledge is the last stage in DIP. In this stage, important information of the image is located, which limits the
searching processes. The knowledge base is very complex when the image database has a high-resolution satellite.
Q. 3 (a) Solution: Equalization of the image histogram
Pixel Value Number of Pixels Cumulative X (L- Round off
1) to the
nearest
grey level
0 8 0.875 1
1 10 1.968 2
2 10 3.062 3
3 2 3.281 3
4 12 4.593 5
5 16 6.343 6
6 4 6.781 7
7 2 7 7

Equalize to the target histogram

Pixel Value Number of Pixels Cumulative x (L- Round off
1) to the
nearest
grey level
0 0 0 0
1 0 0 0
2 0 0 0
3 0 0 0
4 20 2.1875 2
5 40 4.375 4
6 56 6.125 6
7 64 7 7

Mapping obtained from equalization process:

Grey levels (rk) 0 1 2 3 4 5 6 7
Closest level 0 0 0 0 2 4 6 7

Now Map the values from source to the target. The final mapping between the source and the
target histograms in shown blow:
Table: Final mapping process
Pixel Value (grey H (mapping of the S (mapping of the Map
levels) equalization) equalization of the target)
0 1 0 4
1 2 0 4
2 3 0 5
3 3 0 5
4 5 2 6
5 6 4 6
6 7 6 7
7 7 7 7
Q. 3 (b) Solution:
Q. 4 (a) Solution: Step 1: Center the Data Calculate the mean of the original image along each column and
subtract it from the original image to center the data.
Mean of each column: mean_col = [10.5, 8.0, 7.0, 8.5]
Centered data: A_centered = [[1.5, -3.0, 1.0, -6.5], [-1.5, -5.0, -3.0, -2.5], [-3.5, 3.0, -6.0, 1.5], [3.5, 5.0, 8.0, 7.5]]
Step 2: Compute the Covariance Matrix Compute the covariance matrix of the centered data using the formula:
cov_matrix = (1/N) * A_centered.T * A_centered, where N is the number of samples (4 in this case) and A_centered.T
is the transpose of the centered data matrix.
Covariance matrix:
cov_matrix = [[12.5, 3.5, -4.5, -5.5],
[3.5, 13.0, -6.0, -4.0],
[-4.5, -6.0, 20.0, 12.0],
[-5.5, -4.0, 12.0, 12.5]]
Step 3: Compute the Eigenvectors and Eigenvalues Compute the eigenvectors and eigenvalues of the covariance
matrix. Eigenvectors represent the principal components of the data, and eigenvalues represent the amount of
variance explained by each principal component.
Eigenvalues and eigenvectors of the covariance matrix:
Eigenvalues = [39.641836, 17.885057, 2.657107, 0.816999]
Eigenvectors = [[-0.382107, -0.460635, -0.784290, 0.170785],
[-0.337353, -0.203019, 0.251235, -0.879227],
[0.675234, -0.717861, 0.112516, -0.143346],
[0.524200, 0.462985, 0.553678, 0.445569]]
Step 4: Select Principal Components Select the top k eigenvectors with the highest eigenvalues to form the principal
components. In this case, we want to reduce the dimensionality of the image to 2, so we select the top 2
eigenvectors.
Selected eigenvectors (principal components):
PC1 = [-0.382107, -0.460635, -0.784290, 0.170785]
PC2 = [-0.337353, -0.203019, 0.251235, -0.879227]

Q. 4 (b) Solution:
Q. 5 (a) Solution: Given an image, write down the 8-chain code.

Start-point Image

Assume the 8- chain code 3 2 1 is

4 0
5 6 7

Then the result will be: 6 6 7 6 0 7 1 1 3 2 4 4 3

Q. 5 (b) Solution: Scale Invariant Feature Transform (SIFT) is a feature extraction technique used in computer
vision to identify and describe local features in images. It is widely used for image matching and object recognition.

Here are the steps for using SIFT for image matching:
1. Extract keypoints from the images: The first step is to extract keypoints from the images.
Keypoints are the distinctive features in the image that are invariant to scale, orientation,
and illumination changes. SIFT algorithm detects keypoints by looking for local extrema in
the difference of Gaussian (DoG) scale-space representation of the image.
2. Assign orientations to keypoints: The next step is to assign an orientation to each
keypoint. This is done by calculating the gradient magnitude and orientation at each pixel
within a region around the keypoint. The orientation is then assigned to the keypoint
based on the dominant direction of the gradients.
3. Generate feature descriptors: The next step is to generate feature descriptors for each
keypoint. This is done by computing the gradient magnitude and orientation at a set of
points within a region around the keypoint. The resulting gradient orientations are then
used to generate a histogram of orientations, which is used as the feature descriptor.
4. Match keypoints: Once the keypoints and feature descriptors have been extracted from
the images, the next step is to match the keypoints between the images. This is typically
done using a nearest neighbor search to find the best matching keypoints between the
two sets of keypoints. The distance between the feature descriptors is used as the
similarity metric.
5. Filter out incorrect matches: In order to filter out incorrect matches, a ratio test is applied
to the nearest neighbor matches. This test compares the distance to the nearest and
second nearest neighbors and only accepts matches where the distance to the nearest
neighbor is significantly smaller than the distance to the second nearest neighbor.
6. Estimate transformation: Once the correct matches have been identified, a transformation
can be estimated between the two images. This transformation can be used to align the
images or to find correspondences between different views of the same object.
Overall, SIFT is a powerful technique for image matching, and can be used in a wide variety of
applications, including object recognition, image retrieval, and panorama stitching.

Q. 6 (a) Solution: Histogram of Oriented Gradients (HOG):

Histogram of Oriented Gradients (HOG) is a feature descriptor used in image processing, mainly
for object detection. A feature descriptor is a representation of an image or an image patch that
simplifies the image by extracting useful information from it.
Workflow of object detection using HOG

Steps for Object Detection with HOG

Working of Histogram of Oriented Gradients (HOG):
Pre-processing
Pre-processing of images involves normalizing the image, but it is entirely optional. It is used to
improve the performance of the HOG descriptor. Since we are building a simple descriptor, we
don't use any normalization in pre-processing.
Computing Gradient
The first actual step in the HOG descriptor is to compute the image gradient in both the x and y
direction.
Let us take an example. Say the pixel Q has values surrounding it as shown below:

We can calculate the Gradient magnitude for Q in x and y direction as follow:

We can get the magnitude of the gradient as:

And the direction of the gradient as:

Compute Histogram of Gradients in 8×8 cells

 The image is divided into 8×8 cell blocks and a histogram of gradients is calculated for
each 8×8 cell block.
 The histogram is essentially a vector of 9 buckets (numbers) corresponding to angles from
0 to 180 degree (20-degree increments).
 The values of these 64 cells (8X8) are binned and cumulatively added into these 9 buckets.
 This essentially reduces 64 values into 9 values.

Illustration of splitting of gradient magnitude according to gradient direction (Image source:

https://www.learnopencv.com/histogram-of-oriented-gradients)
Block Normalization
After the creation of histogram of oriented gradients, we need to do something else too. Gradient is
sensitive to overall lighting. If we say divide/multiply pixel values by some constant to make it
lighter/ darker the gradient magnitude will change and so will histogram values. We want
histogram values be independent of lighting. Normalization is done on the histogram vector v
within a block. One of the following norms could be used:
 L1 norm

 L2 norm
 L2-Hys (Lowe-style clipped L2 norm)
Now, we could simply normalize the 9×1 histogram vector but it is better to normalize a bigger
sized block of 16×16. A 16×16 block has 4 histograms (8×8 cell results to one histogram) which
can be concatenated to form a 36 x 1 element vector and normalized. The 16×16 window then
moves by 8 pixels and a normalized 36×1 vector is calculated over this window and the process is
repeated for the image.
Calculate HOG Descriptor vector
 To calculate the final feature vector for the entire image patch, the 36×1 vectors are
concatenated into one giant vector.
 So, say if there was an input picture of size 64×64 then the 16×16 block has 7 positions
horizontally and 7 position vertically.
 In one 16×16 block we have 4 histograms which after normalization concatenate to form a
36×1 vector.
 This block moves 7 positions horizontally and vertically totaling it to 7×7 = 49 positions.
 So, when we concatenate them all into one giant vector we obtain a 36×49 = 1764
dimensional vector.
This vector is now used to train classifiers such as SVM and then do object detection.

Q. 6 (b) Solution: Discontinuity-based segmentation methods divide an image into regions based
on the presence of discontinuities, such as edges or colour changes. On the other hand, similarity-
based segmentation methods divide an image into regions based on the similarity between
neighbouring pixels. Here is a brief explanation of the different types of discontinuity and
similarity-based segmentation methods:
1. Discontinuity-based segmentation methods:
a. Edge-based segmentation: This method segments an image based on the presence of edges.
Edges are defined as points in an image where there is a sudden change in intensity or color. Edge-
based segmentation methods use edge detection techniques to find these points and then group
neighboring points into regions.
b. Region-based segmentation: This method segments an image based on the homogeneity of
regions. Homogeneous regions are areas of an image that have similar intensity or color. Region-
based segmentation methods use a variety of techniques to group pixels into regions, such as
clustering algorithms or graph-based methods.
c. Line-based segmentation: This method segments an image based on the presence of straight
lines. Line-based segmentation methods use line detection techniques to find straight lines in an
image and then group pixels that are close to these lines into regions.
Similarity-based segmentation methods:
2. Region growing segmentation: This method starts with a set of seed pixels and then iteratively
grows the region by adding neighboring pixels that are similar to the seed pixels. The similarity
between pixels can be based on various criteria such as intensity, color, or texture.
a. K-means clustering: This method clusters pixels into regions based on their similarity in color or
intensity. K-means clustering is an unsupervised learning algorithm that groups pixels into k
clusters based on their similarity.
b. Watershed segmentation: This method segments an image into regions based on the topography
of the image. In this method, the image is treated as a topographic map, and the image pixels are
considered as mountains and valleys. The watershed segmentation algorithm then floods the
valleys to create regions.
In summary, discontinuity-based segmentation methods use discontinuities in intensity, color, or
texture to segment images, while similarity-based segmentation methods group pixels based on
their similarity in intensity, color, or texture. Both types of methods have their advantages and
disadvantages, and the choice of method depends on the specific application and the type of image
being segmented.

Government 3.0 - Next Generation Government Technology Infrastructure and Services
No ratings yet
Government 3.0 - Next Generation Government Technology Infrastructure and Services
373 pages
20PKSA001-LNT-SPV-G-DBR-0041 - R5 Design Basis Report PV Plant - A
100% (1)
20PKSA001-LNT-SPV-G-DBR-0041 - R5 Design Basis Report PV Plant - A
53 pages
F - 22 - Rationalisation of Wiring Harness-Model Discover 100, 125 & 150
No ratings yet
F - 22 - Rationalisation of Wiring Harness-Model Discover 100, 125 & 150
5 pages
Open Electricity Economics - 3. The Cost of Electricity PDF
No ratings yet
Open Electricity Economics - 3. The Cost of Electricity PDF
13 pages
Basic Operations in Image Processing - Poorvi Joshi - 2019 Batch
No ratings yet
Basic Operations in Image Processing - Poorvi Joshi - 2019 Batch
26 pages
Image Processing
No ratings yet
Image Processing
9 pages
Vision Review: Image Processing: Course Web Page
No ratings yet
Vision Review: Image Processing: Course Web Page
51 pages
بنك الاسئلة د محمود ابوالفتوح PDF
No ratings yet
بنك الاسئلة د محمود ابوالفتوح PDF
4 pages
CV_ALL_ANS
No ratings yet
CV_ALL_ANS
42 pages
Digital Image Fundamentals
No ratings yet
Digital Image Fundamentals
55 pages
Fundamentals of Digital Image Processing: Roger L. Easton, Jr. 22 November 2010
No ratings yet
Fundamentals of Digital Image Processing: Roger L. Easton, Jr. 22 November 2010
216 pages
22951A6728_IMA
No ratings yet
22951A6728_IMA
6 pages
Assignment 2 DIP
No ratings yet
Assignment 2 DIP
36 pages
Part 1 DIP
No ratings yet
Part 1 DIP
132 pages
Part 2 DIP
No ratings yet
Part 2 DIP
89 pages
Digital Image Processing HW 01 .....................
No ratings yet
Digital Image Processing HW 01 .....................
16 pages
Practical File-MATHS
No ratings yet
Practical File-MATHS
63 pages
II MSC Matlab Record Final
No ratings yet
II MSC Matlab Record Final
56 pages
Image Processing
No ratings yet
Image Processing
6 pages
DIP Assignments LQ
No ratings yet
DIP Assignments LQ
23 pages
AP4011 Lab Manual
No ratings yet
AP4011 Lab Manual
42 pages
Dip Quick Guide
No ratings yet
Dip Quick Guide
6 pages
Computer Vision-Lec 02
No ratings yet
Computer Vision-Lec 02
121 pages
Ktunotes - In: Apj Abdul Kalam Technological University
No ratings yet
Ktunotes - In: Apj Abdul Kalam Technological University
21 pages
Pier Luigi Mazzeo: Sift & Matlab
No ratings yet
Pier Luigi Mazzeo: Sift & Matlab
36 pages
Digital Image Processing I Contents
No ratings yet
Digital Image Processing I Contents
44 pages
Abdulla DIP File
No ratings yet
Abdulla DIP File
15 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
Port City International University: Mid Assignment
100% (1)
Port City International University: Mid Assignment
16 pages
DIVP MANUAL Exp
No ratings yet
DIVP MANUAL Exp
36 pages
Scheme and Solution: Rns Institute of Technology
No ratings yet
Scheme and Solution: Rns Institute of Technology
8 pages
Digital Image Processing - DR - B.chandra Mohan
No ratings yet
Digital Image Processing - DR - B.chandra Mohan
63 pages
Homework 1
50% (2)
Homework 1
3 pages
IP - Manual (1) 1
No ratings yet
IP - Manual (1) 1
37 pages
Ipmv Viva Questions
No ratings yet
Ipmv Viva Questions
27 pages
Assign 1
No ratings yet
Assign 1
13 pages
Image Enchancement in Spatial Domain
No ratings yet
Image Enchancement in Spatial Domain
117 pages
Image Processing Suggestion
No ratings yet
Image Processing Suggestion
3 pages
Ipcv Final
No ratings yet
Ipcv Final
51 pages
Dip Question Paper 1 (1)
No ratings yet
Dip Question Paper 1 (1)
27 pages
IP Questions
No ratings yet
IP Questions
13 pages
Lab Manual
No ratings yet
Lab Manual
22 pages
Digital Image Processing
No ratings yet
Digital Image Processing
8 pages
Digital Image Processing Lab.
No ratings yet
Digital Image Processing Lab.
14 pages
Chapter 2 - Two Dimensional System: 1. Discrete Fourier Transform (DFT)
No ratings yet
Chapter 2 - Two Dimensional System: 1. Discrete Fourier Transform (DFT)
11 pages
Comprehensive Analysis of Digital Image Processing
No ratings yet
Comprehensive Analysis of Digital Image Processing
6 pages
Image Processing - MR - Gautam Pal
No ratings yet
Image Processing - MR - Gautam Pal
50 pages
Scilab Manual For Image Processing by MR Gautam Pal Computer Engineering Tripura Institute of Technlogy
No ratings yet
Scilab Manual For Image Processing by MR Gautam Pal Computer Engineering Tripura Institute of Technlogy
50 pages
IP 2023 PAPER
No ratings yet
IP 2023 PAPER
15 pages
Image Processing Questions
No ratings yet
Image Processing Questions
4 pages
RAT292 M3 Part 2 Sensors and Actuators
No ratings yet
RAT292 M3 Part 2 Sensors and Actuators
55 pages
PRH_362_Outline_and _Summarised Notes
No ratings yet
PRH_362_Outline_and _Summarised Notes
26 pages
ICaseThesis5-2010
No ratings yet
ICaseThesis5-2010
64 pages
IT Assignment
No ratings yet
IT Assignment
14 pages
MIP Lab Manual BM3652 MEDICAL IMAGE PROCESSING - 110146
No ratings yet
MIP Lab Manual BM3652 MEDICAL IMAGE PROCESSING - 110146
74 pages
09-00174
No ratings yet
09-00174
14 pages
Module 1 Chapter 3 Cv
No ratings yet
Module 1 Chapter 3 Cv
19 pages
IP Lab Manual
No ratings yet
IP Lab Manual
21 pages
Mid1 Questionpaper SOLUTION
No ratings yet
Mid1 Questionpaper SOLUTION
6 pages
Uptu Eec-068 Puta
No ratings yet
Uptu Eec-068 Puta
12 pages
CV IMP
No ratings yet
CV IMP
15 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Vung Tau
No ratings yet
Vung Tau
3 pages
Family Math Activities For k-1
No ratings yet
Family Math Activities For k-1
1 page
2618 Presentation
No ratings yet
2618 Presentation
5 pages
Articles of Protection From Negative & Demonic Forces
100% (2)
Articles of Protection From Negative & Demonic Forces
7 pages
Skytraq-Venus634FLPx DS v051
No ratings yet
Skytraq-Venus634FLPx DS v051
18 pages
SJK (C) Pei Hwa Year 3 English Language Assessment 3
No ratings yet
SJK (C) Pei Hwa Year 3 English Language Assessment 3
6 pages
Design Considerations of Biophilic Design To Impro
No ratings yet
Design Considerations of Biophilic Design To Impro
9 pages
Manganese Questionnaire
No ratings yet
Manganese Questionnaire
4 pages
Installation of Commercial Fire Sprinkler Systems 03-25-2009
No ratings yet
Installation of Commercial Fire Sprinkler Systems 03-25-2009
11 pages
DCA 101 Solved Questions...
No ratings yet
DCA 101 Solved Questions...
23 pages
Hammer Impact 2
No ratings yet
Hammer Impact 2
9 pages
32pfl7962d 12 Dfu Eng - Manual Philips
100% (2)
32pfl7962d 12 Dfu Eng - Manual Philips
48 pages
From Dot To Line To Plane Constellating Unconscious Imagery in Art Therapy
No ratings yet
From Dot To Line To Plane Constellating Unconscious Imagery in Art Therapy
9 pages
Healthmeans 3 Interview Transcripts From The Dental Oral Health Rescue Summit
100% (1)
Healthmeans 3 Interview Transcripts From The Dental Oral Health Rescue Summit
48 pages
EEE489 -4_interfaces between grid and REs
No ratings yet
EEE489 -4_interfaces between grid and REs
19 pages
Service Bulletin: @perkins
No ratings yet
Service Bulletin: @perkins
2 pages
Main Column
No ratings yet
Main Column
19 pages
Chapter 8 International Cargo Transport
No ratings yet
Chapter 8 International Cargo Transport
14 pages
OCS Open CloudServer Blade v2.1
No ratings yet
OCS Open CloudServer Blade v2.1
57 pages
LG30GLT
No ratings yet
LG30GLT
66 pages
1 Reaction Kinetics
No ratings yet
1 Reaction Kinetics
41 pages
Week 6 Marxism-Lecture
No ratings yet
Week 6 Marxism-Lecture
29 pages
VTAMPS 16.0 P2 Set 1
No ratings yet
VTAMPS 16.0 P2 Set 1
15 pages
Pg. 126 Alg1hwp-1 Copy Dragged
No ratings yet
Pg. 126 Alg1hwp-1 Copy Dragged
1 page
Soal PAS 1
100% (1)
Soal PAS 1
7 pages

ETE-DIP Solution

Uploaded by

ETE-DIP Solution

Uploaded by

Reg.No.

Image restoration is the stage in which the appearance of an image is improved.

4. Color Image Processing

5. Wavelets and Multi-Resolution Processing

9. Representation and Description

10. Object recognition

11. Knowledge Base

Equalize to the target histogram

Mapping obtained from equalization process:

Assume the 8- chain code 3 2 1 is

Then the result will be: 6 6 7 6 0 7 1 1 3 2 4 4 3

Q. 6 (a) Solution: Histogram of Oriented Gradients (HOG):

Steps for Object Detection with HOG

We can calculate the Gradient magnitude for Q in x and y direction as follow:

We can get the magnitude of the gradient as:

And the direction of the gradient as:

Compute Histogram of Gradients in 8×8 cells

Illustration of splitting of gradient magnitude according to gradient direction (Image source:

You might also like