0% found this document useful (0 votes)

2 views

3.1 - Image Fundamentals

The document covers fundamental concepts in data science, focusing on image processing and computer vision. It discusses image representation, pixel structures, convolution operations, and the importance of scaling and cropping images. Additionally, it introduces convolutional neural networks (CNNs) as a key topic for further study.

Uploaded by

toufeeqdata

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

3.1 - Image Fundamentals

Uploaded by

toufeeqdata

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Current Topics in Data Science

Image Fundamentals
Leandro L. Minku
Video Presentation
Assignment
• Presentation about a data science problem of your choice
• Max 5 min.

• Your proposed approach to solve it.

• No need to implement or run this approach, just propose it!

• It can be any data science approach, not restricted to the

approaches learned in this module.

• Recommendation: note that one of the marking criteria is the

suitability of the proposed approach.
• Explain why you believe your proposed approach is suitable!

2
Module Topics
1. Time series

2. Natural language processing

3. Computer vision

4. Fairness

3
Computer Vision
• Image fundamentals.
• Convolutional Neural Networks and variations.

[Image source: https://miro.medium.com/v2/resize: t:720/format:webp/1*uAeANQIOQPqWZnnuH-VEyw.jpeg] 4

fi
Computer Vision
• Image fundamentals.
• Convolutional Neural Networks and variations.
• Image classi cation, image segmentation.

[Image source: Deep Learning with Python book] 5

fi
Computer Vision
• Image fundamentals.
• Convolutional Neural Networks and variations.
• Image classi cation, image segmentation.

[Image source: Deep Learning with Python book] 6

fi
Outline
• Pixels — the building blocks of images.

• Grayscale and RGB colour space.

• Images as arrays.

• Scaling, aspect ratios, cropping.

• Kernels and convolutions.

7
8
Pixels

• An image can be seen as a grid.

• Pixel is the colour or intensity of a position in the grid.

• Smallest building block of an image.

9
Pixel Representation
• Grayscale
• Scalar between 0 (black) and 255 (white), i.e., in
{0,…,255}.

10
Pixel Representation
• Colour
• Various colour representations exist.

• Red, Green, Blue (RGB): each pixel has three channels.

• R: scalar in {0,…,255}.
• G: scalar in {0,…,255}.
• B: scalar in {0,…,255}.

11
RGB: Three Channels

[Image source: Deep Learning for Computer Vision with Python book] 12
Images as Arrays:
1 Channel (2d Matrices)

• Images are matrices, indexed

by (row, column)

M= M1,1, M1,2, M1,3, …

M2,1, M2,2, M2,3, …
M3,1, M2,1, M2,2, …
…

13
Images as Arrays:
1 Channel (2d Matrices)

• Images are arrays, indexed by

(row, column)

M= M[0,0], M[0,1], M[0,2], …

M[1,0], M[1,1], M[1,2], …
M[2,0], M[2,1], M[2,2], …
…

14
Images as Arrays:
3 Channels (3d Tensors)

• Images are arrays, indexed by

(row, column, depth)

M[0,0,2],
M[0,0,1],
M[0,0,0],

Note: the above corresponds to the

OpenCV library, which actually stores
pixels in BGR. Other libraries like
matplotlib assume RGB, such that the red
pixel would be M[0,0,0].
15
Image Resolution

Height

Width

• Level of detail.
• Resolution of 1000 x 750 means Note: a pixel may have one or more
• 1000 pixels wide and channels (depth of image). This is
not considered for the resolution.
• 750 pixels tall.
• I.e., 750,000 pixels in total. 16
Demo
• image-fundamentals.ipynb
• Load an image with OpenCV library
• Convert from BGR to RGB
• Convert image to Grayscale
• Plot each channel of the image

17
Scaling (Resizing)
• Increasing or decreasing the size of an image in terms of
width and height.
• We will often need to do this when using machine learning.

• Aspect ratio: width / height.

Note: aesthetically, maintaining the aspect ratio

is important, but it is not always the case for
image machine learning problems.

[Image source: Deep Learning for Computer Vision with Python book] 18
Cropping

19
Demo
• image-fundamentals.ipynb
• Rescaling an image
• Rescaling by factor
• Cropping an image

20
Image Convolution
• Application of a lter to an image, e.g., to:
• Blur.
• Sharpen.
• Detect edges.
• Etc.

21
fi
Image Convolution
• Application of a lter to an image, e.g., to:
• Blur.
• Sharpen.
• Detect edges.
• Etc.

22
fi
Image Convolution
• Application of a lter to an image, e.g., to:
• Blur.
• Sharpen.
• Detect edges.
• Etc.

23
fi
Convolution
• Based on an element-wise multiplication of two matrices,
followed by a sum.

0 -1 0 93 139 101

-1 5 -1 ⋆ 26 252 196 =
0 -1 0 135 230 18

Note: mathematically,
(0 x 93) + (-1 x 139) + (0 x 101) this operation is
called cross-
+(-1 x 26) + (5 x 252) + (-1 x 196) correlation, but the
computer vision uses
+ (0 x 135) + (-1 x 230) + (0 x 18) the term convolution

≈ 669
for it.
24
Convolution
• Operation between a “large” matrix (image) and a “small” matrix ( lter,
a.k.a., kernel) that produces another matrix (image).

• The kernel slides over the image from left to right, top to bottom.

0 -1 0 93 139 101 4 3 5 4 3 5 4

-1 5 -1 669 196 3
26 252 2 5 3 2 5 3

0 -1 0 135 230 18 2 4 3 2 4 3 2

Note: the convolution is 1 3 5 4 3 5 4 3 5 4

performed by applying the kernel 7 2 5 3 2 5 3 2 5 3
to the original values of the
1 4 3 2 4 3 2 4 3 2
image!
1 3 5 4 3 5 4 3 5 4
Note: convolution is usually
applied separately to different 7 2 5 3 2 5 3 2 5 3
channels. 1 4 3 2 4 3 2 4 3 2

Note: the resolution of the new 1 3 5 4 3 5 4 3 5 4

image is smaller. fi
25
Zero Padding

0 0 0 0 0 0 0 0 0 0 0 0

0 -1 0 0 93 139 101 4 3 5 4 3 5 4 0

-1 5 -1 0 26 252 196 3 2 5 3 2 5 3 0

0 -1 0 0 135 230 18 2 4 3 2 4 3 2 0
0 1 3 5 4 3 5 4 3 5 4 0

0 7 2 5 3 2 5 3 2 5 3 0
0 1 4 3 2 4 3 2 4 3 2 0
0 1 3 5 4 3 5 4 3 5 4 0
Note: other forms of padding
exist, like replicating the values at 0 7 2 5 3 2 5 3 2 5 3 0
the borders.
0 1 4 3 2 4 3 2 4 3 2 0
0 1 3 5 4 3 5 4 3 5 4 0
0 0 0 0 0 0 0 0 0 0 0 0
26
Intensities Outside Range
[0,255]
• Clip to the nearest allowed value, e.g.:
• -1 becomes 0.
• 300 becomes 255.

• Rescale, e.g.:
• values between -1 and 300 are rescaled to the range
{0,…,255}.

old_value − old_min
new_value = 255 ×
old_max − old_min
• E.g.: rescale value of 150 considering that the values are
between -1 and 300.
150 - (-1)
= 255 x ≈ 128
300 - (-1) 27
Demo
• image-fundamentals.ipynb
• Applying convolutions

28
Examples of Kernels
Sharpen Blur
0 -1 0 1/9 1/9 1/9

-1 5 -1 1/9 1/9 1/9

0 -1 0 1/9 1/9 1/9

Edge
Laplacian Sobel X Sobel Y
Detection
-1 -1 -1 0 1 0 -1 0 1 -1 -2 -1

-1 8 -1 1 -4 1 -2 0 2 0 0 0

-1 -1 -1 0 1 0 -1 0 1 1 2 1

29
Kernels and Computer
Vision Problems
• Kernels were typically designed to create features to be given
as inputs to machine learning algorithms.

• Designing kernels is challenging and problem-dependent.

• What about learning kernels automatically?

30
Summary
• Images are represented as arrays storing their pixels.
• The width is represented by the columns.
• The height by the rows.
• The channels by the depth.

• Image resolution corresponds to the level of detail in number of

pixels.

• Scaling, aspect ratio, cropping.

• Convolution, lters/kernels, padding, dealing with values

outside range.

• Next: Convolutional Neural Networks (CNNs).

31
fi
References
• Deep Learning for Computer Vision with Python, Chp 3 and
Chp 11 until 11.1.4 (inclusive).
• If you are keen on programming, you can also read Section
11.1.5.
• If you are not keen on programming, you can still read this
section, but you don’t need to pay attention at the coding
part.
• No matter if you read or don’t read this section, you need to
understand the fact that we are sliding the kernel over the
image and may need padding, as explained in this lecture.

Digital Image Representation - Unit1
No ratings yet
Digital Image Representation - Unit1
26 pages
AWS vs. Azure vs. Google: Cloud Comparison (2019 Update)
67% (3)
AWS vs. Azure vs. Google: Cloud Comparison (2019 Update)
16 pages
SKM4213 C3 Pre-Processing
No ratings yet
SKM4213 C3 Pre-Processing
80 pages
Image Processing
No ratings yet
Image Processing
37 pages
Computer Vision Course Lecture 2
No ratings yet
Computer Vision Course Lecture 2
53 pages
Image Summary
No ratings yet
Image Summary
30 pages
ISYE 8803 - Kamran - M2 - Image Processing
No ratings yet
ISYE 8803 - Kamran - M2 - Image Processing
54 pages
11.image Processing in Agriculture
No ratings yet
11.image Processing in Agriculture
37 pages
Images, Neural Networks, CNNs
No ratings yet
Images, Neural Networks, CNNs
26 pages
P3-Arithmetic and Logic Operations
No ratings yet
P3-Arithmetic and Logic Operations
35 pages
Image Processing: Dept - of Instrumentation Science University of Pune
No ratings yet
Image Processing: Dept - of Instrumentation Science University of Pune
54 pages
Lect3 PDF
No ratings yet
Lect3 PDF
47 pages
Week-16 Lecture-32
No ratings yet
Week-16 Lecture-32
65 pages
Image Processing: Point Processing Filters Dithering Image Compositing Image Compression
No ratings yet
Image Processing: Point Processing Filters Dithering Image Compositing Image Compression
51 pages
MMC UNIT 1 Part1_chapter 3
No ratings yet
MMC UNIT 1 Part1_chapter 3
56 pages
Lec 2 Image Processing
No ratings yet
Lec 2 Image Processing
69 pages
T2310 - TDS3651 - L02 - Manipulating Pixels
No ratings yet
T2310 - TDS3651 - L02 - Manipulating Pixels
56 pages
Content-Based Image Retrieval (CBIR) : Match
No ratings yet
Content-Based Image Retrieval (CBIR) : Match
71 pages
3.2. Image Compression (Part 1)
No ratings yet
3.2. Image Compression (Part 1)
29 pages
Computer Vision
No ratings yet
Computer Vision
38 pages
UNIT-II Computer Graphics
No ratings yet
UNIT-II Computer Graphics
91 pages
Convolutional_Networks_2024
No ratings yet
Convolutional_Networks_2024
44 pages
Image Processin G: Landsat 7 Image of The Retreating Malaspina Glacier, Alaska
No ratings yet
Image Processin G: Landsat 7 Image of The Retreating Malaspina Glacier, Alaska
22 pages
Lecture 1 Machine Vision 1 2022
No ratings yet
Lecture 1 Machine Vision 1 2022
78 pages
IT5409 Ch5 Segmentation v2
No ratings yet
IT5409 Ch5 Segmentation v2
64 pages
Image Enhancement Lesson6 20thFeb2024
No ratings yet
Image Enhancement Lesson6 20thFeb2024
59 pages
BENC 4483enhancement and Restoration
No ratings yet
BENC 4483enhancement and Restoration
89 pages
Digital Signal and Image Processing Using Matlab: Compiled by
No ratings yet
Digital Signal and Image Processing Using Matlab: Compiled by
41 pages
Lec 1 Print
No ratings yet
Lec 1 Print
13 pages
To Opencv: Marvin Smith
100% (2)
To Opencv: Marvin Smith
29 pages
MPA2
No ratings yet
MPA2
56 pages
Prof. Dr. Ashwaq Mahmood Image Processing 2024-2025
No ratings yet
Prof. Dr. Ashwaq Mahmood Image Processing 2024-2025
24 pages
1 (Autosaved)
No ratings yet
1 (Autosaved)
139 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
161 pages
Chapter 1 DIP
No ratings yet
Chapter 1 DIP
45 pages
Pcomms
No ratings yet
Pcomms
24 pages
4-+Computer+Vision+24-10-2023.pdf_083200
No ratings yet
4-+Computer+Vision+24-10-2023.pdf_083200
26 pages
Image Representation and Discretization PDF
No ratings yet
Image Representation and Discretization PDF
34 pages
Lecture4 Arithmetic Logical
No ratings yet
Lecture4 Arithmetic Logical
20 pages
Lecture - 2 Multimedia
No ratings yet
Lecture - 2 Multimedia
35 pages
Lecture 8 Segmentation
No ratings yet
Lecture 8 Segmentation
54 pages
Lecture - 3 - Image - Processing - Toolbox - IPT - Matlab
No ratings yet
Lecture - 3 - Image - Processing - Toolbox - IPT - Matlab
4 pages
06. Mathematical Tools - Contd.,
No ratings yet
06. Mathematical Tools - Contd.,
31 pages
Chapter 2 Digital Image Fundamentalsn Final
No ratings yet
Chapter 2 Digital Image Fundamentalsn Final
85 pages
Chap 10
No ratings yet
Chap 10
73 pages
Physics 4 Halv, No
No ratings yet
Physics 4 Halv, No
47 pages
Two-Dimensional Wavelets: ECE 802 Spring 2010
No ratings yet
Two-Dimensional Wavelets: ECE 802 Spring 2010
61 pages
Image Processing in Matlab: TBMI02 Medical Image Analysis
No ratings yet
Image Processing in Matlab: TBMI02 Medical Image Analysis
24 pages
Baraniuk IMA Compression June07 Final
No ratings yet
Baraniuk IMA Compression June07 Final
87 pages
Dip Unit 4
No ratings yet
Dip Unit 4
58 pages
2.0 Display Graphics V1
No ratings yet
2.0 Display Graphics V1
20 pages
Fundamental Image Processing Steps: sensors/cameras/CCD Cameras Etc.. in Digital Form
No ratings yet
Fundamental Image Processing Steps: sensors/cameras/CCD Cameras Etc.. in Digital Form
44 pages
RAT292 M3 Part 2 Sensors and Actuators
No ratings yet
RAT292 M3 Part 2 Sensors and Actuators
55 pages
DIP chapter 4
No ratings yet
DIP chapter 4
25 pages
Physics 4
No ratings yet
Physics 4
93 pages
ACFrOgCfX9ATrHm9ZSjs1HLKnJCXmmPcIwFi Y7hVAv6zU1Li3igjIXOOLtGhffODBql8a993YAsc3gM SE8bidlMJr2eFkl9eJB0BU8jcLD6iWrroxwbp1 X9yQtpQks6r8vMLEnR-ORk02lgVJ
No ratings yet
ACFrOgCfX9ATrHm9ZSjs1HLKnJCXmmPcIwFi Y7hVAv6zU1Li3igjIXOOLtGhffODBql8a993YAsc3gM SE8bidlMJr2eFkl9eJB0BU8jcLD6iWrroxwbp1 X9yQtpQks6r8vMLEnR-ORk02lgVJ
20 pages
Multimedia Systems-L4
No ratings yet
Multimedia Systems-L4
26 pages
Img Mod1 Session4 PDF
No ratings yet
Img Mod1 Session4 PDF
49 pages
Digital Image Fundamentals
No ratings yet
Digital Image Fundamentals
50 pages
Computer Graphics in Python
From Everand
Computer Graphics in Python
Martin McBride
No ratings yet
SVG Drawing with HTML5
From Everand
SVG Drawing with HTML5
Hussein Qutbi
No ratings yet
Sumit Singh Resumee
No ratings yet
Sumit Singh Resumee
1 page
Balsemão - 2021 - On The Heuristic Value of Luhmann's Systems Theory
No ratings yet
Balsemão - 2021 - On The Heuristic Value of Luhmann's Systems Theory
4 pages
The One Hundred Layers Tiramisu: Fully Convolutional Densenets For Semantic Segmentation
No ratings yet
The One Hundred Layers Tiramisu: Fully Convolutional Densenets For Semantic Segmentation
9 pages
Can Computers Think
100% (1)
Can Computers Think
43 pages
Machine Learning-AI For A Business Problem
No ratings yet
Machine Learning-AI For A Business Problem
16 pages
Experiment 3.3
No ratings yet
Experiment 3.3
3 pages
Real Time Object Detection Using OpenCV and YOLO
No ratings yet
Real Time Object Detection Using OpenCV and YOLO
8 pages
Khushi PPT
No ratings yet
Khushi PPT
6 pages
Chapter 3
No ratings yet
Chapter 3
15 pages
ChatGPT Assignments To Use in Your Classroom Today
No ratings yet
ChatGPT Assignments To Use in Your Classroom Today
145 pages
Synopsis of Courses
No ratings yet
Synopsis of Courses
18 pages
On Hydroinformatics Masters in Ihe Delft v09
No ratings yet
On Hydroinformatics Masters in Ihe Delft v09
41 pages
Machine Learning Marking Criteria Portfolio Part 3
No ratings yet
Machine Learning Marking Criteria Portfolio Part 3
1 page
Pysentimiento: A Python Toolkit For Sentiment Analysis and Socialnlp Tasks
No ratings yet
Pysentimiento: A Python Toolkit For Sentiment Analysis and Socialnlp Tasks
4 pages
Project 1 (Final Copy) Checked
No ratings yet
Project 1 (Final Copy) Checked
46 pages
Artificial Intelligence Ai and Human rights-QA0224534ENN
No ratings yet
Artificial Intelligence Ai and Human rights-QA0224534ENN
119 pages
06 _ Day 06 _ Omagic _ Cgi
No ratings yet
06 _ Day 06 _ Omagic _ Cgi
32 pages
Image Detection and Segmentation Using YOLO v5 For
No ratings yet
Image Detection and Segmentation Using YOLO v5 For
6 pages
Recruitment Report 2022 2023 PDF 1668475931
No ratings yet
Recruitment Report 2022 2023 PDF 1668475931
16 pages
Book Summary
No ratings yet
Book Summary
35 pages
UNIT-1 PPT AI - dum
No ratings yet
UNIT-1 PPT AI - dum
70 pages
UGRD-AI6100 - MIDTERM EXAM - Attempt PERFECT
No ratings yet
UGRD-AI6100 - MIDTERM EXAM - Attempt PERFECT
11 pages
2024 IMMC Problem
No ratings yet
2024 IMMC Problem
8 pages
Deepseek report
No ratings yet
Deepseek report
12 pages
Lpe2501 Writing Portfolio Task 2 (Paraphrase & Summary Form - Final)
No ratings yet
Lpe2501 Writing Portfolio Task 2 (Paraphrase & Summary Form - Final)
8 pages
Q3.Information Communication Technology
No ratings yet
Q3.Information Communication Technology
11 pages
C3.Ai A New Technology Stack
No ratings yet
C3.Ai A New Technology Stack
22 pages
Assignment-1
No ratings yet
Assignment-1
2 pages
Câu Lệnh Promt Hình Ảnh
No ratings yet
Câu Lệnh Promt Hình Ảnh
72 pages

3.1 - Image Fundamentals

Uploaded by

3.1 - Image Fundamentals

Uploaded by

Current Topics in Data Science

• Your proposed approach to solve it.

• No need to implement or run this approach, just propose it!

• It can be any data science approach, not restricted to the

• Recommendation: note that one of the marking criteria is the

2. Natural language processing

[Image source: https://miro.medium.com/v2/resize: t:720/format:webp/1*uAeANQIOQPqWZnnuH-VEyw.jpeg] 4

[Image source: Deep Learning with Python book] 5

[Image source: Deep Learning with Python book] 6

• Grayscale and RGB colour space.

• Scaling, aspect ratios, cropping.

• Kernels and convolutions.

• An image can be seen as a grid.

• Pixel is the colour or intensity of a position in the grid.

• Smallest building block of an image.

• Red, Green, Blue (RGB): each pixel has three channels.

• Images are matrices, indexed

M= M1,1, M1,2, M1,3, …

• Images are arrays, indexed by

M= M[0,0], M[0,1], M[0,2], …

• Images are arrays, indexed by

Note: the above corresponds to the

• Aspect ratio: width / height.

Note: aesthetically, maintaining the aspect ratio

Note: the convolution is 1 3 5 4 3 5 4 3 5 4

Note: the resolution of the new 1 3 5 4 3 5 4 3 5 4

-1 5 -1 1/9 1/9 1/9

0 -1 0 1/9 1/9 1/9

• Designing kernels is challenging and problem-dependent.

• What about learning kernels automatically?

• Image resolution corresponds to the level of detail in number of

• Scaling, aspect ratio, cropping.

• Convolution, lters/kernels, padding, dealing with values

• Next: Convolutional Neural Networks (CNNs).

You might also like