0% found this document useful (0 votes)

64 views

Iva Lab Manual

Uploaded by

jaisankarganesh156

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

Iva Lab Manual

Uploaded by

jaisankarganesh156

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 29

.

O
.
•/ S”
ESTD. 2001

PRATHYUSHA ENGINEERING COLLEGE

DEPARTMENT OF COMPUTER SCIENCE A HB GINEERING

LAB MANU

CCS349-IMAGE AND V ANALYSIS LABORATORY

(Regul$Ji n 2021, V Semester)

(Odd Semester)

»P ACADEMIC YEAR: 2023 — 2024

PREPARED BY
B.Gunasundari,
Assistant Professor /
CSE
CCS349 IMAGE AND VIDEO ANALYTICS LAB

Exp.no. 1
T-pyramid of an image
Aim:
To Write a program that computes the T-pyramid of an image.

Algorithm:
1. Import the required libraries, OpenCV (cv2) and matplotlib.
2. Load an image "img l .jpg" from the specified file path.
3. Initialize a variable layer as a copy of the loaded image. This ccpy ’11 be repeatedly
downsampled to create the pyramid levels.
4. Iterate through a loop four times (from i - 0 to 3).
5. Inside the loop:
• Create a subplot in a 2x2 grid (total of 4 subplotsJ.
• Downsample the layer using cv2.pyrDown(). cooperation reduces the
image size by half, creating a new level of t yramid.
• Display the downsampled image using p ow() within the current
subplot. oN
6. After displaying all four levels of the ima • amid, the code attempts to display
each level individually using cv2.imsh ith a window title based on the current
loop index i. However, there is a mist n the code where the window title is not
updated correctly, and it always d’ s "str(i)" as the title.
7. Finally, the code calls cv2.wai 0) to keep the OpenCV windows open until a key
is pressed, and then it closed
penCV windows with cv2.destroyAllWindows().

Program:

import cv2
import md lib.pyplot as plt

img = cv2.imread("E:\Backup 14.4.23\image\img1.jpg")

layer = img.copy()

for i in range(4):
plt.subplot(2, 2, i + 1)
layer =
cv2.pyrDown(layer)
plt.imshow(layer)
cv2.imshow("str(i)",
layer) cv2.waitKey(0)
cv2.destroyAl1Windows()

Result:

Thus the prog hat computes the T-pyramid of an image is executed successfully.

Exp. No.2
QUAD TREE

Aim:
To Write a program that derives the quad tree representation of an image using the
homogeneity criterion of equal intensity

Algorithm:
1. Define a Quadtree Node structure to represent each node in the quadtree. Each node
should contain the following information:
• Position (x, y): The top-left corner of the node within the image.
• Size: The width and height of the node.
• Color: The dominant color of the node.
• Children: An array or a dictionary to store child nodes.
• Termination Condition: A condition that determines when to stop subdividing.
2. Initialize the quadtree by creating the root node, which represents the entire image.
3. Define the termination condition, which could be based on a threshold for color
similarity, a maximum depth, or any other criterion. If the termination condition is met,
mark the current node as a leaf node.
4. If the termination condition is not met, subdivide the current node into four quadrants,
each representing a subregion of the image:
• Divide the current node's size by 2.
• Create four child nodes, one for each quadrant.
Determine the dominant color for each quadrant.
Recursively apply the quadtree algorithm to each child not“
5. Repeat t nation condition is
met for

Program:
import numpy as np
import cv2
from PIL import Image, ImageDraw

MAX DEPTH = S
DETAIL THRESHOLD =
SIZE MULT = 1

def average_colour(i
4 convert image array
image_arr = n y(image)

# get average o1‘whole image

avg_co1or er_row = np.average(image_arr, axis=0)
avg_color = np.average(avg_color er_row, axis=0)

return (int(avg_color[0]), int(avg_color[l ]), int(avg_color[2]))

def weighted_average(hist):
total = sum(hist)
error = value = 0

if total > 0:
value = sum(i * x for i, x in enumerate(hist)) / total
error = sum(x * (value - i) ** 2 for i, x in enumerate(hist)) / total
error = error ** 0.5

return error

def get_detail(hist):
red_detail = weighted_average(hist[:256])
green_detail = weighted_average(hist[25fi:512])
blue_detail = weighted_average(hist[512:76S])

detail_intensity = red_detail * 0.2959 + green_detail * 0.5570 + blue_detai1 * 0.1140

return detail_intensity

class Quadrant():
def init (self, image, bbox, depth):
se11\bbox = bbox
self.depth = depth
self.children = None
self.leaf‘ = False

# crop image to quadrant size

image = image.crop(bbox)
hist = image.histogram()

self.detail = get_detail(hist
self.colour = average_c image)

def split_quadrant(s , age):

left, top, width e ght = self.bbox

# et the ddle coords of bbox

middle_x = lefi + (width - left) / 2
middle_y = top + (height - top) / 2

# split root Quadrant into 4 new quadrants

upper_left = Quadrant(image, (left, top, middle_x, middles), self.depth+1)
upper_right = Quadrant(image, (middle_x, top, width, middles), self.depth+1)
bottom_left = Quadrant(image, (left, middle_y, middle_x, height), self.depth+ 1)
bottom_right = Quadrant(image, (middle_x, middle_y, width, height), self.depth+1)

# add new quadrants to root children

self.children = [upper_left, upper_right, bottom_left, bottom_right]
class QuadTree():
def init (self, image):
self.width, self.height = image.size

# keep track of max depth achieved by

recursion se1f\max_depth = 0

# start
compression
self.start(image)

def start(self, image):

# create initial root
self.root = Quadrant(image, image.getbbox(), 0)

# build quadtree se11\

build(self.root, image)

def build(self, root, image):

i1’root.depth >= MAX_DEPTH or root.detail <=
if root.depth > self.max_depth:
sel1\max_depth = root.depth

# assign quadrant to leaf and stop ruing

root.leaf = True
return

# split quadrant if there much detail

root.sp1it_quadrant(it

dren in . ildren:
self.bu 1 1 image)

def create_image(sell, custom_depth, show_lines=False):

# create blank image canvas
image = Image.new('RGB’, (self.width, self.height))
draw = ImageDraw.Draw(image)
draw.rectangle((0, 0, self.width, sell\height), (0, 0, 0))

leaf_quadrants = self.get_lea1 quadrants(custom_depth)

# draw rectangle size of’quadrant for each leaf’quadrant

for quadrant in leaf_quadrants:
i1’show_lines:
draw.rectangle(quadrant.bbox, quadrant.colour, outline=(0, 0, 0))
else:
draw.rectangle(quadrant.bbox, quadrant.colour)

return image

def get_1eaf_quadrants(sell; depth):

if depth > self.max_depth:
raise ValueError('A depth larger than the trees depth was given')

quandrants = []

# search recursively down the quadtree

self.recursive_search(self, self.root, depth, quandrants.append)

return quandrants
def recursive_search(self, tree, quadrant, max_depth, appen ):
# append i1’quadrant is a leaf
i1’quadrant.leaf == True or quadrant.depth == max
append_leaf’(quadrant)

# otherwise keep recurring

elif quadrant.children != None:
for child in
quadrant.children:
self.recursive_search(trees, max_depth, append_1eaf)

def create_gif(self, file_na *, ration=1000, 1oop=0, show_lines=Fa1se):

gif = []
end roduct_ima self.create_image(self.max_depth, show_lines=show_lines)

for i in ran max_depth):

image Self.create_image(i, show_1ines=show_lines)
gif.append(image)

# add extra images at end

for _ in range(4):
gif.append(end roduct_image)

gif[0].save( file_n
ame,
save_a1l=True,
append_images=gi1{1:],
duration=duration, loop=1oop)
if name == ' main '.
#image ath = "./images/eye.jpg"
image ath = "E:\Backup 14.4.23\image\img1.jpg"

# load image
image = Image.open(image ath)
image = image.resize((image.size[0] * SIZE_MULT, image.size[1] * SIZE_MULT))

# create quadtree
quadtree = QuadTree(image)

# create image with custom depth depth

image = quadtree.create_image(depth, show_lines=Fa1se) quadtree.create_gif{"mountain_quadtree.gif', show

# show image # image.show()

image.save("E:\Backup 14.4.23\image\img11 j")

Result:
Thus the program that derives the quad tree representation of an image using the homogeneity
criterion of equal intensity is executed successfully.
Exp. No.3
GEOMETRIC TRANSFORMATION OF IMAGE

Aim:
To develop programs for the following geometric transforms: (a) Rotation (b) Change
of scale (c) Skewing (d) Affine transform calculated from three pairs of corresponding
points

Algorithm:
(a) Rotation

1. Import the Pillow library:

• The code starts by importing the Image module fr Pillow library.
2. Open the original image:
• It opens an image from the file path "E:\Ba p 4.4.23\image\img l.jpg" and
assigns it to the Original_lmage variabl
3. Rotate the image by 150 degrees:
• The rotate method is used to original image by 150 degrees, and the
rota result is stored in
rotated_ima
4. Rotate the image by 90 degrees (co Mr-clockwise):
• The transpose method is with the argument Image.ROTATE_90 to rotate
the original image b 0 degrees counter-clockwise (also known as a
counterclockwise rotation). The result is stored in rotated image2.
5. Rotate the image by 6Q rees:
• The rotate od is used to rotate the original image by 60 degrees, and the
result is red in rotated_image3.
6. Display the ed images:
how method is called on each of the rotated images to display them.

(b) Change of scale

1. Import the OpenCV library:

• The code starts by importing the OpenCV library.
2. Read the original image:
• It reads an image from the file path "E:\Backup 14.4.23\image\img1.jpg"
using cv2.imread with the flag cv2.IMREAD_UNCHANGED. This flag
loads the image as-is, including the alpha channel il’ it exists.
3. Print the original image dimensions:
• The code prints the original image's dimensions (height, width, and number of
channels) using img.shape.
4. Calculate the new dimensions for resizing:
• The code calculates the new dimensions for resizing based on a specified scale
percentage. In this case, the image is resized to 40% o1‘its original size.
5. Resize the image:
• The cv2.resize function is used to resize the image to the new dimensions (dim)
using the specified interpolation method (cv2.INTER_AREA). The
interpolation method is often used for downscaling to ensure better quality.
6. Print the resized image dimensions:
• The code prints the dimensions of the resized image using resized.shape.
7. Display the resized image:
• The resized image is displayed using cv2.imshow.
fi. Wait for a key press and close the window:
• The code waits for a key press with cv2.waitKey().
• It then closes all OpenCV windows using cv2.dest Windows().

1. Import the necessary libraries:

• numpy for numerical operations.
• skimage for image processing.
• deskew to perform the ske ction and correction.
2. Read an image:
• It reads an image from path "E:\imageoutput.jpg" using io.imread from
scikit-image.
3. Convert the image to
• The code co rts the color image to grayscale using rgb2gray from scikit-
image.
4. Determine t angle:
T etermine skew function from the deskew library is used to
tomatically de rmine the skew angle in the grayscale image.
5. Rotate the image to correct the skew:
• The code rotates the original image by the determined angle using rotate mom
scikit-image. This corrects the skew in the image.
• The result is multiplied by 255 to ensure that pixel values remain in the range
[0, 255].
6. Save the corrected image:
• The corrected image is saved to "E:\imageoutput1.jpg" using io.imsave. It's
cast to the n p.uint8 data type to ensure the correct data type for image saving.
( d ) Afline transform calculated from three pairs of corresponding points

1. Import necessary libraries:

• The code imports OpenCV (cv2) for image processing, NumPy (up) for
numerical operations, and Matplotlib (plt) for displaying images.
2. Read and convert the image:
• The code reads an image from the file path "E:/img.jpg" using cv2.imread. It
is then converted to the RGB color space using cv2.cvtColor.
3. Define source and target points:
• pt1 contains the coordinates of’ the three vertices of a triangular region in the
source image.
• pt2 contains the corresponding coordinates for the three vertices in the output
image, defining how the triangular region should be transformed.
4. Create a transformation matrix:
• The cv2.getAffineTransform function is used to calculate the affine
transformation matrix (Mat) based on the source (ptl) and et (pt2) points.
5. Apply the affine transformation:
• cv2.warpAffine is used to apply the affine transformati the original image
(img) using the transformation matrix (Mat). Theft is stored in the dst

J. M1Dp8m@ 81kW W£ £@£1km£ mlkm 8£mk£mLWk

The code uses Matplotli image side by side. na1 image and the transformed
7. Show the images:
The images are displayed
•

(a) Rotation

Original I = Image.open("E:\Backup 14.4.23\image\img1.jpg")

# Rotate Image By 150 Degree

rotated_image 1 = Original_lmage.rotate(150)

rotated_image2 = Original_lmage.transpose(Image.ROTATE_90)

rotated_image3 = Original_lmage.rotate(60)

rotated_image1.show()
rotated_image2.show()
rotated_image3.show()
(b) Change of

scale import cv2

img = cv2.iinread("E:\Backup 14.4.23\image\img l.jpg", cv2.IMREAD_UNCHANGED)

print('Original Dimensions : ',img.shape)

scale ercent = 40 # percent of original size

width = int(img.shape[1] * scale ercent / 100)
height = int(img.shape[0] * scale ercent /
100) dim = (width, height)

# resize image
resized = cv2.resize(img, dim, interpolation = cv2.INTER_AREA1

print('Resized Dimensions : ',resized.shape)

cv2.imshow("Resized image", resized)

cv2.waitKey(0)
cv2.destroyAllWindows()

(c ) Skewing

import numpy as np
from skimage impo
from skimage.color i rgb2gray
from skimage.tr rm import
rotate

des port determine_skew

image = io.imread('E:\imageoutput.jpg')
grayscale = rgb2gray(image)
angle = determine_skew(grayscale)
rotated = rotate(image, angle, resize=True) * 255
io.imsave('E:\imageoutput1.jpg', rotated.astype(np.uints))

(d) Affine transform calculated from three pairs of corresponding points

# Importing OpenCV
import cv2
# Importing numpy
import numpy as np
# Importing matp1otlib.pyplot
import matplotlib.pyplot as
plt # Reading the image
img = cv2.imread(r"E:/img.jpg")
img = cv2.cvtColor(img,
cv2.COLOR_BGR2RGB) rows, cols, ch =
img.shape

# Coordinates of triangular vertices in the source image pt1 = np.1loat32([[50, 50],

[200, 50],
[50, 200]])
# Coordinates of the corresponding triangular vertices in the o tP pt2 = np.f1oat32([[ 10, 100],
[200, 50],
[100, 250]])
# Creating a transformation matrix
Mat = cv2.getA1IineTransform(pt1, pt2) dst = cv2.warpAf1ine(img, Mat, (cols, ROWS plt.1igure(figsize=(10,
# Plotting the input image plt.subplot(121) plt.imshow(img) plt.title('Input')
# Plotting the output plt.subplot(122) plt.imshow(dst) plt.title(’Output'

plt.show()

output:
In put Output
0

100 100

200 200

30 300
0

400
400

500
500 0 1DO 300 300 400 5f00 600 D 1D0 200 3D0 400 500 600

Result:
Thus the programs for the geometric trans for Rotation (b) Change o1’ scale (c)
’+K
Skewing (d) Affine transform calculated fr e pairs o1‘corresponding
points is executed successfully.

Exp. No.4
Object Detection and Recognition
Aim:

To develop program to implement Object Detection and Recognition

Algorithm:
1. Import necessary libraries:
• cv2 for OpenCV functions.
• google.colab.patches for displaying images in a Colab notebook.
2. Load and resize an input image:
• Read an image from a file named 'image.jpg'.
• Resize the image to a size of 640x4fi0 pixels.
3. Define the paths to the model and class label tiles:
• weights contains the path to the frozen inference graph file (the pre-trained
model).
• model contains the path to the model configuration file.
• coco names.txt contains the class labels for the COCO dataset.
4. Load the MobileNet SSD model:
• Use cv2.dnn.readNetFromTensorflow to load the model using the provided
weights and model files.
5. Load class labels:
• Read class labels from the 'coco_names.txt’ file and store them in the
class_names list.
6. Create a blob from the input image:
• Prepare the image for inference using cv2.dnn.blobFromImage.
7. Pass the blob through the network:
• Set the blob as input to the network.
• Use net.forward() to obtain the output predictions.
fi. Process the detection results:
• Loop over the detected objects in the output.
• For each detection, check the confidence score (probability“
• If the confidence is below 50%, continue to the next demon.
9. Draw bounding boxes and labels:
• Extract the (x, y) coordinates of the bounding b
• Draw a green rectangle around the detected ’e
• Extract the class ID to identify the objec ' e.
• Draw the object's name and the prob as text above the bounding box.
10. Display the resulting image:
• Use cv2_imshow to display th ge with bounding boxes and labels.
• cv2.waitKey() waits for a Press

Program:

from google.colab.patche rt cv2_imshow

impon cv2

image = cv2.imr mage.jpg')

image = cv2.re e(image, (640,
450)) h = image.shape[0]
w = image.shape[1]

# path to the weights and model files

weights = "frozen_inference_graph.pb"
model = "ssd_mobilenet_v3_large_coco_2020_01_ 14.pbtxt"
# load the MobileNet SSD model trained on the COCO dataset
net = cv2.dun.readNetFromTensor1low(weights, model)

# load the class labels the model was trained on

class_names = []
with open("coco_names.txt", "r") as 1\
class_names = f.read().strip().split("\n")

# create a blob from the image

blob =
cv2.dnn.blobFromImage(
image, 1.0/127.5, (320, 320), [127.5, 127.5, 127.5])
# pass the blog through our network and get the output
predictions net.setInput(blob)
output = net.forward() # shape: (1, 1, 100, 7)

# loop over the number of detected objects

for detection in output[0, 0, :, :]: # output[0, 0, :, :] has a shape of: (100, 7)
# the confidence o1’ the model regarding the detected object
probability = detection[2]

# if the confidence of the model is lower than 50%,

# we do nothing (continue looping)
if probability < 0.5:
continue

# perform element-wise multiplication to get

# the (x, y) coordinates o1‘the bounding box
box = [int(a * b) for a, b in zip(detection[3: , h, w,
h])] box = tuple(box)
# draw the bounding box o1’the obje
cv2.rectangle(image, box[:2], box (0, 255, 0), thickness=2)

# extract the ID of the dete bject to get its

name class_id = int(detections
# draw the name of predicted object along with the probability
label = P'(class_ s[class_id - 1].upper()} {probability * 100:.2f}%"
cv2.putText(’ , label, (box[0], box[ l] + 15),
cv2. NT_HERSHEY_SIMPLEX, 0.5, (0, 255, 0), 2)

cv2_imshow(image)
cv2.waitKey()
Result:
Thus the program to implObject Detection and Recognition is executed
successfully and outp verified.

Exp. No. S

To develop a program for motion analysis using moving edges.

Algorithm:

1. Import necessary libraries:

• cv2 for OpenCV functions.
• numpy for numerical operations.
• google.colab.patches for displaying images in a Colab notebook.
2. Open a video file for reading:
• cv2.VideoCapture is used to open a video file named "yo1odetection.mp4" for
reading.
3. Get video properties:
• Retrieve the frame width and frame height of the video.
4. Define the codec for the output video:
• cv2.VideoWriter_fourcc is used to specify the codec for the output video. In
this case, it's set to 'XVID'.
5. Create a VideoWriter object for the output video:
• A VideoWriter object is created to write the processed video to "output.mp4"
with a frame rate of 5.0 frames per second and a frame size o1’ 12S0x720.
6. Read the first two frames of the video:
• cap.read() is used to read the first two frames of the video.
7. Start processing the video in a loop:
• The code enters a loop that processes each frame of the video.
fi. Calculate the difference between consecutive frames:
• Calculate the absolute dift’erence between framel and fram identify areas
of motion.
9. Convert the di11’erence frame to grayscale:
• Convert the difference frame to grayscale using
10. Apply Gaussian blur:
• Apply Gaussian blur to the grayscale frame reduce noise.
11. Apply thresholding:
• Apply a threshold to the blurred fra reate a binary image where motion
areas are white.
12. Dilate the thresholded image:
• Dilate the thresholded ake the white areas more prominent.
13. Find contours of motion:
• Find contours in the died image.
14. Iterate through the detecte ntours:
• For each cont heck its area. If the area is less than 900, it's likely not
significant re n and is skipped.
• If the are significant, draw a green bounding box around the moving object
and "Movement" status text.
15. Resize e:
• resize the frame to a fixed size of’ 12fi0x720.
16. Write the frame to the output video:
• Write the processed frame to the output video using out.write().
17. Display the frame with bounding boxes:
• Display the frame with bounding boxes using cv2_imshow.
Update the frames for the next iteration:
• Set framel to the previous frame2.
• Read the next frame into frame2.
19. Check for the 'Esc' key (27) to exit the loop:
• Check i1‘the 'Esc' key is pressed to exit the loop.
20. Release resources:
• Release OpenCV windows and the video capture and writer objects
from google.colab.patches import cv2_imshow

import cv2
import numpy as np

cap = cv2.VideoCapture('/content/yolodetection.mp4')
frame_width = int( cap.get(cv2.CAP_PROP_FRAME_WIDTH))

trame_height =int( cap.get( cv2.CAP_PROP_FRAME_HEIGHT))

= cv2.VideoWriter_fourcc('X','V','I','D')

out = cv2.VideoWriter("output.mp4", fourcc, 5.0, (1250,720))

ret, 11ame1 = cap.read()

ret, frame2 = cap.read()
print(frame1.shape)
while cap.isOpened():
diff = cv2.absdiff(frame1, 1’rame2)
gray = cv2.cvtCo1or(diff, cv2.COL

_ blur = cv2.GaussianBlur(gray, (5,5

, thresh = cv2.threshold(blur, 20 cv2.THRESH _BINARY)
dilated = cv2.dilate(thresh, erations=3)
contours, cv2.findContours(dilated, cv2.RETR_TREE,
cv2.CHAIN APPROX i I E)

for contour in co As:

(x, y, w, h .boundingRect(contour)

il’cv2.contourArea(contour) < 900:

continue
cv2.rectangle(frame1, (x, y), (x+w, y+h), (0, 255, 0), 2)
cv2.putText(frame1, "Status: | }".format('Movement'), (10, 20),
cv2.FONT_HERSHEY_SIMPLEX,
1, (0, 0, 255), 3)
#cv2.drawContours(framel, contours, -1, (0, 255, 0), 2)

image = cv2.resize(framel, (1250,720))

out.write(image)
cv2_imshow(frame1)
frame l = frame2
ret, frame2 = cap.read()

il’cv2.waitKey(40) ==
27: break

cv2.destroyAllWindows()
cap.release()
out.release()

Result:
Thus the program for is verified.
i n analysis using moving edges is executes successfully and output

Exp. No.6

FACIAL DETECTION AND RECOGNITION

Aim:

To develop a program for Facial Detection and Recognition

Algorithm:

1. Install the required libraries:

• The code begins by installing the face_recognition library and the dlib library.
These libraries are used for face recognition and deep learning-based image
processing.
2. Import necessary libraries:
• face_recognition for facial recognition functionality.
• cv2 for OpenCV functions.
• numpy for numerical operations.
• os for file and directory operations.
3. Define the path to the directory containing known face images:
• The path variable points to a directory named "train" which contains known
face images.
4. Initialize lists for known names and their encodings:
• Two lists, known_names and known_name_encodings, created to store
the names of known individuals and their face encodings.
5. Load known face images and compute face encodings:
• Loop through the images in the specified directory.
• Load each image using fr.load_image_file.
• Compute the face encoding for each image ’nor.face_encodings.
• Add the name and encoding to the respe ists.
6. Load and process the test image:
• Load the test image using cv2.imr
locations and fr.f codings to locate and encode the laces in
• t he tf' f:
7. Compare face encodings to kno ces:
• For each detected face the test image, compare its encoding to the known
face encodings usi .compare_faces.
• Find the best using up.argmin on the computed lace distances.
8. Label and draw bo g boxes around recognized faces:
with the corresponding name.
face and display the name.
9. Display t iocessed image with recognized faces:
splay the image with bounding boxes and recognized names using
cv2_imshow.
10. Save the output image:
• Save the processed image with recognized faces to the specified output path using
cv2.imwrite.
11. Wait for a key press and close OpenCV windows:
• Wait for a key press (0) to keep the window open.
• Release OpenCV resources and close the window using cv2.waitKey and
cv2.destroyAllWindows.
Program:

!pip install lace recognition

from google.co1ab.patches import cv2_imshow

!pip install dlib

import face_recognition as fr
import cv2
import numpy as np
import os

path = "/content/drive/MyDrive/facer/train/"

known_names = []
known_name_encodings = []

images = os.listdir(path)
for _ in images:
image = fr.load_image_file(path +
_) image ath = ath +
encoding = fr.face_encodings(image)[0]

known_name_encodings.append(e g)
known_names.append(os.path.s xt(os.path.basename(image ath))[0].capitalize())

print(known_names)

test image = "/conte ve/MyDrive/facer/test/test.jpg"

image = cv2.i t_image)
# image = cv2 olor(image, cv2.COLOR_BGR2RGB)

face_locations = 1r.face_locations(image)
face_encodings = fr.face_encodings(image, face_locations)

for (top, right, bottom, left), face_encoding in zip(face_locations,

face_encodings): matches = fr.compare_faces(known_name_encodings,
face_encoding)
name = ""

lace_distances = fr.face_distance(known_name_encodings, face_encoding)

best_match = np.argmin(face_distances)

if matches[best_match]:
name = known_names[best_match]

cv2.rectangle(image, (left, top), (right, bottom), (0, 0, 255), 2)

cv2.rectangle(image, (left, bottom - 15), (right, bottom), (0, 0, 255), cv2.FILLED)
font = cv2.FONT_HERSHEY_DUPLEX
cv2.putText(image, name, (left + 6, bottom - 6), font, 1.0, (255, 255, 255), 1)

cv2_imshow(image)
cv2.imwrite("/content/drive/MyDrive/1acer/output.jpg", image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Result:
Thus the program for Facial Detection and Recognition is executed successfully and
output is verified.
Exp. no:7

HAND €IESTURE RECOGNITION

Aim:

To develop a program to recognize hand gesture.

Algorithm:

1. Import necessary libraries:

• cv2 for OpenCV functions.
• mediapipe for hand tracking and landmarks detection.
2. Open a video capture source:
• Capture video from the default camera (webca cv2.VideoCapture(0).
3. Initialize MediaPipe Hand tracking:
• Create instances of mpHands.Hands tracking and mpDraw for
drawing landmarks.
4. Define finger and thumb coordinates:
• fingerCoordinates is a list oft hat define the landmarks for the fingertips.
Each tuple contains two land indices: the tip and the base of each finger.
• thumbCoordinate is a t that defines the landmarks for the thumb tip and
base.
5. Start an infinite loop for v’ processing:
• Continuously e frames mom the camera.
6. Read and process ptured frame:
• Read a the’from the camera using cap.read().
• Con he frame from BGR to RGB color space, as MediaP ipe requires
RGB in
7. Process and landmarks:
• Use hands.process(imgRGB) to process the RGB image and detect hand
landmarks.
• Extract the landmarks from the results if hands are detected.
fi. Draw hand landmarks and connections:
• If hands are detected, draw the hand landmarks and connections on the frame
using mpDraw.draw_landmarks.
9. Extract and visualize hand points:
• Extract the (x, y) coordinates of‘hand landmarks and store them in handPoints.
• Draw circles at the detected hand points on the frame.
10. Count the number of raised fingers:
• Check the relative positions of specific finger tip and base landmarks to
determine if a finger is raised. Increment upCount for each raised finger.
• Additionally, check the thumb position to see if it is raised.
11. Display the finger count:
• Draw the finger count on the frame using cv2.putText.
12. Display the processed frame:
• Show the processed frame with hand landmarks and finger count using
cv2.imshow.
13. Wait for a key press and update the frame:
• Wait for 1 millisecond using cv2.waitKey(1) to allow the frame to be displayed
and updated in the loop.
14. Close the OpenCV window:
• The loop continues until you press a key to exit the program. When a key is
pressed, the program closes the OpenCV window.

import cv2
import mediapipe as mp

cap = cv2.VideoCapture(0) mpHands = mp.solutions.hands hands = mpHands.Hands()

mpDraw = mp.solutions.drawin _s fingerCoordinates = [(S, 6), (12,), (16, 14), (20, l S)]
thumbCoordinate = (4,2) while True:
success, img = cap.Rd()
imgRGB = color(img, cv2.COLOR_BGR2RGB) sults = hprocess(imgRGB)
ks = results.multi hand landmarks

if multiLandMarks:
handPoints = []
for handLms in multiLandMarks:
mpDraw.draw_landmarks(img, handLms, mpHands.HAND_CONNECTIONS)

for idx, lm in
enumerate(handLms.landmark): #
print(idx,lm)
h, w, c = img.shape
cx, cy = int(lm.x * w), int(1m.y * h)
handPoints.append((cx, cy))

for point in handPoints:

cv2.circle(img, point, 10, (0, 0, 255), cv2.FILLED)

upCount = 0
for coordinate in fingerCoordinates:
if handPoints[coordinate[0]][ l] < handPoints[coordinate[l]][1]:
upCount += 1
il’handPoints[thumbCoordinate[0]][0] > handPoints[thumbCoordinate[ l]][0]:
upCount += 1

cv2.putText(img, str(upCount), (150,150), cv2.FONT_HERSHEY_PLAIN, 12,

(255,0,0), 12)

cv2. imshow("Finger Counter", img)

cv2.waitKey(l)

Output:

Result:

Thus the program to recognize hand gesture is executed successfully and output is
verified.

ADDITIONAL EXPERIMENTS
Exp. No.:S

Aim:

To develop a program for detecting an edges of an image.

Algorithm:
1. Import the OpenCV library:
• The code starts by importing the OpenCV library.
2. Load an image:
• It loads an image named "penguin.jpg" using cv2.imread assigns it to the
image variable.
3. Apply Canny edge detection:
• The Canny edge detection algorithm is applied tO oaded image using the
cv2.Canny function. The parameters 200 and used as the low and
thresholds, respectively, for edge detection. high
4. Save the resulting image:
• The edge-detected image is saved e name 'edges_Penguins.jpg' using
cv2.imwrite.
5. Display the edge-detected image:
• The code uses cv2.imshow play the edge-detected image.

Program:

import cv2
image cv2.imread(I guin.jpg")
cv2.imwrite('ed enguins.jpg',cv2.Canny(image,200,300))

cv2.imsho ges’, cv2.imread('edges_Penguins.jpg'))

Output:
Result:

Thus the program for detecting an edges o1’an image is executed suc fully and output
is verified.

Exp. No.9

SMOO3 G .URMxG

To develop a program y .moothing and blurring to an image.

Algorithm:

and NumPy:
code starts by importing the OpenCV library as cv2 and the NumPy library

2. Read an image:
• It reads an image from the file path "E:\Backup 14.4.23\image\lab\pen.jpg"
using cv2.imread and stores it in the image variable.
3. Create a kernel for averaging (blur):
• The code defines a 5x5 kernel o1‘ones (all values set to 1) using NumPy.
• The division by 25 is to normalize the kernel so that the sum of the values is 1,
making it an average filter.
4. Apply the filter to the image:
• The cv2.filter2D function is used to apply the filter to the input image. It takes
the source image (image), the depth (ddepth), and the kernel (kernel2) as
parameters. The ddepth o1’ -1 indicates that the output image should have the
same depth as the input image.
5. Display the original and filtered images:
• The code displays both the original image and the filtered (blurred) image using
cv2.imshow.
6. Wait for a key press and close the windows:
• The code waits for a key press with cv2.waitKey().
• It then closes all OpenCV windows using cv2.destroyAllWindows().

Program:

import cv2
import numpy as np # Reading the image
image = cv2.imread("E:\Backup 14.4.23\image\lab\pen.jpg

kernel2 = np.ones((5, 5), np.float32)/25 # Applying the filter

img = cv2.filter2D(src=image, ddepth=-1, ker # showing the image
cv2. imshow('Original', image) ernel2)

“cv2*wai Key()“““‘“ ‘“‘“’ ““‘’“

cv2.destroyAl1Windows() Output:

Original

Result:
Thus the program to apply smoothing and blurring to an image is executed successfully
and output is verified.

opencv cheatsheet
No ratings yet
opencv cheatsheet
60 pages
đề listening test 5
0% (1)
đề listening test 5
3 pages
PDF Eng
No ratings yet
PDF Eng
12 pages
Crystalline Properties of Solids
No ratings yet
Crystalline Properties of Solids
24 pages
IVALAB
No ratings yet
IVALAB
21 pages
CCS349 IVA RECORD
No ratings yet
CCS349 IVA RECORD
64 pages
Ccs349 Iva Record - Final
No ratings yet
Ccs349 Iva Record - Final
49 pages
IVA RECORD DK
No ratings yet
IVA RECORD DK
64 pages
Iva Lab Manual
No ratings yet
Iva Lab Manual
34 pages
T-pyramid of an Image-2
No ratings yet
T-pyramid of an Image-2
21 pages
ivalab
No ratings yet
ivalab
25 pages
Dip Lab Short Code-1
No ratings yet
Dip Lab Short Code-1
7 pages
Exp 1
No ratings yet
Exp 1
46 pages
Lab Record
No ratings yet
Lab Record
30 pages
R Programming
No ratings yet
R Programming
9 pages
OPENCV lab1
No ratings yet
OPENCV lab1
18 pages
ComputerGraphicsNotesWeek9 01 0418
No ratings yet
ComputerGraphicsNotesWeek9 01 0418
6 pages
Ip Lab Programs
No ratings yet
Ip Lab Programs
34 pages
IVA_record
No ratings yet
IVA_record
31 pages
Digital Image Processing Lab Manual
No ratings yet
Digital Image Processing Lab Manual
26 pages
Laporan Final Project
No ratings yet
Laporan Final Project
10 pages
exp2j
No ratings yet
exp2j
4 pages
Animish CV File
No ratings yet
Animish CV File
85 pages
CV Practical Record Editted - PDF
No ratings yet
CV Practical Record Editted - PDF
36 pages
Lecture08 OpenCV
No ratings yet
Lecture08 OpenCV
27 pages
18DIP Lab 2
No ratings yet
18DIP Lab 2
11 pages
Documentation For Homework 2
No ratings yet
Documentation For Homework 2
9 pages
Exp.3
No ratings yet
Exp.3
21 pages
Computer vision full record (1)
No ratings yet
Computer vision full record (1)
34 pages
'/content/fruit - JPG' 'No. of Pixels ' 'Shape '
No ratings yet
'/content/fruit - JPG' 'No. of Pixels ' 'Shape '
80 pages
REF1 - OpenCV Basics
No ratings yet
REF1 - OpenCV Basics
16 pages
DRASHTI_CVML
No ratings yet
DRASHTI_CVML
83 pages
opencv cheatsheet
No ratings yet
opencv cheatsheet
65 pages
Computer Vision
No ratings yet
Computer Vision
20 pages
Digital Image Processing Lab Manual# 2
No ratings yet
Digital Image Processing Lab Manual# 2
6 pages
Name-Bhavya Jain College id-19CS19 Batch-C1 Digital Image Processing Lab
No ratings yet
Name-Bhavya Jain College id-19CS19 Batch-C1 Digital Image Processing Lab
23 pages
PCV Lab Codes
No ratings yet
PCV Lab Codes
51 pages
CV File
No ratings yet
CV File
18 pages
AKNipProgram 1
No ratings yet
AKNipProgram 1
7 pages
Practical Image-1
No ratings yet
Practical Image-1
22 pages
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
7 pages
ACFrOgCfX9ATrHm9ZSjs1HLKnJCXmmPcIwFi Y7hVAv6zU1Li3igjIXOOLtGhffODBql8a993YAsc3gM SE8bidlMJr2eFkl9eJB0BU8jcLD6iWrroxwbp1 X9yQtpQks6r8vMLEnR-ORk02lgVJ
No ratings yet
ACFrOgCfX9ATrHm9ZSjs1HLKnJCXmmPcIwFi Y7hVAv6zU1Li3igjIXOOLtGhffODBql8a993YAsc3gM SE8bidlMJr2eFkl9eJB0BU8jcLD6iWrroxwbp1 X9yQtpQks6r8vMLEnR-ORk02lgVJ
20 pages
18DIP Lab 3
No ratings yet
18DIP Lab 3
9 pages
dip_lab
No ratings yet
dip_lab
5 pages
Python Cookbook - Opencv v2
No ratings yet
Python Cookbook - Opencv v2
3 pages
DIP_Manual
No ratings yet
DIP_Manual
27 pages
Lab 04 Digital Image Processing Practice
No ratings yet
Lab 04 Digital Image Processing Practice
9 pages
IP_LAB[1]
No ratings yet
IP_LAB[1]
8 pages
All Pro
No ratings yet
All Pro
4 pages
ALCANTARAuLaboratory-6-Image-Processing-Student_031006
No ratings yet
ALCANTARAuLaboratory-6-Image-Processing-Student_031006
9 pages
Lab Report 4
No ratings yet
Lab Report 4
10 pages
Opencv Beginners
100% (2)
Opencv Beginners
17 pages
Lab 1 Dip
No ratings yet
Lab 1 Dip
8 pages
B120041 IVP Assignment
No ratings yet
B120041 IVP Assignment
14 pages
Project On Opencv New
No ratings yet
Project On Opencv New
11 pages
Color Detection Opencv
No ratings yet
Color Detection Opencv
4 pages
DIP FILE(1)
No ratings yet
DIP FILE(1)
23 pages
Gulshan - DIP - Lab - Programs (11 To 20)
No ratings yet
Gulshan - DIP - Lab - Programs (11 To 20)
37 pages
Shyam Singh
No ratings yet
Shyam Singh
38 pages
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Maya Visual Effects The Innovator's Guide: Autodesk Official Press
From Everand
Maya Visual Effects The Innovator's Guide: Autodesk Official Press
Eric Keller
No ratings yet
Eigenface: Exploring the Depths of Visual Recognition with Eigenface
From Everand
Eigenface: Exploring the Depths of Visual Recognition with Eigenface
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
LV Technical Brochure - 16 PDF
No ratings yet
LV Technical Brochure - 16 PDF
44 pages
Hankel Determinants of Normalized Analytic Functio
No ratings yet
Hankel Determinants of Normalized Analytic Functio
13 pages
360 Showflat Videos - Disclaimer
No ratings yet
360 Showflat Videos - Disclaimer
45 pages
BLT53
No ratings yet
BLT53
12 pages
Cleaning AC
No ratings yet
Cleaning AC
4 pages
ABB Trip Circuit Supervision Ctalogue
No ratings yet
ABB Trip Circuit Supervision Ctalogue
4 pages
Living With The Changing California Coast Gary Griggs Editor Kiki Patsch Editor Lauret Savoy Editor download
No ratings yet
Living With The Changing California Coast Gary Griggs Editor Kiki Patsch Editor Lauret Savoy Editor download
84 pages
苹果新品种及优系的品质和抗病性评价
No ratings yet
苹果新品种及优系的品质和抗病性评价
65 pages
Abb Ag: Settings
No ratings yet
Abb Ag: Settings
11 pages
Foundations of Bioethics - Ethical Theories, Moral Principles, and Medical Decisions
No ratings yet
Foundations of Bioethics - Ethical Theories, Moral Principles, and Medical Decisions
57 pages
Chan, Emily - Chinese Takeout Cookbook_ Easy Chinese Copycat Takeout Recipes You Can Make At Home! (2020)
No ratings yet
Chan, Emily - Chinese Takeout Cookbook_ Easy Chinese Copycat Takeout Recipes You Can Make At Home! (2020)
157 pages
Sensors For Indoor Air Quality Monitoring and Assessment Through Internet of Things: A Systematic Review
No ratings yet
Sensors For Indoor Air Quality Monitoring and Assessment Through Internet of Things: A Systematic Review
32 pages
Bizotic Life Science Pvt.
No ratings yet
Bizotic Life Science Pvt.
20 pages
Batch Distillation: Group 7 Errynne Yanza Hosleck Galasinao Aron Balines
No ratings yet
Batch Distillation: Group 7 Errynne Yanza Hosleck Galasinao Aron Balines
18 pages
The Trees, Mijbil, Madam, Fog
No ratings yet
The Trees, Mijbil, Madam, Fog
31 pages
Speaker C0701
No ratings yet
Speaker C0701
1 page
Maths Report Adapt
No ratings yet
Maths Report Adapt
7 pages
CBSE Worksheets For Class 12 English Core Assignment 1 PDF
100% (1)
CBSE Worksheets For Class 12 English Core Assignment 1 PDF
4 pages
Definition and Scope of Ecology
100% (1)
Definition and Scope of Ecology
12 pages
Best Three Bean Salad Recipe - How To Make Three Bean Salad
No ratings yet
Best Three Bean Salad Recipe - How To Make Three Bean Salad
1 page
Teslabib
No ratings yet
Teslabib
4 pages
Defence Engineering College: Thermodynamics MV2011
No ratings yet
Defence Engineering College: Thermodynamics MV2011
28 pages
Letting Heidegger and Nietzsche Dwell in Turkish
No ratings yet
Letting Heidegger and Nietzsche Dwell in Turkish
18 pages
Tools, Equipment and Processes 1
No ratings yet
Tools, Equipment and Processes 1
8 pages
Waterpik For Irrigant Activation
No ratings yet
Waterpik For Irrigant Activation
4 pages
Parts - T 133 H x4 - en
No ratings yet
Parts - T 133 H x4 - en
101 pages
Landslides and Engineered Slopes From the Past to the Future Proceedings of the 10th International Symposium on Landslides and Engineered Slopes 30 June 4 July 2008 Xi an China 1st Edition Zuyu Chen - Read the ebook now or download it for a full experience
No ratings yet
Landslides and Engineered Slopes From the Past to the Future Proceedings of the 10th International Symposium on Landslides and Engineered Slopes 30 June 4 July 2008 Xi an China 1st Edition Zuyu Chen - Read the ebook now or download it for a full experience
56 pages