Chapter One-3

Class note

Uploaded by

Solomonic Eman

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

Chapter One-3

Class note

Uploaded by

Solomonic Eman

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Chapter One

1.1 What is Computer Vision?

Computer vision is a field of Artificial Intelligence (AI) that enables computers and
systems to derive meaningful information from digital images, videos and other visual
inputs — and take actions or make recommendations based on that information. If AI
enables computers to think, computer vision enables them to see, observe and understand.

1.1.1 How does Computer Vision Work?

Computer vision mainly performs three steps(figure 1), which are:
1. Capturing an Image
A computer vision software or application always includes a digital camera or
CCTV to capture the image. So, firstly it captures the image and puts it as a digital
file that consists of Zero and one's.
2. Processing the image
In the next step, different CV algorithms are used to process the digital data stored
in a file. These algorithms determine the basic geometric elements and generate
the image using the stored digital data.
3. Analyzing and taking required action
Finally, the CV analyses the data, and according to this analysis, the system takes
the required action for which it is designed.

Figure 1:Steps of Computer Vision

1.2 What is Image?
An image is defined as a two-dimensional function(x,y), where x and y are spatial(plane)
coordinates, and the amplitude of F at any pair of coordinates (x,y) is called the intensity
of that image at that point. When x,y, and amplitude values of F are finite, we call it a
digital image.

1.3 Related Fields in Computer Vision

Computer vision is a multidisciplinary field that intersects with various other areas of
study. Here are some related fields in computer vision:
1. Machine Learning and Deep Learning: Machine learning and deep learning are
essential components of computer vision. Techniques such as convolutional neural
networks (CNNs) and recurrent neural networks (RNNs) are widely used for
image and video analysis.
2. Image Processing: Image processing deals with manipulating and enhancing
images to improve their quality or extract useful information. Many computer
vision applications involve image processing techniques.
3. Pattern Recognition: Pattern recognition is the study of recognizing patterns
within data. In computer vision, this involves recognizing objects, shapes, or
gestures in images or videos.
4. Robotics: Computer vision plays a significant role in robotics, enabling robots to
perceive and navigate their environment. It's used in tasks like object detection,
localization, and mapping (SLAM).
5. Natural Language Processing (NLP): Combining computer vision with NLP
allows for multimodal analysis, where both text and visual data are used together,
enabling applications like image captioning and sentiment analysis of images.
6. Biometrics: Biometrics involves recognizing individuals based on their unique
physical or behavioral characteristics. Face recognition and fingerprint analysis
are examples of computer vision applications in biometrics.
7. Augmented Reality (AR) and Virtual Reality (VR): Computer vision is
essential in AR and VR for tracking the user's movements and the environment to
provide an immersive experience.
8. Medical Imaging: In the field of medical imaging, computer vision is used for
tasks like image segmentation, tumor detection, and disease diagnosis through the
analysis of medical images like X-rays, MRIs, and CT scans.
9. Autonomous Vehicles: Computer vision is a key technology in the development
of self-driving cars. It's used for tasks like lane detection, object detection, and
path planning.
10. Remote Sensing: Remote sensing involves collecting data about the Earth's
surface from a distance, often using satellite imagery. Computer vision is used for
land cover classification, monitoring natural disasters, and more.
11. Agriculture: In precision agriculture, computer vision is used for crop
monitoring, yield prediction, and disease detection in plants.
12. Security and Surveillance: Computer vision is used for video surveillance, facial
recognition, and anomaly detection in security applications.
13. Entertainment and Gaming: The gaming industry often uses computer vision for
motion capture, gesture recognition, and creating immersive experiences.
14. Geographic Information Systems (GIS): Computer vision is used to extract
geographical information from images and satellite data, assisting in map creation
and geospatial analysis.
15. Artificial Intelligence Ethics and Bias: As computer vision systems become
more pervasive, ethical considerations related to privacy, bias, and fairness in AI
are gaining importance.
16. Hardware Development: The development of specialized hardware, such as
GPUs (Graphics Processing Unit) and TPUs (Tensor Processing Unit), plays a
crucial role in accelerating the computation required for computer vision tasks.
These fields often overlap, and interdisciplinary research and collaboration are
common in computer vision, as solutions to real-world problems frequently require expertise
from multiple domains.

1.4 Computer Vision vs Image Processing

Computer vision and image processing are closely related but distinct fields within the
domain of visual information processing. Here's a breakdown of their differences:
1.4.1 Computer Vision
Goal: Computer vision focuses on enabling computers to interpret and understand the visual
world, much like human vision. It's about replicating human vision and cognition in
machines.
Tasks: Computer vision involves higher-level tasks such as object recognition, scene
understanding, image understanding, and even making decisions based on visual input. This
can include tasks like object detection, image segmentation, facial recognition, and 3D
reconstruction.
Complexity: Computer vision tasks often involve understanding the context and semantics of
images. It requires higher-level reasoning and can involve machine learning and AI
techniques, including deep learning.
Applications: Computer vision is used in various applications, including autonomous
vehicles, robotics, medical image analysis, and augmented reality. It often deals with real-
world, dynamic environments.
Example: An autonomous car using computer vision might identify pedestrians, other
vehicles, traffic signs, and road conditions to make driving decisions.

1.4.2 Image Processing

Goal: Image processing is primarily focused on manipulating and enhancing images to
extract information or improve image quality. It deals with the modification of the visual
content itself.
Tasks: Image processing tasks include tasks like image filtering, noise reduction, image
sharpening, contrast enhancement, and color correction. It's often concerned with low-level
operations on images.
Complexity: Image processing typically involves simpler operations that are applied to
individual pixels or small neighborhoods of pixels within an image. It's not concerned with
high-level interpretation or recognition.
Applications: Image processing is used in applications like image compression, medical
image enhancement, and improving the quality of photos. It's commonly applied in scenarios
where the goal is to improve the visual quality or extract specific information from images.
Example: A medical image processing application might enhance the contrast in an X-ray
image to make it easier for a radiologist to identify abnormalities.
In summary, computer vision is broader and more complex, focusing on
understanding and interpreting visual data to make higher-level decisions, often utilizing
artificial intelligence and machine learning techniques. Image processing, on the other hand,
is more about enhancing and manipulating the visual content to improve its quality or extract
specific information, usually using low-level operations. While they have distinct goals and
applications, they can also complement each other, with image processing techniques often
being a part of the preprocessing step in computer vision tasks to prepare images for higher-
level analysis.

1.5 Applications of Computer vision and Image processing

Computer vision and image processing have a wide range of applications across
various industries. Here are some of the key applications in both fields:

1.5.1 Applications of Computer Vision:

Object Recognition and Detection: Computer vision is used to recognize and locate objects
in images or video streams, such as pedestrian detection in autonomous vehicles, facial
recognition for security, and barcode scanning in retail.
Image Segmentation: This involves dividing an image into meaningful parts, which is used
in medical image analysis, industrial quality control, and scene understanding in robotics.
Gesture Recognition: Computer vision enables the interpretation of hand and body gestures,
which is useful in applications like sign language translation, gaming, and human-computer
interaction.
Face Recognition: Facial recognition is widely used in security, access control, and
authentication systems, as well as in social media for tagging photos.
Optical Character Recognition (OCR): OCR technology is used to convert printed or
handwritten text in images into machine-readable text. Applications include digitizing printed
documents, license plate recognition, and reading text from images for the visually impaired.
Robotics: Computer vision plays a crucial role in enabling robots to perceive their
environment and navigate, allowing them to perform tasks like object manipulation and
autonomous navigation.
Augmented Reality (AR) and Virtual Reality (VR): AR and VR applications use computer
vision for tracking users' movements and the environment, enabling immersive experiences
and overlaying digital information on the real world.
Medical Image Analysis: Computer vision is used in medical imaging to analyze X-rays,
MRIs, CT scans, and other medical images for tasks like tumor detection, organ
segmentation, and disease diagnosis.
Autonomous Vehicles: Self-driving cars rely on computer vision for tasks like lane
detection, obstacle detection, and road sign recognition.
Agriculture: Computer vision is used for crop monitoring, yield prediction, and disease
detection in precision agriculture.
Industrial Quality Control: Computer vision systems are used to inspect and ensure the
quality of manufactured products, such as checking for defects in electronics, textiles, and
food items.

1.5.2 Applications of Computer Vision:

Image Enhancement: Image processing techniques are used to enhance the quality of
images, improve contrast, reduce noise, and adjust brightness and color levels.
Image Compression: Compression algorithms are used to reduce the size of images for
efficient storage and transmission, as in JPEG and PNG formats.
Image Restoration: Restoring old or damaged images by removing scratches, dust, or other
imperfections is a common application.
Image Filtering: Techniques like edge detection and blurring are used for various
applications, including feature extraction and special effects in photography.
Color Correction: Adjusting the color balance, saturation, and contrast of images to improve
their visual quality in photography and video editing.
Morphological Operations: These are used in applications like document processing and
shape analysis to extract or manipulate image features.
Image Registration: Aligning multiple images of the same scene, often used in medical
imaging for comparing images taken at different times or with different modalities.
Panorama Stitching: Combining multiple images to create a single, wide-angle or
panoramic view, used in photography and virtual tours.
Biometric Identification: Image processing is used in biometric systems for fingerprint
recognition, iris scanning, and hand geometry.
Remote Sensing: Analyzing satellite and aerial imagery for applications in agriculture, urban
planning, and environmental monitoring.
Both computer vision and image processing have a significant impact on numerous
industries, making tasks more efficient, accurate, and accessible, and enabling a wide range
of innovative applications.

1.6 Image processing Example

Image Denoising:
Description: Image denoising is the process of removing unwanted noise or artifacts
from an image, resulting in a cleaner and more visually appealing representation of the
original content. Noise can be caused by factors such as low lighting conditions, sensor
limitations in cameras, or transmission errors in digital images.
Image Processing Techniques:
Filtering: Filtering techniques such as median filtering or Gaussian smoothing are commonly
used to reduce noise. Median filtering replaces each pixel value with the median value of its
local neighborhood, which can effectively remove impulsive noise. Gaussian smoothing
applies a weighted average to each pixel's neighborhood, reducing high-frequency noise.
Wavelet Transform: The wavelet transform is used for multiresolution analysis, and it helps
to separate image content from noise in different scales and orientations. This makes it useful
for denoising applications.
Non-local Means Denoising: This technique compares similar patches of the image to
estimate the noise level and then reduces noise while preserving image details. It is effective
in reducing random noise.
Example:
Imagine you have a digital photograph taken in low light conditions. As a result, the image
contains noticeable speckles, which are a type of noise. To improve the image, you can apply
image denoising techniques:
Gaussian Smoothing: You apply a Gaussian filter to the image, which blurs it slightly,
reducing high-frequency noise. The resulting image appears smoother and cleaner.
Wavelet Transform: Using a wavelet denoising algorithm, you decompose the image into
different scales and remove noise in the high-frequency components while preserving the
main features.
Non-local Means Denoising: This technique identifies similar patches in the image and
averages them, reducing noise while maintaining the essential image features.
After applying these techniques, you obtain a cleaner, less noisy version of the original
image, making it more visually appealing and suitable for various applications, such as
photography, medical imaging, or video processing.
Image denoising is just one of the many image processing applications, demonstrating how
this field plays a critical role in improving image quality and making images more suitable
for analysis, visualization, or further processing.
1.7 Fundamentals steps in image processing
The fundamental steps in image processing typically include the following stages,
which are essential for various image manipulation and analysis tasks:
1. Image Acquisition: Image acquisition is the process of capturing an image using
devices like cameras, scanners, or sensors. It’s the first step where the raw data
(pixels) is obtained from the real world or a digital source.
2. Preprocessing: Image preprocessing involves operations to prepare the image for
subsequent analysis or enhancement.
Common preprocessing steps include:
Image Scaling: Resizing the image to a specific dimension.
Image Cropping: Removing unwanted parts of the image.
Color Conversion: Converting between color spaces (e.g., RGB to grayscale).
Noise Reduction: Reducing noise using filters or techniques like averaging.
Image Enhancement: Adjusting contrast, brightness, and sharpness.
3. Image Segmentation:
Image segmentation divides the image into meaningful regions or objects.
It helps identify and isolate areas of interest within the image.
Techniques like thresholding, edge detection, or clustering are used for segmentation.
4. Feature Extraction:
Feature extraction involves selecting relevant information or attributes from the
segmented regions.
These features can be characteristics like color, texture, shape, or edges.
Feature extraction is crucial for subsequent analysis and classification tasks.
5. Image Analysis:
Image analysis encompasses tasks like object recognition, classification, or
measurement.
This step uses the extracted features to derive information or make decisions.
Machine learning, pattern recognition, and computer vision techniques are often
employed here.

6. Post-processing:
Post-processing may involve further adjustments or operations to refine the results.
Techniques like filtering, morphological operations, and spatial transformations can
be applied.
7. Visualization and Display:
The final processed image can be displayed, saved, or transmitted for interpretation
and decision-making.
Visualization can include generating visual representations, reports, or interactive
displays.
8. Storage and Management:
Processed images are often stored in databases or file systems for future reference and
retrieval.
Effective management ensures that images are organized and accessible.
9. Interpretation and Decision-Making:
The ultimate goal of image processing is often to extract information, make decisions,
or gain insights from the processed images.
Interpretation can be performed by humans or automated systems.
These steps represent a typical image processing workflow. However, the specific
techniques and tools used in each step can vary based on the application and the desired
outcome. Moreover, advancements in deep learning and neural networks have led to more
automated and integrated pipelines for image analysis, where the feature extraction and
analysis stages may overlap or be performed jointly.

Image Processing Answers
No ratings yet
Image Processing Answers
6 pages
Chapter One
No ratings yet
Chapter One
47 pages
CV Unit 1
No ratings yet
CV Unit 1
30 pages
1 Intro to CV
No ratings yet
1 Intro to CV
76 pages
CV 1
No ratings yet
CV 1
21 pages
Unit 1
No ratings yet
Unit 1
20 pages
CV&IP Chapter 1
No ratings yet
CV&IP Chapter 1
27 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
CVIP-Module-01-Reviewer
No ratings yet
CVIP-Module-01-Reviewer
20 pages
Computer Vision
No ratings yet
Computer Vision
2 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
Image Processing & Computer Vision
No ratings yet
Image Processing & Computer Vision
21 pages
Chapter One
No ratings yet
Chapter One
17 pages
Class - Notes Computer Vision
No ratings yet
Class - Notes Computer Vision
3 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Introduction of Computer Vision
No ratings yet
Introduction of Computer Vision
5 pages
Computer Vision Advancement Rebecca
No ratings yet
Computer Vision Advancement Rebecca
17 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
computer-vision-al-701
No ratings yet
computer-vision-al-701
50 pages
Chapter 1 PDF
No ratings yet
Chapter 1 PDF
17 pages
Image Processing and Computer Vision (Notes)
No ratings yet
Image Processing and Computer Vision (Notes)
64 pages
How Computer Vision is Used in Everyday Life
No ratings yet
How Computer Vision is Used in Everyday Life
5 pages
six applications (ai project)
No ratings yet
six applications (ai project)
9 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
Raz Report Final
No ratings yet
Raz Report Final
37 pages
DOC-20241121-WA0194.
No ratings yet
DOC-20241121-WA0194.
7 pages
Computer Vision Powerpoint Presentation PDF
No ratings yet
Computer Vision Powerpoint Presentation PDF
10 pages
Computer Vision Report
No ratings yet
Computer Vision Report
31 pages
computer vision copy
No ratings yet
computer vision copy
28 pages
IT5409 Ch1 Intro
No ratings yet
IT5409 Ch1 Intro
14 pages
New Seminar
No ratings yet
New Seminar
11 pages
Ai Pra
No ratings yet
Ai Pra
15 pages
AI and Computer Vision Bundle
No ratings yet
AI and Computer Vision Bundle
75 pages
Chapter 1 [CV & IP]
No ratings yet
Chapter 1 [CV & IP]
41 pages
Computer Vision and Image Processing
No ratings yet
Computer Vision and Image Processing
23 pages
Introduction To CVIP
No ratings yet
Introduction To CVIP
33 pages
Image Processing and Computer Vision: Goals
No ratings yet
Image Processing and Computer Vision: Goals
14 pages
Ch01_Introduction_to_computer_vision_and_image_processing_1 (1)
No ratings yet
Ch01_Introduction_to_computer_vision_and_image_processing_1 (1)
29 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
108103174
No ratings yet
108103174
1,559 pages
Computer Vision Presentation Updated
No ratings yet
Computer Vision Presentation Updated
15 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
two
No ratings yet
two
4 pages
Image Processing and Computer Vision Both Are Very Exciting Field of Computer Science
No ratings yet
Image Processing and Computer Vision Both Are Very Exciting Field of Computer Science
56 pages
502355296-Computer-Vision-Presentation-AI
No ratings yet
502355296-Computer-Vision-Presentation-AI
16 pages
A Comprehensive Guide to Computer Vision
No ratings yet
A Comprehensive Guide to Computer Vision
6 pages
The Fascinating Field of Computer Vision
No ratings yet
The Fascinating Field of Computer Vision
8 pages
Computer Vision: Chapter 1: Introduction
No ratings yet
Computer Vision: Chapter 1: Introduction
7 pages
A Brief Introduction To Computer Vision
No ratings yet
A Brief Introduction To Computer Vision
3 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision Class Notes
No ratings yet
Computer Vision Class Notes
4 pages
Group 17 Computer Vision @Lcd-1
No ratings yet
Group 17 Computer Vision @Lcd-1
25 pages
Idt
No ratings yet
Idt
15 pages
CV
No ratings yet
CV
2 pages
Unit_4_Notes_1
No ratings yet
Unit_4_Notes_1
3 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
CV_UNIT_1
No ratings yet
CV_UNIT_1
17 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Smart Camera: Revolutionizing Visual Perception with Computer Vision
From Everand
Smart Camera: Revolutionizing Visual Perception with Computer Vision
Fouad Sabry
No ratings yet
Chapter 3 Syntax Analysis Full Reading Material
No ratings yet
Chapter 3 Syntax Analysis Full Reading Material
76 pages
02 CSS
No ratings yet
02 CSS
26 pages
Chapter Four - Dynamic Programming
No ratings yet
Chapter Four - Dynamic Programming
85 pages
School Ofinformatics and Electrical Enginering Department of Computer Science
No ratings yet
School Ofinformatics and Electrical Enginering Department of Computer Science
70 pages
CSC566 - A3 - Wan Azra Sofea Batrisyia Binti Jasman
No ratings yet
CSC566 - A3 - Wan Azra Sofea Batrisyia Binti Jasman
6 pages
Review Questions in Photography
No ratings yet
Review Questions in Photography
43 pages
Abstract Blue Cube PowerPoint Templates
No ratings yet
Abstract Blue Cube PowerPoint Templates
49 pages
Logbook
No ratings yet
Logbook
72 pages
Chapter 1
No ratings yet
Chapter 1
58 pages
Mako G DataSheet G-507 V1.2.2 en
No ratings yet
Mako G DataSheet G-507 V1.2.2 en
5 pages
Konica R 2 PDF
No ratings yet
Konica R 2 PDF
10 pages
SAMSUNG TFT Pantallas LCD PDF
No ratings yet
SAMSUNG TFT Pantallas LCD PDF
55 pages
Photoshop Post-Processing Master 3D Renders - Tutorial
100% (1)
Photoshop Post-Processing Master 3D Renders - Tutorial
13 pages
Landsat 7 Natural Colors
No ratings yet
Landsat 7 Natural Colors
33 pages
Correction
No ratings yet
Correction
24 pages
DM 10203 PPT
No ratings yet
DM 10203 PPT
11 pages
Digital Image Processing: Lab 4: Image Enhancement in The Spatial Domain
No ratings yet
Digital Image Processing: Lab 4: Image Enhancement in The Spatial Domain
22 pages
Ghost Adventures S13e04 Reseda House of Evil HDTV x264-w4f
No ratings yet
Ghost Adventures S13e04 Reseda House of Evil HDTV x264-w4f
2 pages
BMI - General Overview & Products - Compact Rev06
No ratings yet
BMI - General Overview & Products - Compact Rev06
68 pages
The LEE System Brochure
No ratings yet
The LEE System Brochure
33 pages
Comp Science Presentation.
No ratings yet
Comp Science Presentation.
14 pages
Lecture#1-Introduction To DIP
No ratings yet
Lecture#1-Introduction To DIP
19 pages
Geometric Calibration of The Hasselblad H3D Medium Format Camera
No ratings yet
Geometric Calibration of The Hasselblad H3D Medium Format Camera
6 pages
DSJ-NA Explosion proof body camera introduction1(2025-01-13 01_47_51)
No ratings yet
DSJ-NA Explosion proof body camera introduction1(2025-01-13 01_47_51)
6 pages
KMS2404055 - Fatima Ismael - Gani L. Abpi College - P3.91 Outdoor Rental Type
No ratings yet
KMS2404055 - Fatima Ismael - Gani L. Abpi College - P3.91 Outdoor Rental Type
2 pages
The Complete Guide To Macro and Close Up Photography 1st Edition Cyrill Harnischmacher All Chapter Instant Download
100% (7)
The Complete Guide To Macro and Close Up Photography 1st Edition Cyrill Harnischmacher All Chapter Instant Download
70 pages
Relationship Between Pixels
No ratings yet
Relationship Between Pixels
22 pages
Compatibility Chart Manfrotto MVG220
No ratings yet
Compatibility Chart Manfrotto MVG220
19 pages
ISO Meets Aperture and Shutter Speed: Before and After The Click Photography Training
No ratings yet
ISO Meets Aperture and Shutter Speed: Before and After The Click Photography Training
8 pages
MX9C Color Doppler System
No ratings yet
MX9C Color Doppler System
4 pages
CG 7
No ratings yet
CG 7
8 pages
(Ebook) Speedliter's Handbook: Learning to Craft Light with Canon Speedlites by Syl Arena ISBN 9780321711052, 032171105X - The full ebook with complete content is ready for download
100% (2)
(Ebook) Speedliter's Handbook: Learning to Craft Light with Canon Speedlites by Syl Arena ISBN 9780321711052, 032171105X - The full ebook with complete content is ready for download
55 pages
Food Photography 101: Mastering Manual Mode
No ratings yet
Food Photography 101: Mastering Manual Mode
5 pages