Conference Paper

The document discusses a project on real-time object detection using Deep Learning, specifically through Convolutional Neural Networks (CNN) and OpenCV. It highlights the importance of this technology for aiding visually challenged individuals and its applications in various fields such as self-driving cars and video surveillance. The methodology involves training a dataset to identify objects in images and videos, ultimately producing labeled outputs with bounding boxes around detected objects.

Uploaded by

dhana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views3 pages

Conference Paper

Uploaded by

dhana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Real-Time Object Detection using Deep Learning

S . Dhanalakshmi , Excel Engineering College (Autonomous) , Komarapalayam.

Abstract
Central the convolutional neural network is the convolutional layer that gives
Real time object detection is a vast, vibrant and sophisticated
the network its name. This layer performs an operation known as
area of computer vision aimed towards object identification and
“convolution”.
recognition. Object detection detects the semantic objects of a class
In the context of a convolutional neural network, a convolution may be a
objects using OpenCV (Open source Computer Vision), which is a
linear operation that involves the multiplication of a group of weights with the
library of programming functions mainly trained towards real time
input, very similar to a standard neural network. as long as the technique was
computer visionin digital images and videos. Visually challenged
designed for two-dimensional input, the multiplication is performed between
people cannot distinguish the objects around them. The main aim
an array of input file and a two-dimensional array of weights, called a filter or
behind this real time object detection is to help the blind to overcome
a kernel.
their difficulty. Real time object detection finds its uses in the areas
like tracking objects, video surveillance, pedestrian detection, people
The filter is smaller than the input file and therefore the before the sort of
counting, self-driving cars, face detection, ball tracking in sports and
multiplication applied between a filter-sized patch of the input and the filter
many more. This is achieved using Convolution Neural Networks,
may be a scalar product. A scalar product is that the element-wise
which is a representative tool of Deep learning. This project acts as an
multiplication between the filter-sized patch of the input and filter, which is
aiding tool for visually challenged people
then summed, always leading to one value. Because it leads to 1 value, the
operation is conventionally represented and mentioned because the
Keywords: Convolutional Neural Network, OpenCV, Deep Learning.
“scalar product”.

Using a filter smaller than the input is intentional because it allows an

I. INTRODUCTION equivalent filter (set of weights) to be multiplied by the input array multiple
Object detection is a technology to detect various objects in digital times at distinct points on the input. Specifically, the filter is applied
images and videos too. It is mainly helpful within the self- driving cars, systematically to every overlapping part or filter-sized patch of the input file,
face detection, etc., where the objects are to be continuously monitored. left to right, top to bottom.
The algorithm or the technique involved for object detection during this
project is Convolutional Neural Networks which is a class of Deep
learning. This uses MobileNet SSD technique during which MobileNetis
a neural network used for image classification and recognition whereas
SSD is a framework that is used to realize the multibox detector. The Input CN Output
mixture of both MobileNet and SSD can do object detection. The main image
advantage or purpose of choosing Deep learning is that we do not need
to do feature extraction from data as compared to machine learning. npiinn

The Haar-like trait play a crucial role in detecting the objects in a picture.
This systematic application of an equivalent filter across a picture may be a
They scan the entire picture starting from the top left and compares
powerful idea. If the filter is meant to detect a selected sort of feature within
every small box with the trained data. In this way, even small-detailed
the input, then the appliance of that filter systematically across the whole
objects present within the imagesare identified.
input image allows the filter a chance to get that feature anywhere within the
image.
II. METHODOLOGY This capability is usually represented and mentioned as translation invariance,
e.g. the total altogether concern in whether the feature is present instead of
Deep learning, a subset of machine learning which in turn is a subset of where it should had been present.
artificial intelligence (AI) has networks capable of learning things from
the data that is unstructured or unlabeled. The approach utilized in this
project is Convolutional Neural Networks (CNN). It uses the Haar-
cascade classifiers which help us in the detection of objects.

1. CNN:
The convolutional neural network, or CNN for brief, could also be a
specialized kind of neural network model designed for working with
two-dimensional image data, although they're going to be used with one-
dimensional and three-dimensional data.

Fig1.2. Image classification using CNN

2. OpenCV:
Open CV stands for open source computer vision. it's a group of libraries all the thing s are identified and every object is surrounded by an oblong box
in Python. it's a tool by which we will be able to manipulate the pictures , and therefore the name of the object is additionally displayed. we'll be only
like image scaling, etc. This supports and helps us in developing real observing the output video stream but not the input video stream.
time computing applications. It mainly concentrates and targets on image
processing, video capture and analysis. It includes several features like
face detection and also object detection. Currently OpenCV supports III. RESULT
differing types of programming languages like C++, Python, Java etc., Here, in this project we’ve considered around 15 to 20 objects to be detected
and it's available on various platforms including Windows, Linux, OS X, during the training. Some of those include ‘person’, ‘car’, ‘train’, ‘bird’,
Android ‘sofa’, ‘dog’, ‘’plant’, ‘aero plane’, ‘bicycle’, ‘bus’, ‘motorbike’, etc.
etc.
The output of this project displays the objects detected with a rectangular box
around the object with a label indicating it’s name and therefore the exactness
3. Training the data set:
with which the object has been detected on the top of it. It can dig out any
The data set is typically the gathering of knowledge . the info set could
number of objects existing during a single image with certainty
also be collection of images or alphabets or numbers or documents and
files too. the info set we used for the thingdetection is that the collection
of images of all the objects that are to be identified. Several different
images of every and each object is typically present within the data set.
If there are more number of images like each object within the datasets
then the accuracy are often improved. The important thing that's to be
remembered is that the info within the data set must be labelled. there'll
be actually 3 data set. they're the training data set, the validation dataset
and therefore the other one is testing data set. The training data set will
usually contains around 85-90% of the entire labelled data. This training
dataset are going to be training our machine and therefore
the model is obtained by training the info set. The validation data set
consists of around 5-10% of the entire labelled data.

4. Developing a real time object detector:

For developing a true time object detector using deep learning and open
cv we'd like to access our web cam during a really effective way then the
thing detection is to be applied to each and every frame. we should
always install open cv in our systems.The deep neural network module
should be installed. Firstly, we should always always import all the
specified packages:

1. From imutils.video we'll import VideoStream

2. From imutils.video we'll import FPS
3. we'll import numpy as np
4. we'll import argparse
5. we'll import imutils
6. we'll import time
7. we'll import cv2

The next step is to construct the argument parse then we should always
parse the arguments.
--prototxt: provide path to the Caffe prototxt file.
--model: provide path to the pre-trained model.
--confidence: The minimum probability threshold to filter weak
detections. The default value is given as 20%.
The next step is to initialize CLASS labels and corresponding random

COLORS.
Each object when it's detected, it's surrounded by a box with some
predefined colour. Thus, we assign each object a specific color.
After that we'll load our model and that we will provide the regard to our
prototxt and also to our model files. With the assistance of imutils we'll
read the video and that we will set the amount of frames per second.
Now with this some predefined number of frames are going to be loaded
per second. Eachframe is analogous to the image. Now these images are
going to be given because the inputs to the model. The model will
process the input image and produces the output image which consists of
labels. in additional practical sense the input raw image is given to the
model. Now the model process the input image. within the output image
APPLICATION
VI. REFERENCES
Here are a some of the future implementation of object detection. 1. Geethapriya S, N. Duraimurugan, S.P. Chokkalingam, “Real-Time Object
1. Face detections and recognition: Detection with Yolo”, International
Face detection perhaps be a separate class of object detection. We Journal of Engineering and Advanced Technology (IJEAT)
wonder how some applications like Facebook, Faceapp, etc., detect and 2. Abdul Vahab, Maruti S Naik, Prasanna G Raikar an Prasad S R4,
recognize our faces. this is often a sample example of object detection in “Applications of Object Detection System”, International Research Journal of
our day to day life. Face detection is already in use in our lifestyle to Engineering and Technology (IRJET)
unlock our mobile phones and for other security systems to scale back 3. Hammad Naeem, Jawad Ahmad and Muhammad Tayyab, “Real-Time
rate . Object Detection and Tracking”,IEEE
4. Meera M K, & Shajee Mohan B S. 2016, "Object recognition in images",
International Conference on InformationScience (ICIS).
2. Object tracking: 5. Astha Gautam, Anjana Kumari, Pankaj Singh: "The Concept of
Object detection is additionally utilized in tracking objects like tracking Object Recognition", International Journal of Advanced Research in
an individual and his actions, continuously Computer Science and Software Engineering, Volume 5, Issue 3,
March 2015
monitoring a ball within the game of Football or Cricket. As there's an 6. Joseph Redmon, Santosh Divvala, Ross Girshick, “You Only
enormous interest for people in these games, these tracking techniques Look Once: Unified, Real-Time Object Detection”, The IEEE
enables them to know it during a better way and obtain some additional Conference on Computer Vision and Pattern Recognition
information. Tracking of the ball is of maximal importance in any ball- (CVPR),2016,pp. 779-
based games to automatically record the movement of the ball and adjust 788
the video frame accordingly. 7. V. Gajjar, A. Gurnani and Y. Khandhediya, "Human Detection and
Tracking for Video Surveillance: A Cognitive Science Approach," in
3. Self-driving cars: 2017 IEEE International Conference on Computer Vision Workshops,
this is often one among the main evolutions of the planet and is that the
2017.
best example why we'd like object detection. so as for a car to travel to
the specified destination automatically with none human interference or
to form decisions whether to accelerate or to use brakes and to spot the
objects around it. this needs object detection.

4. Emotions detection:
this permits the system to spot the type of emotion the person puts on his
face. the corporate Apple has already tried to use this by detecting the
emotion of the user and converting it into a respective emoji within the
smart phone.

5. Biometric identification through retina scan:

Retina scan through iris code is one among the techniques utilized in
high security systems because it is one among the
foremost accurate and unique biometric.

6. Smart text search and text selection (Google lens)

In recent times, we've encountered an application in smart phones called
google lens. this will recognize the text and also images and search the
relevant information within the browser without much effort.

V. CONCLUSION
Deep-learning based object detection has been a search hotspot in recent
years. This project starts on generic object detection pipelines which
give base architectures for other related tasks. With the assistance of this
the 3 other common tasks, namely object detection, face detection and
pedestrian detection, are often accomplished. Authors accomplished this
by combing 2 things: Object detection with deep learning and OpenCV
and Efficient, threaded video streams with OpenCV. The camera sensor
noise and lightening condition can change the result because it can create
problem in recognizing the objects. generally, this whole process
requires GPU’s rather than CPU’s. But we’ve done using CPU’s and
executes in much less time, making it efficient. Object Detection
algorithms act as a mixture of both image classification and object
localization. It takes the given image as input and produces the output
having the bounding boxes adequate to the amount of objects present
within the image with the category label attached to every bounding box
at the highest. It projects the scenario of the bounding box up the shape
of position, height and width.

Big Science 1 Student Book
100% (2)
Big Science 1 Student Book
112 pages
Grade 9 Tech, Math & NS 2024 - Teacher - S Book - MST
100% (9)
Grade 9 Tech, Math & NS 2024 - Teacher - S Book - MST
27 pages
Cat and Dog Classification Using CNN Fin
No ratings yet
Cat and Dog Classification Using CNN Fin
34 pages
PE 12 Module 7 For Student
67% (3)
PE 12 Module 7 For Student
21 pages
Final Report Yolo Voice
No ratings yet
Final Report Yolo Voice
94 pages
Part 2
No ratings yet
Part 2
225 pages
Sample Mini Project in Deep Learning
No ratings yet
Sample Mini Project in Deep Learning
61 pages
M. e Report
No ratings yet
M. e Report
56 pages
SoS'25 Midterm - Report
No ratings yet
SoS'25 Midterm - Report
14 pages
Module V-Deep Learning
No ratings yet
Module V-Deep Learning
19 pages
Sepm Exp. 0-5
No ratings yet
Sepm Exp. 0-5
14 pages
Fs1 Episode 9 - 16 - Nadela, Ma - Mannyros P
No ratings yet
Fs1 Episode 9 - 16 - Nadela, Ma - Mannyros P
61 pages
Sorting of Objects Using Image Processing
No ratings yet
Sorting of Objects Using Image Processing
6 pages
Deep Residual Learning
No ratings yet
Deep Residual Learning
80 pages
A Review On Various Methodologies Used For Vehicle Classification, Helmet Detection and Number Plate Recognition
No ratings yet
A Review On Various Methodologies Used For Vehicle Classification, Helmet Detection and Number Plate Recognition
9 pages
Autonomous Car
100% (1)
Autonomous Car
12 pages
Unit 3
No ratings yet
Unit 3
17 pages
Project Report Final 1
No ratings yet
Project Report Final 1
63 pages
Nivetha Me Phase1rep
No ratings yet
Nivetha Me Phase1rep
57 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
W11 Lecture ITS69204 Image Recognition
No ratings yet
W11 Lecture ITS69204 Image Recognition
44 pages
Nursing Care of A Family With An Infant
100% (1)
Nursing Care of A Family With An Infant
26 pages
Object Tracking
No ratings yet
Object Tracking
50 pages
Psychotherapy For Psychosis Integrating Cognitive Behavioral and Psychodynamic Treatment Complete Chapter Download
100% (11)
Psychotherapy For Psychosis Integrating Cognitive Behavioral and Psychodynamic Treatment Complete Chapter Download
14 pages
Object Detection Using Convolutional Neural Network Transfer Learning
No ratings yet
Object Detection Using Convolutional Neural Network Transfer Learning
11 pages
Real Time Object Detection Using Deep Learning Andmachine Learning Project
No ratings yet
Real Time Object Detection Using Deep Learning Andmachine Learning Project
56 pages
Part B Eti-1
No ratings yet
Part B Eti-1
7 pages
SSRN 4286087
No ratings yet
SSRN 4286087
7 pages
Fyp Zainab 1
No ratings yet
Fyp Zainab 1
16 pages
Nivetha Me P2 PPT
No ratings yet
Nivetha Me P2 PPT
18 pages
Object Detection With Deep Learning - A Review Summary
No ratings yet
Object Detection With Deep Learning - A Review Summary
11 pages
Wen Wen 2021 Thesis
No ratings yet
Wen Wen 2021 Thesis
114 pages
ObjectDetectionPhase2 Demo
No ratings yet
ObjectDetectionPhase2 Demo
16 pages
1 Realtimeobjectdetection
No ratings yet
1 Realtimeobjectdetection
6 pages
6th June 2011 - 20
No ratings yet
6th June 2011 - 20
17 pages
Object Detection Using Deep CNNs Trained On Synthetic Images
No ratings yet
Object Detection Using Deep CNNs Trained On Synthetic Images
8 pages
MVS - Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS - Expt8 Object Detection and Reconstruction Using CNN
5 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
Vitamin Deficiency Detection (Base Paper)
No ratings yet
Vitamin Deficiency Detection (Base Paper)
3 pages
Sagar Paper
No ratings yet
Sagar Paper
4 pages
Object Detection Using OpenCV and Python
No ratings yet
Object Detection Using OpenCV and Python
5 pages
Project Report (2) RRRRRRRRRRR
No ratings yet
Project Report (2) RRRRRRRRRRR
10 pages
2024 TESAS New Intakes and Continuing Students Final v20
No ratings yet
2024 TESAS New Intakes and Continuing Students Final v20
85 pages
Object Detection Using ELAN
No ratings yet
Object Detection Using ELAN
6 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
45 pages
Team-4 DL
No ratings yet
Team-4 DL
5 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
Object Detection Using CNN
No ratings yet
Object Detection Using CNN
5 pages
Object Detection Using CNN
No ratings yet
Object Detection Using CNN
6 pages
CH 1
No ratings yet
CH 1
8 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
Development of Framework For Detecting Smoking Scenes
No ratings yet
Development of Framework For Detecting Smoking Scenes
5 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
Supreme Court: Epicharis T Garcia in Her Own Behalf. Bengzon, Villegas, Zarraga, Narciso and Cudala For Respondents
No ratings yet
Supreme Court: Epicharis T Garcia in Her Own Behalf. Bengzon, Villegas, Zarraga, Narciso and Cudala For Respondents
46 pages
Oracle HCM Cloud Training Outline
No ratings yet
Oracle HCM Cloud Training Outline
6 pages
SR22804211151
No ratings yet
SR22804211151
8 pages
Fruit Old
No ratings yet
Fruit Old
37 pages
Wundt in History
No ratings yet
Wundt in History
314 pages
Journal of Infusion Nursing
No ratings yet
Journal of Infusion Nursing
5 pages
Cooperating Teacher Evaluation
No ratings yet
Cooperating Teacher Evaluation
3 pages
Multiple Intelligences
No ratings yet
Multiple Intelligences
161 pages
A Deep Learning Based Assistant For The Visually Impaired
No ratings yet
A Deep Learning Based Assistant For The Visually Impaired
11 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Real Time Object Detection System Using Deep Learning: University Institute of Engineering, Chandigarh University
No ratings yet
Real Time Object Detection System Using Deep Learning: University Institute of Engineering, Chandigarh University
6 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
Realtime Object Detection Using SSD
No ratings yet
Realtime Object Detection Using SSD
8 pages
Real-Time Object Detection Using Deep Learning and Open CV
No ratings yet
Real-Time Object Detection Using Deep Learning and Open CV
4 pages
Minor Project
No ratings yet
Minor Project
21 pages
Aryabhatt Circular
100% (1)
Aryabhatt Circular
2 pages
Real Time Object Recognition and Classification
No ratings yet
Real Time Object Recognition and Classification
6 pages
Photto VMM - T NG H P
No ratings yet
Photto VMM - T NG H P
125 pages
1 TYBCOM SEM 6 COMMERCE-VI HUMAN RESOURCE MANAGEMEN-pages-1
No ratings yet
1 TYBCOM SEM 6 COMMERCE-VI HUMAN RESOURCE MANAGEMEN-pages-1
8 pages
Adjectives and Prepositions Complete Worksheet
No ratings yet
Adjectives and Prepositions Complete Worksheet
5 pages
Smart Shopping System IEEE PAPER TYK EDI Group 9unique
No ratings yet
Smart Shopping System IEEE PAPER TYK EDI Group 9unique
6 pages
Naturopathy Applicant'S Profile Form
100% (2)
Naturopathy Applicant'S Profile Form
2 pages
Enc1501 2025 - Assessment 2 - (Finalz) Q
No ratings yet
Enc1501 2025 - Assessment 2 - (Finalz) Q
5 pages
Computer Networks Syllabus
No ratings yet
Computer Networks Syllabus
3 pages
Final Project Paper Akash
No ratings yet
Final Project Paper Akash
5 pages
Project Bibu Action Research
No ratings yet
Project Bibu Action Research
6 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
GMOA Annual Report 2011/2012
No ratings yet
GMOA Annual Report 2011/2012
76 pages
Consensus On Psychiatry History and Diagnostic Formulation Endorsed by IPS IToP
No ratings yet
Consensus On Psychiatry History and Diagnostic Formulation Endorsed by IPS IToP
14 pages
3 - Software Engineering
No ratings yet
3 - Software Engineering
29 pages
Dbms Model Question Set-2
No ratings yet
Dbms Model Question Set-2
2 pages
The Arts Vocabulary For IELTS
No ratings yet
The Arts Vocabulary For IELTS
7 pages
Qualitative Research 1 Paul
No ratings yet
Qualitative Research 1 Paul
6 pages
Backshift of Tenses
No ratings yet
Backshift of Tenses
3 pages
Joy Ezeigbo Personal Statement For PGDC
No ratings yet
Joy Ezeigbo Personal Statement For PGDC
1 page
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Deep Learning: Fundamentals and Applications
From Everand
Deep Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet