SlideShare a Scribd company logo
Object Detection with deep learning
SUBMITTED TO: MR. B. SURESH SUBMITTED BY:
HIMANSHU MAURYA(9917102004)
SUSHANT SHRIVASTAVA(9917102023)
BHUVNESH KUMAR BHARDWAJ(9917102028)
1. INTRODUCTION TO
OBJECT DETECTION
○ Object detection is scanning and searching for an object in an image or
a video.
Fig. 1 Object detection
Literature Review.
• Object detection is a common term for computer vision techniques classifying and
locating objects in an image. Modern object detection is largely based on use of
convolutional neural networks Some of the most relevant system types today are Faster
R-CNN, R-FCN, Multibox Single Shot Detector (SSD) and YOLO (You Only Look
Once) [1]. Original R-CNN method worked by running a neural net classifier on samples
cropped from images using externally computed box proposals (=samples cropped with
externally computed box proposals; feature extraction done on all the cropped samples).
This approach was computationally expensive due to many crops.
• Single Shot Multibox Detector (SSD) differs from the R-CNN based approaches by not
requiring a second stage per-proposal classification operation. This makes it fast enough
for real-time detection applications. However, this comes with a price of reduced
precision . “SSD with MobileNet” refers to a model where model meta architecture is
SSD and the feature extractor type is MobileNet.
2. Generic object detection
● Generic object detection aims at locating and classifying
existing object in any one image and labelling them with
rectangular BBs to show the confidences of existences.
Fig. 2 Generic object detection
3. Basic architecture of CNN
Convolutional Neural Network (CNN) is a Deep Learning
algorithm which can take in an input image, assign importance to
various aspects/objects in the image and be able to differentiate
one from the other.[2]
Fig. 3 Basic architecture of CNN
4. Building the CNN
● Convolution
● Polling
● Flattening
4.1 Convolution
● Convolution preserves the spatial relationship between pixels
by learning image features using small squares of input data.
FIG. 4.1 Convolution
4.2 POOLING
● It reduces the dimensionality of each feature map but retains
the most important information.
FIG. 4.2 POOLING
4.3 FLATTENING
● Here the matrix is converted into a linear array so that to input
it into the nodes of our neural network.
FIG. 4.3 FLATTENING
5. Dataset & Preprocessing
COCO stands for Common Objects in Context, this dataset contains around 330K labelled images. COCO is
a large-scale object detection, segmentation, and captioning dataset.[3]
5.1 Features of dataset
· Object segmentation
· Recognition in context
· 330K images (>200K labeled)
· 1.5 million object instances
· 80 object categories
· 91 stuff categories
5.2 Data Preprocessing
● Since the model is pre trained, there is no need for data Preprocessing.
6. What is SSD?
● SSD(Single Shot Detector) is a is designed for object
detection in real-time.
FIG 5. Single Shot Detector.
7. Object detection using SSD algorithm.
● It is a three steps Process:
1. Region Proposal
2. Feature Generation
3. Classification
FIG. 6 Object detection using SSD
8. SSD FRAMEWORK
● Multi-scale feature maps for detection.
● Convolutional predictors for detection.
● Default boxes and aspect ratios.
FIG. 7 SSD FRAMEWORK
9. Feature extraction
● In this stage ,each region proposal is warped or cropped into
a fixed resolution and the SSD module is utilized to extract
features.
FIG. 8 Feature extraction
10. Classification and Localization
● Classify each region using MobileNet V1 Architecture for each
category by passing feature vector created from feature extraction
and scored region are then adjusted with bounding box regression.
● This architecture uses depthwise separable convolutions which
significantly reduces the number of parameters when compared to
the network with normal convolutions.
FIG. 9 Depth Wise Separable
Convolution
11. MobileNet V1 Architecture
● It uses Separable Convolution to reduce the model size and
complexity.
● Smaller model size: Fewer number of parameters.
● Smaller complexity: Fewer Multiplications and Additions
(Multi-Adds).
Fig. 10 MobileNet V1 Architecture
12. Advantages of MobileNet V1 Architecture
● The main advantages is their accuracy in image recognition
problem.
● It takes less time.
● Improve the quality of candidate bounding boxes.
13. Tools And Libraries
● Anaconda — Anaconda is a free and open source distribution of the Python and R programming languages
for data science and machine learning related applications.
● Spyder — Spyder is an open source cross-platform IDE for scientific programming in the Python language.
● Tensorflow — TensorFlow is an open-source software library for dataflow programming across a range of
tasks.
● NumPy- NumPy is a Python package which stands for ‘Numerical Python’. It is the core library for scientific
computing, which contains a powerful n-dimensional array object, provide tools for integrating C, C++ etc.
● Matplotlib- Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of
hardcopy formats and interactive environments across platforms.
● Urllib - Urllib is a Python module that can be used for opening URLs. It defines functions and classes to help in URL
actions. With Python you can also access and retrieve data from the internet like XML, HTML, JSON, etc.
References
1. Zhong-Qiu Zhao , Member, IEEE, Peng Zheng, Shou-Tao Xu, and Xindong Wu , Fellow, IEEE(2016)
2. https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-
99760835f148
3. http://cocodataset.org/#home
LINKS TO FIGURES:-
1.
2. https://towardsdatascience.com/going-deep-into-object-detection-bed442d92b34
3. https://medium.com/datadriveninvestor/convolutional-neural-network-cnn-simplified-ecafd4ee52c5
4. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53
5. https://www.researchgate.net/figure/The-architecture-of-Single-Shot-Multibox-Detector-SSD-It-considers- only-two-
stage-by_fig9_327491507
6. Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott Reed4, Cheng-Yang Fu1, Alexander C.
Berg1(2016)
7. Sermanet,P.,Eigen,D.,Zhang,X.,Mathieu,M.,Fergus,R.,LeCun,Y.: Overfeat:Integrated recognition, localization and
detection using convolutional networks. In: ICLR. (2014)
8. https://towardsdatascience.com/cnn-application-on-structured-data-automated-feature-extraction-8f2cd28d9a7e
9. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-
3bd2b1164a53
10.https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-
99760835f148
https://machinethink.net/blog/object-detection/
THANK YOU.
Ad

More Related Content

What's hot (20)

Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
MrsShwetaBanait1
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural Networks
Usman Qayyum
 
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
Taegyun Jeon
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
Ashiq Ullah
 
Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detection
chettykulkarni
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
Brodmann17
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object Tracking
Vanya Valindria
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
Intel Nervana
 
Object detection and Instance Segmentation
Object detection and Instance SegmentationObject detection and Instance Segmentation
Object detection and Instance Segmentation
Hichem Felouat
 
Deep learning for object detection
Deep learning for object detectionDeep learning for object detection
Deep learning for object detection
Wenjing Chen
 
Yolo
YoloYolo
Yolo
NEHA Kapoor
 
Object detection
Object detectionObject detection
Object detection
Somesh Vyas
 
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Simplilearn
 
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Universitat Politècnica de Catalunya
 
Deep sort and sort paper introduce presentation
Deep sort and sort paper introduce presentationDeep sort and sort paper introduce presentation
Deep sort and sort paper introduce presentation
경훈 김
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
Kuppusamy P
 
Moving object detection
Moving object detectionMoving object detection
Moving object detection
Raviraj singh shekhawat
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
Antonio Rueda-Toicen
 
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow) Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Lalit Jain
 
Yolo
YoloYolo
Yolo
Bang Tsui Liou
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
MrsShwetaBanait1
 
Object Detection using Deep Neural Networks
Object Detection using Deep Neural NetworksObject Detection using Deep Neural Networks
Object Detection using Deep Neural Networks
Usman Qayyum
 
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection
Taegyun Jeon
 
Introduction to Object recognition
Introduction to Object recognitionIntroduction to Object recognition
Introduction to Object recognition
Ashiq Ullah
 
Deep learning based object detection
Deep learning based object detectionDeep learning based object detection
Deep learning based object detection
chettykulkarni
 
Deep learning based object detection basics
Deep learning based object detection basicsDeep learning based object detection basics
Deep learning based object detection basics
Brodmann17
 
Real Time Object Tracking
Real Time Object TrackingReal Time Object Tracking
Real Time Object Tracking
Vanya Valindria
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
Intel Nervana
 
Object detection and Instance Segmentation
Object detection and Instance SegmentationObject detection and Instance Segmentation
Object detection and Instance Segmentation
Hichem Felouat
 
Deep learning for object detection
Deep learning for object detectionDeep learning for object detection
Deep learning for object detection
Wenjing Chen
 
Object detection
Object detectionObject detection
Object detection
Somesh Vyas
 
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Convolutional Neural Network - CNN | How CNN Works | Deep Learning Course | S...
Simplilearn
 
Deep sort and sort paper introduce presentation
Deep sort and sort paper introduce presentationDeep sort and sort paper introduce presentation
Deep sort and sort paper introduce presentation
경훈 김
 
Image segmentation
Image segmentationImage segmentation
Image segmentation
Kuppusamy P
 
Image segmentation with deep learning
Image segmentation with deep learningImage segmentation with deep learning
Image segmentation with deep learning
Antonio Rueda-Toicen
 
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow) Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Object classification using CNN & VGG16 Model (Keras and Tensorflow)
Lalit Jain
 

Similar to Object detection with deep learning (20)

Object Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNetObject Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNet
IRJET Journal
 
kanimozhi2019.pdf
kanimozhi2019.pdfkanimozhi2019.pdf
kanimozhi2019.pdf
AshrafDabbas1
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
vivatechijri
 
Text and Object Recognition using Deep Learning for Visually Impaired People
Text and Object Recognition using Deep Learning for Visually Impaired PeopleText and Object Recognition using Deep Learning for Visually Impaired People
Text and Object Recognition using Deep Learning for Visually Impaired People
ijtsrd
 
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
ajajkhan16
 
Real Time Object Detection And Recognization.pdf
Real Time Object Detection And Recognization.pdfReal Time Object Detection And Recognization.pdf
Real Time Object Detection And Recognization.pdf
DevidasBhere
 
Scene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural NetworkScene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural Network
DhirajGidde
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
ijscai
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
ijscai
 
Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...
IJSCAI Journal
 
IRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET Journal
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problems
PetteriTeikariPhD
 
REVIEW ON OBJECT DETECTION WITH CNN
REVIEW ON OBJECT DETECTION WITH CNNREVIEW ON OBJECT DETECTION WITH CNN
REVIEW ON OBJECT DETECTION WITH CNN
IRJET Journal
 
OBJECT IDENTIFICATION
OBJECT IDENTIFICATIONOBJECT IDENTIFICATION
OBJECT IDENTIFICATION
IRJET Journal
 
ppt - of a project will help you on your college projects
ppt - of a project will help you on your college projectsppt - of a project will help you on your college projects
ppt - of a project will help you on your college projects
vikaspandey0702
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
pratik pratyay
 
slide-171212080528.pptx
slide-171212080528.pptxslide-171212080528.pptx
slide-171212080528.pptx
SharanrajK22MMT1003
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind People
IRJET Journal
 
Devanagari Digit and Character Recognition Using Convolutional Neural Network
Devanagari Digit and Character Recognition Using Convolutional Neural NetworkDevanagari Digit and Character Recognition Using Convolutional Neural Network
Devanagari Digit and Character Recognition Using Convolutional Neural Network
IRJET Journal
 
Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...
IRJET Journal
 
Object Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNetObject Detetcion using SSD-MobileNet
Object Detetcion using SSD-MobileNet
IRJET Journal
 
A Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detectionA Literature Survey: Neural Networks for object detection
A Literature Survey: Neural Networks for object detection
vivatechijri
 
Text and Object Recognition using Deep Learning for Visually Impaired People
Text and Object Recognition using Deep Learning for Visually Impaired PeopleText and Object Recognition using Deep Learning for Visually Impaired People
Text and Object Recognition using Deep Learning for Visually Impaired People
ijtsrd
 
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
6. PRESENTATION REAL TIME OBJECT DETECTION.pptx
ajajkhan16
 
Real Time Object Detection And Recognization.pdf
Real Time Object Detection And Recognization.pdfReal Time Object Detection And Recognization.pdf
Real Time Object Detection And Recognization.pdf
DevidasBhere
 
Scene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural NetworkScene recognition using Convolutional Neural Network
Scene recognition using Convolutional Neural Network
DhirajGidde
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
ijscai
 
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
UNSUPERVISED LEARNING MODELS OF INVARIANT FEATURES IN IMAGES: RECENT DEVELOPM...
ijscai
 
Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...
IJSCAI Journal
 
IRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET- Real-Time Object Detection using Deep Learning: A Survey
IRJET Journal
 
Dataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problemsDataset creation for Deep Learning-based Geometric Computer Vision problems
Dataset creation for Deep Learning-based Geometric Computer Vision problems
PetteriTeikariPhD
 
REVIEW ON OBJECT DETECTION WITH CNN
REVIEW ON OBJECT DETECTION WITH CNNREVIEW ON OBJECT DETECTION WITH CNN
REVIEW ON OBJECT DETECTION WITH CNN
IRJET Journal
 
OBJECT IDENTIFICATION
OBJECT IDENTIFICATIONOBJECT IDENTIFICATION
OBJECT IDENTIFICATION
IRJET Journal
 
ppt - of a project will help you on your college projects
ppt - of a project will help you on your college projectsppt - of a project will help you on your college projects
ppt - of a project will help you on your college projects
vikaspandey0702
 
Real Time Object Dectection using machine learning
Real Time Object Dectection using machine learningReal Time Object Dectection using machine learning
Real Time Object Dectection using machine learning
pratik pratyay
 
IRJET - Direct Me-Nevigation for Blind People
IRJET -  	  Direct Me-Nevigation for Blind PeopleIRJET -  	  Direct Me-Nevigation for Blind People
IRJET - Direct Me-Nevigation for Blind People
IRJET Journal
 
Devanagari Digit and Character Recognition Using Convolutional Neural Network
Devanagari Digit and Character Recognition Using Convolutional Neural NetworkDevanagari Digit and Character Recognition Using Convolutional Neural Network
Devanagari Digit and Character Recognition Using Convolutional Neural Network
IRJET Journal
 
Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...Detection of a user-defined object in an image using feature extraction- Trai...
Detection of a user-defined object in an image using feature extraction- Trai...
IRJET Journal
 
Ad

Recently uploaded (20)

ML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdf
ML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdfML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdf
ML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdf
rameshwarchintamani
 
Relations and Functions – Understanding the Foundation of Mathematics.pptx
Relations and Functions – Understanding the Foundation of Mathematics.pptxRelations and Functions – Understanding the Foundation of Mathematics.pptx
Relations and Functions – Understanding the Foundation of Mathematics.pptx
srmvalliammaicse2
 
Dynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptxDynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptx
University of Glasgow
 
Design of Variable Depth Single-Span Post.pdf
Design of Variable Depth Single-Span Post.pdfDesign of Variable Depth Single-Span Post.pdf
Design of Variable Depth Single-Span Post.pdf
Kamel Farid
 
Compiler Design_Code generation techniques.pptx
Compiler Design_Code generation techniques.pptxCompiler Design_Code generation techniques.pptx
Compiler Design_Code generation techniques.pptx
RushaliDeshmukh2
 
Analog electronic circuits with some imp
Analog electronic circuits with some impAnalog electronic circuits with some imp
Analog electronic circuits with some imp
KarthikTG7
 
Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...
Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...
Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...
IJCNCJournal
 
Introduction to FLUID MECHANICS & KINEMATICS
Introduction to FLUID MECHANICS &  KINEMATICSIntroduction to FLUID MECHANICS &  KINEMATICS
Introduction to FLUID MECHANICS & KINEMATICS
narayanaswamygdas
 
Novel Plug Flow Reactor with Recycle For Growth Control
Novel Plug Flow Reactor with Recycle For Growth ControlNovel Plug Flow Reactor with Recycle For Growth Control
Novel Plug Flow Reactor with Recycle For Growth Control
Chris Harding
 
Machine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATIONMachine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATION
DarrinBright1
 
NOMA analysis in 5G communication systems
NOMA analysis in 5G communication systemsNOMA analysis in 5G communication systems
NOMA analysis in 5G communication systems
waleedali330654
 
Compiler Design_Syntax Directed Translation.pptx
Compiler Design_Syntax Directed Translation.pptxCompiler Design_Syntax Directed Translation.pptx
Compiler Design_Syntax Directed Translation.pptx
RushaliDeshmukh2
 
sss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptx
sss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptx
sss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptx
ajayrm685
 
SICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introductionSICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introduction
fabienklr
 
Compiler Design_Code Optimization tech.pptx
Compiler Design_Code Optimization tech.pptxCompiler Design_Code Optimization tech.pptx
Compiler Design_Code Optimization tech.pptx
RushaliDeshmukh2
 
2025 Apply BTech CEC .docx
2025 Apply BTech CEC                 .docx2025 Apply BTech CEC                 .docx
2025 Apply BTech CEC .docx
tusharmanagementquot
 
New Microsoft PowerPoint Presentation.pdf
New Microsoft PowerPoint Presentation.pdfNew Microsoft PowerPoint Presentation.pdf
New Microsoft PowerPoint Presentation.pdf
mohamedezzat18803
 
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G..."Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
Infopitaara
 
Evonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdfEvonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdf
szhang13
 
Autodesk Fusion 2025 Tutorial: User Interface
Autodesk Fusion 2025 Tutorial: User InterfaceAutodesk Fusion 2025 Tutorial: User Interface
Autodesk Fusion 2025 Tutorial: User Interface
Atif Razi
 
ML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdf
ML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdfML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdf
ML_Unit_V_RDC_ASSOCIATION AND DIMENSIONALITY REDUCTION.pdf
rameshwarchintamani
 
Relations and Functions – Understanding the Foundation of Mathematics.pptx
Relations and Functions – Understanding the Foundation of Mathematics.pptxRelations and Functions – Understanding the Foundation of Mathematics.pptx
Relations and Functions – Understanding the Foundation of Mathematics.pptx
srmvalliammaicse2
 
Dynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptxDynamics of Structures with Uncertain Properties.pptx
Dynamics of Structures with Uncertain Properties.pptx
University of Glasgow
 
Design of Variable Depth Single-Span Post.pdf
Design of Variable Depth Single-Span Post.pdfDesign of Variable Depth Single-Span Post.pdf
Design of Variable Depth Single-Span Post.pdf
Kamel Farid
 
Compiler Design_Code generation techniques.pptx
Compiler Design_Code generation techniques.pptxCompiler Design_Code generation techniques.pptx
Compiler Design_Code generation techniques.pptx
RushaliDeshmukh2
 
Analog electronic circuits with some imp
Analog electronic circuits with some impAnalog electronic circuits with some imp
Analog electronic circuits with some imp
KarthikTG7
 
Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...
Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...
Efficient Algorithms for Isogeny Computation on Hyperelliptic Curves: Their A...
IJCNCJournal
 
Introduction to FLUID MECHANICS & KINEMATICS
Introduction to FLUID MECHANICS &  KINEMATICSIntroduction to FLUID MECHANICS &  KINEMATICS
Introduction to FLUID MECHANICS & KINEMATICS
narayanaswamygdas
 
Novel Plug Flow Reactor with Recycle For Growth Control
Novel Plug Flow Reactor with Recycle For Growth ControlNovel Plug Flow Reactor with Recycle For Growth Control
Novel Plug Flow Reactor with Recycle For Growth Control
Chris Harding
 
Machine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATIONMachine Learning basics POWERPOINT PRESENETATION
Machine Learning basics POWERPOINT PRESENETATION
DarrinBright1
 
NOMA analysis in 5G communication systems
NOMA analysis in 5G communication systemsNOMA analysis in 5G communication systems
NOMA analysis in 5G communication systems
waleedali330654
 
Compiler Design_Syntax Directed Translation.pptx
Compiler Design_Syntax Directed Translation.pptxCompiler Design_Syntax Directed Translation.pptx
Compiler Design_Syntax Directed Translation.pptx
RushaliDeshmukh2
 
sss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptx
sss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptx
sss1.pptxsss1.pptxsss1.pptxsss1.pptxsss1.pptx
ajayrm685
 
SICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introductionSICPA: Fabien Keller - background introduction
SICPA: Fabien Keller - background introduction
fabienklr
 
Compiler Design_Code Optimization tech.pptx
Compiler Design_Code Optimization tech.pptxCompiler Design_Code Optimization tech.pptx
Compiler Design_Code Optimization tech.pptx
RushaliDeshmukh2
 
New Microsoft PowerPoint Presentation.pdf
New Microsoft PowerPoint Presentation.pdfNew Microsoft PowerPoint Presentation.pdf
New Microsoft PowerPoint Presentation.pdf
mohamedezzat18803
 
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G..."Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
"Feed Water Heaters in Thermal Power Plants: Types, Working, and Efficiency G...
Infopitaara
 
Evonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdfEvonik Overview Visiomer Specialty Methacrylates.pdf
Evonik Overview Visiomer Specialty Methacrylates.pdf
szhang13
 
Autodesk Fusion 2025 Tutorial: User Interface
Autodesk Fusion 2025 Tutorial: User InterfaceAutodesk Fusion 2025 Tutorial: User Interface
Autodesk Fusion 2025 Tutorial: User Interface
Atif Razi
 
Ad

Object detection with deep learning

  • 1. Object Detection with deep learning SUBMITTED TO: MR. B. SURESH SUBMITTED BY: HIMANSHU MAURYA(9917102004) SUSHANT SHRIVASTAVA(9917102023) BHUVNESH KUMAR BHARDWAJ(9917102028)
  • 2. 1. INTRODUCTION TO OBJECT DETECTION ○ Object detection is scanning and searching for an object in an image or a video. Fig. 1 Object detection
  • 3. Literature Review. • Object detection is a common term for computer vision techniques classifying and locating objects in an image. Modern object detection is largely based on use of convolutional neural networks Some of the most relevant system types today are Faster R-CNN, R-FCN, Multibox Single Shot Detector (SSD) and YOLO (You Only Look Once) [1]. Original R-CNN method worked by running a neural net classifier on samples cropped from images using externally computed box proposals (=samples cropped with externally computed box proposals; feature extraction done on all the cropped samples). This approach was computationally expensive due to many crops. • Single Shot Multibox Detector (SSD) differs from the R-CNN based approaches by not requiring a second stage per-proposal classification operation. This makes it fast enough for real-time detection applications. However, this comes with a price of reduced precision . “SSD with MobileNet” refers to a model where model meta architecture is SSD and the feature extractor type is MobileNet.
  • 4. 2. Generic object detection ● Generic object detection aims at locating and classifying existing object in any one image and labelling them with rectangular BBs to show the confidences of existences. Fig. 2 Generic object detection
  • 5. 3. Basic architecture of CNN Convolutional Neural Network (CNN) is a Deep Learning algorithm which can take in an input image, assign importance to various aspects/objects in the image and be able to differentiate one from the other.[2] Fig. 3 Basic architecture of CNN
  • 6. 4. Building the CNN ● Convolution ● Polling ● Flattening
  • 7. 4.1 Convolution ● Convolution preserves the spatial relationship between pixels by learning image features using small squares of input data. FIG. 4.1 Convolution
  • 8. 4.2 POOLING ● It reduces the dimensionality of each feature map but retains the most important information. FIG. 4.2 POOLING
  • 9. 4.3 FLATTENING ● Here the matrix is converted into a linear array so that to input it into the nodes of our neural network. FIG. 4.3 FLATTENING
  • 10. 5. Dataset & Preprocessing COCO stands for Common Objects in Context, this dataset contains around 330K labelled images. COCO is a large-scale object detection, segmentation, and captioning dataset.[3] 5.1 Features of dataset · Object segmentation · Recognition in context · 330K images (>200K labeled) · 1.5 million object instances · 80 object categories · 91 stuff categories 5.2 Data Preprocessing ● Since the model is pre trained, there is no need for data Preprocessing.
  • 11. 6. What is SSD? ● SSD(Single Shot Detector) is a is designed for object detection in real-time. FIG 5. Single Shot Detector.
  • 12. 7. Object detection using SSD algorithm. ● It is a three steps Process: 1. Region Proposal 2. Feature Generation 3. Classification FIG. 6 Object detection using SSD
  • 13. 8. SSD FRAMEWORK ● Multi-scale feature maps for detection. ● Convolutional predictors for detection. ● Default boxes and aspect ratios. FIG. 7 SSD FRAMEWORK
  • 14. 9. Feature extraction ● In this stage ,each region proposal is warped or cropped into a fixed resolution and the SSD module is utilized to extract features. FIG. 8 Feature extraction
  • 15. 10. Classification and Localization ● Classify each region using MobileNet V1 Architecture for each category by passing feature vector created from feature extraction and scored region are then adjusted with bounding box regression. ● This architecture uses depthwise separable convolutions which significantly reduces the number of parameters when compared to the network with normal convolutions. FIG. 9 Depth Wise Separable Convolution
  • 16. 11. MobileNet V1 Architecture ● It uses Separable Convolution to reduce the model size and complexity. ● Smaller model size: Fewer number of parameters. ● Smaller complexity: Fewer Multiplications and Additions (Multi-Adds). Fig. 10 MobileNet V1 Architecture
  • 17. 12. Advantages of MobileNet V1 Architecture ● The main advantages is their accuracy in image recognition problem. ● It takes less time. ● Improve the quality of candidate bounding boxes.
  • 18. 13. Tools And Libraries ● Anaconda — Anaconda is a free and open source distribution of the Python and R programming languages for data science and machine learning related applications. ● Spyder — Spyder is an open source cross-platform IDE for scientific programming in the Python language. ● Tensorflow — TensorFlow is an open-source software library for dataflow programming across a range of tasks. ● NumPy- NumPy is a Python package which stands for ‘Numerical Python’. It is the core library for scientific computing, which contains a powerful n-dimensional array object, provide tools for integrating C, C++ etc. ● Matplotlib- Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms. ● Urllib - Urllib is a Python module that can be used for opening URLs. It defines functions and classes to help in URL actions. With Python you can also access and retrieve data from the internet like XML, HTML, JSON, etc.
  • 19. References 1. Zhong-Qiu Zhao , Member, IEEE, Peng Zheng, Shou-Tao Xu, and Xindong Wu , Fellow, IEEE(2016) 2. https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning- 99760835f148 3. http://cocodataset.org/#home LINKS TO FIGURES:- 1. 2. https://towardsdatascience.com/going-deep-into-object-detection-bed442d92b34 3. https://medium.com/datadriveninvestor/convolutional-neural-network-cnn-simplified-ecafd4ee52c5 4. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53 5. https://www.researchgate.net/figure/The-architecture-of-Single-Shot-Multibox-Detector-SSD-It-considers- only-two- stage-by_fig9_327491507 6. Wei Liu1, Dragomir Anguelov2, Dumitru Erhan3, Christian Szegedy3, Scott Reed4, Cheng-Yang Fu1, Alexander C. Berg1(2016) 7. Sermanet,P.,Eigen,D.,Zhang,X.,Mathieu,M.,Fergus,R.,LeCun,Y.: Overfeat:Integrated recognition, localization and detection using convolutional networks. In: ICLR. (2014) 8. https://towardsdatascience.com/cnn-application-on-structured-data-automated-feature-extraction-8f2cd28d9a7e 9. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way- 3bd2b1164a53 10.https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning- 99760835f148 https://machinethink.net/blog/object-detection/