Presentation1

The document discusses various algorithms for 2D and 3D object detection, categorizing them into traditional, anchor-based, anchor-free, and transformer-based methods. It highlights the strengths and weaknesses of each approach, such as the speed and accuracy of YOLO and SSD, as well as the computational demands of DETR and ViT. The comparison emphasizes the trade-offs between accuracy, speed, and complexity in object detection models.

Uploaded by

yayesiy213

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Presentation1

Uploaded by

yayesiy213

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Different Algorithms of

2D and 3D Object
Detection
Team 3
Jua
Traditional Object Detection

Three stages: Flaws:

• region proposal • Slow speed

• Regions of Interest (RoI) • Low Accuracy
• Different window sizes • High Computational overhead
• feature extraction
• Local Binary Pattern (LBP)
• Histogram of Oriented Gradient (HOG)
• Scale Invariant Feature Transform
(SIFT)
• classification, and regression.
• Calssification
• boundary box regression
Classification of algorithms

Anchor – Based Anchor – Free Transformer – Based

RCNN
Key-point-based DETR
YOLO

SSD Anchor-point-based ViT

RCNN (Region-
Based CNN)

RCNN Overview:
• Introduced by Ross Girshick et al.
• Region Proposal: Uses selective search to
propose regions (bounding boxes)
• CNN-Based Feature Extraction: Extracts
features for each region
• Classification: Uses classifiers like SVM for
object classification
Limitations:
• Slow due to separate steps for region
proposal, feature extraction, and classification
• Not real-time
RCNN (Region-
Based CNN)
Fast RCNN and
Faster RCNN
Fast RCNN:
• Combines feature extraction and
classification in a single forward pass
• Uses ROI Pooling for faster
computation
Faster RCNN:
• Introduces the Region Proposal
Network (RPN) to generate proposals
• Significantly faster than RCNN
Fast RCNN
Faster RCNN
YOLO (You Only
Look Once)
YOLO Overview:
• Single-stage detector: Combines region proposal,
classification, and bounding box prediction in one pass
• Speed: Real-time object detection
• Divides the image into grid cells and predicts bounding
boxes for each cell
Strengths:
• Extremely fast
• Real-time detection for video processing
Weaknesses:
• Struggles with detecting small or overlapping objects
SSD (Single Shot
MultiBox Detector)
SSD Overview:
• Combines YOLO’s speed with better accuracy for small
objects
• Predicts objects at multiple scales using feature maps
from different layers
• No need for a separate region proposal network like in
Faster RCNN
Strengths:
• Good balance between speed and accuracy
• Multi-scale detection improves performance for small
objects
Weaknesses:
• Still not as accurate as two-stage detectors like Faster
RCNN
Anchor-Free Keypoint-
Based Detection
Overview:
• Anchor-Free: Does not use predefined
anchor boxes
• Detects objects by keypoints (like center
points or object corners)
• Examples: CornerNet, CenterNet
Strengths:
• Eliminates the complexity of anchor design
• More flexible for varying object shapes
Weaknesses:
• May struggle with overlapping objects or
cluttered scenes
Anchor-Free Anchor-
Point-Based Detection
Overview:
• Instead of using predefined anchors, anchor
points are selected dynamically
• Faster and simpler as it removes the need for a
predefined grid of anchors
• Examples: FCOS, CrossDet
Strengths:
• Improves the efficiency of object detection
• Reduces false positives from misaligned anchors
Weaknesses:
• May lose some localization accuracy compared
to anchor-based models
DETR (DEtection
TRansformers)
• Overview:
• Transformer-based approach for object detection
• Uses Transformers to model object detection as a set
prediction problem
• No need for non-maximum suppression or anchor
boxes
• Strengths:
• Simplified architecture
• Strong performance in detecting objects with complex
relationships
• Weaknesses:
• Requires large amounts of data and computational
resources
• Slower convergence compared to CNN-based models
ViT (Vision
Transformer)
• Overview:
• Vision Transformer applies the transformer
architecture (originally for NLP) to image data
• Divides images into patches and processes
them like sequences of words
• Strengths:
• Strong performance for large datasets
• Captures long-range dependencies in images
• Weaknesses:
• Requires substantial training data
• Computationally expensive
Comparison of 2-D Object Detection
Models

RCNN: • Accurate but slow

Faster RCNN: • Balanced speed and accuracy

YOLO: • Real-time but less accurate

• Faster and more accurate than YOLO, but slower than
SSD: YOLO
Anchor-Free Methods: • Flexible and efficient, but struggles with clutter

DETR: • Simplified, no anchors, but computationally heavy

ViT: • Best for large-scale data but expensive to train

Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Object Detection Models
No ratings yet
Object Detection Models
36 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Overview_of_object_detection_based_on_deep_learnin
No ratings yet
Overview_of_object_detection_based_on_deep_learnin
7 pages
Research Paper UGR_Team-07
No ratings yet
Research Paper UGR_Team-07
16 pages
Final Report - Removed
No ratings yet
Final Report - Removed
43 pages
Object Detection Models Part2
No ratings yet
Object Detection Models Part2
12 pages
MINI PROJECT SYNOPSIS
No ratings yet
MINI PROJECT SYNOPSIS
6 pages
Real-Time Object Detection Using Deep Learning and Open CV
No ratings yet
Real-Time Object Detection Using Deep Learning and Open CV
4 pages
Vijay Report
No ratings yet
Vijay Report
14 pages
Object and Face Detection Based On Center-Net 1
No ratings yet
Object and Face Detection Based On Center-Net 1
7 pages
Lecture Paola Object Detection
No ratings yet
Lecture Paola Object Detection
29 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
45 pages
ref14
No ratings yet
ref14
5 pages
194174B Final Presentations
No ratings yet
194174B Final Presentations
20 pages
Object Detect
No ratings yet
Object Detect
12 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
5 Major Computervision Technique
No ratings yet
5 Major Computervision Technique
10 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
Wepik Advancing Object Detection Unveiling The Potential For Precision and Efficiency 202401081226449LyU
No ratings yet
Wepik Advancing Object Detection Unveiling The Potential For Precision and Efficiency 202401081226449LyU
22 pages
Object Detection
No ratings yet
Object Detection
76 pages
Object Detection With Deep Learning_ A Review Summary
No ratings yet
Object Detection With Deep Learning_ A Review Summary
11 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
Comparative Analysis of Deep Learning Image Detection Algorithms
No ratings yet
Comparative Analysis of Deep Learning Image Detection Algorithms
27 pages
IMINT Target Acquisition Using Deep Learning
No ratings yet
IMINT Target Acquisition Using Deep Learning
5 pages
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
No ratings yet
cv2021 Lec6 Object Detection - 1600 - PDF - Gdrive.vip
60 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
Object Detection Report
No ratings yet
Object Detection Report
27 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
DEVANSH RAJESH DHURI 8TH F ROLL NO.13 [OBJECT DETECTION IN AI]
No ratings yet
DEVANSH RAJESH DHURI 8TH F ROLL NO.13 [OBJECT DETECTION IN AI]
10 pages
Comparative analysis of feature descriptors and classifiers for real-time object detection
No ratings yet
Comparative analysis of feature descriptors and classifiers for real-time object detection
11 pages
Object Detection With Deep Learning
No ratings yet
Object Detection With Deep Learning
3 pages
Deep Learning Algorithms For Object Detection
No ratings yet
Deep Learning Algorithms For Object Detection
43 pages
Center Net
No ratings yet
Center Net
12 pages
BTP Report Faster R CNN Compressed
No ratings yet
BTP Report Faster R CNN Compressed
32 pages
A brief review and challenges of object 2020
No ratings yet
A brief review and challenges of object 2020
17 pages
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
No ratings yet
Obstacle Detection and Classification Using Deep Learning For Tracking in High-Speed Autonomous Driving
6 pages
Deep-Drone-Object 2
No ratings yet
Deep-Drone-Object 2
8 pages
Report 34
No ratings yet
Report 34
22 pages
UNIT 5
No ratings yet
UNIT 5
18 pages
CenterNet Keypoint Triplets PDF
No ratings yet
CenterNet Keypoint Triplets PDF
10 pages
Object Detection Techniques A Review
No ratings yet
Object Detection Techniques A Review
9 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
Object Identify Recog. CV
No ratings yet
Object Identify Recog. CV
12 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
Object Detection Slides
No ratings yet
Object Detection Slides
90 pages
Deep Residual Learning
No ratings yet
Deep Residual Learning
80 pages
Object Detection1
No ratings yet
Object Detection1
29 pages
Aiav Unit 2 Notes
No ratings yet
Aiav Unit 2 Notes
8 pages
Object_Detection_in_Images_and_Videos_Using_OpenCV_A_Comparative_Study_of_Deep_Learning_and_Traditional_Computer_Vision_Techniques
No ratings yet
Object_Detection_in_Images_and_Videos_Using_OpenCV_A_Comparative_Study_of_Deep_Learning_and_Traditional_Computer_Vision_Techniques
6 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Li 2021 J. Phys.: Conf. Ser. 1827 012085
No ratings yet
Li 2021 J. Phys.: Conf. Ser. 1827 012085
11 pages
10 R CNN
No ratings yet
10 R CNN
28 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
E3sconf Iconnect2023 04032
No ratings yet
E3sconf Iconnect2023 04032
11 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
From Everand
Contemporary Machine Learning Methods: Harnessing Scikit-Learn and TensorFlow
Adam Jones
No ratings yet
CRI-O Deep Dive: Definitive Reference for Developers and Engineers
From Everand
CRI-O Deep Dive: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Display of Digital Image Copy-Move Forensics Techniques
No ratings yet
Comprehensive Display of Digital Image Copy-Move Forensics Techniques
6 pages
People Identification Via Tongue Print Using Fine-Tuning Deep Learning
No ratings yet
People Identification Via Tongue Print Using Fine-Tuning Deep Learning
9 pages
Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet
No ratings yet
Local Features Tutorial:: (C) 2004 F. Estrada & A. Jepson & D. Fleet
25 pages
DVP Motion
No ratings yet
DVP Motion
42 pages
Lect 1 and 2
No ratings yet
Lect 1 and 2
100 pages
FINAL
No ratings yet
FINAL
19 pages
Implementation of SLAM On Mobile Robots and Stitching of The Generated Maps
No ratings yet
Implementation of SLAM On Mobile Robots and Stitching of The Generated Maps
13 pages
BE - Mechatronics Engineering - 2019 - Course - 17082023
No ratings yet
BE - Mechatronics Engineering - 2019 - Course - 17082023
57 pages
Multimedia and Computer Vision unit 5
No ratings yet
Multimedia and Computer Vision unit 5
25 pages
Ming and Qing Dynasty Official-Style Architecture
No ratings yet
Ming and Qing Dynasty Official-Style Architecture
24 pages
Automatic Indian New Fake Currency Detection Technique: Mayadevi A.Gaikwad Vaijinath V. Bhosle
No ratings yet
Automatic Indian New Fake Currency Detection Technique: Mayadevi A.Gaikwad Vaijinath V. Bhosle
4 pages
Main
No ratings yet
Main
13 pages
Number Plate Recogination Using Machine Learning
No ratings yet
Number Plate Recogination Using Machine Learning
11 pages
(Ebook) Programming Computer Vision with Python: Tools and algorithms for analyzing images by Jan Erik Solem ISBN 9781449316549, 1449316549 - Read the ebook online or download it as you prefer
100% (2)
(Ebook) Programming Computer Vision with Python: Tools and algorithms for analyzing images by Jan Erik Solem ISBN 9781449316549, 1449316549 - Read the ebook online or download it as you prefer
55 pages
Disaster Management and Assesment Drone
No ratings yet
Disaster Management and Assesment Drone
24 pages
Embedded System Vehicle Based On Multi-Sensor Fusion
No ratings yet
Embedded System Vehicle Based On Multi-Sensor Fusion
16 pages
3D Reconstruction USING MULTIPLE 2D IMAGES
No ratings yet
3D Reconstruction USING MULTIPLE 2D IMAGES
4 pages
On Computer Vision For Augmented Reality
No ratings yet
On Computer Vision For Augmented Reality
4 pages
Individual Buffalo Identification Through Muzzle Dermatoglyphics Images Using Deep Learning Approaches
No ratings yet
Individual Buffalo Identification Through Muzzle Dermatoglyphics Images Using Deep Learning Approaches
14 pages
Missing Child Identification System
No ratings yet
Missing Child Identification System
85 pages
CSE-IT-312 DIP -1
No ratings yet
CSE-IT-312 DIP -1
17 pages
A PCB Dataset For Defects Detection and Classification: Weibo Huang, Peng Wei
No ratings yet
A PCB Dataset For Defects Detection and Classification: Weibo Huang, Peng Wei
9 pages
Object Recognition Using Only A Single View
No ratings yet
Object Recognition Using Only A Single View
6 pages
Face Mask Detection
No ratings yet
Face Mask Detection
102 pages
CS 223-B L4 Features2
No ratings yet
CS 223-B L4 Features2
72 pages
EECS 442: Prof. David Fouhey Winter 2019, University of Michigan
No ratings yet
EECS 442: Prof. David Fouhey Winter 2019, University of Michigan
64 pages
Indian Sign Language Character Recognition: Shravani K, Sree Lakshmi A, Sri Geethikam, DR - Sapna B Kulkarni
No ratings yet
Indian Sign Language Character Recognition: Shravani K, Sree Lakshmi A, Sri Geethikam, DR - Sapna B Kulkarni
6 pages
Car Make and Model Recognition Using Ima
No ratings yet
Car Make and Model Recognition Using Ima
8 pages
Rohit Ranade RP
No ratings yet
Rohit Ranade RP
2 pages
A Method To Improve Interest Point Detection and Its Gpu Implementation
No ratings yet
A Method To Improve Interest Point Detection and Its Gpu Implementation
60 pages