0% found this document useful (0 votes)

12 views17 pages

Object Detection Using Overfeat

The document discusses object detection using a convolutional neural network. It describes Overfeat, which uses a pretrained model, sliding window detection, image pyramids, and fully connected layers as convolutions. Overfeat was the winner of the localization task at ILSVRC2013 and used these techniques along with non-maximum suppression for detection.

Uploaded by

Sprout Gigs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views17 pages

Object Detection Using Overfeat

Uploaded by

Sprout Gigs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Object Detection using Convolution Neural Network

Overfeat
Overfeat
Idea 4 Sliding window (spatial output) + Pretrained Model + Image Pyramid + FC as Convnet

1. Train Localize multi class classification - To understand generalize property of

model Bbox

2. Use it as pretrained model for object - Multi class detection

detection task

3. Sliding window - To detect multiple instance of object Overfeat

Paper
4. Image Pyramid - For detecting varying size object

5. FC as convnet - To overcome CNN constrain

6. NMS - For final prediction

Overfeat
Idea 4 Sliding window (spatial output ) + Pretrained Model + Image Pyramid + FC as convnet Testing

FC as convnet

Regression

MODEL
F

VGG 16
C C
1 2

Sliding window Feature Extraction

+ Classification
Image Pyramid
Pre Trained Localize Model
Overfeat - Experiment

~ winner of the localization task of the ImageNet Large

Scale Visual Recognition Challenge 2013 (ILSVRC2013)

Paper link: https://arxiv.org/abs/1312.6229

Overfeat - Experiment - Classification and Localization - Training

2012

• ImageNet 2012 dataset

• 1000 classes

• Trained classification and localization

task on modified AlexNET.
Vehicle Dog Craft • ILSVRC 2013 1st Winner for
classification and localization task

• 3rd Rank for Detection task

1000 classes
Overfeat - Experiment - Classification and Localization - Training

Classification and Localization

Localize
Bounding Box
ImageNet 2012
Dataset – 1000 CLASS Regression

MODEL
F
C C
1 2

Feature Extraction
Image + Bounding Box Classification

Modified AlexNET Classify Class k

• ILSVRC 2013 1st Winner for classification and localization task

Overfeat - Experiment - Object Detection - Training

• Training without Background

class leads to a lot of False
Positive prediction.

• To avoid FP, one additional

class is used.

• Training data for background

245 x 245 is taken randomly where no
object is present.
Classification and Localization
• Training of Classification and
Regression Localization is done on 20 + 1
Class.
MODEL

Training
C + 1 Class
• Base dimension for training is
+ 1 for Background Classification 245 x 245
Overfeat - Experiment - Object Detection - Inference

• Pretrained Classification and Localization Network

trained on C + 1 Class

• Spatial output of Image pyramid with base dimension

of prediction is 245 x 245

• 6 Scale Image pyramid with 1:2 factor of Resolution

• FC as Convent

• Resolution / Subsampling ratio / Effective Strides = 36

Overfeat - Experiment - Resolution

Resolution / Subsampling Ratio / Effective Stride = 36 Resolution / Subsampling Ratio / Effective Stride = 18

389 x 461 317 x 386 317 x 386

Spatial Output 3x5 6 x 10

For 1 class only
5x7
Overfeat - Experiment - Image Pyramid - Spatial Output
6 Scale Image Pyramid with 1:2 factor of Resolution

281 x 317 317 x 386 389 x 461 425 x 497 464 x 569

2 x 3 x C+1

3 x 5 x C+1 5 x 7 x C+1
6 x 7 x C+1
7 x 10 x C+1

1 x 1 x C+1 Resolution / Effective Stride = 36

245 X 245
Credit: https://www.youtube.com/watch?v=9I6nzfx_kpE&list=PL1GQaVhO4f_jLxOokW7CS5kY_J1t1T17S&ab_channel=Cogneethi
Overfeat - Experiment - Object Detection - Inference
Spatial Output

2x3 + 3x5 + 5x7 + 6x7 + 7 x 10 = 169 x C+1 Resolution / Effective Stride = 36

NMS

245 X 245 245 X 245

Overfeat - Experiment - Object Detection - Inference
1 x 1 x 4 x c+1
245 X 245
2 x 3 x 4 x c+1
Localize
Bounding Box 3 x 5 x 4 x c+1
281 x 317
Regression 5 x 7 x 4 x c+1
Classification and Localization
317 x 386 6 x 7 x 4 x c+1
Query 7 x 10 x 4 x c+1

MODEL
389 x 461
245 X 245

Modified Alex NET 1 x 1 x c+1

425 x 497
2 x 3 x c+1
Classification
3 x 5 x c+1
Classify
Class c+1 5 x 7 x c+1
464 x 569
6 x 7 x c+1
7 x 10 x c+1
Credit: https://www.youtube.com/watch?v=9I6nzfx_kpE&list=PL1GQaVhO4f_jLxOokW7CS5kY_J1t1T17S&ab_channel=Cogneethi
Query Input Resolution = 36

Classification
Confidence Box Resolution = 12

Model

Regression
Bounding Box

Credit: https://arxiv.org/abs/1312.6229
Overfeat - Experiment - Object Detection - Inference

NMS
Spatial Outputs

Final prediction

Credit: https://arxiv.org/abs/1312.6229
Overfeat - Experiment - Object Detection - Research Paper Result

• 3rd Rank for Detection task

Paper link: https://arxiv.org/abs/1312.6229

Overfeat - Object Detection - Drawbacks
Background Class

Model Background Class

Overfeat - Object Detection - Drawbacks

• Each inference takes 2 seconds

• Computationally Inefficient and also expensive (2013)

• A lot of background region is getting unnecessary processed because of Sliding

window or dense Sampling approach

• Sliding window approach creates a lot of FP , therefore less MAP

• Can we have some way to find to only those regions where background is not present?

• Accurate predictions and increase MAP ?

• RPNN END 
Credit :https://www.pyimagesearch.com/2020/06/29/opencv-selective-search-for-object-detection/

1st English Applications - Compressed
100% (1)
1st English Applications - Compressed
14 pages
Kinder Single Digit Addition Using Manipulatives Unit Lesson Plan Math Ilia Willison
No ratings yet
Kinder Single Digit Addition Using Manipulatives Unit Lesson Plan Math Ilia Willison
44 pages
Civil 3D
100% (1)
Civil 3D
3 pages
Lecture 6 CNN - Detection
No ratings yet
Lecture 6 CNN - Detection
48 pages
Object Detection and Tracking
No ratings yet
Object Detection and Tracking
144 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Learning To Detect Objects in Images Via A Sparse, Part-Based Representation
No ratings yet
Learning To Detect Objects in Images Via A Sparse, Part-Based Representation
28 pages
Scalable Object Detection
No ratings yet
Scalable Object Detection
8 pages
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
No ratings yet
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
25 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Nivetha Me P2 PPT
No ratings yet
Nivetha Me P2 PPT
18 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
No ratings yet
Ding 2018 IOP Conf. Ser. Mater. Sci. Eng. 322 062024
6 pages
CV_T3_ Unit-7
No ratings yet
CV_T3_ Unit-7
36 pages
Object Detection Webcam
No ratings yet
Object Detection Webcam
3 pages
Overfeat
No ratings yet
Overfeat
58 pages
Object Detaction
No ratings yet
Object Detaction
6 pages
aicw
No ratings yet
aicw
19 pages
RO47002 - Lecture 2A - Case Study Visual Object Detection
No ratings yet
RO47002 - Lecture 2A - Case Study Visual Object Detection
24 pages
Havi Doc Batch 10
No ratings yet
Havi Doc Batch 10
17 pages
Efficientdet: Scalable and Efficient Object Detection: Mingxing Tan Ruoming Pang Quoc V. Le Google Research, Brain Team (
No ratings yet
Efficientdet: Scalable and Efficient Object Detection: Mingxing Tan Ruoming Pang Quoc V. Le Google Research, Brain Team (
10 pages
2.ObjectDetection Two Stage
No ratings yet
2.ObjectDetection Two Stage
66 pages
ref19
No ratings yet
ref19
6 pages
tesi
No ratings yet
tesi
57 pages
Incremental Training For Image Classification of Unseen Objects
No ratings yet
Incremental Training For Image Classification of Unseen Objects
19 pages
mv_cs4243_2024_amir_6_p2 (1)
No ratings yet
mv_cs4243_2024_amir_6_p2 (1)
95 pages
Efficientdet: Scalable and Efficient Object Detection: Mingxing Tan Ruoming Pang Quoc V. Le Google Research, Brain Team (
No ratings yet
Efficientdet: Scalable and Efficient Object Detection: Mingxing Tan Ruoming Pang Quoc V. Le Google Research, Brain Team (
10 pages
Traffic Sign Classification Slides
No ratings yet
Traffic Sign Classification Slides
29 pages
Danupon Chansong - Siriporn Supratid - 2021 - Impacts of Kernel Size On Different Resized Images in Object Recognition Based
No ratings yet
Danupon Chansong - Siriporn Supratid - 2021 - Impacts of Kernel Size On Different Resized Images in Object Recognition Based
4 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Object Detection With Deep Learning
No ratings yet
Object Detection With Deep Learning
3 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
Generalized Focal Loss Towards Efficient Representation Learning for Dense Object Detection
No ratings yet
Generalized Focal Loss Towards Efficient Representation Learning for Dense Object Detection
15 pages
IP Report Final
No ratings yet
IP Report Final
20 pages
Face Recognition Using CNN
No ratings yet
Face Recognition Using CNN
17 pages
MVS_Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS_Expt8 Object Detection and Reconstruction Using CNN
5 pages
Lecture06 - Copie
No ratings yet
Lecture06 - Copie
52 pages
Lec 8
No ratings yet
Lec 8
60 pages
Yolo
No ratings yet
Yolo
24 pages
Project Report
No ratings yet
Project Report
9 pages
End-to-End Object Detection with Fully Convolutional Network
No ratings yet
End-to-End Object Detection with Fully Convolutional Network
13 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Ayush Singh Research
No ratings yet
Ayush Singh Research
5 pages
Week 05
No ratings yet
Week 05
38 pages
Analyzing the Performance of Multilayer Neural
No ratings yet
Analyzing the Performance of Multilayer Neural
16 pages
havi 2
No ratings yet
havi 2
13 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Lesson 07
No ratings yet
Lesson 07
59 pages
Deep Learning: Dr. Sanjeev Sharma
No ratings yet
Deep Learning: Dr. Sanjeev Sharma
61 pages
VGG Image Classification Practical
No ratings yet
VGG Image Classification Practical
11 pages
Tripartite Feature Enhanced Pyramid Network For Dense Prediction
No ratings yet
Tripartite Feature Enhanced Pyramid Network For Dense Prediction
15 pages
Object Detection Using CNN
No ratings yet
Object Detection Using CNN
5 pages
Deep Learning Overview - Kuntal Chakraborty
No ratings yet
Deep Learning Overview - Kuntal Chakraborty
44 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
Part 2
No ratings yet
Part 2
225 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
A Literature Review of Object Detection Using YOLOv4 Detector
No ratings yet
A Literature Review of Object Detection Using YOLOv4 Detector
7 pages
ECE_685D_HW3_2024
No ratings yet
ECE_685D_HW3_2024
3 pages
W11 Lecture ITS69204 Image Recognition (1)
No ratings yet
W11 Lecture ITS69204 Image Recognition (1)
44 pages
Knowledge-Based Systems
No ratings yet
Knowledge-Based Systems
10 pages
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
From Everand
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
Fouad Sabry
No ratings yet
Nov Fuel Topping
No ratings yet
Nov Fuel Topping
43 pages
Sites Billing Report 24 Without Graph
No ratings yet
Sites Billing Report 24 Without Graph
24 pages
Ports Data of All Sites
No ratings yet
Ports Data of All Sites
456 pages
Dec Fuel Topping
No ratings yet
Dec Fuel Topping
44 pages
ML LAB MANUAL
No ratings yet
ML LAB MANUAL
53 pages
STATEMENT OF PURPOSE
No ratings yet
STATEMENT OF PURPOSE
1 page
Edc Lab Manuals Third Semester
No ratings yet
Edc Lab Manuals Third Semester
56 pages
Internal Scanning
100% (1)
Internal Scanning
10 pages
Toniesha D Webb Resume
No ratings yet
Toniesha D Webb Resume
3 pages
Tam2601 Assignment 2 2024
No ratings yet
Tam2601 Assignment 2 2024
9 pages
Dont Drink and Drive-Stay Alive
No ratings yet
Dont Drink and Drive-Stay Alive
13 pages
These interview from ai 2
No ratings yet
These interview from ai 2
6 pages
CSS136
No ratings yet
CSS136
8 pages
The 7 Concepts
No ratings yet
The 7 Concepts
1 page
Social Capital Powerpoint.2
100% (1)
Social Capital Powerpoint.2
16 pages
Bai 1 - text
No ratings yet
Bai 1 - text
13 pages
Compilation of Lesson Plan (Group 4)
100% (1)
Compilation of Lesson Plan (Group 4)
37 pages
CHAPTER 2-Review
No ratings yet
CHAPTER 2-Review
16 pages
The Importance of Eye Contact in The Classroom
No ratings yet
The Importance of Eye Contact in The Classroom
2 pages
Models of Second Language Learning
No ratings yet
Models of Second Language Learning
21 pages
EMDR Solutions Pathways to Healing Textbook PDF Download
100% (12)
EMDR Solutions Pathways to Healing Textbook PDF Download
15 pages
Giles & Coupland, 1991
No ratings yet
Giles & Coupland, 1991
5 pages
Indian Institute of Technology Delhi Undergraduate Section (Ugs)
No ratings yet
Indian Institute of Technology Delhi Undergraduate Section (Ugs)
14 pages
Mayer Astrology Science Part2
No ratings yet
Mayer Astrology Science Part2
37 pages
ANG004-English as a Foreign Language [PAGE MEMOIRE1]
No ratings yet
ANG004-English as a Foreign Language [PAGE MEMOIRE1]
3 pages
Lesson Plan Clasa A III A Abracadabra
No ratings yet
Lesson Plan Clasa A III A Abracadabra
2 pages
Link L8 U8 Unit Testa
No ratings yet
Link L8 U8 Unit Testa
2 pages
Manual Whinsec
No ratings yet
Manual Whinsec
48 pages
As A Man Thinketh: by James Allen
No ratings yet
As A Man Thinketh: by James Allen
26 pages
The Glove and The Lions Q&A
100% (3)
The Glove and The Lions Q&A
4 pages
B1B1plusB2 Catalogue2014 14246
No ratings yet
B1B1plusB2 Catalogue2014 14246
5 pages
Kurukshetra University, Kurukshetra: Ordinance and Application Form (2015-16) For Doctorate in Philosophy
No ratings yet
Kurukshetra University, Kurukshetra: Ordinance and Application Form (2015-16) For Doctorate in Philosophy
19 pages
Tygt
No ratings yet
Tygt
1 page
Lesson 2
No ratings yet
Lesson 2
5 pages
What Is The Zeitgeist Movement
No ratings yet
What Is The Zeitgeist Movement
11 pages