ECE_685D_HW3_2024

The document outlines the instructions for Homework 3 of ECE 685D, which includes tasks on object detection using CNNs and implementing optimizers from scratch. Students are required to train a CNN for human-face detection, preprocess data, and evaluate model performance, as well as implement various optimizers for MNIST classification. The assignment emphasizes the use of custom dataset classes and prohibits the use of large language models.

Uploaded by

shrey saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views3 pages

ECE_685D_HW3_2024

Uploaded by

shrey saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

ECE 685D HW3

Submission Instructions:
1. Upload your Jupyter Notebook (.ipynb file).
2. Export all outputs and necessary derivations as one or multiple PDF files and upload them.
LLM policy:
The use of large language models (LLMs) is not allowed for this assignment.

1 Problem 1: Object Detection with Convolutional Neural Net-

works (CNNs)
In this task, you will train a CNN model for a simplified human-face detection problem. The dataset
contains 2204 imaegs and a .csv file for bounding box coordinate. It can be downloaded from
https://www.kaggle.com/datasets/sbaghbidi/human-faces-object-detection/data.
Please divide your dataset into training and testing sets with a ratio of 4:1. After training, evaluate the
model’s performance on the test set.

1.1 Data Preprocessing (15 points)

Typically, bounding box annotations for object detection tasks are provided in formats like .xml, .json,
or .csv. Your task is to implement a custom dataset class for object detection by inheriting from
the ‘torch.utils.data.Dataset‘ class. The dataset should return an image along with its corresponding
bounding box when the ‘ getitem ‘ function is called. Additionally, a spatial transformations (e.g.,
random affine transformations, random elastic deformations) that should be applied to both the image
and its bounding box as part of the function call. You may use any library to accomplish this task.
After implementing the dataset, sample from the dataset you implemented and visualize the bounding
boxes overlaid on the images. Verify that the bounding boxes are correctly positioned and that any
spatial transformations are consistently applied to both the images and their respective bounding boxes.
Sample and plot twice to make sure that your augmentations are indeed random and consistent.

1.2 Object Detection with Pre-trained Feature Extractor(25 pts)

Once your dataset is ready, you can download a pre-trained classifier (e.g., one trained on ImageNet)
from any source to use as your feature extraction backbone. This backbone will be used to extract latent
representations from your images. These latent representations will then serve as the input to a module
that you will train to predict the coordinates (or transformed coordinates) of the bounding boxes. Note
that the weights of the feature extraction backbone should remain frozen and must not be modified
during this process. Evaluate the performance of your bounding box regression network on the test set
using IOU (see Fig. 1) as the metric and plot a few examples of your output.

1.3 Object Detection with End-to-End Fine-Tuning(15 pts)

Next, unfreeze the weights of your feature extraction network and ensure that the feature extraction
backbone and the bounding box regression module are connected with gradients. Train both the back-
bone and the bounding box prediction module together and observe if there is any improvement in model
performance. Finally, report the Intersection over Union (IoU) for the model and visualize a few exam-
ples of your bounding box predictions.

Note: For simplicity, most images in this dataset are selected to contain just one object

1
Figure 1: IOU for bounding box

of interest. It’s ok to have the bounding box regressor to predict only one bounding box.
However, think carefully how do you want to represent your bounding box. If the image
sizes are too larger to fit into the GPU memory, consider using ”torch.cuda.amp.autocast()”
or reducing your image size and batch size

1.4 What if there are multiple objects? (10pts)

In the previous dataset, we assumed that each image contains exactly one object. Would your current
model still work if there were zero or multiple objects from different categories in the images? Propose
a modification that would enable your model to handle such cases and explain how this adjustment
addresses or mitigates the issue. List relevant literature when applicable.

2 Problem 2: Optimizers from Scratch

2.1 Optimizer Implementaiton(20pts)
In this problem, you will implement the commonly-used optimizers from scratch. Train a CNN for MNIST
classification on cross-entropy loss with L1-regularization using the following optimizers. Showing the
correctness of your implementation by reporting the classification accuracy of your model after training.

1. Momentum method with parameter β = 0.9 (5 pts)

2. Nesterov’s Accelerated Gradient (NAG) with parameter β = 0.95 (5 pts)
3. RMSprop with parameters β = 0.95, γ = 1 and ϵ = 10−8 (10 pts)

4. Adam with parameters β1 = 0.9, β2 = 0.999, and ϵ = 10−8 (10 pts)

Note: You can use the Autograd package from Pytorch to compute the gradient when building your
optimizer. However, you are NOT allowed to use any built-in optimizers.
Hint: Kingma et al. stated in [1] an alternative implementation of Adam, which has lower clarity but
higher computation efficiency. (read the last paragraph before Section 2.1 for that paper)

2.2 Comparing your optimizer efficiency (15pts)

After implementing your optimizers, set your batch size to {4, 8, 16, 32}. Then, compare the changes in
training and validation loss for each optimizer at various chosen learning rates (choose at your discretion)
throughout the training process by creating plots for each batch size.

2
References
[1] Diederik P Kingma and Jimmy Ba. “Adam: A method for stochastic optimization”. In: arXiv
preprint arXiv:1412.6980 (2014).

C1W3 Assignment
No ratings yet
C1W3 Assignment
7 pages
How To Configure T24 Browser With Jboss 6 EAP Version
100% (1)
How To Configure T24 Browser With Jboss 6 EAP Version
12 pages
Information Sciences: C.L. Philip Chen, Chun-Yang Zhang
No ratings yet
Information Sciences: C.L. Philip Chen, Chun-Yang Zhang
34 pages
CEP-DIP
No ratings yet
CEP-DIP
9 pages
21BCP167_AI_9
No ratings yet
21BCP167_AI_9
10 pages
pppr 2 final
No ratings yet
pppr 2 final
37 pages
AdvanceQuestionsAnswers
No ratings yet
AdvanceQuestionsAnswers
4 pages
CVPDL hw3
No ratings yet
CVPDL hw3
26 pages
aicw
No ratings yet
aicw
19 pages
Project 2
No ratings yet
Project 2
2 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Comprehensive PyTorch Coding Challenges Across Mac
No ratings yet
Comprehensive PyTorch Coding Challenges Across Mac
5 pages
Social Distancing Detection Using Tensorflow
No ratings yet
Social Distancing Detection Using Tensorflow
4 pages
hw1 2487155975100812
No ratings yet
hw1 2487155975100812
6 pages
keras
No ratings yet
keras
4 pages
MNIST Based Handwritten Digits Recognition
No ratings yet
MNIST Based Handwritten Digits Recognition
5 pages
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
BIA9
No ratings yet
BIA9
5 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Assignment 5 - NN
No ratings yet
Assignment 5 - NN
4 pages
MMDetection Open MMLab Detection Toolbox and Benchmark
No ratings yet
MMDetection Open MMLab Detection Toolbox and Benchmark
13 pages
anthony
No ratings yet
anthony
33 pages
Talking Avatar Application
No ratings yet
Talking Avatar Application
9 pages
MVS_Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS_Expt8 Object Detection and Reconstruction Using CNN
5 pages
Emotion Detection-Final
No ratings yet
Emotion Detection-Final
24 pages
Code
No ratings yet
Code
4 pages
Ccnet Only
No ratings yet
Ccnet Only
6 pages
tesi
No ratings yet
tesi
57 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
A.I project(9)_094855
No ratings yet
A.I project(9)_094855
30 pages
Task 9 Implementation of Object Detection and Localization
No ratings yet
Task 9 Implementation of Object Detection and Localization
7 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
No ratings yet
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
25 pages
code_edge impulse
No ratings yet
code_edge impulse
13 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
cat_dog_classification_CNN_Model
No ratings yet
cat_dog_classification_CNN_Model
13 pages
Object Detection Tutorial - Py
No ratings yet
Object Detection Tutorial - Py
3 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
Assign4
No ratings yet
Assign4
3 pages
Project Deep Learning
No ratings yet
Project Deep Learning
4 pages
Week 6
No ratings yet
Week 6
8 pages
Lab 1 Assignment_W2022
No ratings yet
Lab 1 Assignment_W2022
7 pages
CD 601 Lab Manual
No ratings yet
CD 601 Lab Manual
61 pages
DL Programs
No ratings yet
DL Programs
12 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Assignment for AI Interns
No ratings yet
Assignment for AI Interns
3 pages
arjun1123 (3)
No ratings yet
arjun1123 (3)
20 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
co4 question bank
No ratings yet
co4 question bank
6 pages
CS335 Lab6
No ratings yet
CS335 Lab6
7 pages
col780_a3-1 (1)
No ratings yet
col780_a3-1 (1)
5 pages
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
No ratings yet
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
8 pages
Machine Learning HW3 - Image Classification
No ratings yet
Machine Learning HW3 - Image Classification
48 pages
Assignment I-4
No ratings yet
Assignment I-4
3 pages
SS_2021
No ratings yet
SS_2021
16 pages
ece484_mp1
No ratings yet
ece484_mp1
12 pages
Report on Neural Network Implementation and Optimization Techniques-10.02.25
No ratings yet
Report on Neural Network Implementation and Optimization Techniques-10.02.25
13 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
CS7643: Deep Learning Assignment 3: Instructor: Zsolt Kira Deadline: 11:59pm Mar 14, 2021, EST
No ratings yet
CS7643: Deep Learning Assignment 3: Instructor: Zsolt Kira Deadline: 11:59pm Mar 14, 2021, EST
12 pages
Arjun Present
No ratings yet
Arjun Present
20 pages
News3SaxenaShrey
No ratings yet
News3SaxenaShrey
1 page
Market Trends - Final Draft
No ratings yet
Market Trends - Final Draft
1 page
News 5 Saxena Sh Rey
No ratings yet
News 5 Saxena Sh Rey
2 pages
Case Study 3_BasecampPricing_GroupQuestions_Team 2
No ratings yet
Case Study 3_BasecampPricing_GroupQuestions_Team 2
4 pages
Control Systems: Dr. Malaya Kumar Hota (Prof.), SENSE, VIT University
No ratings yet
Control Systems: Dr. Malaya Kumar Hota (Prof.), SENSE, VIT University
23 pages
News4SaxenaShrey
No ratings yet
News4SaxenaShrey
1 page
IA2 - Development Goals - Shrey Saxena
No ratings yet
IA2 - Development Goals - Shrey Saxena
2 pages
Modeling of Mechanical System II
No ratings yet
Modeling of Mechanical System II
11 pages
17BEC0069 Coursea
No ratings yet
17BEC0069 Coursea
1 page
Analogous System
100% (1)
Analogous System
16 pages
17BEC0069 Da-1 PTRP
No ratings yet
17BEC0069 Da-1 PTRP
26 pages
Unit 3 - Decision Making under Uncertainty in AI
No ratings yet
Unit 3 - Decision Making under Uncertainty in AI
25 pages
review_work_energy_and_power_paper_2
No ratings yet
review_work_energy_and_power_paper_2
4 pages
L05 Slides.mlp2
No ratings yet
L05 Slides.mlp2
21 pages
PSD Syllabus
No ratings yet
PSD Syllabus
2 pages
Cap 3 Sistemas de Transporte de Carga Transportadores Bandas
No ratings yet
Cap 3 Sistemas de Transporte de Carga Transportadores Bandas
136 pages
Btech Cs 5 Sem Web Designing kcs052 2021
No ratings yet
Btech Cs 5 Sem Web Designing kcs052 2021
2 pages
Ped 236 Elementary Maths
No ratings yet
Ped 236 Elementary Maths
246 pages
What Is Lockbox in Account Receivables
No ratings yet
What Is Lockbox in Account Receivables
3 pages
ARMCO Iron Brochure
No ratings yet
ARMCO Iron Brochure
12 pages
Vmware Horizon Client. Please Follow The Instructions Below On
No ratings yet
Vmware Horizon Client. Please Follow The Instructions Below On
7 pages
Bind-9 13 3-Manual
No ratings yet
Bind-9 13 3-Manual
326 pages
FHC 35 K
No ratings yet
FHC 35 K
244 pages
Properties and Thermal Degradation Studies of Gelatin Based Film - Exploring The Biopolymer For Plastic Advancement
No ratings yet
Properties and Thermal Degradation Studies of Gelatin Based Film - Exploring The Biopolymer For Plastic Advancement
5 pages
Transformer On Auto Load Shifting With Cuttoff
No ratings yet
Transformer On Auto Load Shifting With Cuttoff
20 pages
DSB SC Final Project
No ratings yet
DSB SC Final Project
14 pages
Probabilistic Seismic Hazard Analysis (PSHA)
100% (2)
Probabilistic Seismic Hazard Analysis (PSHA)
48 pages
Mechanical Properties of Biomedical Ti Alloys
No ratings yet
Mechanical Properties of Biomedical Ti Alloys
6 pages
LicenseResourceInfo 20200706 142554
No ratings yet
LicenseResourceInfo 20200706 142554
8 pages
Generating Patterns Part 3
No ratings yet
Generating Patterns Part 3
28 pages
FIPS AIPS Manual
No ratings yet
FIPS AIPS Manual
18 pages
6E Light Workbook
No ratings yet
6E Light Workbook
19 pages
Operating System Project 4
No ratings yet
Operating System Project 4
7 pages
Electrostatic 1 Notes Class 12
0% (1)
Electrostatic 1 Notes Class 12
9 pages
COMPUTING BS6 (AutoRecovered)
No ratings yet
COMPUTING BS6 (AutoRecovered)
4 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
23 pages
Unbalanced Fields Takeoffs
No ratings yet
Unbalanced Fields Takeoffs
57 pages
Home Builders Catalogue Wunderlich, 1929
100% (2)
Home Builders Catalogue Wunderlich, 1929
52 pages