SlideShare a Scribd company logo
4
Most read
14
Most read
15
Most read
Object Classification using CNN
Lalit Jain https://www.linkedin.com/in/lalit7jain/
Rohit Agarwal https://www.linkedin.com/in/rohitag04/
Shivam Goel https://www.linkedin.com/in/goelshi/
Hina Gandhi: https://www.linkedin.com/in/hina-gandhi-52834356/
Big Data & Engineering Systems
Under the guidance of Professor. Dino Konstantopoulos
Agenda
1. Case Study
2. Building a Convolutional Neural Network
3. Time Complexity
4. Using VGG16 model
5. Demo
6. Results
7. Limitations & Challenges
8. Future Scope
Agenda
Case Study
Building a CNN model which can be trained on
the fly and classify objects
Approach
1. Using Google API to download 100 images from the front end using a Keyword
2. Prepare the image by cleaning and augmentation
3. Train the model based on the image downloaded with Convolutional Neural Network
using Keras and Tensorflow
4. Classify the image
Technology Stack:
Programming Language: Python
Front End: Django Framework
Machine Specifications:
Libraries: Keras, Tensorflow, VGG16, etc
Convolutional Neural Network
Images are a matrix of pixel values
Convolution Operation
Relu Activation
Pooling (Max)
Putting it all together
About the Dataset
We have taken two datasets :
Static Model:
One dataset is related to natural calamities where we have trained our
models for 4 classes “earthquake”, “hurricane”, “tornado” and “volcanic
eruption”.
Dynamic Model:
Other dataset is related to personality identification which will work on the
fly. During demo, we will add one personality on the fly in addition to the
existing classes (Donald Trump, Morgan Freeman)
Use case and Scope
1. Using static model, identify one of the 4 natural calamities
2. Using the trained dynamic model, identify the personality
Out of Scope/ Limitations:
1. Being able to identify a class which is not trained
2. Being able to identify 0 as output if image does not match
3. Misclassification of images due to training on uncleaned and inaccurate
image labels
DEMO
Data augmentation for improving the
model
By applying random transformation to our train set, we artificially enhance our
dataset with new unseen images.
shear_range: Rotate image by 0.2
zoom_range: Zoom the image by 0.2
horizontal_flip: Randomly flip inputs horizontally
rescale: rescaling factor (1./255)
Building CNN from Scratch
Basic CNN model (the model that was
developed from scratch) gave ~74% accuracy.
CNN training complexity
Time: 8-9 Hours
Processor: CPU
Dataset: 2000 images
Classes: 2
Can we improve it?
VGG16 Pre-Trained Model
https://arxiv.org/pdf/1409.1556.pdfCitation:
The model achieves 92.7%
top-5 test accuracy in
ImageNet, which is a
dataset of over 14 million
images belonging to 1000
classes.
Using the VGG16 Model
Using the VGG16 Model
We have created our own sequential model and then added to VGG16.
Improvement
Using this approach, the accuracy was improved to 87%.
Time to run the model was still 2-3 hours.
Can we improve the
time?
Using GPU machine
Running the model on a AWS instance with Nvidia GK104 GPU (4 GB) powered instance
The epochs that took 1 hour to train 140 images are now executing in under a minute as
shown in next slide with an accuracy of 95%
Results
Results
Evolution of accuracy on training (blue) and validation (green) sets for 1 to 30 epochs :
CHALLENGES
1. Limitation of Google API custom search to return only 10 images.
2. Compressing large size images before writing to disk.
3. Basic CNN model took more than 8 hours to run from scratch
4. The total images in all the classes should be same and the batch size should be the
GCD of total images in train and test classes.
5. Django installation and other software installations required lot of configurations in
GPU instance
6. Tried to install quiver engine for visualization of each convoluted layer.
Future Scope
1. Facial key point detection
2. Analysis of satellite images for disaster detection
3. Real Time Criminal Detection through Video analysis
4. And many more…
References
References:
https://districtdatalabs.silvrback.com/modern-methods-for-sentiment-analysis
https://www.zybuluo.com/HaomingJiang/note/462804
https://xkcd.com/1425/
http://cs231n.stanford.edu/reports/2016/pdfs/267_Report.pdf
https://en.wikipedia.org/wiki/Kernel_(image_processing)
https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-
Understanding-Convolutional-Neural-Networks-Part-2/
http://culurciello.github.io/tech/2016/06/04/nets.html
https://en.wikipedia.org/wiki/Convolutional_neural_network
THANK YOU
https://github.com/Lalit7Jain/ObjectClassification_CNN_KerasCode & Report Available at:

More Related Content

What's hot (20)

PPTX
Convolution Neural Network (CNN)
Basit Rafiq
 
PDF
Deep Learning - Convolutional Neural Networks
Christian Perone
 
PPTX
Object Detection using Deep Neural Networks
Usman Qayyum
 
PPTX
Transfer Learning and Fine-tuning Deep Neural Networks
PyData
 
PDF
Deep Learning - Overview of my work II
Mohamed Loey
 
PPTX
Convolutional Neural Network (CNN)
Muhammad Haroon
 
PDF
OpenCV Introduction
Zachary Blair
 
PDF
Deep learning for medical imaging
geetachauhan
 
PDF
Introduction to OpenCV
Luigi De Russis
 
PDF
Machine Learning: Generative and Discriminative Models
butest
 
PDF
Notes from Coursera Deep Learning courses by Andrew Ng
dataHacker. rs
 
PPTX
Introduction to Deep learning
leopauly
 
PPTX
Image captioning
Muhammad Zbeedat
 
PPTX
Convolutional Neural Network and Its Applications
Kasun Chinthaka Piyarathna
 
PDF
Introduction to Recurrent Neural Network
Yan Xu
 
PPTX
Generative Adversarial Network (GAN)
Prakhar Rastogi
 
PPTX
Semantic Segmentation Methods using Deep Learning
Sungjoon Choi
 
PPTX
Deep Learning in Computer Vision
Sungjoon Choi
 
PDF
A survey of deep learning approaches to medical applications
Joseph Paul Cohen PhD
 
PPTX
CONVOLUTIONAL NEURAL NETWORK
Md Rajib Bhuiyan
 
Convolution Neural Network (CNN)
Basit Rafiq
 
Deep Learning - Convolutional Neural Networks
Christian Perone
 
Object Detection using Deep Neural Networks
Usman Qayyum
 
Transfer Learning and Fine-tuning Deep Neural Networks
PyData
 
Deep Learning - Overview of my work II
Mohamed Loey
 
Convolutional Neural Network (CNN)
Muhammad Haroon
 
OpenCV Introduction
Zachary Blair
 
Deep learning for medical imaging
geetachauhan
 
Introduction to OpenCV
Luigi De Russis
 
Machine Learning: Generative and Discriminative Models
butest
 
Notes from Coursera Deep Learning courses by Andrew Ng
dataHacker. rs
 
Introduction to Deep learning
leopauly
 
Image captioning
Muhammad Zbeedat
 
Convolutional Neural Network and Its Applications
Kasun Chinthaka Piyarathna
 
Introduction to Recurrent Neural Network
Yan Xu
 
Generative Adversarial Network (GAN)
Prakhar Rastogi
 
Semantic Segmentation Methods using Deep Learning
Sungjoon Choi
 
Deep Learning in Computer Vision
Sungjoon Choi
 
A survey of deep learning approaches to medical applications
Joseph Paul Cohen PhD
 
CONVOLUTIONAL NEURAL NETWORK
Md Rajib Bhuiyan
 

Similar to Object classification using CNN & VGG16 Model (Keras and Tensorflow) (20)

PDF
dfdshofdifhdifhdfhgfoighfgofgfgfgfgdfdfdfdf
nguyenhoangy207
 
PPTX
Cv mini project (1)
Kadambini Indurkar
 
PPTX
Deep Learning for Computer Vision - PyconDE 2017
Alex Conway
 
PPTX
PyConZA'17 Deep Learning for Computer Vision
Alex Conway
 
PPTX
Introduction to computer vision
Marcin Jedyk
 
PPTX
Introduction to computer vision with Convoluted Neural Networks
MarcinJedyk
 
PPTX
Dp2 ppt by_bikramjit_chowdhury_final
Bikramjit Chowdhury
 
PPTX
Computer Vision for Beginners
Sanghamitra Deb
 
PDF
Deep Learning for Computer Vision - ExecutiveML
Alex Conway
 
PPTX
Detection of medical instruments project- PART 2
Sairam Adithya
 
PPTX
Deep Neural Networks for Computer Vision
Alex Conway
 
PDF
IRJET - Image Classification using CNN
IRJET Journal
 
PDF
AI-Powered Image Analysis Using Python.pdf
directsuppliersorg
 
PPTX
Convolutional Neural Networks for Computer vision Applications
Alex Conway
 
PPTX
Image classification using convolutional neural network
KIRAN R
 
PDF
Image Object Detection Pipeline
Abhinav Dadhich
 
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
PDF
Convolutional Neural Network
Junho Cho
 
PDF
Deep learning and image analytics using Python by Dr Sanparit
BAINIDA
 
dfdshofdifhdifhdfhgfoighfgofgfgfgfgdfdfdfdf
nguyenhoangy207
 
Cv mini project (1)
Kadambini Indurkar
 
Deep Learning for Computer Vision - PyconDE 2017
Alex Conway
 
PyConZA'17 Deep Learning for Computer Vision
Alex Conway
 
Introduction to computer vision
Marcin Jedyk
 
Introduction to computer vision with Convoluted Neural Networks
MarcinJedyk
 
Dp2 ppt by_bikramjit_chowdhury_final
Bikramjit Chowdhury
 
Computer Vision for Beginners
Sanghamitra Deb
 
Deep Learning for Computer Vision - ExecutiveML
Alex Conway
 
Detection of medical instruments project- PART 2
Sairam Adithya
 
Deep Neural Networks for Computer Vision
Alex Conway
 
IRJET - Image Classification using CNN
IRJET Journal
 
AI-Powered Image Analysis Using Python.pdf
directsuppliersorg
 
Convolutional Neural Networks for Computer vision Applications
Alex Conway
 
Image classification using convolutional neural network
KIRAN R
 
Image Object Detection Pipeline
Abhinav Dadhich
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Universitat Politècnica de Catalunya
 
Convolutional Neural Network
Junho Cho
 
Deep learning and image analytics using Python by Dr Sanparit
BAINIDA
 
Ad

Recently uploaded (20)

PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
Advancing WebDriver BiDi support in WebKit
Igalia
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
July Patch Tuesday
Ivanti
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Advancing WebDriver BiDi support in WebKit
Igalia
 
Ad

Object classification using CNN & VGG16 Model (Keras and Tensorflow)