0% found this document useful (0 votes)

2 views

Inception New

The document discusses the Inception Architecture, also known as Google LeNet, which enhances deep neural networks by improving computational efficiency while increasing depth and width. It outlines the evolution from Inception V1 to V3, highlighting techniques like dimensionality reduction and factorization of convolutions to minimize parameters. Additionally, it demonstrates the application of transfer learning using a pre-trained Inception model for image classification, achieving a validation accuracy of 90%.

Uploaded by

BENAZIR AE

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Inception New

Uploaded by

BENAZIR AE

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Inception Architecture

Understanding the Inception (Google LeNet) Architecture

Figure 1. Google LeNet (Inception) architecture (Source: Image from

the original paper)

Inception Architecture &

Applying It To A Real-World
Dataset
Fun fact: The inception model takes its name from a famous
internet meme
Source
Index Of Contents
· Introduction
· The architecture of Inception V1
· How does this architecture reduce dimensionality?
· What is different in the Inception V3 network from the inception
V1 network?
· Using transfer learning(pre-trained inception network) on an
image classification problem.
· References

Introduction
Building a powerful deep neural network is possible by
increasing the number of layers in a network.

Two problems with the above approach are that increasing

the number of layers of a neural network may lead to
overfitting especially if you have limited labeled training data
and there is an increase in the computational requirement.

Inception networks were created with the idea of increasing

the capability of a deep neural network while efficiently
using computational resources.
We propose a deep convolutional neural network
architecture codenamed “Inception”, which was responsible
for setting the new state of the art for classification and
detection in the ImageNet Large-Scale Visual Recognition
Challenge 2014 (ILSVRC 2014). The main hallmark of this
architecture is the improved utilization of the computing
resources inside the network. This was achieved by a
carefully crafted design that allows for increasing the depth
and width of the network while keeping the computational
budget constant.

Inception networks are released in versions, each version

having some improvement over the previous ones.

The architecture of Inception V1

Consider the below images of peacocks. The area of the
image occupied by the peacock varies in both
images, selecting the right kernel size thus becomes a
difficult choice.

A large kernel size is used to capture a global distribution of

the image while a small kernel size is used to capture more
local information.
Inception network architecture makes it possible to use
filters of multiple sizes without increasing the depth of the
network.

The different filters are added parallelly instead of being

fully connected one after the other.
This is known as the naive version of the inception model.
The problem with this model was the huge number of
parameters. To mitigate the same, they came up with the
below architecture.
How does this architecture reduce dimensionality?
Adding a 1X1 convolution before a 5X5 convolution would
reduce the number of channels of the image when it is
provided as an input to the 5X5 convolution, in turn reducing
the number of parameters and the computational
requirement.

Let me explain with an example.

What is different in the Inception V3 network from the
inception V1 network?
Inception V3 is an extension of the V1 module, it uses
techniques like factorizing larger convolutions to smaller
convolutions (say a 5X5 convolution is factorized into two
3X3 convolutions) and asymmetric factorizations (example:
factorizing a 3X3 filter into a 1X3 and 3X1 filter).
These factorizations are done with the aim of reducing the
number of parameters being used at every inception module.
Below is an image of the inception V3 module.
Using transfer learning(pre-trained inception network)
on an image classification problem.
I would be solving the same problem we solved in the last
article(link at the start of this article) using CNNs to
compare the performance of using a vanilla CNN from a pre-
trained inception network.

In case you have not read the previous article, we are trying
to classify images into 6 different classes, the training data is
fairly balanced and with a convolution neural network, we
were able to achieve a validation accuracy of 77%.

Let us now use an inception model and train only its last
layer as below.

# Import necessary libraries from TensorFlow

from tensorflow.keras.applications.inception_v3 import InceptionV3
from tensorflow.keras.optimizers import RMSprop
from tensorflow.keras import layers
from tensorflow.keras import Model

# Path to the pre-trained InceptionV3 weights file (without the top

classification layer)
local_weights_file =
'../input/inception-weights/inception_v3_weights_tf_dim_ordering_tf_kernel
s_notop.h5'

# Initialize the InceptionV3 model without the top layer (for feature
extraction)
pre_trained_model = InceptionV3(input_shape = (150, 150, 3), # Input
image size (150x150, 3 color channels)
include_top = False, # Exclude the
top (classification) layer
weights = None) # Do not load
default weights initially

# Load pre-trained weights into the model from the file

pre_trained_model.load_weights(local_weights_file)

# Freeze the pre-trained layers to prevent them from being trained

for layer in pre_trained_model.layers:
layer.trainable = False

# Print the output shape of the last convolutional layer ('mixed7')

last_layer = pre_trained_model.get_layer('mixed7')
print('last layer output shape: ', last_layer.output_shape) # Prints the
shape of the output feature map
last_output = last_layer.output # Get the output of the last
convolutional layer

# Add custom layers on top of the pre-trained model for classification

x = layers.Flatten()(last_output) # Flatten
the 3D output from 'mixed7' to 1D vector
x = layers.Dense(1024, activation='relu')(x) # Fully
connected layer with 1024 neurons and ReLU activation
x = layers.Dropout(0.2)(x) # Dropout
layer with 20% rate to prevent overfitting
x = layers.Dense(6, activation='softmax')(x) # Output
layer with 6 units for classification (6 classes)

# Create the final model with the pre-trained input and the custom layers
on top
model = Model(pre_trained_model.input, x)

# Compile the model with the RMSprop optimizer, categorical cross-entropy

loss, and accuracy metric
model.compile(optimizer = RMSprop(lr=0.0001), # RMSprop
optimizer with a learning rate of 0.0001
loss = 'categorical_crossentropy', #
Categorical cross-entropy loss function for multi-class classification
metrics = ['acc']) # Track
accuracy during training

# Train the model using the training and validation data generators
history = model.fit(train_generator, # Training
data generator
epochs=10, # Train for
10 epochs
validation_data=validation_generator) # Validation
data generator for evaluation

We were able to get a validation accuracy of 90%, by using

the above architecture!

Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
TLM for CNN
No ratings yet
TLM for CNN
32 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
DL Practical
No ratings yet
DL Practical
23 pages
Summary
No ratings yet
Summary
36 pages
Case Study - AP23322130042
No ratings yet
Case Study - AP23322130042
7 pages
CNN Implementation in Python
No ratings yet
CNN Implementation in Python
7 pages
nndl
No ratings yet
nndl
20 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
Experiment 10-1
No ratings yet
Experiment 10-1
3 pages
Experiment 10 1
No ratings yet
Experiment 10 1
3 pages
179899_Naveen_PILLA_U2601948_CN7023_2401128_187603914
No ratings yet
179899_Naveen_PILLA_U2601948_CN7023_2401128_187603914
22 pages
BreastCancer EXP
No ratings yet
BreastCancer EXP
8 pages
shivansh_exp8
No ratings yet
shivansh_exp8
5 pages
Lab Manual
No ratings yet
Lab Manual
45 pages
AIML Lab 3
No ratings yet
AIML Lab 3
17 pages
NN & DL Lab Manual 1-1
No ratings yet
NN & DL Lab Manual 1-1
23 pages
Image Category Classification Using Deep Learning
No ratings yet
Image Category Classification Using Deep Learning
11 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
NNDL_RECORD_MANUAL
No ratings yet
NNDL_RECORD_MANUAL
36 pages
NN & DL Record Final
No ratings yet
NN & DL Record Final
50 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
lab 6 ml
No ratings yet
lab 6 ml
7 pages
UNIT_I CHP_5
No ratings yet
UNIT_I CHP_5
26 pages
Deep Learning Lab
No ratings yet
Deep Learning Lab
20 pages
A Neural Network Model Using Python
No ratings yet
A Neural Network Model Using Python
10 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
Lab 4 Assignment_W2022
No ratings yet
Lab 4 Assignment_W2022
8 pages
CCS355 Neural Networks and Deep Learning Lab
No ratings yet
CCS355 Neural Networks and Deep Learning Lab
43 pages
Introduction To Keras
No ratings yet
Introduction To Keras
14 pages
Train your image classifier model with PyTorch
No ratings yet
Train your image classifier model with PyTorch
6 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
No ratings yet
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
5 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Implemented LeNet on PyTorch
100% (1)
Implemented LeNet on PyTorch
17 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
Neural_DEEP
No ratings yet
Neural_DEEP
39 pages
DL Programs
No ratings yet
DL Programs
12 pages
KJ Mohamed Dhanish MIP 2
No ratings yet
KJ Mohamed Dhanish MIP 2
36 pages
3
No ratings yet
3
7 pages
659451A19_DL_EXP5
No ratings yet
659451A19_DL_EXP5
8 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
experiment 1
No ratings yet
experiment 1
2 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Arabic OCR Report
No ratings yet
Arabic OCR Report
20 pages
Assignment 2 Dl
No ratings yet
Assignment 2 Dl
10 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Image Classification Using CNN: Page - 1
No ratings yet
Image Classification Using CNN: Page - 1
13 pages
Lab 4-Image Segmentation Using U-Net
No ratings yet
Lab 4-Image Segmentation Using U-Net
9 pages
Lab DigitRecognitionMINST
No ratings yet
Lab DigitRecognitionMINST
10 pages
KT 01 Intro2Keras
No ratings yet
KT 01 Intro2Keras
24 pages
Crear Red Neural en Python
No ratings yet
Crear Red Neural en Python
7 pages
Artificial Intelligence May Minor Project
No ratings yet
Artificial Intelligence May Minor Project
8 pages
Keras
No ratings yet
Keras
3 pages
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
8 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Ex NO 9 DL LAB
No ratings yet
Ex NO 9 DL LAB
3 pages
JS FUNCTIONS
No ratings yet
JS FUNCTIONS
8 pages
Bootstrap Lab Manual
No ratings yet
Bootstrap Lab Manual
28 pages
Javascript Programs
No ratings yet
Javascript Programs
14 pages
Unit 1
No ratings yet
Unit 1
16 pages
Css Text Styling
No ratings yet
Css Text Styling
20 pages
Part I: Discrete PID Gains As Functions of Sampling Time
No ratings yet
Part I: Discrete PID Gains As Functions of Sampling Time
9 pages
Network Traffic Intrusion Detection System Using Decision Tree & K-Means Clustering Algorithm
No ratings yet
Network Traffic Intrusion Detection System Using Decision Tree & K-Means Clustering Algorithm
3 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
8 pages
(Ebook) Market Risk Analysis: Practical Financial Econometrics, Volume 2 by Carol Alexander ISBN 9780470771037, 9780470998014, 0470771038, 0470998016instant download
100% (3)
(Ebook) Market Risk Analysis: Practical Financial Econometrics, Volume 2 by Carol Alexander ISBN 9780470771037, 9780470998014, 0470771038, 0470998016instant download
53 pages
MadXAbhi - Engineering Mathematics - by MadXAbhi - Robot
No ratings yet
MadXAbhi - Engineering Mathematics - by MadXAbhi - Robot
5 pages
Full Chapter Java Deep Learning Essentials 1St Edition Sugomori Yusuke PDF
100% (9)
Full Chapter Java Deep Learning Essentials 1St Edition Sugomori Yusuke PDF
53 pages
M (CS) 312
No ratings yet
M (CS) 312
8 pages
Chapter 4 Elements of Realizability Theory
No ratings yet
Chapter 4 Elements of Realizability Theory
37 pages
CFD Chapter 1
No ratings yet
CFD Chapter 1
5 pages
Lecture 03 Algorithms-And-Flowcharts
No ratings yet
Lecture 03 Algorithms-And-Flowcharts
48 pages
AWC Unit 4
No ratings yet
AWC Unit 4
11 pages
One Dimensional Bar Element
No ratings yet
One Dimensional Bar Element
34 pages
Controller Performance Evaluation For Concentration Control of Isothermal Continuous Stirred Tank
100% (1)
Controller Performance Evaluation For Concentration Control of Isothermal Continuous Stirred Tank
7 pages
Fermi GR
No ratings yet
Fermi GR
13 pages
Ranked Positional Weight Method: of Assembly Line Balancing
No ratings yet
Ranked Positional Weight Method: of Assembly Line Balancing
11 pages
ST7201-Finite Element Method
No ratings yet
ST7201-Finite Element Method
11 pages
Topic#8 Root Locus Technique
No ratings yet
Topic#8 Root Locus Technique
31 pages
Apresentacao SBMO RicardoSS
No ratings yet
Apresentacao SBMO RicardoSS
21 pages
Network Information Security Lab 2
No ratings yet
Network Information Security Lab 2
14 pages
Security Lab Aim and Algorithms
No ratings yet
Security Lab Aim and Algorithms
5 pages
Matlab Code Fixed Point PDF
No ratings yet
Matlab Code Fixed Point PDF
2 pages
BayesianOptimizationfor Designof MultiscaleBiologicalCircuits
No ratings yet
BayesianOptimizationfor Designof MultiscaleBiologicalCircuits
10 pages
Histogram
No ratings yet
Histogram
10 pages
Optimization Lecture Notes
No ratings yet
Optimization Lecture Notes
3 pages
A3 - 1bm15me039 - Nyquist Plot Using Matlab
No ratings yet
A3 - 1bm15me039 - Nyquist Plot Using Matlab
12 pages
MTH603 - Assignment 1 - Fall 2024
No ratings yet
MTH603 - Assignment 1 - Fall 2024
4 pages
Technical_Report
No ratings yet
Technical_Report
5 pages
Rodriguez Hw2
No ratings yet
Rodriguez Hw2
2 pages
IDS705 Final Report
No ratings yet
IDS705 Final Report
27 pages
LAB MANUAL_SIGNALS & SYSTEM_EC244AI_May_24
No ratings yet
LAB MANUAL_SIGNALS & SYSTEM_EC244AI_May_24
53 pages