Architecture of Inception-ResnetV2

Architecture

Uploaded by

Devchand Chaudhari

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Architecture of Inception-ResnetV2

Architecture

Uploaded by

Devchand Chaudhari

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

3.

4 Architecture of Inception-ResnetV2
In this process, the feature extraction process is done by using modified CNN (Inception-
ResnetV2). The Inception-ResNet v2 model's initial layers are designed to extract low-level
features like dots, lines, and edges. The network's deeper layers will then extract middle-level
properties such as sharpness, texture and picture shadowing in specific portions of the image. At
last, the deepest layer will extract high-level features such as shape from the rice leaf image in
order to detect the presence of disease.
The Inception-ResNetv2 is a convolutional neural architecture that builds on the
Inception family of architectures but incorporates residual connections (replacing the filter
concatenation stage of the Inception architecture). It uses a new Inception Module, called the
Inception-ResNet Module, which combines the benefits of both Inception and Residual
networks. These Inception-ResNet Modules allow for a deeper network with fewer parameters
and better performance. It also uses a batch normalization layer after each convolutional layer,
which improved the stability and performance of the network.

The Inception-ResNetV2 architecture is the combination of recent deep-learning models:

Residual connection and the Inception architecture. This hybrid deep learning model has the
advantages of a residual network and retains the unique characteristics of the multiconvolutional
core of the Inception network. The Inception-ResNet network is a hybrid network inspired both
by Inception and the performance of ResNet. The operations involved in Inception-ResNetv2
based model has been explained in the following subsections.

3.4.1 Inception
Conventional convolutional neural networks typically use convolutional and pooling
layers to extract features from the input data. However, these networks are limited in capturing
local and global features, as they typically focus on either one or the other. An Inception Module
is a building block used in the Inception network architecture for CNNs. The inception blocks are
intended to solve the problem of learning a combination of local and global features from the
input data. The idea behind the inception module is to learn a variety of feature maps at different
scales and these feature maps are then concatenated together to form a more comprehensive
representation of the input data. This allows the network to capture a wide range of features,
including both low-level and high-level features, which can be useful for tasks such as image
classification. By using inception blocks, the InceptionNet architecture can learn a more
comprehensive set of features from the input data, which can improve the network's performance
on tasks such as image classification. Inception was designed to be more efficient and faster to
train than other deep convolutional neural networks. The basic structure of an Inception Module
is a combination of multiple convolutional filters of different sizes applied in parallel to the
input data. It improves performance by allowing multiple parallel convolutional filters. Output
of each filter is concatenated together to form a single output feature map. Inception Module
also includes a max pooling layer, which takes the maximum value from a set of non-
overlapping regions of the input data. This reduces the spatial dimensionality of the data and
allows for translation invariance. The use of multiple parallel filters and max pooling layers
allows the Inception Module to extract features at different scales and resolutions, improving
the network's ability to recognize patterns in the input data. Thus, the Inception module
improves feature extraction, improving the network's performance.
In the design of Inception-ResNetv2, feature extraction is done using inception structural
designs. The main benefit of inception design is that they provide a significant quality gain at a
variable increase in computing needs when compared to shallower and less wide networks. By
employing effective factorization techniques, inception design tries to reduce the limitation and
increase the computation complexity.

The inception module consists of convolutions of different sizes that allow the network to
process features at different spatial scales. For dimensionality reduction, 1x1 convolutions are
used before the more expensive 3x3 and 5x5 convolutions as shown in fig. 2 below. In many
problems, we need the deeper network to process features at different spatial scales. To cope
with our challenges, such flexibility can be incorporated in convolutional neural networks by
introducing inception blocks.

Fig 2 dimensionality reduction using 1x1 convolutions

Each Inception block is followed by a 1×1 convolution filter without activation called
filter expansion layer. This is done to scale up (increase) the dimensionality of filter bank to
match the depth of input to next layer. It is required to give back the dimensionality
minimization generated by the inception block. The pooling layers inside the Inception blocks
were replaced by residual connections. However, pooling operations can be found in reduction
blocks. Introduce residual connections that add the output of the convolution operation of the
inception module, to the input. For residual addition to work, the input and output after
convolution must have the same dimensions. Hence, we use 1x1 convolutions after the original
convolutions, to match the depth sizes (Depth is increased after convolution). Inception modules
comprise a series of smaller convolutional and pooling layers, which are combined to allow the
network to learn spatial and temporal features from the input data.

In feature extractor portion, the convolutional layers use different sizes of filters such as
1x1 is used for dimensionality reduction or restoring dimensions of the feature maps, and
factorized the filters into smaller (2x(3x3)) and asymmetric (1x3,3x1 and 1x7,7x1) types. The
distinctive inception blocks shown in Fig. 3(a), 3(b) and 3(c).

3.4.2 Factorization into smaller convolutions

The various methods of factorizing convolutions in different contexts have been defined
in this section, with the purpose of improving the software's computing performance. Due to the
complete convolutional nature of the inception model, each weight reflects a multiplication per
activation. As a result, any reduction in computation cost results in a reduction of variable count.
It states that along with appropriate factorization, more extricated variables are obtained resulting
in greater training rate. Furthermore, the computation and memory savings can lead to a larger
filter bank while allowing each model copy to be trained on its own machine. The Inception-
ResNet architecture incorporates the use of 3 different inception modules and 2 reduction
blocks. However, as this network is a hybrid between the Inceptionv2 and ResNet, the key
functionality of the Inception-ResNet is that the output of the inception module is added to the
input (i. e. data from previous layer). For this to work, the dimensions of the output from the
inception module and the input from the previous layer must have the same dimensions.
Factorization hence becomes important here to match these dimensions. Inception v2 sought to
further reduce computational costs by factorizing the filters. However, too much factorization
could also result in loss of information. Since the filter might not be able to accommodate all the
data from the previous layer, some data will be lost or unfilterd if it lies outside the dimension of
the convolution of the filter. Hence, factorization methods were used to improve computational
complexity, while also producing efficient performance.
Factorization is simply reducing the convolution sizes of the filters to smaller sizes to
reduce the computational cost. For example, a 5x5 convolutional node can be broken down to 2
layers of 3x3 nodes, which reduced the computational cost by a little less than 3x (2.78x for
efficient to be precise). This makes the network deeper and even more so for larger dimension
convolutions, and too deep of the network will eventually result in loss of information, since
there will need to be more factorization as mentioned in the precious paragraph. Another
factorization method involves replacing nxn nodes with 1xn and nx1 nodes. For example, a 5x5
convolution will be replaced by a 1x5 convolution and a 5x1 convolution. This makes the
network broader instead of longer (deeper) and hence reduces the chance of loss of information
from layers of factorization at the same time reducing the computational cost in general.
Factorize 5x5 convolutions to two 3x3 convolution operations to improve computational
speed. Although this may seem counterintuitive, a 5x5 convolution is 2.78 times more
expensive than a 3x3 convolution. So stacking two 3x3 convolutions infact leads to a boost in
performance. This is illustrated in the fig. 4 given below.

Fig. 4 Shows factorization of 5x5 convolutions to two 3x3 convolution operations

Moreover, they factorize convolutions of filter size nxn to a combination of 1xn and
nx1 convolutions. For example, a 3x3 convolution is equivalent to first performing a 1x3
convolution, and then performing a 3x1 convolution on its output. They found this method to
be 33% more cheaper than the single 3x3 convolution. The filter banks in the module
were expanded (made wider instead of deeper) to remove the representational bottleneck. If the
module was made deeper instead, there would be excessive reduction in dimensions, and hence
loss of information.

The above three principles were used to build three different types of inception modules
(Let’s call them modules A, B and C in the order they were introduced. These names are
introduced for clarity, and not the official names).

3.4.4 Inception-ResNet V2

Fig. 5 shows the detailed architecture of our proposed network InceptionResNet-V2.

Inception-ResNetV2 architecture consists of three main blocks namely A, B, and C containing
different number of stacked inception blocks. To identify the optimal number of inception
modules to be used in each block, we tried various combinations and found that network with 3
inception block in A, 5 blocks in B, and 2 blocks in C outperforms the other combinations. The
first layer of Inception-ResNetv2 architecture referred as stem is introduced before the Inception
blocks A, B and C. Although Inception v2 sought to lengthen or broaden the network for more
efficient performance, reduction blocks were introduced, specifically for the same functionality
(to regulate the bredth and/or depth of the network).

Fig. 5 shows the detailed architecture of Inception-ResnetV2

For the Inception part of the network, we have 3 inception blocks A at the 35x35 with
288 filters each. This is then reduced to a 17x17 grid with 768 filters using the reduction
technique. This is followed by 5 inception blocks B of the factorized inception modules. This is
reduced to a 8x8 grid by 1280 filters with the reduction technique. At the coarsest 8 x 8 level, we
have 2 Inception blocks C with a concatenated output filter bank size of 2048 for each tile.

Fig. 6 Block diagram of Inception-ResNetv2

A minor variation between the residual and non-residual inception variants is that in case
of Inception-ResNetv2, batch-normalization is used only on top of the conventional layers,
however, not on the top of summations. It is sensible to assume that a detailed usage of batch
normalization is beneficial; however, every model replica needs to train under an individual
graphical processing unit (GPU). Through the avoidance of batch-normalization, the inception
block count can be increased in a substantial way. The architecture of Inception-ResNetv2
having three basic structures, convolutional layer, activation layer, and pooling layer.

CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
Alex Net
No ratings yet
Alex Net
26 pages
Unit-3
No ratings yet
Unit-3
38 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
37 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
15 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
No ratings yet
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
15 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
dl ass 742
No ratings yet
dl ass 742
14 pages
U-Net: Convolutional Networks For Biomedical Image Segmentation
No ratings yet
U-Net: Convolutional Networks For Biomedical Image Segmentation
8 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
EfficientNet Tutorial
No ratings yet
EfficientNet Tutorial
20 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
3.Convolutional Networks and Sequence Modeling
No ratings yet
3.Convolutional Networks and Sequence Modeling
19 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
Principles of Convolutional Neural Networks
No ratings yet
Principles of Convolutional Neural Networks
9 pages
cnn
No ratings yet
cnn
10 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
Kanoria Shubham Anil 2023HT01569
No ratings yet
Kanoria Shubham Anil 2023HT01569
9 pages
A convolutional neural network
No ratings yet
A convolutional neural network
6 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
Project Exhibition 2
No ratings yet
Project Exhibition 2
42 pages
Convolutional Neural Networks (CNN)
No ratings yet
Convolutional Neural Networks (CNN)
7 pages
Technical Report On DenseNet Architecture (Deep Learning Network Model)
No ratings yet
Technical Report On DenseNet Architecture (Deep Learning Network Model)
9 pages
Unit IV Deep Leraning
No ratings yet
Unit IV Deep Leraning
35 pages
Convolutional Neural Networks. Before Kickstarting Into CNNs We Must - by Namita - Medium
No ratings yet
Convolutional Neural Networks. Before Kickstarting Into CNNs We Must - by Namita - Medium
13 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
What Is Padding in CNN
No ratings yet
What Is Padding in CNN
7 pages
What is a Convolutional Neural Network-unit3.docx
No ratings yet
What is a Convolutional Neural Network-unit3.docx
12 pages
21-Foundations of Convolutional Neural Networks-04!09!2024
No ratings yet
21-Foundations of Convolutional Neural Networks-04!09!2024
10 pages
Step by Step Procedure That How I Resolve Given Task Pytorh
No ratings yet
Step by Step Procedure That How I Resolve Given Task Pytorh
6 pages
Unit III
No ratings yet
Unit III
58 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
3 pages
DL3 QB
No ratings yet
DL3 QB
19 pages
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
Unit 3 ML
No ratings yet
Unit 3 ML
27 pages
UNIT-2 - Part-1
No ratings yet
UNIT-2 - Part-1
116 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
Unit III
No ratings yet
Unit III
89 pages
CONVOLUTIONAL NEURAL NETWORK
No ratings yet
CONVOLUTIONAL NEURAL NETWORK
36 pages
Cnn
No ratings yet
Cnn
9 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
CS231n - Convolutional-Networks 1
No ratings yet
CS231n - Convolutional-Networks 1
3 pages
Unit III
No ratings yet
Unit III
89 pages
UNIT-III DLL full unit
No ratings yet
UNIT-III DLL full unit
63 pages
Structured Pruning of Deep Convolutional Neural Netw Orks: Sajid Anwar, Kyuyeon Hwang and Wonyong Sung
No ratings yet
Structured Pruning of Deep Convolutional Neural Netw Orks: Sajid Anwar, Kyuyeon Hwang and Wonyong Sung
11 pages
Poolin Layer
No ratings yet
Poolin Layer
28 pages
DL Endsem 2024 FlyHigh Services
No ratings yet
DL Endsem 2024 FlyHigh Services
18 pages
IBM Question & Answers
No ratings yet
IBM Question & Answers
3 pages
Unit V
No ratings yet
Unit V
84 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Malware_Image_Classification_Using_ML_DL (1)
No ratings yet
Malware_Image_Classification_Using_ML_DL (1)
5 pages
dpt4 answer key
No ratings yet
dpt4 answer key
4 pages
Semantic Segmentation
No ratings yet
Semantic Segmentation
22 pages
Introduction To CNNs
No ratings yet
Introduction To CNNs
26 pages
Feng Liang Et Al - 2021 - Efficient Neural Network Using Pointwise Convolution Kernels With Linear Phase
No ratings yet
Feng Liang Et Al - 2021 - Efficient Neural Network Using Pointwise Convolution Kernels With Linear Phase
8 pages
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet
Detection and Prediction of Rice Leaf Disease using a Hybrid CNN-SVM Model
No ratings yet
Detection and Prediction of Rice Leaf Disease using a Hybrid CNN-SVM Model
19 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Presentation UNIT-2(Old)
No ratings yet
Presentation UNIT-2(Old)
58 pages
SVM Multi-class classification
No ratings yet
SVM Multi-class classification
5 pages
10.4324_9781003463979_previewpdf
No ratings yet
10.4324_9781003463979_previewpdf
49 pages
A_Futures_Quantitative_Trading_Strategy_Based_on_a_Deep_Reinforcement_Learning_Algorithm
No ratings yet
A_Futures_Quantitative_Trading_Strategy_Based_on_a_Deep_Reinforcement_Learning_Algorithm
5 pages
IEEE 802.11 Network Anomaly Detection and Attack Classification: A Deep Learning Approach
No ratings yet
IEEE 802.11 Network Anomaly Detection and Attack Classification: A Deep Learning Approach
6 pages
Explainable
No ratings yet
Explainable
5 pages
Ukoha Chinonso Precious 17CG023225
No ratings yet
Ukoha Chinonso Precious 17CG023225
86 pages
ML , ALgo roadmap
No ratings yet
ML , ALgo roadmap
21 pages
How To Build Your Own Neural Network From Scratch in Python
No ratings yet
How To Build Your Own Neural Network From Scratch in Python
11 pages
Deep Learning in Biometrics-CRC Press (2018)
No ratings yet
Deep Learning in Biometrics-CRC Press (2018)
329 pages
mashhadimoslem-et-al-2024-computational-and-machine-learning-methods-for-co2-capture-using-metal-organic-frameworks
No ratings yet
mashhadimoslem-et-al-2024-computational-and-machine-learning-methods-for-co2-capture-using-metal-organic-frameworks
34 pages
AI Important Ques Ans
No ratings yet
AI Important Ques Ans
11 pages
Artificial Intelligence in Endodontics
No ratings yet
Artificial Intelligence in Endodontics
6 pages
Random Forest For Binary Classification
No ratings yet
Random Forest For Binary Classification
19 pages
Improving Twitter Named Entity Recognition Using Word Representations
No ratings yet
Improving Twitter Named Entity Recognition Using Word Representations
5 pages
Introduction to Speech Recognition
No ratings yet
Introduction to Speech Recognition
3 pages
(Whitepaper) How RPA Will Transform Finance and Accounting
No ratings yet
(Whitepaper) How RPA Will Transform Finance and Accounting
11 pages
CRF_Laura_Kallmeyer
No ratings yet
CRF_Laura_Kallmeyer
21 pages
Important File Sheets Class 10
No ratings yet
Important File Sheets Class 10
3 pages
Classification - Issues Regarding Classification and Prediction
No ratings yet
Classification - Issues Regarding Classification and Prediction
42 pages
Machine Learning: Design, Development and Augmented Intelligence
No ratings yet
Machine Learning: Design, Development and Augmented Intelligence
25 pages
Machine Learing Algorithms
No ratings yet
Machine Learing Algorithms
13 pages
Escaping The Big Data Paradigm With Compact Transformers
No ratings yet
Escaping The Big Data Paradigm With Compact Transformers
18 pages
Backpropagation - Wikipedia, The Free Encyclopedia
No ratings yet
Backpropagation - Wikipedia, The Free Encyclopedia
10 pages
MACHINE LEARNING - Research Report
No ratings yet
MACHINE LEARNING - Research Report
7 pages
Disease Detection and Consultation Using Django and Machine Learning
No ratings yet
Disease Detection and Consultation Using Django and Machine Learning
9 pages
Cyber Security and Artificial Intelligence Sol. 1
No ratings yet
Cyber Security and Artificial Intelligence Sol. 1
27 pages
Water Fraud REPORT
0% (2)
Water Fraud REPORT
63 pages
InfoSystems AI Assignment
No ratings yet
InfoSystems AI Assignment
4 pages
Data Science Notes - TutorialsDuniya
No ratings yet
Data Science Notes - TutorialsDuniya
59 pages
Data Mining-Rule Based Classification
No ratings yet
Data Mining-Rule Based Classification
4 pages
Scs302 Artificial Intelligence Notes
No ratings yet
Scs302 Artificial Intelligence Notes
110 pages