cnn-notes-architecture

Convolutional Neural Networks (CNNs) are deep learning models designed for processing grid-like data, excelling in tasks such as image classification and object detection. The architecture consists of multiple layers, including convolutional, pooling, and fully connected layers, which work together to extract and transform features from input data. CNNs have numerous applications across various fields, including image classification, object detection, and medical imaging.

Uploaded by

nssurlec

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

cnn-notes-architecture

Uploaded by

nssurlec

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are a class of deep learning models

specifically designed for processing grid-like data, such as images, videos, and
audio. CNNs are inspired by the biological visual cortex and are highly effective
for tasks like image classification, object detection, segmentation, and more. The
architecture of a CNN is composed of multiple layers, each serving a specific
purpose in extracting and transforming features from the input data.
Key Components of CNN Architecture
1. Input Layer:
- The input layer takes in the raw data, such as an image (e.g., a 3D tensor of
height × width × channels for RGB images).
2. Convolutional Layers:
- These layers apply convolution operations to the input using learnable filters
(kernels).
- Each filter detects specific features (e.g., edges, textures, patterns) in the input.
- Multiple filters are used to extract different features, producing a stack of
feature maps as output.
3. Activation Function:
- After convolution, an activation function (e.g., ReLU) is applied to introduce
non-linearity into the model.
- ReLU (Rectified Linear Unit) is the most commonly used activation function:

𝑅𝑒𝐿𝑈(𝑥) = max(𝑜, 𝑥)
4. Pooling Layers:
- Pooling layers down sample the feature maps, reducing their spatial
dimensions while retaining the most important information.
- Common types of pooling:
- Max Pooling: Selects the maximum value in each pooling region.
- Average Pooling: Computes the average value in each pooling region.
- Pooling reduces computational complexity and helps prevent overfitting.
5. Fully Connected (Dense) Layers:
- After multiple convolutional and pooling layers, the feature maps are flattened
into a 1D vector and passed to fully connected layers.
- These layers combine the extracted features to make predictions (e.g.,
classifying an image into categories).
6. Output Layer:
- The output layer produces the final predictions, such as class probabilities in
classification tasks.
- Common activation functions for the output layer:
- Softmax: For multi-class classification.
- Sigmoid: For binary classification.
Typical CNN Architecture
A typical CNN architecture consists of the following sequence of layers:
1. Input Layer:
- Takes in the raw image or data.
2. Convolutional Block:
- Convolutional Layer: Applies filters to extract features.
- Activation Function: Introduces non-linearity (e.g., ReLU).
- Pooling Layer: Down samples the feature maps.
3. Repeat Convolutional Blocks:
- Multiple convolutional blocks are stacked to extract hierarchical features (e.g.,
edges → textures → shapes → objects).
4. Flatten Layer:
- Converts the 3D feature maps into a 1D vector for input to fully connected
layers.
5. Fully Connected Layers:
- Combines features to make predictions.
6. Output Layer:
- Produces the final output (e.g., class probabilities).
Example: LeNet-5 (Early CNN Architecture)
LeNet-5, developed by Yann LeCun in 1998, is one of the earliest CNN
architectures used for handwritten digit recognition. Its architecture is as follows:
1. Input Layer: Grayscale image (32x32 pixels).
2. Convolutional Layer: 6 filters of size 5x5, stride 1.
3. Pooling Layer: Max pooling with 2x2 window, stride 2.
4. Convolutional Layer: 16 filters of size 5x5, stride 1.
5. Pooling Layer: Max pooling with 2x2 window, stride 2.
6. Fully Connected Layers: 120 → 84 neurons.
7. Output Layer: 10 neurons (for digit classification).
Modern CNN Architectures
Over the years, more advanced CNN architectures have been developed,
including:
1. AlexNet:
- Introduced ReLU activation, dropout, and data augmentation.
- Won the ImageNet competition in 2012.
2. VGGNet:
- Uses smaller 3x3 filters and deeper networks (e.g., VGG16, VGG19).
3. ResNet (Residual Networks):
- Introduces skip connections (residual blocks) to enable training of very deep
networks (e.g., ResNet-50, ResNet-101).
4. Inception (GoogLeNet):
- Uses multi-scale filters within the same layer (Inception modules).
5. Efficient Net:
- Balances depth, width, and resolution for efficient scaling.
Advantages of CNNs
1. Automatic Feature Extraction:
- CNNs learn relevant features directly from the data, eliminating the need for
manual feature engineering.
2. Parameter Sharing:
- Filters are shared across the input, reducing the number of parameters and
computational cost.
3. Translation Invariance:
- CNNs can detect features regardless of their position in the input.
4. Hierarchical Feature Learning:
- Early layers detect low-level features (e.g., edges), while deeper layers detect
high-level features (e.g., objects).
Applications of CNNs
1. Image Classification:
- Assigning labels to images (e.g., cat vs. dog).
2. Object Detection:
- Locating and classifying objects within an image (e.g., YOLO, Faster R-
CNN).
3. Semantic Segmentation:
- Assigning a label to each pixel in an image (e.g., identifying roads, buildings).
4. Face Recognition:
- Identifying or verifying individuals from images or videos.
5. Medical Imaging:
- Detecting diseases from X-rays, MRIs, etc.
6. Natural Language Processing (NLP):
- Text classification, sentiment analysis, etc.
In summary, CNNs are powerful and versatile models that have revolutionized
computer vision and other fields. Their ability to automatically learn hierarchical
features from data makes them a cornerstone of modern deep learning.

Cv Ppt Mt101
No ratings yet
Cv Ppt Mt101
16 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
MODULE_05_CNN_ARCTITECTURE
No ratings yet
MODULE_05_CNN_ARCTITECTURE
7 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
CNN
No ratings yet
CNN
5 pages
Sommaire CNN Presentation
No ratings yet
Sommaire CNN Presentation
10 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
74 pages
Reviewer - Convolutional Neural Networks (CNNs) - Muqaddas Bin Tahir
No ratings yet
Reviewer - Convolutional Neural Networks (CNNs) - Muqaddas Bin Tahir
8 pages
DL CNN
No ratings yet
DL CNN
7 pages
MODULE 5
No ratings yet
MODULE 5
20 pages
Cnn Remake
No ratings yet
Cnn Remake
1 page
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
15 pages
Introduction to Convolutional Neural Networks
No ratings yet
Introduction to Convolutional Neural Networks
4 pages
DL 4
No ratings yet
DL 4
4 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
Cnn
No ratings yet
Cnn
9 pages
AD3501-DL-UNIT 2 NOTES
No ratings yet
AD3501-DL-UNIT 2 NOTES
29 pages
DL UNIT 3
No ratings yet
DL UNIT 3
27 pages
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
No ratings yet
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
15 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
DL_MOD3
No ratings yet
DL_MOD3
102 pages
DL_U4
No ratings yet
DL_U4
7 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
21 pages
What is CNN
No ratings yet
What is CNN
2 pages
Unit III
No ratings yet
Unit III
60 pages
CNN Model Introduction and Overview
No ratings yet
CNN Model Introduction and Overview
2 pages
UNIT -4 DL
No ratings yet
UNIT -4 DL
19 pages
Convolutional Neural Network - Wikipedia
No ratings yet
Convolutional Neural Network - Wikipedia
21 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
MRS Sot Seminar Report
No ratings yet
MRS Sot Seminar Report
16 pages
CV Unit V
No ratings yet
CV Unit V
18 pages
deep learning u3
No ratings yet
deep learning u3
3 pages
nn-jaguar-lava-122
No ratings yet
nn-jaguar-lava-122
10 pages
DL-UNIT-3
No ratings yet
DL-UNIT-3
12 pages
Convolutional Neural Network in DIP
No ratings yet
Convolutional Neural Network in DIP
2 pages
Convolutional Neural Networks (CNN)
No ratings yet
Convolutional Neural Networks (CNN)
7 pages
Convolutional Neural Networks_100629
No ratings yet
Convolutional Neural Networks_100629
3 pages
Unit III
No ratings yet
Unit III
8 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
12 pages
What Should You Consider or Pay Attention To When Preparing A Data Set
No ratings yet
What Should You Consider or Pay Attention To When Preparing A Data Set
7 pages
Module 05
No ratings yet
Module 05
10 pages
Advancements in Image Classification Using Convolutional Neural Network
No ratings yet
Advancements in Image Classification Using Convolutional Neural Network
8 pages
CNN notes unit-3
No ratings yet
CNN notes unit-3
12 pages
5 Layers of A Convolutional Neural Network
No ratings yet
5 Layers of A Convolutional Neural Network
15 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
3 # Deep Learning
No ratings yet
3 # Deep Learning
36 pages
Unit 2
No ratings yet
Unit 2
20 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
Computer Vision With CNNs
No ratings yet
Computer Vision With CNNs
3 pages
PEC CS 802C Deep Learning
No ratings yet
PEC CS 802C Deep Learning
13 pages
Assignment 5_ _Implementing Image Classification using Deep Learning
No ratings yet
Assignment 5_ _Implementing Image Classification using Deep Learning
8 pages
Seminar
No ratings yet
Seminar
16 pages
CNN
No ratings yet
CNN
9 pages
DL-Unit-3 final
No ratings yet
DL-Unit-3 final
25 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
cnn-190813145957
No ratings yet
cnn-190813145957
34 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
35 pages
CNN Notes
No ratings yet
CNN Notes
10 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
13 Scale Space Combined
No ratings yet
13 Scale Space Combined
31 pages
Raspberry Pi Object Counting
No ratings yet
Raspberry Pi Object Counting
5 pages
04 - 05 - 06 - Unit I - Image Sensing and Acqusition - Sampling Quantization - Relationship Between Pixels
No ratings yet
04 - 05 - 06 - Unit I - Image Sensing and Acqusition - Sampling Quantization - Relationship Between Pixels
38 pages
DOC-20241207-WA0004.
No ratings yet
DOC-20241207-WA0004.
13 pages
Darkvisionnet: Low-Light Imaging Via Rgb-Nir Fusion With Deep Inconsistency Prior
No ratings yet
Darkvisionnet: Low-Light Imaging Via Rgb-Nir Fusion With Deep Inconsistency Prior
9 pages
T24156 LAB Assignment 4
No ratings yet
T24156 LAB Assignment 4
9 pages
DIP Lab Report 11
No ratings yet
DIP Lab Report 11
11 pages
Unit IV
No ratings yet
Unit IV
3 pages
Sizer Sander Lay Out - 4
No ratings yet
Sizer Sander Lay Out - 4
9 pages
3D Stereo Camera
No ratings yet
3D Stereo Camera
7 pages
09.color Based Image Segmentation Using Adaptive Thresholding
No ratings yet
09.color Based Image Segmentation Using Adaptive Thresholding
6 pages
CSC566-Tutorial Thresholding
No ratings yet
CSC566-Tutorial Thresholding
6 pages
Moving Object Tracking in Video Using MATLAB
No ratings yet
Moving Object Tracking in Video Using MATLAB
5 pages
2020 An Ensemble Architecture of Deep Convolutional
No ratings yet
2020 An Ensemble Architecture of Deep Convolutional
22 pages
CU4073 -SET 4
No ratings yet
CU4073 -SET 4
2 pages
Multimodal Medical Image Fusion Under Nonsubsampled Contourlet Transform Domain
No ratings yet
Multimodal Medical Image Fusion Under Nonsubsampled Contourlet Transform Domain
5 pages
R-20 DIP Question Bank
No ratings yet
R-20 DIP Question Bank
3 pages
CGV 3rd Week Lab Programs
No ratings yet
CGV 3rd Week Lab Programs
6 pages
Vision Ugguide PDF
No ratings yet
Vision Ugguide PDF
710 pages
Comparison of HOG, MSER, SIFT, FAST, LBP and CANNY Features For Cell Detection in Histopathological Images
No ratings yet
Comparison of HOG, MSER, SIFT, FAST, LBP and CANNY Features For Cell Detection in Histopathological Images
6 pages
3 +배출량+산정계획서+작성+가이드라인 (2024 2)
No ratings yet
3 +배출량+산정계획서+작성+가이드라인 (2024 2)
257 pages
A Review Paper On Vehicle Number Plate Recognition IJERTV8IS040246
No ratings yet
A Review Paper On Vehicle Number Plate Recognition IJERTV8IS040246
5 pages
Edge Detection: From Matlab and Simulink To Real Time With Ti Dsps
No ratings yet
Edge Detection: From Matlab and Simulink To Real Time With Ti Dsps
22 pages
Materi Pengolahan Citra Digital 4c Sesi 11-12 Image Transformations
No ratings yet
Materi Pengolahan Citra Digital 4c Sesi 11-12 Image Transformations
16 pages
Week 2 PDF
No ratings yet
Week 2 PDF
53 pages
Photogrammetry: Dr. Razak Zakariya Lecturer Department of Marine Science FMSM UMT by
100% (1)
Photogrammetry: Dr. Razak Zakariya Lecturer Department of Marine Science FMSM UMT by
12 pages
Digital Image Processing
No ratings yet
Digital Image Processing
8 pages
Inpaint Anything: Segment Anything Meets Image Inpainting
No ratings yet
Inpaint Anything: Segment Anything Meets Image Inpainting
7 pages
Digital Image and Video Processing Nov 2022
No ratings yet
Digital Image and Video Processing Nov 2022
5 pages
Histogram
No ratings yet
Histogram
10 pages

cnn-notes-architecture

Uploaded by

cnn-notes-architecture

Uploaded by

Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are a class of deep learning models

You might also like