0% found this document useful (0 votes)

14 views10 pages

Implementing A Convolutional Neural Network CNN 1718899610

Uploaded by

kalexan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views10 pages

Implementing A Convolutional Neural Network CNN 1718899610

Uploaded by

kalexan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Implementing a

Custom CNN (Convolutional Neural Network)

for Image Classification
from Scratch in Python

1 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

Table of Contents
1. Introduction to CNN (Convolutional Neural Network)
2. The Structure of a Convolutional Neural Network
3. Implementation in Python
a. Image Preprocessing
b. Building the Convolutional Layer
c. Implementing the Pooling Layer
d. Creating the Fully Connected Layer
e. Putting It All Together
f. Training and Evaluation
4. Conclusion

2 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

1. Introduction to Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are a type of deep learning algorithm specifically designed for image
recognition and classification tasks. They have achieved remarkable success in computer vision, powering
applications like facial recognition, self-driving cars, and medical image analysis. This post will guide you
through building a custom CNN from scratch in Python, focusing on image preprocessing, convolutional layers,
pooling layers, and fully connected layers without relying on high-level libraries such as TensorFlow or
PyTorch.

2. The Structure of a Convolutional Neural Networks

The Structure illustrates the following components and their connections:

1. Input Layer:
- Input image of size 64x64x3.

2. First Convolutional Layer:

- Convolution operation with filters of size 3x3x3 and 8 filters.
- ReLU activation function.
- Padding: 1, Stride: 1.

3. First Pooling Layer:

- Max-pooling operation with a 2x2 filter and stride of 2.

4. Second Convolutional Layer:

- Convolution operation with filters of size 3x3x8 and 16 filters.
- ReLU activation function.
- Padding: 1, Stride: 1.

5. Second Pooling Layer:

- Max-pooling operation with a 2x2 filter and stride of 2.

6. Flatten Layer:
- Flatten the output of the second pooling layer into a 1D array.

7. Fully Connected Layer:

- Fully connected layer with an output size of 10 (assuming 10 classes for classification).

8. Output Layer:
- Softmax activation function to produce a probability distribution over the 10 classes.

3 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

4 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python
3. Implementation in Python
Let's implement a simple CNN (Convolutional Neural Network) in Python to classify image.

Step 1: Image Preprocessing

Before feeding images into a CNN, they need to be preprocessed. This includes resizing, normalization, and
converting them into a suitable format.

Loading and Preprocessing Images

import numpy as np
import cv2
import os

def load_images_from_folder(folder, img_size):

images = []
labels = []
for filename in os.listdir(folder):
img = cv2.imread(os.path.join(folder, filename))
if img is not None:
img = cv2.resize(img, (img_size, img_size))
img = img / 255.0 # Normalize to [0,1]
images.append(img)
labels.append(filename.split('_')[0]) # Assuming filenames contain labels
return np.array(images), np.array(labels)

# Example usage
images, labels = load_images_from_folder('path_to_images', 64)

In this example, cv2 is used to read and resize images, while the pixel values are normalized to the range [0, 1].

5 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

Step 2: Building the Convolutional Layer

The convolutional layer is the core building block of a CNN. It applies a number of filters to the input image to
extract features.

Convolutional Layer Implementation

def conv2d(X, W, b, stride=1, padding=0):

(n_H_prev, n_W_prev, n_C_prev) = X.shape
(f, f, n_C_prev, n_C) = W.shape
n_H = int((n_H_prev - f + 2 * padding) / stride) + 1
n_W = int((n_W_prev - f + 2 * padding) / stride) + 1
Z = np.zeros((n_H, n_W, n_C))

X_pad = np.pad(X, ((padding, padding), (padding, padding), (0, 0)), mode='constant')

for h in range(n_H):
for w in range(n_W):
for c in range(n_C):
vert_start = h * stride
vert_end = vert_start + f
horiz_start = w * stride
horiz_end = horiz_start + f

X_slice = X_pad[vert_start:vert_end, horiz_start:horiz_end, :]

Z[h, w, c] = np.sum(X_slice * W[:, :, :, c]) + b[c]

return Z

# Example usage
X = np.random.randn(64, 64, 3) # Example input
W = np.random.randn(3, 3, 3, 8) # 8 filters of size 3x3x3
b = np.random.randn(8)
Z = conv2d(X, W, b)

This function performs a 2D convolution operation with the specified filters, stride, and padding. The input X is
padded to ensure the output dimensions match the desired size.

6 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

Step 3: Implementing the Pooling Layer

Pooling layers reduce the spatial dimensions of the input, which helps in reducing the computational
complexity and prevents overfitting.

Max-Pooling Layer Implementation

def max_pooling(X, f=2, stride=2):

(n_H_prev, n_W_prev, n_C) = X.shape
n_H = int(1 + (n_H_prev - f) / stride)
n_W = int(1 + (n_W_prev - f) / stride)
Z = np.zeros((n_H, n_W, n_C))

for h in range(n_H):
for w in range(n_W):
for c in range(n_C):
vert_start = h * stride
vert_end = vert_start + f
horiz_start = w * stride
horiz_end = horiz_start + f

X_slice = X[vert_start:vert_end, horiz_start:horiz_end, c]

Z[h, w, c] = np.max(X_slice)

return Z

# Example usage
X = np.random.randn(64, 64, 8) # Example input
Z = max_pooling(X)

This function performs max-pooling, which extracts the maximum value from each patch of the input.

Step 4: Creating the Fully Connected Layer

The fully connected (FC) layer is a standard neural network layer where each neuron is connected to every
neuron in the previous layer. This layer combines features learned by convolutional and pooling layers to make
predictions.

Fully Connected Layer Implementation

This function performs a linear transformation of the input X using weights W and biases b.
def fully_connected(X, W, b):
Z = np.dot(W, X) + b
return Z

# Example usage
X = np.random.randn(64 * 64 * 8) # Flattened input
W = np.random.randn(10, 64 * 64 * 8) # 10 output classes
b = np.random.randn(10)
Z = fully_connected(X, W, b)

7 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

Step 5: Putting It All Together

We will now combine the above components to build a simple CNN model.

CNN Model Implementation

def cnn_forward(X, parameters):

W1, b1, W2, b2, W3, b3 = parameters
Z1 = conv2d(X, W1, b1, stride=1, padding=1)
A1 = np.maximum(0, Z1) # ReLU activation
P1 = max_pooling(A1, f=2, stride=2)

Z2 = conv2d(P1, W2, b2, stride=1, padding=1)

A2 = np.maximum(0, Z2) # ReLU activation
P2 = max_pooling(A2, f=2, stride=2)

F = P2.flatten()
Z3 = fully_connected(F, W3, b3)

return Z3

# Initialize parameters
W1 = np.random.randn(3, 3, 3, 8)
b1 = np.random.randn(8)
W2 = np.random.randn(3, 3, 8, 16)
b2 = np.random.randn(16)
W3 = np.random.randn(10, 16 * 16 * 16)
b3 = np.random.randn(10)

parameters = (W1, b1, W2, b2, W3, b3)

# Example usage
X = np.random.randn(64, 64, 3) # Example input
Z3 = cnn_forward(X, parameters)

In this example, we define a simple CNN model with two convolutional layers followed by ReLU activation
and max-pooling, and a final fully connected layer for classification.

8 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

Step 6: Training and Evaluation

Training a CNN involves optimizing the weights and biases using backpropagation and gradient descent. For
simplicity, we'll use random values for our input and target, and demonstrate the forward pass.

Training Loop (Simplified)

def compute_loss(Y_hat, Y):

m = Y.shape[0]
loss = -np.sum(Y * np.log(Y_hat + 1e-8)) / m
return loss

def softmax(Z):
expZ = np.exp(Z - np.max(Z))
return expZ / expZ.sum(axis=0, keepdims=True)

# Example usage
# Example target (one-hot encoded)
Y = np.eye(10)[np.random.choice(10, 1)].T
Y_hat = softmax(Z3) # Applying softmax to the output of the CNN
loss = compute_loss(Y_hat, Y)

This code computes the cross-entropy loss, which is commonly used for classification tasks.

7. Conclusion
In this post, we built a custom CNN from scratch in Python, implementing each layer manually. This process
helps in understanding the inner workings of CNNs and provides a foundation for using higher-level libraries
for more complex tasks
By following this guide, you should have a clear understanding of how to build and understand a CNN from
scratch, providing a solid foundation for further learning and development in the field of deep learning and
computer vision.

9 ANSHUMAN JHA
Implementing a Custom CNN (Convolutional Neural Network)
for Image Classification from Scratch in Python

Constructive comments and feedback are welcomed

10 ANSHUMAN JHA

Experiment 10-1
No ratings yet
Experiment 10-1
3 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
Experiment 10 1
No ratings yet
Experiment 10 1
3 pages
MLOA Exp 1 - C121
No ratings yet
MLOA Exp 1 - C121
18 pages
Guddu jha_organized
No ratings yet
Guddu jha_organized
3 pages
UNIT_I CHP_5
No ratings yet
UNIT_I CHP_5
26 pages
CNN with TensorFlow and Keras
No ratings yet
CNN with TensorFlow and Keras
11 pages
222001822E
No ratings yet
222001822E
5 pages
CNN Implementation in Python
No ratings yet
CNN Implementation in Python
7 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
UNIT III DEEP LEARNING
No ratings yet
UNIT III DEEP LEARNING
31 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
21BCP167_AI_9
No ratings yet
21BCP167_AI_9
10 pages
Convolutional Neural Networks : Covnets
No ratings yet
Convolutional Neural Networks : Covnets
22 pages
Document 1
No ratings yet
Document 1
2 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
No ratings yet
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
17 pages
Ch VI _ Convolutional Neural Network_24
No ratings yet
Ch VI _ Convolutional Neural Network_24
33 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Artificial Intelligence Mini Project
No ratings yet
Artificial Intelligence Mini Project
5 pages
Introduction to Genetic Algorithm Neural Networks
No ratings yet
Introduction to Genetic Algorithm Neural Networks
44 pages
lab 10 practical
No ratings yet
lab 10 practical
8 pages
CO2_CNN_3
No ratings yet
CO2_CNN_3
31 pages
7 CNNWithCustomImage
No ratings yet
7 CNNWithCustomImage
11 pages
DEL AAT front sheet (2)
No ratings yet
DEL AAT front sheet (2)
8 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Lab 09
No ratings yet
Lab 09
5 pages
Solution Methodology
No ratings yet
Solution Methodology
4 pages
CS5242 Assignment 2
No ratings yet
CS5242 Assignment 2
12 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
4 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Structure of Convolutional Neural Networks - Deep Learning
No ratings yet
Structure of Convolutional Neural Networks - Deep Learning
12 pages
Computer_Vision_Material
No ratings yet
Computer_Vision_Material
21 pages
CNN Model Construction
No ratings yet
CNN Model Construction
22 pages
CNN Project
No ratings yet
CNN Project
16 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Introduction to ANN with steps 10 25
No ratings yet
Introduction to ANN with steps 10 25
30 pages
03_pytorch_computer_vision
No ratings yet
03_pytorch_computer_vision
29 pages
cat_dog_classification_CNN_Model
No ratings yet
cat_dog_classification_CNN_Model
13 pages
nndl
No ratings yet
nndl
20 pages
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
No ratings yet
Convolutional Neural Networks: Convolutions, Pooling and Cnns. Neural Architectures For Computer Vision
64 pages
Report23 24
No ratings yet
Report23 24
55 pages
Assignment No 2 - OCR CNN
No ratings yet
Assignment No 2 - OCR CNN
2 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
ENG21CS0302 - SGAN
No ratings yet
ENG21CS0302 - SGAN
7 pages
MICCAI Educational Challenge
No ratings yet
MICCAI Educational Challenge
3 pages
AIML Lab 3
No ratings yet
AIML Lab 3
17 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
NN & DL Lab Manual 1-1
No ratings yet
NN & DL Lab Manual 1-1
23 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
5 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
Brain Tumour Classification
No ratings yet
Brain Tumour Classification
10 pages
1729492946538
No ratings yet
1729492946538
10 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Grade 1 - Pause & Think Online - Lesson Slides
No ratings yet
Grade 1 - Pause & Think Online - Lesson Slides
15 pages
Assistive and Rehab Technologies - Divya
No ratings yet
Assistive and Rehab Technologies - Divya
49 pages
Jawaban ASAS 25
No ratings yet
Jawaban ASAS 25
173 pages
Computer Organization and Assembly Language
No ratings yet
Computer Organization and Assembly Language
12 pages
400V 18.5KW Motor Thermal Curve
No ratings yet
400V 18.5KW Motor Thermal Curve
1 page
Sample-Asia Data
No ratings yet
Sample-Asia Data
139 pages
AI With Python MCQ's
No ratings yet
AI With Python MCQ's
4 pages
Yemen WTTX Site Solution
No ratings yet
Yemen WTTX Site Solution
8 pages
Add a patron
No ratings yet
Add a patron
3 pages
Linux - Equivalent of Export Command in Windows - Super User
No ratings yet
Linux - Equivalent of Export Command in Windows - Super User
1 page
300-4L User Guide PDF
No ratings yet
300-4L User Guide PDF
14 pages
OFM April 1 Population Estimates
No ratings yet
OFM April 1 Population Estimates
7 pages
Papers in Quantitative Finance March 2024 1712238549
No ratings yet
Papers in Quantitative Finance March 2024 1712238549
27 pages
Sonatest Manual
No ratings yet
Sonatest Manual
117 pages
In House Training Report: Basic Safety
No ratings yet
In House Training Report: Basic Safety
11 pages
Fs44a & Fs54a NM-C171 NM-C171 PDF
No ratings yet
Fs44a & Fs54a NM-C171 NM-C171 PDF
50 pages
MSS Agent Expectation Document
No ratings yet
MSS Agent Expectation Document
2 pages
P.RG A4202G-o - User - Manual - HBK 939800029
No ratings yet
P.RG A4202G-o - User - Manual - HBK 939800029
165 pages
Pechakucha Presentation
No ratings yet
Pechakucha Presentation
7 pages
1.3.23 FlexStack-Plus Hot-Swappable Stacking Module (C2960X-STACK) + Cable 0.5 M - Datasheet
No ratings yet
1.3.23 FlexStack-Plus Hot-Swappable Stacking Module (C2960X-STACK) + Cable 0.5 M - Datasheet
3 pages
Get Automation for Food Engineering Food Quality Quantization and Process Control 1st Edition Lev Nelik free all chapters
No ratings yet
Get Automation for Food Engineering Food Quality Quantization and Process Control 1st Edition Lev Nelik free all chapters
41 pages
Lesson Plan in Media Information Technology
0% (1)
Lesson Plan in Media Information Technology
4 pages
Compaq nx9420
No ratings yet
Compaq nx9420
41 pages
AAI JE IT PRACTICE QUESTIONS
No ratings yet
AAI JE IT PRACTICE QUESTIONS
19 pages
Basic ITK Customization Concept Part-02
No ratings yet
Basic ITK Customization Concept Part-02
1 page
6.7.12 Packet Tracer - Configure Cisco Devices For Syslog, NTP, and SSH Operations - ILM
No ratings yet
6.7.12 Packet Tracer - Configure Cisco Devices For Syslog, NTP, and SSH Operations - ILM
4 pages
Accessing Mysql Using Pdo: This Work Is Licensed Under A
No ratings yet
Accessing Mysql Using Pdo: This Work Is Licensed Under A
28 pages
Tooryalai CV
No ratings yet
Tooryalai CV
2 pages
Full Learn PowerShell Scripting in A Month of Lunches: Write and Organize Scripts and Tools, 2nd Edition James Petty Ebook All Chapters
100% (2)
Full Learn PowerShell Scripting in A Month of Lunches: Write and Organize Scripts and Tools, 2nd Edition James Petty Ebook All Chapters
49 pages
Loceria Colombiana S.A.S. 22050101 1,535,985.00 890916575 Distribuidora de Vinos y Licores SAS 22050101 2,000.25
No ratings yet
Loceria Colombiana S.A.S. 22050101 1,535,985.00 890916575 Distribuidora de Vinos y Licores SAS 22050101 2,000.25
4 pages