0% found this document useful (0 votes)
5 views

DL 4

Convolutional Neural Networks (CNNs) are a specialized type of deep learning architecture used primarily in computer vision for tasks like image classification and object detection. They consist of multiple layers, including convolutional, pooling, and fully connected layers, which work together to extract features from visual data while being robust to transformations. Despite their advantages, such as high accuracy and end-to-end training, CNNs can be computationally expensive and require large amounts of labeled data.

Uploaded by

sumathipogiri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

DL 4

Convolutional Neural Networks (CNNs) are a specialized type of deep learning architecture used primarily in computer vision for tasks like image classification and object detection. They consist of multiple layers, including convolutional, pooling, and fully connected layers, which work together to extract features from visual data while being robust to transformations. Despite their advantages, such as high accuracy and end-to-end training, CNNs can be computationally expensive and require large amounts of labeled data.

Uploaded by

sumathipogiri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

A Convolutional Neural Network (CNN) is a type of Deep Learning neural network architecture

commonly used in Computer Vision. Computer vision is a field of Artificial Intelligence that enables a
computer to understand and interpret the image or visual data.

When it comes to Machine Learning, Artificial Neural Networks perform really well. Neural Networks
are used in various datasets like images, audio, and text. Different types of Neural Networks are used
for different purposes, for example for predicting the sequence of words we use Recurrent Neural
Networks more precisely an LSTM, similarly for image classification we use Convolution Neural
networks. In this blog, we are going to build a basic building block for CNN. Neural Networks: Layers
and Functionality

In a regular Neural Network there are three types of layers:

1. Input Layers: It’s the layer in which we give input to our model. The number of neurons in
this layer is equal to the total number of features in our data (number of pixels in the case of
an image).

2. Hidden Layer: The input from the Input layer is then fed into the hidden layer. There can be
many hidden layers depending on our model and data size. Each hidden layer can have
different numbers of neurons which are generally greater than the number of features. The
output from each layer is computed by matrix multiplication of the output of the previous
layer with learnable weights of that layer and then by the addition of learnable biases
followed by activation function which makes the network nonlinear.

3. Output Layer: The output from the hidden layer is then fed into a logistic function like
sigmoid or softmax which converts the output of each class into the probability score of each
class.

The data is fed into the model and output from each layer is obtained from the above step is
called feedforward, we then calculate the error using an error function, some common error
functions are cross-entropy, square loss error, etc. The error function measures how well the
network is performing. After that, we backpropagate into the model by calculating the derivatives.
This step is called Backpropagation which basically is used to minimize the loss.

Convolution Neural Network

Convolutional Neural Network (CNN) is the extended version of artificial neural networks (ANN)
which is predominantly used to extract the feature from the grid-like matrix dataset. For example
visual datasets like images or videos where data patterns play an extensive role.

CNN Architecture

Convolutional Neural Network consists of multiple layers like the input layer, Convolutional layer,
Pooling layer, and fully connected layers.
Image source: Deep Learning Udacity

The Convolutional layer applies filters to the input image to extract features, the Pooling layer
downsamples the image to reduce computation, and the fully connected layer makes the final pr

ediction. The network learns the optimal filters through backpropagation and gradient descent.

Example: Applying CNN to an Image

Let’s consider an image and apply the convolution layer, activation layer, and pooling layer operation
to extract the inside feature.

Input image:

ANY IMAGE(PASTE)

Step:

 import the necessary libraries

 set the parameter

 define the kernel

 Load the image and plot it.

 Reformat the image

 Apply convolution layer operation and plot the output image.

 Apply activation layer operation and plot the output image.

 Apply pooling layer operation and plot the output image.


Advantages and Disadvantages of Convolutional Neural Networks (CNNs)

Advantages of CNNs:

1. Good at detecting patterns and features in images, videos, and audio signals.

2. Robust to translation, rotation, and scaling invariance.

3. End-to-end training, no need for manual feature extraction.

4. Can handle large amounts of data and achieve high accuracy.

Disadvantages of CNNs:

1. Computationally expensive to train and require a lot of memory.

2. Can be prone to overfitting if not enough data or proper regularization is used.

3. Requires large amounts of labeled data.

4. Interpretability is limited, it’s hard to understand what the network has learned.

Convolutional neural networks (CNNs) are a type of deep learning neural network that is specifically
designed

for image processing and computer vision tasks. CNNs are inspired by the structure and function of
the human

visual cortex, which is the part of the brain that is responsible for processing visual information.

CNNs have a number of advantages over other types of neural networks for image processing and
computer

vision tasks:

• They are able to extract features from images that are invariant to translation, rotation, and scaling.

This means that the same features can be detected even if the object in the image is in a different

position or orientation.

• They are able to extract features from images that are hierarchically organized. This means that
they

can start by extracting simple features, such as edges and corners, and then use those features to

extract more complex features, such as faces and objects.

• They are computationally efficient. This is because the convolution operation at the heart of CNNs

can be implemented using the fast Fourier transform (FFT).

CNNs have revolutionized the field of computer vision. They are now used in a wide range of
applications,

including:
• Image classification: CNNs can be used to classify images into different categories, such as cats,
dogs,

and cars.

• Object detection: CNNs can be used to detect objects in images, such as pedestrians, cars, and
traffic

signs.

• Facial recognition: CNNs can be used to recognize faces in images.

• Medical imaging: CNNs can be used to analyze medical images, such as X-rays and MRI scans, to

diagnose diseases and identify abnormalities.

• Natural language processing: CNNs can be used to extract features from text, which can then be
used

for tasks such as sentiment analysis and machine translation.

CNNs are a powerful tool for a wide range of applications. They are still under active development,
and

new applications for CNNs are being discovered all the time.

Here are some specific examples of how CNNs are being used today:

• Facebook uses CNNs to recognize faces in photos.

• Google uses CNNs to power its image search engine.

• Tesla uses CNNs to power its self-driving cars.

• Doctors use CNNs to analyze medical images and diagnose diseases.

• Researchers are using CNNs to develop new methods for machine translation and text analysis.

CNNs are a powerful and versatile tool that is transforming the way we interact with the world
around us.

You might also like