0% found this document useful (0 votes)

12 views

DL Mentoring Session - Final

Uploaded by

Ajanta Bearing

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

DL Mentoring Session - Final

Uploaded by

Ajanta Bearing

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Deep Learning

[email protected]
KU0V563MY9

WEEK 6

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 1
Sharing or publishing the contents in part or full is liable for legal action.
Session agenda
• Introduction to Deep learning with case study discussion
• Understanding nodes and layers
• Loss function
• Activation function
• Forward and Backward propagation
[email protected]

• Gradient descent
KU0V563MY9

• Manipulating Deep Neural Networks

• Non-convex function
• Transfer learning
• Natural Language processing

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 2
Sharing or publishing the contents in part or full is liable for legal action.
Why Deep Learning?

[email protected]
KU0V563MY9

• Deep learning is the first class of algorithms that is

scalable where its performance just keeps getting
better as you feed them more data.
• Almost all the value today of deep learning is through
supervised learning or learning from labeled data.
This file is meant for personal use by [email protected] only.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 3
Sharing or publishing the contents in part or full is liable for legal action.
Linear equation

This function is defined as

a weighted sum of its
inputs

[email protected]
KU0V563MY9

Simple Threshold Function(Step function)

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 4
Sharing or publishing the contents in part or full is liable for legal action.
Sigmoid Function

[email protected]
KU0V563MY9

Loss Function

The goal of the loss function is to minimize the error between the predicted and desired output

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 5
Sharing or publishing the contents in part or full is liable for legal action.
Activation Function
Perceptron

[email protected]
KU0V563MY9

Is ReLU faster than tanh?

The main idea is to let the gradient be non zero and
recover during training eventually. ReLU is less
computationally expensive than tanh and sigmoid
because it involves simpler mathematical operations.
This file is meant for personal use by [email protected] only.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 6
Sharing or publishing the contents in part or full is liable for legal action.
Gradient Descent

[email protected]
KU0V563MY9

Adjusted values of w and b

Gradient descent is the essence of the learning
process — through it the machine learns what
values of weights and biases minimize the cost
function. It does this by iteratively comparing its The alpha term in front of the partial derivative is called
predicted output for a set of data to the true the learning rate and is a measure of how big a step to
output in a process called training. take at each iteration.
This file is meant for personal use by [email protected] only.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 7
Sharing or publishing the contents in part or full is liable for legal action.
Training model
Forward Propagation

First, a linear combination of weights and biases is performed at each neuron in a layer. At each
neuron/node, the linear combination of the inputs(weights & biases) is then passed through an
activation function that introduces nonlinearity to the model. This process by which weights and
biases are propagated from inputs to output is called forward propagation. After arriving at the
predicted output, the value of the loss function for the training example is calculated.
[email protected]
KU0V563MY9

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 8
Sharing or publishing the contents in part or full is liable for legal action.
Backward Propagation
Back propagation is the process of calculating the partial derivatives from the loss
function back to the inputs, thereby updating the values of w and b that lead us to
the minimum.

[email protected]
KU0V563MY9

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 9
Sharing or publishing the contents in part or full is liable for legal action.
Final Design

[email protected]
KU0V563MY9

Accuracy
We will apply a threshold value to the output, 0.5 for
instance, so that probability values 0.5 or above
result in a predicted output value of 1, whereas
probability values less than 0.5 result in a predicted
output value of 0. This file is meant for personal use by [email protected] only. 10
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
Sharing or publishing the contents in part or full is liable for legal action.
Case Study
Predicting Probability of credit card approval

We are going to discuss on

credit card approval
decision using neural net.
[email protected]
KU0V563MY9
The factor that will be
used to determine the
output are as listed – age,
salary, education, city ,
company and education.

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 11
Sharing or publishing the contents in part or full is liable for legal action.
Manipulating Deep Neural Networks

[email protected]
KU0V563MY9

Hidden Layers
The blue layers (layers between the input and the final output neuron) are called the hidden layers. These are what makes
the Model “Deep”. The hidden layer neurons try to capture the latent patterns within the data that will help the model in
predicting the creditworthiness of the individual.

Neural Networks are also referred to as Black Box Models because even though the model will predict with high accuracy,
the results are not usually interpretable. The number of layers, and number of neurons in each layer, are called
the Hyperparameters, and are something whose desired value is found by the data scientist through trial and error. Unlike
weights, (aka parameters), whose optimum value is found by calculus or other mathematical methods.

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 12
Sharing or publishing the contents in part or full is liable for legal action.
Convex vs Non-Convex Function
• A non-convex function "curves up and
down“. A non-convex function is wavy -
has some 'valleys' (local minima) that
aren't as deep as the overall deepest
'valley' (global minimum).
• Optimization algorithms can get stuck
in the local minimum

Convergence
[email protected]
KU0V563MY9

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 13
Sharing or publishing the contents in part or full is liable for legal action.
Non Convex Optimization
• In Deep Learning, Non Convex Optimization convergence may happen at a bad local minimum .
In such a case, we can re-optimize the system with different initialization and/or add extra noise
for gradient updates.
• We may face convergence to a saddle point that can be tackled by finding the hessian and
computing a descent direction.
• Getting stuck in a region of low gradient magnitude can be solved using batchnorm or designing
networks efficiently using a rectified linear unit (ReLU) activation function
• We may take huge steps and diverge because of the high curvature. In that case, we can use
adaptive step size or more intuitively limit the size of the gradient step.
[email protected]
KU0V563MY9

• If we end up having a wrong setting of hyperparameters, we can go for hyperparameter

optimization methods.

Saddle Point
Convergence In ADAM Optimization
This file is meant for personal use by [email protected] only.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 14
Sharing or publishing the contents in part or full is liable for legal action.
Transfer Learning
• Transfer learning generally refers to a process where a model trained on one problem is used in some
way on a second related problem.
• Transfer learning has the benefit of decreasing the training time for a neural network model and can
result in lower generalization error.

[email protected]
KU0V563MY9

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 15
Sharing or publishing the contents in part or full is liable for legal action.
How Transfer Learning works
• A model can be downloaded and used as-is.
• Models can be downloaded and used as feature extraction models. Here, the output of the model
from a layer prior to the output layer of the model is used as input to a new classifier model.
• The pre-trained model can be used as a separate feature extraction program, in which case input can
be pre-processed by the model or portion of the model to a given an output (e.g. vector of numbers)
[email protected]
for each input image, that can then use as input when training a new model.
KU0V563MY9

The pre-trained model or desired portion of the model can be integrated directly into a new neural
network model. In this usage, the weights of the pre-trained can be frozen so that they are not updated as
the new model is trained. Alternately, the weights may be updated during the training of the new model,
perhaps with a lower learning rate, allowing the pre-trained model to act like a weight initialization
scheme when training the new model.

This file is meant for personal use by [email protected] only.

Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited. 16
Sharing or publishing the contents in part or full is liable for legal action.
[email protected]
KU0V563MY9

ANY QUESTIONS
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Proprietary content. ©Great Learning. All Rights Reserved. Unauthorized use or distribution prohibited.
17

1 The Road
No ratings yet
1 The Road
3 pages
06 AIS302 ANN backpropagation
No ratings yet
06 AIS302 ANN backpropagation
83 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Intoduction To Neural Networks
No ratings yet
Intoduction To Neural Networks
45 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
Unit 4
No ratings yet
Unit 4
19 pages
MLS+1+-+Presentation
No ratings yet
MLS+1+-+Presentation
11 pages
Deep Learning Tutorial 9
No ratings yet
Deep Learning Tutorial 9
70 pages
Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
Intro To DL
No ratings yet
Intro To DL
28 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
1.1 Introduction
No ratings yet
1.1 Introduction
73 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
AI - W7L13
No ratings yet
AI - W7L13
46 pages
Lecture_09_slides_-_after
No ratings yet
Lecture_09_slides_-_after
57 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Lecture2
No ratings yet
Lecture2
67 pages
Slides 11
No ratings yet
Slides 11
48 pages
AD3451 ML UNIT 4 NOTES
No ratings yet
AD3451 ML UNIT 4 NOTES
36 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
midterm_study_guide_csci566
No ratings yet
midterm_study_guide_csci566
20 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
26 pages
ca3dl
No ratings yet
ca3dl
6 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
GDG_SOF_WEEK_2[1]
No ratings yet
GDG_SOF_WEEK_2[1]
11 pages
Lect 5
No ratings yet
Lect 5
89 pages
Unit 2 Introduction to Deep Learning
No ratings yet
Unit 2 Introduction to Deep Learning
79 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
30 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
25 pages
Kannan M5L3 Notes
No ratings yet
Kannan M5L3 Notes
98 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Module 3-DL
No ratings yet
Module 3-DL
12 pages
6.3 HiddenUnits
No ratings yet
6.3 HiddenUnits
26 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
week 06 - Deep Feedforward Networks - Optimization
No ratings yet
week 06 - Deep Feedforward Networks - Optimization
83 pages
HODL Lec 2 Training NNs Intro TF
No ratings yet
HODL Lec 2 Training NNs Intro TF
83 pages
2-Mathematical Optimization and Deep Learning
No ratings yet
2-Mathematical Optimization and Deep Learning
53 pages
ML.8-Neural Networks - Deep Learning (Week 12,13)
No ratings yet
ML.8-Neural Networks - Deep Learning (Week 12,13)
80 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Complete Deep Learning Interview Question
No ratings yet
Complete Deep Learning Interview Question
46 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Deep Learning and Its Applications
No ratings yet
Deep Learning and Its Applications
21 pages
UNIT II DNN
No ratings yet
UNIT II DNN
24 pages
DL_Unit2
No ratings yet
DL_Unit2
113 pages
4-Neural Networks and Activation Function
No ratings yet
4-Neural Networks and Activation Function
28 pages
ANN Notes Updated
No ratings yet
ANN Notes Updated
46 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
14 pages
CS 329 Lecture4 2025New
No ratings yet
CS 329 Lecture4 2025New
61 pages
Unit 5 (Second Half)
No ratings yet
Unit 5 (Second Half)
10 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
CS601_Machine Learning_Unit 2 New
No ratings yet
CS601_Machine Learning_Unit 2 New
56 pages
DEEP LEARNING Paper
No ratings yet
DEEP LEARNING Paper
12 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
cst414- Deep learning
No ratings yet
cst414- Deep learning
34 pages
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
From Everand
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
Mark Magic
No ratings yet
A Study On Marketing Strategies of LIC Agents With Special Reference To Branches in Madurai Division
No ratings yet
A Study On Marketing Strategies of LIC Agents With Special Reference To Branches in Madurai Division
14 pages
Holiday Home Work III
No ratings yet
Holiday Home Work III
3 pages
Delivery Schedule DSML INTL - Nov'22
No ratings yet
Delivery Schedule DSML INTL - Nov'22
1 page
Paper Mill
100% (2)
Paper Mill
20 pages
How Can Bold Action Become Everyday Action?: EY Attractiveness Program Africa
No ratings yet
How Can Bold Action Become Everyday Action?: EY Attractiveness Program Africa
40 pages
Bits Application Form
No ratings yet
Bits Application Form
5 pages
Sensor Bearings For Industrial Machinery
No ratings yet
Sensor Bearings For Industrial Machinery
1 page
Cover Title - Cover 1 Optimise Equipment and Plant Efficiency
No ratings yet
Cover Title - Cover 1 Optimise Equipment and Plant Efficiency
24 pages
Bearing For Cement Industry - N
100% (2)
Bearing For Cement Industry - N
20 pages
Standard Deep Groove Ball Bearings by JESA
No ratings yet
Standard Deep Groove Ball Bearings by JESA
3 pages
The Landscape For BESS in GB
No ratings yet
The Landscape For BESS in GB
13 pages
Electric Substation Equipment
No ratings yet
Electric Substation Equipment
6 pages
Updated PPT
No ratings yet
Updated PPT
14 pages
Skylark AI Launches Purpose-Built AI Engine To Revolutionize Private Investment Analysis and Enterprise AI Deployment
No ratings yet
Skylark AI Launches Purpose-Built AI Engine To Revolutionize Private Investment Analysis and Enterprise AI Deployment
4 pages
Job Description-Technical Support Engineer - Optics
No ratings yet
Job Description-Technical Support Engineer - Optics
2 pages
DIGITAL TRANSFORMATION GP 1
No ratings yet
DIGITAL TRANSFORMATION GP 1
14 pages
Accounting Information Systems - Ethics Fraudulent Behavior and
No ratings yet
Accounting Information Systems - Ethics Fraudulent Behavior and
18 pages
Base Paper
No ratings yet
Base Paper
10 pages
Need of Legal Framework For Artificial Intelligence in India
No ratings yet
Need of Legal Framework For Artificial Intelligence in India
5 pages
Exploring The Landscape of Explainable Artificial Intelligence: Benefits, Challenges, and Future Perspectives
No ratings yet
Exploring The Landscape of Explainable Artificial Intelligence: Benefits, Challenges, and Future Perspectives
5 pages
ChatGPT For Content and SEO
No ratings yet
ChatGPT For Content and SEO
16 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
The Rise of Artificial Intelligence A Clarion Call For Higher Education To Redefine Learning & Reimagine Assessment
No ratings yet
The Rise of Artificial Intelligence A Clarion Call For Higher Education To Redefine Learning & Reimagine Assessment
5 pages
A Review On Generative Adversarial Networks Algorithms Theory and Applications
No ratings yet
A Review On Generative Adversarial Networks Algorithms Theory and Applications
20 pages
Artificial Intelligence
0% (1)
Artificial Intelligence
6 pages
Project Proposal Template
No ratings yet
Project Proposal Template
3 pages
Intel AI Everywhere
No ratings yet
Intel AI Everywhere
29 pages
Privacy and Artificial Intelligence: Challenges For Protecting Health Information in A New Era
No ratings yet
Privacy and Artificial Intelligence: Challenges For Protecting Health Information in A New Era
5 pages
2023-24_ML_NOTES_2
No ratings yet
2023-24_ML_NOTES_2
16 pages
AI - 16 Weeks Plan
No ratings yet
AI - 16 Weeks Plan
3 pages
IT8601
No ratings yet
IT8601
1 page
ĐỀ VIP 5 - Soạn chuẩn cấu trúc minh họa BGD năm 2023 - Môn TIẾNG ANH - Bản word có giải (TN2) - (fix) .Image.Marked
No ratings yet
ĐỀ VIP 5 - Soạn chuẩn cấu trúc minh họa BGD năm 2023 - Môn TIẾNG ANH - Bản word có giải (TN2) - (fix) .Image.Marked
9 pages
First IA-Question_bank -M23BCS405
No ratings yet
First IA-Question_bank -M23BCS405
3 pages
UAI Book Chapter
No ratings yet
UAI Book Chapter
36 pages
AI - Tree (31-1-2023)
No ratings yet
AI - Tree (31-1-2023)
1 page
Deloitte CH en Trendradar Future of Banking
No ratings yet
Deloitte CH en Trendradar Future of Banking
31 pages
Flutter Lab Manual Final Modified
No ratings yet
Flutter Lab Manual Final Modified
75 pages
unit3 DL JNTUK
No ratings yet
unit3 DL JNTUK
15 pages
Recommenders Intro Annotated PDF
No ratings yet
Recommenders Intro Annotated PDF
45 pages
Literature Review On Pms
100% (1)
Literature Review On Pms
5 pages
Road Extraction Image Processing
No ratings yet
Road Extraction Image Processing
5 pages