0% found this document useful (0 votes)

3 views

ass9_soln

The document contains an assignment for a Deep Learning course from IIT Kharagpur, consisting of 10 multiple-choice questions related to gradient descent and optimization techniques. Each question includes a correct answer and a detailed explanation of the concepts involved. The assignment aims to assess the understanding of key topics such as learning rates, momentum optimizers, and the effects of gradient descent on model training.

Uploaded by

Revathi S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

ass9_soln

Uploaded by

Revathi S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

NPTEL Online Certification Courses

Indian Institute of Technology Kharagpur

Deep Learning
Assignment- Week 9
TYPE OF QUESTION: MCQ/MSQ
Number of questions: 10 Total mark: 10 X 1 = 10
______________________________________________________________________________

QUESTION 1:
What can be a possible consequence of choosing a very small learning rate?
a. Slow convergence
b. Overshooting minima
c. Oscillations around the minima
d. All of the above

Correct Answer: a
Detailed Solution:
Choosing a very small learning rate can lead to slower convergence and thus option (a) is
correct.
______________________________________________________________________________

QUESTION 2:
The following is the equation of update vector for momentum optimizer. Which of the
following is true for ?

a. is the momentum term which indicates acceleration

b. is the step size
c. is the first order moment
d. is the second order moment

Correct Answer: a
Detailed Solution:
A fraction of the update vector of the past time step is added to the current update vector. is
that fraction which indicates how much acceleration you want and its value lies between 0 and 1.
______________________________________________________________________________

QUESTION 3:
Which of the following is true about momentum optimizer?
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

a. It helps accelerating Stochastic Gradient Descent in right direction

b. It helps prevent unwanted oscillations
c. It helps to know the direction of the next step with knowledge of the previous step
d. All of the above

Correct Answer: d
Detailed Solution:
Option (a), (b) and (c) all are true for momentum optimiser. Thus, option (d) is correct.
______________________________________________________________________________

QUESTION 4:
Let be the cost function. Let the gradient descent update rule for be,

What is the correct expression of . is the learning rate.

Correct Answer: a
Detailed Solution:
Gradient descent update rule for is,
, is the learning rate
______________________________________________________________________________

QUESTION 5:
2
-
gradient descent optimization at step t+1? Consider, to be the learning rate.

a.
b.
c.
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

Correct Answer: a
Detailed Solution:

So, weight update will be

( )
______________________________________________________________________________

QUESTION 6:
If the first few iterations of gradient descent cause the function f 0 1) to increase rather than
decrease, then what could be the most likely cause for this?

a. we have set the learning rate to too large a value

b. we have set the learning rate to zero
c. we have set the learning rate to a very small value
d. learning rate is gradually decreased by a constant value after every epoch

Correct Answer: a
Detailed Solution:
If learning rate were small enough, then gradient descent should successfully take a tiny small
downhill and decrease 0 1) at least a little bit. If gradient descent instead increases the
objective value that means learning rate is too high.
______________________________________________________________________________

QUESTION 7:
For a function f 0 1), 0 1 are initialized at a global minimum, then what should be the
0 1 after a single iteration of gradient descent?

a. 0 1 will update as per gradient descent rule

b. 0 1 will remain same
c. 0 1
d. Depends on the learning rate

Correct Answer: b
Detailed Solution:
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

At a local minimum, the derivative (gradient) is zero, so gradient descent will not change the
parameters.
______________________________________________________________________________

QUESTION 8:
What can be one of the practical problems of exploding gradient?
a. Too large update of weight values leading to unstable network
b. Too small update of weight values inhibiting the network to learn
c. Too large update of weight values leading to faster convergence
d. Too small update of weight values leading to slower convergence

Correct Answer: a
Detailed Solution:
Exploding gradients are a problem where large error gradients accumulate and result in very
large updates to neural network model weights during training. This has the effect of your model
being unstable and unable to learn from your training data.
______________________________________________________________________________

QUESTION 9:
What are the steps for using a gradient descent algorithm?

1. Calculate error between the actual value and the predicted value
2. Update the weights and biases using gradient descent formula
3. Pass an input through the network and get values from output layer
4. Initialize weights and biases of the network with random values
5. Calculate gradient value corresponding to each weight and bias

a. 1, 2, 3, 4, 5
b. 5, 4, 3, 2, 1
c. 3, 2, 1, 5, 4
d. 4, 3, 1, 5, 2

Correct Answer: d
Detailed Solution:
Initialize random weights, and then start passing input instances and calculate error response
from output layer and back-propagate the error through each subsequent layers. Then update the
neuron weights using a learning rate and gradient of error. Please refer to the lectures of week 4.
______________________________________________________________________________
NPTEL Online Certification Courses
Indian Institute of Technology Kharagpur

QUESTION 10:
You run gradient descent for 15 iterations with learning rate and compute error after
each iteration. You find that the value of error decreases very slowly. Based on this, which of
the following conclusions seems most plausible?

a. Rather than using the current value of a, use a larger value of

b. Rather than using the current value of a, use a smaller value of
c. Keep
d. None of the above

Correct Answer: a
Detailed Solution:
Error rate is decreasing very slowly. Therefore increasing the learning rate is a most plausible
solution.
______________________________________________________________________________

______________________________________________________________________________

************END*******

Financial Accounting for Decision Makers 7th Edition by Peter Atrill Eddie McLaney Peter Atrill & Eddie McLaney instant download
No ratings yet
Financial Accounting for Decision Makers 7th Edition by Peter Atrill Eddie McLaney Peter Atrill & Eddie McLaney instant download
29 pages
Optimal Barrier Trading With and Without Transaction Costs
No ratings yet
Optimal Barrier Trading With and Without Transaction Costs
33 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
DEEP LEARNING IIT Kharagpur Assignment - 4 - 2024
100% (1)
DEEP LEARNING IIT Kharagpur Assignment - 4 - 2024
7 pages
DL Unit-2
No ratings yet
DL Unit-2
24 pages
DL - Assignment 5 Solution
No ratings yet
DL - Assignment 5 Solution
7 pages
DL - Assignment 9 Solution
100% (3)
DL - Assignment 9 Solution
7 pages
Solution Dseclzg524!01!102020 Ec2r
100% (1)
Solution Dseclzg524!01!102020 Ec2r
6 pages
Assignment9 DeepLearning
No ratings yet
Assignment9 DeepLearning
6 pages
Assignment Week 4-Deep-Learning PDF
100% (1)
Assignment Week 4-Deep-Learning PDF
7 pages
Assignment_4_2022
No ratings yet
Assignment_4_2022
7 pages
DL - Assignment 4 Solution
No ratings yet
DL - Assignment 4 Solution
6 pages
week 9
No ratings yet
week 9
10 pages
Deep Learning - IIT Ropar - Unit 7 - Week 4
100% (1)
Deep Learning - IIT Ropar - Unit 7 - Week 4
5 pages
Assignment - Week5-With Solution
No ratings yet
Assignment - Week5-With Solution
4 pages
DL_26-09 (3)
No ratings yet
DL_26-09 (3)
22 pages
4.deep Learning Assignment4 Solution PDF
100% (1)
4.deep Learning Assignment4 Solution PDF
12 pages
UNIT V NNHDL
No ratings yet
UNIT V NNHDL
33 pages
Deep Learning - week 4
No ratings yet
Deep Learning - week 4
6 pages
7 Optimization2 Stochastic Gradient
No ratings yet
7 Optimization2 Stochastic Gradient
114 pages
Week 6 Prev & Current Assignments
No ratings yet
Week 6 Prev & Current Assignments
21 pages
week4 (1)
No ratings yet
week4 (1)
4 pages
DL - Assignment 10 Solution
100% (2)
DL - Assignment 10 Solution
6 pages
DL 3
No ratings yet
DL 3
72 pages
week 06 - Deep Feedforward Networks - Optimization
No ratings yet
week 06 - Deep Feedforward Networks - Optimization
83 pages
SS 2020 Solutions
No ratings yet
SS 2020 Solutions
22 pages
Assignment Week 8-Deep-Learning PDF
100% (1)
Assignment Week 8-Deep-Learning PDF
5 pages
DL - Assignment 12 Solution
No ratings yet
DL - Assignment 12 Solution
7 pages
Gradient Descent Learning: Minimize Objective Function: Error Landscape
No ratings yet
Gradient Descent Learning: Minimize Objective Function: Error Landscape
14 pages
Tutorial 8 Questions
No ratings yet
Tutorial 8 Questions
3 pages
Assignment 10 2024
No ratings yet
Assignment 10 2024
5 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
59 pages
Deep Learning - IIT Ropar - Unit 5 - Week 2
No ratings yet
Deep Learning - IIT Ropar - Unit 5 - Week 2
4 pages
2.vanishing Gradient and Exploding Gradient Simple Notes
No ratings yet
2.vanishing Gradient and Exploding Gradient Simple Notes
2 pages
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
No ratings yet
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
4 pages
شبكات عصبية ٢
No ratings yet
شبكات عصبية ٢
6 pages
Lecture 03
No ratings yet
Lecture 03
32 pages
Week8
No ratings yet
Week8
3 pages
L5 - UCLxDeepMind DL2020
No ratings yet
L5 - UCLxDeepMind DL2020
52 pages
Weight Initialization Techniques Assignment Questions
No ratings yet
Weight Initialization Techniques Assignment Questions
8 pages
[AK]_AIMLCZG511_Midsem_Regular
No ratings yet
[AK]_AIMLCZG511_Midsem_Regular
7 pages
Solution: Introduction To Deep Learning
No ratings yet
Solution: Introduction To Deep Learning
20 pages
DNN M3 Optimization
No ratings yet
DNN M3 Optimization
81 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
7 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
110 pages
Module 2
No ratings yet
Module 2
13 pages
Test 1 Week 5
No ratings yet
Test 1 Week 5
3 pages
4 - DNN Tip
No ratings yet
4 - DNN Tip
52 pages
Deep Learning (All in One)
No ratings yet
Deep Learning (All in One)
23 pages
2021-exam2-solution
No ratings yet
2021-exam2-solution
11 pages
week2
No ratings yet
week2
3 pages
Solution PDF
No ratings yet
Solution PDF
20 pages
Gradient Flow in Recurrent Nets-The Difficulty of Learning Long-Term
No ratings yet
Gradient Flow in Recurrent Nets-The Difficulty of Learning Long-Term
15 pages
Backpropagation LectureNotesPublic
No ratings yet
Backpropagation LectureNotesPublic
13 pages
1155_CS_F425_20230524120823_Mid_Semester_Question_Paper_DL
No ratings yet
1155_CS_F425_20230524120823_Mid_Semester_Question_Paper_DL
5 pages
DL Mentoring Session - Final
No ratings yet
DL Mentoring Session - Final
17 pages
DL - Assignment 7 Solution
100% (1)
DL - Assignment 7 Solution
5 pages
Machine Learning: Algorithms and Applications: (Continued)
No ratings yet
Machine Learning: Algorithms and Applications: (Continued)
17 pages
neural network basics
No ratings yet
neural network basics
37 pages
WEEK 4
No ratings yet
WEEK 4
61 pages
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
5/5 (2)
Assignment 11 2022
No ratings yet
Assignment 11 2022
7 pages
Assignment 10 2022
No ratings yet
Assignment 10 2022
5 pages
Assignment 1 2024 Updated
No ratings yet
Assignment 1 2024 Updated
6 pages
ass2_soln
No ratings yet
ass2_soln
6 pages
ass7_soln
No ratings yet
ass7_soln
6 pages
ass3_soln
No ratings yet
ass3_soln
6 pages
BRIDGE_COURSE_Class_XII_Economics 24-25
No ratings yet
BRIDGE_COURSE_Class_XII_Economics 24-25
41 pages
Republic v. N. Dela Merced, GR. 201501, 22 Jan. 2018
No ratings yet
Republic v. N. Dela Merced, GR. 201501, 22 Jan. 2018
15 pages
Graduate Program Manual 2021 Edition
No ratings yet
Graduate Program Manual 2021 Edition
70 pages
RESULT SHEET - BIOLOGY MOCK TEST 02 Class 10 2023
No ratings yet
RESULT SHEET - BIOLOGY MOCK TEST 02 Class 10 2023
7 pages
Minerals in Afghanistan The Potential For Gold
No ratings yet
Minerals in Afghanistan The Potential For Gold
7 pages
Experimental Investigation of Inelastic Cyclic Buckling and Fracture of Steel Braces
No ratings yet
Experimental Investigation of Inelastic Cyclic Buckling and Fracture of Steel Braces
14 pages
C5 Air Condition AC
No ratings yet
C5 Air Condition AC
15 pages
SY Binary Numbers
No ratings yet
SY Binary Numbers
14 pages
(Mechanical Engineering Series) Denny K. Miu Ph. D (Auth.) - Mechatronics - Electromechanics and Contromechanics-Springer-Verlag New York (1993)
No ratings yet
(Mechanical Engineering Series) Denny K. Miu Ph. D (Auth.) - Mechatronics - Electromechanics and Contromechanics-Springer-Verlag New York (1993)
240 pages
Instant Download Brief Narrative Practice in Single-Session Therapy 1st Edition Cooper PDF All Chapters
100% (4)
Instant Download Brief Narrative Practice in Single-Session Therapy 1st Edition Cooper PDF All Chapters
52 pages
Strobe Checklist
No ratings yet
Strobe Checklist
7 pages
TALACOGON WEST and EAST SMEA 1ST QUARTER 2022
No ratings yet
TALACOGON WEST and EAST SMEA 1ST QUARTER 2022
71 pages
Instructions
No ratings yet
Instructions
376 pages
Structural Functionalism 2019-2020
No ratings yet
Structural Functionalism 2019-2020
39 pages
XQ Por Que No Se Siguen Los Procedimientos
No ratings yet
XQ Por Que No Se Siguen Los Procedimientos
72 pages
Successful Ultrasonic Inspection of Austenitic Welds
No ratings yet
Successful Ultrasonic Inspection of Austenitic Welds
6 pages
Importance of Graphic Design in Human-Life Niron
No ratings yet
Importance of Graphic Design in Human-Life Niron
14 pages
Pointers-Science 6 Term 4
No ratings yet
Pointers-Science 6 Term 4
1 page
North Carolina Testing Program EOC Physics Sample Items Goal 6
No ratings yet
North Carolina Testing Program EOC Physics Sample Items Goal 6
11 pages
Grasps 2
No ratings yet
Grasps 2
2 pages
LESSON 7 - Exogenic Processes
No ratings yet
LESSON 7 - Exogenic Processes
40 pages
LP PG Conversion To Thar Coal
100% (1)
LP PG Conversion To Thar Coal
3 pages
MSC Thesis Mira Groot EPR in Indonesia
No ratings yet
MSC Thesis Mira Groot EPR in Indonesia
154 pages
TCVN 1450.1998 Hollow Clay Bricks
No ratings yet
TCVN 1450.1998 Hollow Clay Bricks
3 pages
IMPLICIT and EXPLICIT HOMEWORK
No ratings yet
IMPLICIT and EXPLICIT HOMEWORK
2 pages
K-CW-HS-ST-014 Toolbox Talks 2.0
No ratings yet
K-CW-HS-ST-014 Toolbox Talks 2.0
2 pages
HRM Learning Organizations
100% (4)
HRM Learning Organizations
9 pages
Sampling Merge Science
No ratings yet
Sampling Merge Science
79 pages

ass9_soln

Uploaded by

ass9_soln

Uploaded by

NPTEL Online Certification Courses

Indian Institute of Technology Kharagpur

a. is the momentum term which indicates acceleration

a. It helps accelerating Stochastic Gradient Descent in right direction

What is the correct expression of . is the learning rate.

So, weight update will be

a. we have set the learning rate to too large a value

a. 0 1 will update as per gradient descent rule

a. Rather than using the current value of a, use a larger value of

You might also like