0% found this document useful (0 votes)

41 views3 pages

Sheet 3 Sol 3

Uploaded by

Ahmed Elkerdawy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views3 pages

Sheet 3 Sol 3

Uploaded by

Ahmed Elkerdawy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

1.

What is the purpose of gradient descent optimization in

machine learning?
- used to minimize a loss function by iteratively moving towards the minimum of the function.
- It calculates the gradient (or slope) of the function at a point and updates the parameters to find
the optimal values.

2. What is the learning rate in gradient descent, and why is

it important?
- controls the size of the steps taken during the gradient descent updates

- It is important because it affects how fast or slow the optimization algorithm works

 A small learning rate makes the convergence slow but precise.

 A large learning rate might speed up the process but can cause overshooting, making it harder to converge to the
minimum.

3. What is the role of batch size in gradient descent

optimization?
- The batch size determines how many training examples are used to compute the gradient .

- Small batch sizes can converge faster but are more noisy and less stable

- Large batch sizes can converge slower but are more accurate and stable
- There are three common variants:

 Batch Gradient Descent: Uses the entire dataset.

 Stochastic Gradient Descent (SGD): Uses one training example at each step.
 Mini-batch Gradient Descent: Uses a subset of the dataset.

4. Consider the quadratic function

f(𝑥, 𝑦) = x 2 + 3𝑦2 + 2𝑥𝑦 − 6𝑥 − 9𝑦 + 5. Use gradient descent to
find the minimum value of the function. Start with an initial guess for x,
y and update it iteratively using the gradient descent algorithm.
Given conditions:
- Initial guess: (1,1) - Learning rate: 𝛼 = .1
- Number of iterations: 3
- Partial derivative of f(x,y) with respect to x: 2x+2y−6

- Partial derivative of f(x,y) with respect to y: 6y+2x-9

The gradient vector is: ∇ f(x,y) = ( 2x+2y−6 , 6y+2x−9 )

Gradient descent updates using the following rules:

at (x,y)=(1,1), α= 0.1, and 3 iterations

Iteration 1 Iteration 2 Iteration 3

∇f(x,y) ∇f(1,1) = (−2 , −1) ∇f(1.2 , 1.1) = (−1.4 , .04) ∇f(1.34 , 1.096) = (−1.128,0.232)
Xnew 1 − 0.1(−2) = 1.2 1.2 - .1(-1.4) = 1.34 1.2 - .1(-1.4) = 1.4528
Ynew 1 − 0.1(−1) = 1.1 1.1 - .1(0.04) = 1.096 1.1 - .1(0.04) = 1.0728

∇f(1.4528 , 1.0728) = 0.4705

Unit VI Optimization Techniques question bank solved answer
No ratings yet
Unit VI Optimization Techniques question bank solved answer
20 pages
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2025-01-08_Reference-Material-I
No ratings yet
WINSEM2024-25_CSE4006_ETH_AP2024254000693_2025-01-08_Reference-Material-I
40 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
No ratings yet
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
12 pages
3 Types of Gradient Descent Algorithms For Small & Large Datasets
No ratings yet
3 Types of Gradient Descent Algorithms For Small & Large Datasets
9 pages
Module 4 Lab 3
No ratings yet
Module 4 Lab 3
6 pages
Mlfa Autumn 23 Optimization
No ratings yet
Mlfa Autumn 23 Optimization
37 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
LInear
No ratings yet
LInear
14 pages
4. Gradient Descent
No ratings yet
4. Gradient Descent
15 pages
5.1Loss Function, Optimization,Gd
No ratings yet
5.1Loss Function, Optimization,Gd
39 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
Gradient Descent Method
No ratings yet
Gradient Descent Method
12 pages
Gradient Descent and SGD
No ratings yet
Gradient Descent and SGD
8 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient-Based Optimizers
No ratings yet
Gradient-Based Optimizers
54 pages
Lecture 5
No ratings yet
Lecture 5
34 pages
HMD-Deep Learning-Lecture 2-2024
No ratings yet
HMD-Deep Learning-Lecture 2-2024
47 pages
ML Notes
No ratings yet
ML Notes
14 pages
Gradient_Descent
No ratings yet
Gradient_Descent
52 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
DL Regularization
No ratings yet
DL Regularization
51 pages
GD Types
No ratings yet
GD Types
98 pages
14-RMSProp and Adam Optimization-12!08!2024
No ratings yet
14-RMSProp and Adam Optimization-12!08!2024
2 pages
Ch2-Training, Optimization and Regularization of DNN-new (1)
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new (1)
114 pages
LinearRegression Annotated
No ratings yet
LinearRegression Annotated
116 pages
Linear Models (Unit II) Chapter III 1
No ratings yet
Linear Models (Unit II) Chapter III 1
24 pages
DL MODULE 2
No ratings yet
DL MODULE 2
8 pages
Gradient Descent Algorithm is a first
No ratings yet
Gradient Descent Algorithm is a first
5 pages
Gradient Descent
No ratings yet
Gradient Descent
4 pages
5 Optimizers
No ratings yet
5 Optimizers
10 pages
Gradient DescentSummartyL5
No ratings yet
Gradient DescentSummartyL5
7 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Gradient Descent Optimization
No ratings yet
Gradient Descent Optimization
27 pages
Tut04 - One Algorithm To Optimize Them All
No ratings yet
Tut04 - One Algorithm To Optimize Them All
19 pages
Gradient Descent_PR
No ratings yet
Gradient Descent_PR
31 pages
4_Gradient Descent and Stochastic GD
No ratings yet
4_Gradient Descent and Stochastic GD
37 pages
Machine Learning Notes by Standard Andrew Ng
No ratings yet
Machine Learning Notes by Standard Andrew Ng
142 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
3 Gradient Descent
No ratings yet
3 Gradient Descent
8 pages
3 TrainingNetwork
No ratings yet
3 TrainingNetwork
65 pages
Notes Unit 1-3 Part-III
No ratings yet
Notes Unit 1-3 Part-III
25 pages
Eem520l3 2023
No ratings yet
Eem520l3 2023
25 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Gradient Descent
No ratings yet
Gradient Descent
108 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
[PR 2024] Lec2 Regression II
No ratings yet
[PR 2024] Lec2 Regression II
41 pages
DL Unit -2
No ratings yet
DL Unit -2
20 pages
Technical_writing
No ratings yet
Technical_writing
8 pages
Lesson 4 Gradient Descent
No ratings yet
Lesson 4 Gradient Descent
13 pages
Gradient Descent
No ratings yet
Gradient Descent
2 pages
Gradient Descent a Fundamental Optimization Algorithm
No ratings yet
Gradient Descent a Fundamental Optimization Algorithm
30 pages
Technical_writing (1)
No ratings yet
Technical_writing (1)
9 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Gradient descent
No ratings yet
Gradient descent
16 pages
SAT Math Shortcuts
From Everand
SAT Math Shortcuts
Bella Biscotti
No ratings yet
RoboDK-Doc-EN-Getting-Started
No ratings yet
RoboDK-Doc-EN-Getting-Started
18 pages
Assignement 4
No ratings yet
Assignement 4
1 page
CV Lec4
No ratings yet
CV Lec4
46 pages
Sheet Sol 2
No ratings yet
Sheet Sol 2
3 pages
Edges Standard Embedded Diploma 82, 83 and 84 Reservation
No ratings yet
Edges Standard Embedded Diploma 82, 83 and 84 Reservation
13 pages
Sheet 4 Modeling2023 Mod
No ratings yet
Sheet 4 Modeling2023 Mod
3 pages
Sheet 4
No ratings yet
Sheet 4
4 pages
Cpu Scheduling MCQ
No ratings yet
Cpu Scheduling MCQ
5 pages
C Programming
No ratings yet
C Programming
135 pages
Pointers
No ratings yet
Pointers
39 pages
دعوة حلوة وانت معدي?
No ratings yet
دعوة حلوة وانت معدي?
4 pages
Architecture للمواصلات
No ratings yet
Architecture للمواصلات
6 pages
7420211132816
No ratings yet
7420211132816
2 pages
2324 - Factoring Polynomials
No ratings yet
2324 - Factoring Polynomials
67 pages
Application To Dynamical System
No ratings yet
Application To Dynamical System
2 pages
OTA Project (A1-G4)
No ratings yet
OTA Project (A1-G4)
12 pages
Unit II (Mathematics II)
No ratings yet
Unit II (Mathematics II)
5 pages
Lecture17-Four-Lines-Wonder_APSP-updated
No ratings yet
Lecture17-Four-Lines-Wonder_APSP-updated
22 pages
Lecture On Bus Math 3 - Linear Progamming - Minimization
No ratings yet
Lecture On Bus Math 3 - Linear Progamming - Minimization
3 pages
Matlab Code
No ratings yet
Matlab Code
23 pages
Literature Review Linear Programming
100% (2)
Literature Review Linear Programming
6 pages
Bellman Ford RIP
No ratings yet
Bellman Ford RIP
9 pages
Mathematics: Illustrating Polynomial Equations
No ratings yet
Mathematics: Illustrating Polynomial Equations
14 pages
APPC 1.6A Wkst Polynomial End Behavior
No ratings yet
APPC 1.6A Wkst Polynomial End Behavior
2 pages
MST BoruvkasAlgorithm
No ratings yet
MST BoruvkasAlgorithm
30 pages
Informed Search
No ratings yet
Informed Search
7 pages
Lecture 6
No ratings yet
Lecture 6
55 pages
Bca Daa 04
No ratings yet
Bca Daa 04
16 pages
A Model in Decision Making
No ratings yet
A Model in Decision Making
32 pages
Assignment 2 G10
No ratings yet
Assignment 2 G10
8 pages
Covering and Coloring Mat175
No ratings yet
Covering and Coloring Mat175
9 pages
Introduction PDF
No ratings yet
Introduction PDF
25 pages
B.math - CH 03
No ratings yet
B.math - CH 03
3 pages
Branch and Cut - Optimization
No ratings yet
Branch and Cut - Optimization
6 pages
Gauss-Seidel 3
No ratings yet
Gauss-Seidel 3
14 pages
Lecture 4 Scan Conversion Bresenhams Algorithm
No ratings yet
Lecture 4 Scan Conversion Bresenhams Algorithm
16 pages
Section 3: Chapter 5.3-1
No ratings yet
Section 3: Chapter 5.3-1
41 pages
MCQs | Artificial Neural Networks- Components and Concepts | AIMCQs
No ratings yet
MCQs | Artificial Neural Networks- Components and Concepts | AIMCQs
11 pages
DAA-Unit II
No ratings yet
DAA-Unit II
12 pages
Analisis Faktor Yang Mempengaruhi Penumpang Angkutan Umum Beralih Ke Transportasi Online Go-Jek Menggunakan Metode K-Means Clustering
No ratings yet
Analisis Faktor Yang Mempengaruhi Penumpang Angkutan Umum Beralih Ke Transportasi Online Go-Jek Menggunakan Metode K-Means Clustering
7 pages
Root Finding (Numericals Method)
No ratings yet
Root Finding (Numericals Method)
14 pages

Sheet 3 Sol 3

Uploaded by

Sheet 3 Sol 3

Uploaded by

1.

What is the purpose of gradient descent optimization in

2. What is the learning rate in gradient descent, and why is

 A small learning rate makes the convergence slow but precise.

3. What is the role of batch size in gradient descent

 Batch Gradient Descent: Uses the entire dataset.

4. Consider the quadratic function

- Partial derivative of f(x,y) with respect to y: 6y+2x-9

The gradient vector is: ∇ f(x,y) = ( 2x+2y−6 , 6y+2x−9 )

Gradient descent updates using the following rules:

at (x,y)=(1,1), α= 0.1, and 3 iterations

Iteration 1 Iteration 2 Iteration 3

∇f(1.4528 , 1.0728) = 0.4705

You might also like