0% found this document useful (0 votes)

5 views

MLFA Spring 2024

The document outlines the structure and content of a class test for the Machine Learning Foundations and Applications course at IIT Kharagpur, scheduled for February 1, 2024. It includes various topics such as supervised and unsupervised learning, K-Nearest Neighbors, linear models, and Naive Bayes, along with specific questions and instructions for the test. Additionally, it provides details on a mid-semester examination and a subsequent class test, including guidelines on allowed materials and the format of questions.

Uploaded by

yevoc62980

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

MLFA Spring 2024

Uploaded by

yevoc62980

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Indian Institute of Technology Kharagpur

Machine Learning Foundations and Applications

(AI42001)
Class Test-1, Date: Feb 1, 2024

Timing: 2:10 to 3:40 PM (# Qns: 4) Spring 2023-24 Max marks: 35

Attempt all questions

1. Intro to Machine Learning
ka) Explain the differences and similarities between (a) Supervised Learning; (b)
Unsupervised Learning, and (c) Reinforcement Learning. (3)
6) In the context of Supervised learning, explain the concepts of (a) Labelled Data,
(b) Model, (c) Loss, and (d) Parameter Optimization. (5)
() In the context of Unsupervised learning, explain the difference(s) between Clus
tering and Association using a practical example. (2)

2. KNearest Neighbor
(at In certain situations K in a KNN cannot be too small or too large. Explain
what are those situations? (2)
(b) Explain the difference between KNN and Weighted KNN. State whether the
weighted KNN solves the problem of too small K or too large K. Justify. (2
(é) In the context of KNN, explain (a) Why we need to normalize the input features
before KNN classification. (b) What is curse of dimensionality. (2)
(a A KNN classifier assigns a test instance the majority class associated with its
K nearest training instances. Distance between instances is measured using
Euclidean distance. Suppose we have the following training set of positive (+)
and negative (-) instances and a single test instance (o). All instances are
projected onto a vector space of two real-valued features (X and Y). Answer
the following questions. Assume "unweighted" KNN (every nearest neighbor
contributes equally to the final vote).
Y

test instance

Figure Input distribution.

(i) What would be the class assigned to this test instance for K=1?
(ii) What would be the class assigned to this test instance for K=3?
(iii) What would be the class assigned to this test instance for K=5?
(iv) Setting K to a large value seems like a good idea. We get more votes!
Given this particular training set, would you recommend setting K = 11?
Why or why not?
(4)
3. Linear Models
(a) For a binary classification problem, explain (a) pre-inner product and post
inner product interpretation. What are the advantages of one over other? (b)
What is the importance of bias using the post-inner product interpretation. (3)
) Can a linear regression classifier achieve zero training error on any of the
datasets in Fig. 2. Provide justification for your answer.

Page 2
(A)
(B)

(E) (F

Figure 2: The 2-dimensional labeled training sets with two

classes.

(2)
( A random sample of eight drivers insured with a company and
having similar
auto insurance policies was selected. The following table lists their driving
experiences (in years) and monthly auto insurance premiums.

5
Driving Experience (years) Monthly Auto Insurance Premium
64 USD
2 87
12 50
9 71 ()c-9

15 44
6 56
25 42
16 60 }2

1) Does the insurance premium depend on the driving experience ? Do you

expect'a positive or a negative relationship between these two variables?
(ii) Find the least squares regression line by choosing appropriate dependent
and independent variables based on your answer in part 1.
) Interpret the meaning of the values of a and b calculated in part 2.
AV) Predict the monthly auto insurance premium for a drËver with 10 years of
driving experience.
(5)

Page 3
A. Naive Bayes
(a Here's a naive Bayes model with the following conditional probability table and
the following prior probabilities over classes.
Word type b C

P(w y =1) 5/10 3/10 2/10

P(wy =0) 2/10 2/ 10 6/10
P(y=1) P(y=0)
8/10 2/10
Consider a binary classification problem, for whether a document is about the
Chandrayaan-3 (class y = 1), or it is not about the Chandrayaan-3 (y = 0).
Consider a document consisting of 2 a's, and 1c.
What is the probability that it is about the Chandrayaan?
6) What is the probability that it is not about the Chandrayaan?
(5)

Best wishes

Page 4
So
INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR
Mid Spring Semester Examination 2023-24
Date of Examination: 20-02-2024 Session: FN Duration:2 Hrs_Full Marks: 40
Subject No. :AM2001 Subject : MACHINE LEARNING FOUNDATIONS AND APPUGATIONS
DepartmentUCenter/School: Artiflclal Intellgence
Specific charts, graph paper, log book et., requlred No (Ensure qucstion papct has 10 questions)
Special Instructions (if any) :Calculators arc allowed. Rough work must be prescnt in the answct sript itscif.

Short Answer Questions (Answer this as a Separate Scction)

, K Explain the principle of the gradient desccnt algorithm. Accompany your explanation with a diagram
Explain the use of all the tems and constants. (2]
2. Derive the gradicnt descent training rule assuming that the target function representation is
Od =Wo + W;X1d .+ W,Xpd.

Define explicitly the cost/error function E, assuming that a set of training examples D is provided, where
cach training example deDis associated with the target output ta. (2]
3. Which of the following statements are true for k-NN classifiers (provide all answers that are correct)? [1]
a) The classification accuracy is better with larger values of k.

b) The decision boundary is smoother with smaller values of k.

c) k-NN is a type of instance-based learning.

d) k-NN does not require an explicit training step.

e) The decision boundary is linear.

4. Give a one sentence reason why: [1-1-1-1+1]

A. Though both Supervised and Reinforcement learning use supervision, two supervisions are different.
B. The two unsupervised learning problems Association and Clustering are different.

. We might prefer Decision Tree learning over K-NN classifier.

. We choose parameters that minimize the sum of squared training errors in Lincar Regression.

LASSO Regression cnforces more sparsity in weights as comparcd to Ridge Regression

irelevant to the classification
. Suppose among the attributes used to represent the instances, a subset may be
and Decision Tree,
problem being solved. Given this situation, which one among K Nearest Neighbor
[1]
would be better modelling choice? Provide justification.
(Answer this as a Separate Section)
Long Answer Questions
[(4+2+2]
6. In the context of Lincar Regression, answer the following questions:
coresponding output pair: {Xi1, Xi2, Yi}. We would
We are given a set of two-dimensional inputs and their
like to use the following regression modcl to predict y:
y= wi'x t w:'x. (Wz may
Dernve the optimal valuc for wË when using least squares as the target minimization function
appear in your resulting cquation). Note that there may be morc than one possible valuc for w

B. Now assume we only observe a single input for each output (that is, a set of fx, y} pairs). We would like
to compare the following two models on our input datasct (for cach onc we split into training and testing
set to evaluate the lcamed model). Assume we have an unlimited amount of data:
Model A: y=wx,
Model B: y = wx.
Which of the following is correct (chose the answer that best describes the outcome)? Justify.
a. There are datasets for which A would perform better than B
b. There are datasets for which B would perfom better than A
c. Both l and 2 are correct.
d. They would perform equally well on all datasets

C. For the data above we are now comparing the following two models:
Model A: y=wix+ W2x,
Model B: y= wx.
Note that model A now uses two parameters (though both multiply the same input value, x). Again, we
assume unlimited data. Which of the following is correct (chose the answer that best describes the
outcome)? Justify your answer.
a. There are datasets for which A would perform better than B
b. There are datasets for which B would perform better than A
c. Bothl and 2are correct.
d. They would perform equally well on all datasets.

.7. Suppose you are given a Linear Classification problem with the dataset in Fig. 1. [3)

Fig. 1
y = wo + w;X) + w:X; Illustrate using suitable
We would like to use the following classification model: weights: (a) No regularization (b) LI
decision boundaries, what is the impact of regularization on
regularization and (c) L2 regularization. We must aim for the
least loss value and assume that we can
neglect atmost 1 misclassified datapoint, if needed.
of different fruits. Apply Naïve Baye's and predict
. Suppose we have the following table which has attributes 3]
if a fruit has the following properties: (Yellow, Sweet, Long}, then which type of the fruit it is.
that
have

Frequency Table:
Fruit Yellow Sweet Long Total
Mango |350 450 0 650
Banana |400 300 350 400 20
Others 50 100 50 |150
Total 800 850 400 1200

ISRO intends to include a module in Pragyan, the lunar probe of Chandrayaan-3, that will discriminate
between igneous rocks found in Moon (M) and igneous rocks found in Earth (E) based on the folowing
characteristics (attributes): Water content E (N, Y), Number of distinct textures E(> 10,< 10), Size E
{S, L), Smelly ¬ (N, Y}. Available training data is as follows. (S+3+2]
Index Type Water No: of Textures Size Smelly
1 Y > 10/ Y
(2) M N < 10 L N
(3 N > 10 / N
(4 M < 10 S
(5 Y < 10
(6) E Y < 10 S
(7) Y < 10
8 E N < 10 S N

a Train a decision tree using the above data and draw the tree (provide all your calculations).
Write the learned concept for an igneous rocks found in Moon as a set of conjunctive rules (using
AND and OR operators)
Figure 2shows a decision tree with depth two. Show that this decision tree perfectly classifies the
given data. Though this decision tree gives a simpler hypothesis with zero error, why does the
approach employed in question (a) fails to output this kind of simpler decision tree

Size

Smelly Water

Fig. 2
Learning. You should mathematically derive
0. Explain the concept of Bias-variance trade-off in Machine
boosting methods to counteract bias
the bias-variance relation. Also explain, how we use bagging and
individually help. [5]
variance trade-off, and which part ofbias-variance trade-off will they
Good luck!
Al42001 - Machine Learning Foundations and
Applications
Class Test 2

Instuctions: Please answer all questions. The maximum points of this test is 15.
and you are allowed 30 minutes to complete the test. This is a closed book test, and
the use of electronic devices other than non-progranmable calculators are not
permitted during the duration of the test. Good luck!

Question 1

Pick the correct option in each of the following questions. Some questions may
have more than one correct option, and you have need to identify all of them to
receive full credit.

t. A single image of size 27 x 27 >x3 is passed through a convolutional layer

having 16 convolutional filters are of spatial dimension 3 x3 and has 3
channels corresponding to the 3 channels of the input image. The padding size
is 2 and the stride of the convolutional filters is also 2. What would be the size of
the feature map produced by this convolutional layer? [1point)
A. 9X9× 16
B. 15 x 15 x 16
C. 14 x14 ×3
D. None of the above

2. Suppose we use a polynomial kernel of degree 4 anda RBF (Gaussian) kernel to

implicitiy map a7 dimensional feature vector to feature spaces of dimensions
k, and k, for polynomial and RBF kernels, respectively. What are the values of
k and k,? [1point]
A. k, = 4, ky = o
B. k, = 28, k, =7

Page 1 of 4
C. ky = 2401, k, = 49
D. k, = 330, k, = 0

3. The figure shows two decision boundaries obtained using soft-margin SVM
classifiers, Aand Bobtained using soft-margin SVM as discussed in class:
M
min
i=l

s. t.y(w"r0+b) 1-§ Vi
>0Vi

A B
The values of the hyper parameter C are C and Cg for the learned classifiers A
and B. What is the relationship between C and Cg? [1point]
A. CC<Cg
B. CA> CB
C. CA = CB
D. Cannot be determined from available infornmation

A. Suppose you have a deep CNN model having several layers that perfoms an
image classification task by learning on a dataset of 256 X 256 images, and a
logistic regression model operating on 10 features extracted from the sante
dataset of images to perform the same classification task. Which ensemble

Page 2 of 4
learning technique would you apply to improve the bias-variance tradeoff of
each learner? [1point)
A. Boosting for deep CNN, bagging for logistic regression
B. Boosting for both learners
C. Bagging for both learners
D. Bagging for deep CNN, boosting for logistic regression

5. Which solutions could be effective in mitigating the vanishing/exploding gradient

problems in RNNs?
A. Gradient clipping
B. Making the RNN bidirectional by adding backward layers.
C. Using gated RNNs such as LSTMs
D. All of the above.

Question 2

Design a 2 input XOR gate using 3 units (artificial neurons). You may assume that
the inputs x and x, as wellas the output ytake binary values, i.e xj, X). y
e{-1,1), while the weights and biases can take integer values (positive or
negative). Clearly show the structure of the network and specify the weights and
biases of each unit. (5 points]

Question 3

The receptive field of a layer with respect to the input is defined as the number of
pixels of input that influences each element of the feature map produced by the
corresponding layer. Assume that a 64 × 64 X3 image, I is passed through two
convolutional layers followed by a max pooling layer to produce a feature map Fas
shown in the figure below.

C(5x5X 8) C(3X3X 16) Max pool (4X4) F

Stride-2 Stride=1 Stride=4

Page 3 of 4
i) What is the receptive field of the first convolutional layer C,? 1 point)
i) What is the receptive field of the second convolutional layer C, 2 points]
0v) Calculate the receptive field of the max pooling layer. 2 points]

Page 4 of 4

John Potts - Future Fear - Fear of The Future From Prehistory To Climate Change-Palgrave Macmillan (2024)
No ratings yet
John Potts - Future Fear - Fear of The Future From Prehistory To Climate Change-Palgrave Macmillan (2024)
233 pages
Diffusion - Virtual Lab
100% (2)
Diffusion - Virtual Lab
8 pages
Aviation Psychology and Human Factors - Monica Martinussen and David R. Hunter 2018
0% (1)
Aviation Psychology and Human Factors - Monica Martinussen and David R. Hunter 2018
66 pages
Machine Learning-Csen 3233-2023
No ratings yet
Machine Learning-Csen 3233-2023
4 pages
2020-09-22SupplementaryCS467CS467-E - Ktu Qbank
No ratings yet
2020-09-22SupplementaryCS467CS467-E - Ktu Qbank
3 pages
DL - Assignment 2 Solution
No ratings yet
DL - Assignment 2 Solution
7 pages
2019 - Introduction To Data Analytics Using R
No ratings yet
2019 - Introduction To Data Analytics Using R
5 pages
2023 Midsem Spring
No ratings yet
2023 Midsem Spring
2 pages
ECON 312 ECONOMETRICS I - Kabarak University (2)
No ratings yet
ECON 312 ECONOMETRICS I - Kabarak University (2)
5 pages
Answer All Questions, Each Carries 4 Marks
No ratings yet
Answer All Questions, Each Carries 4 Marks
3 pages
DEEP LEARNING IIT Kharagpur Assignment - 2 - 2024 - Updated
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 2 - 2024 - Updated
6 pages
Assignment 2 2024 Updated
No ratings yet
Assignment 2 2024 Updated
6 pages
T1942209I4PV1 : Module Examination 2022 Engineering: Mathematics, Modelling, Applications
No ratings yet
T1942209I4PV1 : Module Examination 2022 Engineering: Mathematics, Modelling, Applications
7 pages
II ST QP Format ML
No ratings yet
II ST QP Format ML
1 page
ST3189 2022 paper
No ratings yet
ST3189 2022 paper
8 pages
Sem 4 Question Paper
No ratings yet
Sem 4 Question Paper
12 pages
BCA SEMESTER-IV Assignments 2021-22
No ratings yet
BCA SEMESTER-IV Assignments 2021-22
16 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
2326 - EC2020 - Main EQP v1 - Final
No ratings yet
2326 - EC2020 - Main EQP v1 - Final
19 pages
RecSys - Final (Solution)
No ratings yet
RecSys - Final (Solution)
6 pages
CSC220 356 133-CSC220
No ratings yet
CSC220 356 133-CSC220
5 pages
QBANK_ML
No ratings yet
QBANK_ML
6 pages
Midterm - APS1070 - 2019 - 09 Fall
No ratings yet
Midterm - APS1070 - 2019 - 09 Fall
2 pages
Sem 1 Assign
No ratings yet
Sem 1 Assign
8 pages
AIML MODEL Q-Set
No ratings yet
AIML MODEL Q-Set
2 pages
Tables 4, 5, 7, 8, 9, 10, 13 & 14 (New Cambridge) - Graph Paper
No ratings yet
Tables 4, 5, 7, 8, 9, 10, 13 & 14 (New Cambridge) - Graph Paper
21 pages
ECON 330 Problem Sets
No ratings yet
ECON 330 Problem Sets
3 pages
ECON 312 ECONOMETRICS I - Kabarak University (1)
No ratings yet
ECON 312 ECONOMETRICS I - Kabarak University (1)
5 pages
Allama Iqbal Open University, Islamabad (Department of Statistics) Warning
No ratings yet
Allama Iqbal Open University, Islamabad (Department of Statistics) Warning
4 pages
Semester Two Examinations 2023 DATA7703
No ratings yet
Semester Two Examinations 2023 DATA7703
15 pages
MID_SEM_QP_2024_MARCH_final
No ratings yet
MID_SEM_QP_2024_MARCH_final
4 pages
Week4 Questions
No ratings yet
Week4 Questions
4 pages
Mid-Sem_11
No ratings yet
Mid-Sem_11
2 pages
Machine Learning PYQ 2023
No ratings yet
Machine Learning PYQ 2023
8 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Btech Cs 5 Sem Machine Learning Techniques Kcs055 2023
No ratings yet
Btech Cs 5 Sem Machine Learning Techniques Kcs055 2023
1 page
Mba 2 Sem 15mng201 Business Research Methods 2017
No ratings yet
Mba 2 Sem 15mng201 Business Research Methods 2017
3 pages
ml-20240315
No ratings yet
ml-20240315
8 pages
S12010 Jan
No ratings yet
S12010 Jan
13 pages
CS467 A
No ratings yet
CS467 A
3 pages
CS467-ML-Dec2018
No ratings yet
CS467-ML-Dec2018
3 pages
Machine Learning Kme074
No ratings yet
Machine Learning Kme074
2 pages
SDSC3006_Assignment 2
No ratings yet
SDSC3006_Assignment 2
3 pages
203105515_5926_Question_Paper ML
No ratings yet
203105515_5926_Question_Paper ML
2 pages
CS2011 AI & ML END SEM
No ratings yet
CS2011 AI & ML END SEM
2 pages
CSC510 DEC 2019
No ratings yet
CSC510 DEC 2019
6 pages
1152_CS_F425_Comprehensive_Exam_Question_Paper_DL
No ratings yet
1152_CS_F425_Comprehensive_Exam_Question_Paper_DL
2 pages
Sessional 1-1
No ratings yet
Sessional 1-1
5 pages
CS2011 AI & ML
No ratings yet
CS2011 AI & ML
2 pages
Cmsacor05t (2020) Paper
No ratings yet
Cmsacor05t (2020) Paper
2 pages
ML CT Question Paper 2023 24
No ratings yet
ML CT Question Paper 2023 24
2 pages
Compre Solution Regular
No ratings yet
Compre Solution Regular
27 pages
HT TP: //qpa Pe R.W But .Ac .In: Pattern Recognition
No ratings yet
HT TP: //qpa Pe R.W But .Ac .In: Pattern Recognition
4 pages
Model Question Paper- AIML
No ratings yet
Model Question Paper- AIML
4 pages
Be Summer 2022
No ratings yet
Be Summer 2022
2 pages
Assignment 1-12 ML
No ratings yet
Assignment 1-12 ML
54 pages
Sta01b1 2022 Supp
No ratings yet
Sta01b1 2022 Supp
21 pages
sample_final_questions
No ratings yet
sample_final_questions
4 pages
Final Exam BWA44603
No ratings yet
Final Exam BWA44603
4 pages
2022-23 Second Sem- DRL Mid Sem Regular
No ratings yet
2022-23 Second Sem- DRL Mid Sem Regular
2 pages
Data Mining Comprehensive Exam - Regular PDF
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
3 pages
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
Oracle Certified Associate Java Programmer OCAJP 1Z0 808
From Everand
Oracle Certified Associate Java Programmer OCAJP 1Z0 808
Manish Soni
No ratings yet
Appreciation of Mathematics in Nature Essay
No ratings yet
Appreciation of Mathematics in Nature Essay
2 pages
Settlement Folio
No ratings yet
Settlement Folio
11 pages
PFF260S 2023 Chapter 3.2 - Conservation Laws (Conservation of Energy)
No ratings yet
PFF260S 2023 Chapter 3.2 - Conservation Laws (Conservation of Energy)
23 pages
Transformer Faults in Tanzanian Electrical Distrib
No ratings yet
Transformer Faults in Tanzanian Electrical Distrib
13 pages
36 - Construction Standard 2 2013-Steel Reinforcement Test Standard-PowerPoint - 17dec12
No ratings yet
36 - Construction Standard 2 2013-Steel Reinforcement Test Standard-PowerPoint - 17dec12
21 pages
Revision Cognitive Ethics
No ratings yet
Revision Cognitive Ethics
3 pages
Ruizo, Charlene Joy A. Fieldwork #3
No ratings yet
Ruizo, Charlene Joy A. Fieldwork #3
6 pages
D Instruction To Tenderers
No ratings yet
D Instruction To Tenderers
11 pages
Semantic Primes. Natural Semantic Metalanguage
No ratings yet
Semantic Primes. Natural Semantic Metalanguage
39 pages
Capacitance L 1
No ratings yet
Capacitance L 1
4 pages
Maths Mid Term 10
No ratings yet
Maths Mid Term 10
8 pages
Science-Pt Tos q4
No ratings yet
Science-Pt Tos q4
1 page
Information Theory
No ratings yet
Information Theory
27 pages
Technology Comparison (EGSB IC ECSB)
No ratings yet
Technology Comparison (EGSB IC ECSB)
11 pages
MSC Dissertation
No ratings yet
MSC Dissertation
34 pages
Specific Gravity of Crude Oil PDF
No ratings yet
Specific Gravity of Crude Oil PDF
3 pages
Indices
No ratings yet
Indices
3 pages
Credit Card Fraud Detection Through Anomaly Detection
No ratings yet
Credit Card Fraud Detection Through Anomaly Detection
20 pages
Production Optimization Pee 617 Presentation
No ratings yet
Production Optimization Pee 617 Presentation
3 pages
Quotations for upsc
No ratings yet
Quotations for upsc
6 pages
CLASS 12 Mathematics - Lesson - Plan - 2024-25
No ratings yet
CLASS 12 Mathematics - Lesson - Plan - 2024-25
3 pages
Session-4 - POTENTIOMETER - Loading Characteristics - 15-9-2020
No ratings yet
Session-4 - POTENTIOMETER - Loading Characteristics - 15-9-2020
19 pages
Englis Essay Confidence
No ratings yet
Englis Essay Confidence
4 pages
Learning Standards Tests: Unit 1 Test
No ratings yet
Learning Standards Tests: Unit 1 Test
5 pages
IND3 Bilfinger Sweden 01 Hot Work
No ratings yet
IND3 Bilfinger Sweden 01 Hot Work
3 pages
Model Paper - 3: Smoking Yes
100% (4)
Model Paper - 3: Smoking Yes
7 pages
Pengaruh Pelayanan Prima Terhadap Kepuasan Pasien Di Instalasi Rawat Inap Rumah Sakit Daerah Kalisat Kabupaten Jember
No ratings yet
Pengaruh Pelayanan Prima Terhadap Kepuasan Pasien Di Instalasi Rawat Inap Rumah Sakit Daerah Kalisat Kabupaten Jember
9 pages

MLFA Spring 2024

Uploaded by

MLFA Spring 2024

Uploaded by

Indian Institute of Technology Kharagpur

Machine Learning Foundations and Applications

Timing: 2:10 to 3:40 PM (# Qns: 4) Spring 2023-24 Max marks: 35

Attempt all questions

Figure Input distribution.

Figure 2: The 2-dimensional labeled training sets with two

1) Does the insurance premium depend on the driving experience ? Do you

P(w y =1) 5/10 3/10 2/10

Short Answer Questions (Answer this as a Separate Scction)

b) The decision boundary is smoother with smaller values of k.

d) k-NN does not require an explicit training step.

4. Give a one sentence reason why: [1-1-1-1+1]

. We might prefer Decision Tree learning over K-NN classifier.

LASSO Regression cnforces more sparsity in weights as comparcd to Ridge Regression

t. A single image of size 27 x 27 >x3 is passed through a convolutional layer

2. Suppose we use a polynomial kernel of degree 4 anda RBF (Gaussian) kernel to

5. Which solutions could be effective in mitigating the vanishing/exploding gradient

C(5x5X 8) C(3X3X 16) Max pool (4X4) F

You might also like