Assessment - Google Forms
Assessment - Google Forms
1. Email *
kernel size 3, stride 2, and padding 1. What is the final output size?
4. If your CNN model takes input 224x224x3 and passes it through a 7x7 conv 1 point
5. A GRU cell has input size 20 and hidden size 10. How many parameters 1 point
does it have?
6. A model predicts [0.8, 0.1, 0.1] for class labels, but the true label is [1, 0, 0]. 1 point
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 1/7
4/9/25, 12:59 PM Quiz Test (10 Marks): Deep Learning and Applications CSBB 424
7. An LSTM has input size 100 and hidden size 50. How many parameters are 1 point
in the LSTM cell?
8. A ReLU activation function outputs 0 for how many inputs out of these: [-5, 1 point
0, 3, -2, 7]?
9. A model has 2 hidden layers with 128 and 64 neurons respectively. Each 1 point
layer has a bias and ReLU activation. If the input vector is of size 256,
calculate the total number of parameters (excluding output layer).
Speed up inference
Prevent overfitting
11. In an LSTM cell, what does the forget gate do? 1 point
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 2/7
4/9/25, 12:59 PM Quiz Test (10 Marks): Deep Learning and Applications CSBB 424
12. Which optimizer adapts learning rates based on the first and second 1 point
SGD
Adagrad
Adam
Momentum
13. What is the primary benefit of using batch normalization in deep 1 point
networks?
Prevent overfitting
Convolutions
Attention mechanisms
Recurrent layers
Embeddings
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 3/7
4/9/25, 12:59 PM Quiz Test (10 Marks): Deep Learning and Applications CSBB 424
GANs
17. A model’s training loss keeps decreasing, but validation loss starts 1 point
increasing after epoch 15. What does this indicate and what should you
do?
The Batch Normalization layer computes the normalized values using the
batch mean and standard deviation. Then, it applies learnable parameters
γ=2 and β=5 as follows: What is the output of the Batch Normalization
layer after applying γ and β?
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 4/7
4/9/25, 12:59 PM Quiz Test (10 Marks): Deep Learning and Applications CSBB 424
19. A deep learning model is used for semantic segmentation on medical 1 point
imaging data. The ground truth and prediction for a particular image are
binary masks (1 for foreground, 0 for background).
What is the Intersection over Union (IoU) for the foreground class?
20. A deep learning model is used for semantic segmentation on medical 1 point
imaging data. The ground truth and prediction for a particular image are
binary masks (1 for foreground, 0 for background).
Predicted mask:
[1,0,0,1,0,1,1,0]
21. Suppose your model gives the raw logits for 3 classes at a pixel: 1 point
Logits=[2.0,1.0,0.1]
What is the SoftMax probability for each class (to 2 decimal places)?
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 5/7
4/9/25, 12:59 PM Quiz Test (10 Marks): Deep Learning and Applications CSBB 424
Forms
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 6/7
4/9/25, 12:59 PM Quiz Test (10 Marks): Deep Learning and Applications CSBB 424
https://docs.google.com/forms/d/1JlAOVX2js6Eq8iGeuFsP8shFQ7fhMElASeNewQ_vLik/edit 7/7