AssignmentQuestion4Bigdata_2025
AssignmentQuestion4Bigdata_2025
3. a) What is machine learning? How is machine learning different from the traditional
programming approach?
b) Explain the features of Machine learning.
c) Describe the different phases of Machine learning with a neat diagram
d) Explain the steps involved in Machine Learning to develop a model.
4. a) Explain the differences between supervised learning and unsupervised learning with
suitable example
b) Explain the major regression techniques with suitable example
c) Explain cross-validation and why it’s essential.
d) Which metrics would you use to evaluate a regression model, and why?
6. a) What is a Confusion Matrix? Explain how accuracy, sensitivity and specificity can be
calculated using it.
b) Explain the significance and relationship between dependent and independent variables
in model development
c) Describe how decision trees split nodes based on purity measures (e.g., Gini index,
entropy).
d) How do you handle hyperparameter tuning?
7. a) Explain the differences between Lasso, Ridge and Elastic net regression techniques
b) Explain the differences between different cluster classification algorithms
c) Discuss the impact of tree depth on bias and variance.
d) What are the practical trade-offs between using random forests and gradient boosting
machines?
Actual Predicted
Value Value
True False
False True
True True
False False
True False
False True
True True
False False
True True
False False
False False
True False
True False
False True
True True
False False
False True
True True
False False
True True
a) i. Accuracy ii. Misclassification iii. Precision iv. Recall v. Sensitivity vi. Specificity vii.
TRUE Positive Rate(TPR) viii. False Positive Rate(FPR)
b) How do you interpret an ROC curve and an AUC score?
c) Explain the differences between Decision Trees and Random forest
d) Describe the differences between bagging and boosting
10. a) Define a multilayer perceptron. Why is at least one hidden layer necessary for learning
more complex functions?
b) Compare and contrast convolutional layers with fully connected layers. Why do
convolutional layers generally require fewer parameters?
c) Describe the components of a single artificial neuron (perceptron). How do they relate
to biological neurons?
d) List common activation functions (e.g., sigmoid, tanh, ReLU). Explain their
mathematical form and discuss their advantages and disadvantages.