Final Homework Assignment

Uploaded by

Oussema Charbib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Final Homework Assignment

Uploaded by

Oussema Charbib

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Final Homework Assignment

Implement and Evaluate a CNN Pipeline with MLOps Part II

Objective:

This homework is designed to help you evaluate your knowledge and apply the MLOps
principles covered in previous sessions. You will extend the steps to implement a CNN
pipeline using tools like DVC, GitHub Actions, Docker, and MLflow. Additionally, you will
document your process and demonstrate your understanding of these tools through
hands-on implementation and theoretical questions.

Tasks:
1. Task 1: CNN Practical Implementation in ML Workflow
1.1 Data Versioning Using DVC
● Description: Set up DVC to manage and version your dataset (e.g., sea and forest
data). Add your training dataset to DVC and ensure it's tracked properly. Integrate
GitHub Actions to automate DVC processes (e.g., pulling datasets and ensuring the
pipeline is up-to-date in CI/CD workflows).
● Steps:
○ Initialise DVC.
○ Add your dataset to DVC and configure a remote (e.g., Google Drive).
○ Track the dataset changes using Git and DVC.
○ Set up a GitHub Actions workflow to automate the integration of DVC in your
project, including pulling data and verifying pipeline integrity during CI runs.
● Deliverable:
○ A brief description of how you set up DVC and integrated GitHub Actions.
○ Screenshots of your terminal commands (e.g., dvc init, dvc add, dvc pull).
○ Output showing tracked files.
○ GitHub Actions YAML configuration file and relevant logs/screenshots.

1.2 CNN Model Setup and Initial Training

● Description: Launch the training of a CNN model using the initial dataset, limited to
30 epochs. Analyse the confusion matrix and the training/validation loss curves.
● Steps:
○ Use a basic CNN architecture.
○ Train for 20 epochs and evaluate the model.
○ Visualise the confusion matrix and loss curves.
● Deliverable:
○ A description of the initial training results.
○ Screenshots of the confusion matrix and loss curves.

1.3 Model Fine-Tuning

● Description: Perform hyperparameter tuning by adjusting parameters like batch size
and learning rate. Run three experiments to compare performance:
○ Experiment 1: Epochs = 20, Batch size = 8.
○ Experiment 2: Epochs = 20, Batch size = 16.
○ Experiment 3: Epochs = 25, Batch size = 16.
● Steps:
○ Run the above experiments using MLflow for tracking.
○ Compare results such as accuracy, loss, and metrics.
● Deliverable:
○ Explanation of changes made for fine-tuning.
○ Screenshots of performance metrics and comparisons.

1.4 Model Monitoring with MLflow

● Description: Integrate MLflow to log experiments, hyperparameters, metrics, and
model artifacts for the three models.
● Steps:
○ Set up MLflow for local tracking.
○ Log metrics, hyperparameters, and artifacts.
○ Compare results in the MLflow UI.
● Deliverable:
○ Screenshots of MLflow UI with logged experiments.
○ Short description of MLflow setup.

2.1 Build and Run a Docker Image

● Description:
○ Create a Docker image for your CNN classifier application.
○ Run the image locally within a Docker container to confirm it executes as
expected.
● Deliverable:
○ Dockerfile used to build the image.
○ Screenshot of Docker container execution.

2.2 MLOps Questions

MLOps Concepts

1. How does MLOps improve the scalability of machine learning workflows?

2. What challenges do teams face when implementing MLOps in large organisations?
3. Explain the concept of feature stores and their role in the MLOps pipeline.
4. What are some strategies for ensuring data quality in MLOps pipelines?

Tool-Specific Questions

DVC:

1. How does DVC integrate with cloud storage providers, and why is this useful?
2. What role does the dvc.lock file play in maintaining pipeline integrity?
3. Discuss how DVC pipelines can be automated using CI/CD tools.
4. What is the significance of checkpoints in DVC pipelines?

MLflow:

1. How can MLflow's model serving feature simplify deployment?

2. Discuss how MLflow handles experiment reproducibility across environments.
3. What are the advantages of MLflow's integration with platforms like Kubernetes?
4. Explain how MLflow's artifact tracking supports auditability in machine learning
workflows.

General Questions

1. How can teams balance the trade-offs between automation and flexibility in MLOps
workflows?
2. Discuss the importance of explainability in models deployed via an MLOps pipeline.
3. How do MLOps practices align with ethical AI considerations?
4. What future trends do you foresee in the adoption and evolution of MLOps tools and
frameworks?

Submission Requirements:

● Submit a comprehensive report that includes:

○ Screenshots and detailed descriptions for each step.
○ Corresponding descriptions for screenshots explaining the actions taken.
● Summarise any issues faced and how you resolved them.
● For Task 2, provide concise and clear answers. Where applicable, reference practical
examples from Task 1.

Deadline: Friday, 5th January, 2025 (11:59 PM).

Good luck with your assignment! Make sure to include detailed documentation and clear
interpretations of the results for better evaluation.

Git Apprentice
No ratings yet
Git Apprentice
311 pages
Practical Monte Carlo Simulation with Excel - Part 2 of 2: Applications and Distributions
From Everand
Practical Monte Carlo Simulation with Excel - Part 2 of 2: Applications and Distributions
Akram Najjar
2/5 (1)
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Angular Essentials
From Everand
Angular Essentials
Infragistics Inc
No ratings yet
Step-By-Step Guide To Gain MLOps Skills
No ratings yet
Step-By-Step Guide To Gain MLOps Skills
6 pages
MLOps Interview QnA
No ratings yet
MLOps Interview QnA
19 pages
Unit-3 mlops
No ratings yet
Unit-3 mlops
8 pages
Homework 5 Yessine Labyedh
No ratings yet
Homework 5 Yessine Labyedh
28 pages
Session 29 - MLOps Tools Overview-new
100% (1)
Session 29 - MLOps Tools Overview-new
40 pages
Automated ML
No ratings yet
Automated ML
4 pages
Base Paper 3 - Master theises (1)
No ratings yet
Base Paper 3 - Master theises (1)
75 pages
Getting Started With MLOPs 21 Page Tutorial
No ratings yet
Getting Started With MLOPs 21 Page Tutorial
21 pages
Build Reliable Machine Learning Pipelines With Continuous Integration
No ratings yet
Build Reliable Machine Learning Pipelines With Continuous Integration
22 pages
Makinen Sasu Thesis 2021
No ratings yet
Makinen Sasu Thesis 2021
76 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
MLOps+Specialization+Course+January+2024!5!15
No ratings yet
MLOps+Specialization+Course+January+2024!5!15
11 pages
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
From Everand
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
Steve Brown
No ratings yet
MLOps Asilla 20221124
No ratings yet
MLOps Asilla 20221124
16 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
sample resume 2
No ratings yet
sample resume 2
3 pages
MLOps
No ratings yet
MLOps
21 pages
MLOps Research Work by Arka Roy (1)
No ratings yet
MLOps Research Work by Arka Roy (1)
21 pages
MLOps and Systems - Syllabus and Weekly Schedule - September 2021
No ratings yet
MLOps and Systems - Syllabus and Weekly Schedule - September 2021
4 pages
On The Automation of Machine Learning Pipelines: F E U P
No ratings yet
On The Automation of Machine Learning Pipelines: F E U P
86 pages
MLflow - An Open Platform To Simplify The Machine Learning Lifecycle Presentation 1
No ratings yet
MLflow - An Open Platform To Simplify The Machine Learning Lifecycle Presentation 1
28 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
Cours 6 - TP
No ratings yet
Cours 6 - TP
2 pages
1kjh
No ratings yet
1kjh
4 pages
ML Project [BS IT-8]
No ratings yet
ML Project [BS IT-8]
20 pages
Design and Development of An Mlops Framework: Iago Águila Cifuentes
No ratings yet
Design and Development of An Mlops Framework: Iago Águila Cifuentes
66 pages
DT166g_FinalReport_2
No ratings yet
DT166g_FinalReport_2
39 pages
MLOps
No ratings yet
MLOps
9 pages
Mlops 101
No ratings yet
Mlops 101
33 pages
Software Requirements Specification
No ratings yet
Software Requirements Specification
4 pages
PS Presentation
No ratings yet
PS Presentation
15 pages
Android Jetpack Compose Handbook
From Everand
Android Jetpack Compose Handbook
Onyx Rose
No ratings yet
DevOps Engineer's Guidebook: Essential Techniques
From Everand
DevOps Engineer's Guidebook: Essential Techniques
Ted Noreux
No ratings yet
MLOPS
No ratings yet
MLOPS
56 pages
MLOps Specialization Course April 2024
100% (1)
MLOps Specialization Course April 2024
25 pages
Advanced React Patterns
From Everand
Advanced React Patterns
Pedro Martins
No ratings yet
Mlops: A Reality!!!!
No ratings yet
Mlops: A Reality!!!!
5 pages
Mlhops: Machine Learning For Healthcare Operations: Keywords
No ratings yet
Mlhops: Machine Learning For Healthcare Operations: Keywords
86 pages
unit-1
No ratings yet
unit-1
21 pages
MLOps For Enhancing The Accuracy of Machine Learni
No ratings yet
MLOps For Enhancing The Accuracy of Machine Learni
7 pages
DevOps Mastery: Unlocking Core Techniques for Optimal Software Delivery
From Everand
DevOps Mastery: Unlocking Core Techniques for Optimal Software Delivery
Adam Jones
No ratings yet
tensorflow proposal
No ratings yet
tensorflow proposal
3 pages
Mastering Kubernetes
From Everand
Mastering Kubernetes
Manish Soni
No ratings yet
The Kubeflow Handbook: Streamlining Machine Learning on Kubernetes
From Everand
The Kubeflow Handbook: Streamlining Machine Learning on Kubernetes
Robert Johnson
No ratings yet
01 coding the god bot (dragged) 6
No ratings yet
01 coding the god bot (dragged) 6
1 page
MLOps+Specialization+Course
No ratings yet
MLOps+Specialization+Course
29 pages
Implementation of MLOps 1710672760
No ratings yet
Implementation of MLOps 1710672760
23 pages
Mlflow Workshop Part 2
No ratings yet
Mlflow Workshop Part 2
29 pages
MLOps
No ratings yet
MLOps
19 pages
MLOps Specialization Course January 2024
No ratings yet
MLOps Specialization Course January 2024
24 pages
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
From Everand
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
Georgio Daccache
No ratings yet
Machine Learning Dev Ops Engineer Nanodegree Program Syllabus
No ratings yet
Machine Learning Dev Ops Engineer Nanodegree Program Syllabus
16 pages
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet
HZDR Publications 17653
No ratings yet
HZDR Publications 17653
27 pages
Syllabus
No ratings yet
Syllabus
4 pages
Arquivs nlp01
No ratings yet
Arquivs nlp01
3 pages
Building Machine Learning Systems with a Feature Store Batch, Real-Time, and LLM Systems Early Release Jim
No ratings yet
Building Machine Learning Systems with a Feature Store Batch, Real-Time, and LLM Systems Early Release Jim
84 pages
CODE Institute FullStack Software Development Specialization Europe Brochure
No ratings yet
CODE Institute FullStack Software Development Specialization Europe Brochure
36 pages
Case Study Git: Local Operations
No ratings yet
Case Study Git: Local Operations
6 pages
INTERNSHIP FINAL REPORTTT
No ratings yet
INTERNSHIP FINAL REPORTTT
24 pages
Unit V Building Devops Pipelines Using Azure Notes
No ratings yet
Unit V Building Devops Pipelines Using Azure Notes
21 pages
Pentest Com POWERSHELL - Overview
No ratings yet
Pentest Com POWERSHELL - Overview
31 pages
Repository and Task Management Tools
No ratings yet
Repository and Task Management Tools
3 pages
7 - Git & Github
No ratings yet
7 - Git & Github
23 pages
Hack Your Way To Better Security - Mastering GitHub Dorks For Eth
No ratings yet
Hack Your Way To Better Security - Mastering GitHub Dorks For Eth
16 pages
Git Challenge
No ratings yet
Git Challenge
2 pages
Senior Devops Engineer Resume Example
No ratings yet
Senior Devops Engineer Resume Example
1 page
Git Commands
No ratings yet
Git Commands
5 pages
S4HC_Dev_ext_PartnerBootcamp_Exercise3_V3 (1)
No ratings yet
S4HC_Dev_ext_PartnerBootcamp_Exercise3_V3 (1)
20 pages
Using Git and GiHub in VSCode and PyCharm IDE
No ratings yet
Using Git and GiHub in VSCode and PyCharm IDE
3 pages
Google Apis2
No ratings yet
Google Apis2
5 pages
Wqqajjk
No ratings yet
Wqqajjk
5 pages
GitHub Education
No ratings yet
GitHub Education
1 page
Git Github
No ratings yet
Git Github
5 pages
3.3.11 Lab - Software Version Control With Git Yanuaryusuf
No ratings yet
3.3.11 Lab - Software Version Control With Git Yanuaryusuf
18 pages
Git Tutorial
No ratings yet
Git Tutorial
8 pages
Silo - Tips - An Introduction To Python Programming For Research
No ratings yet
Silo - Tips - An Introduction To Python Programming For Research
390 pages
Git-workshop-2024
No ratings yet
Git-workshop-2024
99 pages
Project Report Pawan Pant
No ratings yet
Project Report Pawan Pant
65 pages
QUANTITATIVE ECONOMICS With Julia
100% (1)
QUANTITATIVE ECONOMICS With Julia
509 pages
Git Cheat Sheet Education
50% (2)
Git Cheat Sheet Education
2 pages
LST Pre-Internship Program and Internship Details
No ratings yet
LST Pre-Internship Program and Internship Details
13 pages
jPOS EE
No ratings yet
jPOS EE
111 pages
GITHUB
No ratings yet
GITHUB
31 pages
Jenkins Notes
No ratings yet
Jenkins Notes
25 pages
Configuration_Guide
No ratings yet
Configuration_Guide
36 pages