Final Homework Assignment
Final Homework Assignment
Objective:
This homework is designed to help you evaluate your knowledge and apply the MLOps
principles covered in previous sessions. You will extend the steps to implement a CNN
pipeline using tools like DVC, GitHub Actions, Docker, and MLflow. Additionally, you will
document your process and demonstrate your understanding of these tools through
hands-on implementation and theoretical questions.
Tasks:
1. Task 1: CNN Practical Implementation in ML Workflow
1.1 Data Versioning Using DVC
● Description: Set up DVC to manage and version your dataset (e.g., sea and forest
data). Add your training dataset to DVC and ensure it's tracked properly. Integrate
GitHub Actions to automate DVC processes (e.g., pulling datasets and ensuring the
pipeline is up-to-date in CI/CD workflows).
● Steps:
○ Initialise DVC.
○ Add your dataset to DVC and configure a remote (e.g., Google Drive).
○ Track the dataset changes using Git and DVC.
○ Set up a GitHub Actions workflow to automate the integration of DVC in your
project, including pulling data and verifying pipeline integrity during CI runs.
● Deliverable:
○ A brief description of how you set up DVC and integrated GitHub Actions.
○ Screenshots of your terminal commands (e.g., dvc init, dvc add, dvc pull).
○ Output showing tracked files.
○ GitHub Actions YAML configuration file and relevant logs/screenshots.
Tool-Specific Questions
DVC:
1. How does DVC integrate with cloud storage providers, and why is this useful?
2. What role does the dvc.lock file play in maintaining pipeline integrity?
3. Discuss how DVC pipelines can be automated using CI/CD tools.
4. What is the significance of checkpoints in DVC pipelines?
MLflow:
General Questions
1. How can teams balance the trade-offs between automation and flexibility in MLOps
workflows?
2. Discuss the importance of explainability in models deployed via an MLOps pipeline.
3. How do MLOps practices align with ethical AI considerations?
4. What future trends do you foresee in the adoption and evolution of MLOps tools and
frameworks?
Submission Requirements:
Good luck with your assignment! Make sure to include detailed documentation and clear
interpretations of the results for better evaluation.