Visualizing Covid - 19
Visualizing Covid - 19
NAYANA B H (20232MCA0224)
MUKTHASHREE D G (20232MCA0230)
RAHUL PARAMAGOND (20232MCA0247)
INTRODUCTION
This project uses data analytics to study and predict COVID-19 cases. Data analytics involves
examining large datasets to uncover patterns, trends, and useful information. This project aims to
compare different algorithms like linear regression and Random forest with respect to their
Algorithms Used:
• Logistic Regression
Logistic Regression is a supervised
learning algorithm used for binary
classification tasks. Simple and
interpretable model that helps
understand the relationship between
features and the target variable.
• Random Forest
Random Forest is an ensemble
learning algorithm that uses multiple
decision trees to make predictions.
The final output is based on the
majority vote of the individual trees.
METHODOLOGY
• Random Forest was trained on the training set, and its performance was
• Feature importance from the model helped highlight critical factors like age
and symptoms.
FUTURE SCOPE
improved predictions
enhanced accuracy.
•
REPORT ON OUTPUTS
Heatmap:
REPORT ON OUTPUTS
Logistic Regression:
⚬ Accuracy: 85%
⚬ F1-score: 82%
REPORT ON OUTPUTS
Random Forest:
⚬ Accuracy: 90%
⚬ F1-score: 88%
• The project successfully applied data analytics and machine learning to