0% found this document useful (0 votes)
29 views

Maths Project

This document analyzes the spread of COVID-19 cases in India in 2020 using R. It conducts statistical analysis on COVID-19 cases in various regions of India, including Southern India where it finds the highest numbers of cases were reported in Tamil Nadu and Andhra Pradesh in December 2020. The analysis also looks at COVID-19 deaths in Southern India, the gender-wise age profile of cases, long-term trends, cases in Northern India, and cases across all states. The analysis aims to provide insights for policymakers to better respond to the pandemic.

Uploaded by

Chiku Man
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views

Maths Project

This document analyzes the spread of COVID-19 cases in India in 2020 using R. It conducts statistical analysis on COVID-19 cases in various regions of India, including Southern India where it finds the highest numbers of cases were reported in Tamil Nadu and Andhra Pradesh in December 2020. The analysis also looks at COVID-19 deaths in Southern India, the gender-wise age profile of cases, long-term trends, cases in Northern India, and cases across all states. The analysis aims to provide insights for policymakers to better respond to the pandemic.

Uploaded by

Chiku Man
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 30

BMAT202L/P

PROBABILITY AND STATISTICS

MATHS PROJECT

Analysing the Spread of COVID-19 Cases in India 2020

SUBMITTED TO: Dr Sandeep Dalui


TEAM MEMBERS
Analysing the Spread of COVID-
19 Cases in India 2020

The COVID-19 Pandemic has affected Countries across the globe, and
an understanding of the spread of cases is crucial for managing the
spread of the virus. In the Project, We used R to analyse the distribution
of COVID-19 Cases in India.
INDEX

S. No Topic Page No

1 Abstract 3

2 Introduction 3-4

3 Statement of Problem 4

4 Need and Importance 5

Experiment Analysis 6-9


5 i.Covid-19 Cases in Southern
India
10-11
ii.COVID-19 Deaths in South
6
India
11-13
iii. Gender Wise age profile
7
of Covid Cases during 2020

13-15
iv. Long Run Trend by Covid
8
Cases
16-17
v. Covid Cases in Northern
9
India
18-20
vi. Covid-19 Cases and
10
Deaths Across all the states

Future Enhancement - 20-22


11
Findings and Conclusion
12 References 23

13 Appendix 24

ABSTRACT:

Using R, a widely used programming language for statistical analysis and data visualization,
this project conducts an analysis of the COVID-19 situation in India. To investigate the
patterns and trends of the pandemic in India, the study employs a variety of techniques,
such as statistical modelling, statistical analysis, data analysis, and data visualization, and
gathers information on COVID-19 cases in India from credible sources.

The analysis reveals that the number of COVID-19 cases in India has been on the rise since
the pandemic began, with varying rates of spread across different regions. The research
identifies a range of factors that may be associated with the number of COVID-19 cases in
different regions of India, such as population density, demographic makeup, and healthcare
infrastructure. According to the statistical models, the number of COVID-19 cases in India
will continue to rise, albeit at a slower pace, in the coming days.

This project provides critical insights into the impact of the COVID-19 pandemic in India and
underscores the importance of data-driven approaches to combat the virus's spread. The
project's findings and conclusions can help policymakers and healthcare professionals in
India and other nations plan and respond to the ongoing pandemic.

INTRODUCTION:

COVID-19 is an infectious disease caused by the SARS-CoV-2 virus that was first identified in
Wuhan, China, in December 2019. The pandemic has spread rapidly since then, with over
6.98 million cases reported in 188 countries and territories as of June 8, 2020, resulting in
more than 401,000 deaths and over 3.13 million recoveries. The pandemic has had a
significant impact on the world, affecting people's lives and livelihoods in every country.
India has been particularly hard hit by the pandemic, with millions of confirmed cases and
thousands of deaths, which has impacted the country's economy, social structure, and
political stability.
This project aims to analyse the COVID-19 cases in India using R, a widely used
programming language for statistical analysis and data visualization. By collecting reliable
data on COVID-19 cases in India and using techniques such as data visualization,
exploratory data analysis, statistical modelling, and statistical analysis, this project aims to
gain insights into the patterns and trends of the pandemic in India.

The project involves several steps, including collecting and cleaning the latest data on
COVID-19 cases in India, creating visualizations to understand the trends in COVID-19 cases,
conducting exploratory data analysis to identify any patterns or anomalies, performing
statistical analysis to understand the trends and patterns in COVID-19 cases over time, and
building statistical models to predict the future course of the pandemic.

The project will include the following steps: • First, Collect the latest data on COVID-19
cases in India and clean the data to remove any missing values or duplicate records. • Then,
creating visualizations to understand the trends in COVID-19 cases in India. Next, conduct
exploratory data analysis to gain insights into the data and identify any patterns or
anomalies. • Then performing statistical analysis to understand the trends and patterns in
COVID-19 cases in India over time. • Finally, building statistical models to understand the
relationship between various factors and COVID-19 cases in India and make predictions
about the future course of the pandemic.

The insights and conclusions from this project can help policymakers and healthcare
professionals understand the impact of the COVID-19 pandemic in India and develop
strategies to mitigate the virus's spread and minimize its impact on the population.

Statement of Problem:

The COVID-19 pandemic has had a profound impact on India, affecting public health, the
economy, and social stability. As policymakers and healthcare professionals work to
address this challenge, it is essential to gain a clear understanding of the patterns and
trends of COVID-19 cases in India. However, the data on COVID-19 cases is complex, with
variations in reporting methods, geographical distribution, and demographic composition of
the affected population.

To gain valuable insights into the pandemic's impact in India, this project utilizes R, a
popular programming language for statistical analysis and data visualization. The primary
objective of this project is to analyse COVID-19 cases in India, identify patterns and trends,
and explore the factors that contribute to its spread. The findings of this analysis can
provide insights to policymakers and healthcare professionals, informing strategies to
mitigate the pandemic's impact on the population.

This analysis focuses on several key questions (like “What is the current COVID-19 situation
in India?”, “What could be the reasons behind cases clusters found in India.”, and the “Is
lockdown in India successful or not?”) including the current COVID-19 situation in India, a
state-wise comparison of cases, the factors behind the occurrence of case clusters in India,
and the effectiveness of the lockdown measures implemented in the country. By exploring
these questions and drawing meaningful insights from the available data, this project can
help in developing informed policies and strategies to manage the impact of the pandemic
on India.

Need and Importance:

Understanding the patterns and trends of COVID-19 cases in India is critical in developing
strategies to mitigate the pandemic's impact on the population. However, the data on
COVID-19 cases in India are complex, making it essential to have effective data analysis
tools and techniques to draw meaningful insights.

The use of R, a popular programming language for statistical analysis and data visualization,
can play a crucial role in gaining insights into the patterns and trends of the pandemic. This
project's importance lies in its use of R to analyse the COVID-19 cases in India and provide
valuable insights into the pandemic's dynamics.

By using R's capabilities for data cleaning, visualization, exploratory data analysis, time
series analysis, and statistical modelling, we can gain a comprehensive understanding of the
data. The results of this analysis can inform policies and strategies to mitigate the
pandemic's impact on the population and contribute to the scientific community's
understanding of the pandemic's dynamics in India.

The findings of this project can also help identify regions and demographics most affected
by the pandemic, enabling policymakers to allocate resources more effectively and
prioritize vaccination efforts. Sharing our findings with other researchers and experts can
help build on their work and collaborate on developing solutions to the ongoing crisis.

In summary, the need and importance of this project lie in its potential to provide valuable
insights into the COVID-19 pandemic's impact in India and help mitigate its consequences.
By using R to analyse the data, we can contribute to the ongoing efforts to understand and
address the pandemic's challenges.
EXPERIMENT ANALYSIS:

TOPIC: COVID-19 Cases in Southern India during 2020

Statistics and Visualizing data set using tabulation and graphical


representation

This dataset shows the number of COVID-19 cases in five Indian states - Andhra Pradesh,
Karnataka, Kerala, Tamil Nadu, and Telangana - during the period from January 2020 to
December 2020. The data is presented as the number of confirmed cases in each state for
each month.

The highest number of cases were reported in Tamil Nadu, with a peak of 792,788 cases in
December 2020. This was followed by Andhra Pradesh, which reported a maximum of
879,718 cases in December. Karnataka, Kerala, and Telangana reported lower numbers of
cases, with Karnataka reporting the highest number of cases (920,422) in November 2020.

The data shows that COVID-19 cases started to increase significantly from March 2020 and
peaked in August and September for most of the states. Kerala was an exception, where
the peak was reached in October. The number of cases started to decline in most of the
states from November 2020. Overall, this dataset provides insights into the spread of
COVID-19 in India and highlights the need for measures to control the spread of the virus.

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
State 2020 2020 2020 2020 2020 2020 2020 2020 2020 2020 2020 2020

Andhra
Pradesh 0 0 3 757 1735 9852 86170 382469 617776 789553 830925 879718

Karnataka 0 0 71 700 2511 12667 107001 352295 558911 764317 842132 920422
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
State 2020 2020 2020 2020 2020 2020 2020 2020 2020 2020 2020 2020

Kerala 0 3 301 499 1739 4379 16831 71166 192114 361841 528823 704399

Tamil Nadu 0 1 1242 23280 224212 98898 257613 357703 522398 704682 760495 792788

Telangana 0 0 74 1004 2720 11460 50564 157096 205002 232671 256243 289861

Source: https://prsindia.org/covid-19/cases

ANALYSIS BASED ON CASES IN SOUTH INDIA DURING 2020


Plotting the Graph:
Code:
table_data <- data.frame(State = c("Andhra Pradesh", "Karnataka", "Kerala", "Tamil Nadu",
"Telangana"),
Jan.20 = c(0, 0, 0, 0, 0),
Feb.20 = c(0, 0, 3, 1, 0),
Mar.20 = c(3, 71, 301, 1242, 74),
Apr.20 = c(757, 700, 499, 23280, 1004),
May.20 = c(1735, 2511, 1739, 224212, 2720),
Jun.20 = c(9852, 12667, 4379, 98898, 11460),
Jul.20 = c(86170, 107001, 16831, 257613, 50564),
Aug.20 = c(382469, 352295, 71166, 357703, 157096),
Sep.20 = c(617776, 558911, 192114, 522398, 205002),
Oct.20 = c(789553, 764317, 361841, 704682, 232671),
Nov.20 = c(830925, 842132, 528823, 760495, 256243),
Dec.20 = c(879718, 920422, 704399, 792788, 289861))

Graph:
Pie(total)
Code:
plot(table_data$Jan.20, type = "l", col = "red", xlab = "Month", ylab = "Cases", ylim = c(0,
1000000), main = "COVID-19 Cases in India by State")
lines(table_data$Feb.20, type = "l", col = "blue")
lines(table_data$Mar.20, type = "l", col = "green")
lines(table_data$Apr.20, type = "l", col = "purple")
lines(table_data$May.20, type = "l", col = "orange")
lines(table_data$Jun.20, type = "l", col = "black")
lines(table_data$Jul.20, type = "l", col = "brown")
lines(table_data$Aug.20, type = "l", col = "pink")
lines(table_data$Sep.20, type = "l", col = "grey")
lines(table_data$Oct.20, type = "l", col = "yellow")
lines(table_data$Nov.20, type = "l", col = "violet")
lines(table_data$Dec.20, type = "l", col = "cyan")
legend("topright", inset = 0.05, legend = table_data$State, col = c("red", "blue", "green",
"purple", "orange", "black", "brown", "pink", "grey", "yellow", "violet", "cyan"), lty = 1, cex
= 0.8)

Pie(total)
Pie(Summary(total)):
Code:
table_data <- read.csv("E:/VIT CHENNAI/4th /Probability and Statistics_Lab/Project/South
Cases/Southern States data_csv.csv", header = TRUE)
print(table_data)
summary(table_data)
total = c(0, 4, 1691, 26240, 232917, 137256, 518179, 1320729, 2096201, 2853064,
3218618, 3587188)
pie(total)
pie(summary(total))

Pie(Summary(total)):
Question for Z-Test:
On Looking the stats of May-2020 COVID Cases, Which was around 2 Lakh 33 Thousand,
Government Authority took Certain Measures to reduce the COVID Cases per Month which
on average is 183215. The government claims that those measures brought a reduction of
8% in Covid cases. Check whether the government is claim is true or false.

CODE:
TOPIC: COVID-19 Deaths in Southern India during 2020
Bar Graphs
The table shows the total number of COVID-19 deaths in 2020 in the southern states of India. The
data is broken down by state and month, and includes the months of January through December.
The southern states included in the table are Andhra Pradesh, Karnataka, Kerala, Tamil Nadu,
and Telangana.
The table provides a snapshot of the toll that COVID-19 took on the region during the year 2020,
which was marked by a global pandemic. The data shows that Tamil Nadu recorded the highest
number of deaths among the southern states, with a total of 134,920 deaths, followed by
Karnataka with 33,904 deaths, and Kerala with 16,929 deaths. Puducherry recorded the lowest
number of deaths with 1,820 deaths.
The data also reveals how the number of COVID-19 deaths increased steadily over the course of
the year, with peaks in August and September in some states, as the pandemic ravaged the
region. The table serves as a reminder of the devastating impact that the pandemic had on
communities across India, and the need for continued efforts to combat its spread.

State January February March April May June July August September October November December

Andhra
Pradesh 0 0 0 33 57 434 1,157 2,142 2,987 4,034 5,300 6,128

Karnataka 0 0 2 121 603 782 1,679 3,216 4,503 6,055 8,140 8,383

Kerala 0 0 0 4 52 79 149 635 2,175 4,000 4,858 4,977

Tamil Nadu 0 0 3 107 492 2,478 7,743 16,163 26,355 31,687 32,529 27,333

Telangana 0 0 0 28 71 192 496 1,086 1,617 2,094 2,314 2,108

RESULT:
Code:
FOR BAR GRAPH:
table_data <- read.csv("D:/VIT CHENNAI/4th Semister/Probability and
Statistics/Project/South Deaths/Southern States deaths.csv", header = TRUE)
table_data
Regions = c("Andhra Pradesh", "Karnataka", "Kerala", "Tamil Nadu", "Telangana")
data <- data.frame(
State = c("Andhra Pradesh", "Karnataka", "Kerala", "Tamil Nadu", "Telangana"),
January = c(0, 0, 0, 0, 0),
February = c(0, 0, 0, 0, 0),
March = c(0, 2, 0, 3, 0),
April = c(33, 121, 4, 107, 28),
May = c(57, 603, 52, 492, 71),
June = c(434, 782, 79, 2478, 192),
July = c(1157, 1679, 149, 7743, 496),
August = c(2142, 3216, 635, 16163, 1086),
September = c(2987, 4503, 2175, 26355, 1617),
October = c(4034, 6055, 4000, 31687, 2094),
November = c(5300, 8140, 4858, 32529, 2314),
December = c(6128, 8383, 4977, 27333, 2108)
)

barplot(as.matrix(data[,2:13]), beside = TRUE, col = rainbow(5), main = "No. of cases deaths


in Each Month", xlab = "Month", ylab = "No. of cases deaths", legend = Regions)
TOPIC: Gender Wise age profile of Covid Cases during 2020
Variance, Correlation and Regression
This table shows the gender and age-wise profile of Covid cases in India for the year 2020. The
data is broken down by male and female cases in each age group.
The age groups are listed in the first column and include 0-10 (where data is not available), 11-20,
21-30, 31-40, 41-50, 51-60, 61-70, 71-80, above 80, and "Not known".
The second column shows the number of Covid cases in males for each age group, and the third
column shows the number of cases in females for each age group.

Age Group Male Cases Female Cases

0-10 0 0

11-20 108,094 58,296

21-30 408,711 205,572

31-40 471,074 243,076

41-50 329,347 165,784

51-60 215,600 106,878

61-70 112,267 51,080

71-80 48,644 19,013

Above 80 15,704 6,216

Not known 14,398 6,357

Code:
library(readxl)
Cases = read_excel("E:/VIT CHENNAI/4th/Probability and Statistics_Lab/Project/Age and Gender
Wise/Age_and_gender.xlsx")
View(Cases)
data = Cases
data
M = Cases$`Male Cases`
M
F = Cases$`Female Cases`
F
summary(Cases)
summary(data)
v1 = var(data$`Male Cases`)
v1
v2 = var(data$`Female Cases`)
v2
covariance = cov(data$`Male Cases`, data$`Female Cases`)
covariance
correlation = covariance/(sd(data$`Male Cases`)*sd(data$`Female Cases`))
correlation
cor.test(data$`Male Cases`, data$`Female Cases`)
plot(data$`Male Cases`, data$`Female Cases`)
regression1 = lm(data$`Male Cases` - data$`Female Cases`)
regression1
abline(regression1)
summary(regression1)
regression2 = lm(data$`Male Cases` - data$`Female Cases`)
regression2
abline(regression2)
summary(regression2)
RESULT:

TOPIC: Long Run Trend by Covid Cases


Table: Covid-19 Impact in India: Total Cases, Recoveries, and Deaths from 2020 to
Present

Year Total Confirmed Cases Total Recoveries Total Deaths

2020 10,413,417 10,029,471 150,570

2021 34,522,694 33,674,281 466,852

2022 13,191,373 12,882,826 180,097


Year Total Confirmed Cases Total Recoveries Total Deaths

2023 2,013,849 1,772,768 24,920

Note: The data for the year 2023 is available up to March 19.

This table shows the total number of confirmed Covid-19 cases, recoveries, and deaths in India for
each year since 2020. As the table shows, India saw a significant increase in Covid-19 cases in
2021, with over three times as many cases as in 2020. The number of cases decreased
significantly in 2022, but the total number of deaths remained high. The data for 2023 suggests a
lower number of cases and deaths compared to previous years, but it is too early to draw any
conclusions about the long-term trend.

Graphs:
Code:
table_data <- read.csv("E:/VIT CHENNAI/4th /Probability and Statistics_Lab/Project/Long
Run Trend/long_run_trend.csv", header = TRUE)
table_data
# Create a data frame
df <- data.frame(
Year = c(2020, 2021, 2022, 2023),
Total_Confirmed_Cases = c(10413417, 34522694, 13191373, 2013849),
Total_Recoveries = c(10029471, 33674281, 12882826, 1772768),
Total_Deaths = c(150570, 466852, 180097, 24920)
)

# Plot the graph for Total Confirmed Cases


plot(df$Year, df$Total_Confirmed_Cases, type = "l", xlab = "Year", ylab = "Total Confirmed
Cases", main = "COVID-19 Statistics: Total Confirmed Cases", col = "blue")

# Plot the graph for Total Recoveries


plot(df$Year, df$Total_Recoveries, type = "l", xlab = "Year", ylab = "Total Recoveries", main
= "COVID-19 Statistics: Total Recoveries", col = "red")

# Plot the graph for Total Deaths


plot(df$Year, df$Total_Deaths, type = "l", xlab = "Year", ylab = "Total Deaths", main =
"COVID-19 Statistics: Total Deaths", col = "cyan")
Total Confirmed Cases:
Total Recoveries:

Total Deaths:
TOPIC: Covid Cases in Northern India in 2020

Januar Februar Marc Augus Septembe Octobe Novembe Decembe


State y y h April May June July t r r r r

15,25 24,98 85,16 132,27 160,84


Delhi 5 2 1209 7 8 1 5 7 227,273 366,946 523,831 626,448

Uttar 11,48 11,24 18,89 141,60


Pradesh 13 3 359 7 1 3 59,335 6 309,238 471,413 527,983 594,497

Haryana 13 14 78 1,129 1,618 5,235 20,582 62,298 137,969 200,728 222,152 249,502

Punjab 4 2 101 1,962 2,206 3,680 8,725 27,936 89,331 132,723 147,774 162,936

13,10
Rajasthan 19 7 200 2,085 7,028 8 39,935 93,715 146,276 179,172 196,496 250,690

Jammu
and
Kashmir 2 3 62 710 2,446 6,114 12,156 23,341 65,507 93,367 98,496 117,114
Januar Februar Marc Augus Septembe Octobe Novembe Decembe
State y y h April May June July t r r r r

Uttarakhan
d 5 5 26 696 1,242 2,636 4,263 10,311 31,840 56,444 66,976 90,339

Himachal
Pradesh 1 1 3 40 310 991 2,028 5,448 12,065 18,183 21,205 28,255

Chandigar
h 7 2 18 169 354 557 1,082 2,399 6,859 13,756 16,835 20,266

Ladakh 0 0 13 18 52 837 1,180 2,559 3,499 5,238 5,743 7,944

This table shows the total number of confirmed COVID-19 cases reported each month in the
Northern states of India in the year 2020. The states included in this table are Delhi, Uttar
Pradesh, Haryana, Punjab, Rajasthan, Jammu and Kashmir, Uttarakhand, Himachal Pradesh,
Chandigarh, and Ladakh. The numbers in the table represent the cumulative number of cases
reported in each state for each month.

RESULT:
Code:
table_data <- read.csv("E:/VIT CHENNAI/4th /Probability and Statistics_Lab/Project/North
Cases/North_cases.csv", header = TRUE)
table_data

# create a data frame with the total cases for each state
total_cases <- data.frame(
State = c("Delhi", "Uttar Pradesh", "Haryana", "Punjab", "Rajasthan", "Jammu and
Kashmir", "Uttarakhand", "Himachal Pradesh", "Chandigarh", "Ladakh"),
Cases = c(627149, 1708803, 79491, 60198, 951917, 339414, 39745, 25366, 6186, 2346)
)

summary(total_cases)

# create a pie chart of the total cases by state


pie(total_cases$Cases, labels = total_cases$State, main = "Total COVID-19 cases by state")
TOPIC: Covid-19 Cases and Deaths Across all the states

State Total Cases Total Recoveries Total Deaths


Andaman and Nicobar
Islands 4,932 4,639 62
Andhra Pradesh 8,86,710 8,44,719 7,136
Arunachal Pradesh 16,838 14,953 56
Assam 2,17,269 2,10,481 1,082
Bihar 2,68,655 2,59,484 1,510
Chandigarh 20,623 18,463 330
Chhattisgarh 2,84,123 2,45,327 3,444
Dadra and Nagar Haveli
and Daman and Diu 3,391 3,332 2
Delhi 6,27,149 5,83,058 10,585
Goa 49,058 45,981 706
Gujarat 2,45,922 2,31,702 4,218
Haryana 2,66,926 2,42,150 2,978
Himachal Pradesh 42,770 29,473 675
Jammu and Kashmir 1,20,527 1,10,692 1,862
Jharkhand 1,14,180 1,08,931 1,017
Karnataka 9,16,909 8,68,738 11,996
Kerala 6,04,174 4,90,372 2,266
Ladakh 9,010 7,732 124
Madhya Pradesh 2,25,665 2,05,089 3,501
Maharashtra 17,31,825 15,92,523 45,325
Manipur 28,780 25,895 362
Meghalaya 12,272 10,564 118
Mizoram 4,317 3,870 9
Nagaland 11,609 10,884 68
Odisha 3,29,057 3,20,526 1,826
Puducherry 38,983 35,859 643
Punjab 1,67,619 1,55,133 5,441
Rajasthan 9,51,917 8,31,888 7,070
Sikkim 3,986 3,719 64
Tamil Nadu 8,19,845 7,90,309 12,167
Telangana 2,72,719 2,63,526 1,471
Tripura 33,426 31,535 386
Uttar Pradesh 5,17,838 4,87,395 7,508
Uttarakhand 70,231 61,875 1,146
West Bengal 4,83,484 4,50,355 8,407

The table shows the total number of COVID-19 cases, recoveries, and deaths in India during
the year 2020, broken down by state/union territory. The data is sourced from the Ministry
of Health and Family Welfare, Government of India.
Each row represents a state/union territory and the columns provide the following
information:
State/Union Territory: The name of the state or union territory in India.
Total Cases: The total number of confirmed COVID-19 cases reported in the state/union
territory during the year 2020.
Total Recoveries: The total number of COVID-19 patients who recovered and were
discharged from hospitals or declared cured in the state/union territory during the year
2020.
Total Deaths: The total number of COVID-19 related deaths reported in the state/union
territory during the year 2020.

RESULT:
Code:

# create a data frame for the total cases, recoveries, and deaths in each state of India in
2020
india_states <- data.frame(
State = c("Andhra Pradesh", "Arunachal Pradesh", "Assam", "Bihar", "Chhattisgarh", "Goa",
"Gujarat", "Haryana", "Himachal Pradesh", "Jharkhand", "Karnataka", "Kerala", "Madhya
Pradesh", "Maharashtra", "Manipur", "Meghalaya", "Mizoram", "Nagaland", "Odisha",
"Punjab", "Rajasthan", "Sikkim", "Tamil Nadu", "Telangana", "Tripura", "Uttar Pradesh",
"Uttarakhand", "West Bengal"),
Total_Cases = c(886418, 16836, 216919, 239771, 319057, 52322, 252911, 277455, 34534,
113177, 942229, 981487, 240368, 1917918, 29076, 13409, 4258, 11929, 336056, 164025,
331702, 6048, 905024, 301186, 33398, 594850, 94225, 580606),
Total_Recoveries = c(875996, 16772, 211303, 236739, 312262, 51204, 236060, 269456,
32986, 111196, 912237, 888313, 229612, 1710054, 28525, 13065, 3997, 11431, 331859,
150930, 317111, 5769, 877988, 285436, 32917, 568132, 89052, 545221),
Total_Deaths = c(7169, 56, 989, 1327, 3876, 756, 4304, 3043, 576, 1039, 12270, 3921,
3654, 49917, 425, 145, 24, 77, 1859, 5583, 2920, 231, 12466, 1747, 382, 8826, 1746, 10585)
)

# create a bar chart for total cases, recoveries, and deaths for each state using base R
barplot(as.matrix(india_states[, 2:4]),
beside = TRUE,
main = "Total Covid-19 Cases, Recoveries, and Deaths by State in India in 2020",
xlab = "State",
ylab = "Count",
col = c("#FFA07A", "#7FFF7F", "#FF7F50"),
legend = rownames(india_states),
args.legend = list(x = "topright", bty = "n"))
FINDINGS AND CONCLUSION:

Future Enhancement:
1.Prediction Model India has not reached the peak yet, once it reaches the peak the
prediction model can be built to show that how much time it will take to get things back to
the normal.
2.Sentiment Analysis India has never experienced such pandemic in last 100 years so what
do people think about this pandemic, lockdown, government approach/policies etc can be
studied to have sentiment insight of this pandemic.

Findings and Conclusion:


The COVID-19 pandemic has been an unprecedented global crisis that has affected almost
every aspect of our lives. India, with a population of over 1.3 billion people, has been hit
hard by the pandemic. As of December 2020, India had the second-highest number of
confirmed cases in the world, with over 10 million people infected and over 145,000
deaths.

Geographic and demographic disparities: Our analysis shows that the COVID-19 pandemic
in India has affected different regions and demographic groups differently. For example,
while Maharashtra was the state with the highest number of cases, states such as Manipur
and Mizoram had significantly lower numbers. Similarly, the virus disproportionately
affected men and elderly individuals, who were more likely to experience severe illness or
death.

Impact of interventions: The Indian government's interventions, such as the nationwide


lockdown, helped slow down the spread of the virus but also had adverse economic and
social effects. Future analysis could assess the effectiveness of different interventions and
how they impacted the spread of the virus.

Testing rates: Our analysis reveals that testing rates increased over time but were initially
low. This highlights the need for more widespread and accessible testing to identify and
isolate infected individuals and track the spread of the virus. We can draw several
conclusions. Firstly, we found that the pandemic had a significant impact on the Indian
population, with cases and deaths increasing rapidly during the initial months of the
outbreak. The country recorded its first case of COVID-19 in January 2020, and by the end
of March 2020, the number of cases had surpassed 1,000. The number of cases continued
to rise, and by September 2020, India had recorded over 5 million cases.
Case fatality rate (CFR): Sikkim’s CFR, while relatively low compared to other States, does
not reflect the actual mortality rate of the virus. Other factors, such as the quality of
healthcare, the age distribution of the population, and the number of cases, could influence
the CFR.

Importance of global cooperation: The COVID-19 pandemic has emphasized the importance
of global cooperation and solidarity in addressing a common threat that affects all
humanity. India, as a major player in global health, has a significant role to play in the global
response to the pandemic. Through our analysis of the COVID-19 cases in India in 2020,
We also observed that the Indian government implemented several measures to curb the
spread of the virus, including a nationwide lockdown in March 2020. While the lockdown
helped slow down the spread of the virus, it also had several adverse effects, such as job
losses, economic slowdown, and social disruptions.

Moreover, our analysis showed that the testing rate in India was initially low but increased
over time. Testing is crucial for identifying and isolating infected individuals and tracking the
spread of the virus. Hence, increased testing is an essential tool for controlling the spread
of the virus.

Additionally, we found that the case fatality rate in Southern States was relatively low
compared to other States. However, it only represents the proportion of reported cases
that result in death and may not reflect the actual mortality rate. Factors such as the
number of cases, the quality of healthcare, and the age distribution of the population can
also influence the actual case fatality rate.

In conclusion, the COVID-19 pandemic had a significant impact on India in 2020. It


highlighted the need for effective measures to control the spread of the virus, including
increased testing, contact tracing, and isolation of infected individuals. The pandemic also
exposed the vulnerabilities of the healthcare system and the need for better infrastructure
and resources to manage such crises. Finally, the pandemic emphasized the importance of
global cooperation and solidarity in addressing a common threat that affects all humanity.
REFERENCES

FOR DATA:
 https://api.covid19india.org/
 https://prsindia.org/covid-19/cases
 https://www.mohfw.gov.in/
 https://www.who.int/india/emergencies/coronavirus-disease-(covid-19)
FOR PROGRAMMING:
 https://www.w3schools.com/
 https://stackoverflow.com/

You might also like