0% found this document useful (0 votes)
2 views

Descriptive

Descriptive statistics involves summarizing data using methods like charts and tables to make it easily understandable. Key concepts include measures of central tendency (mean, median, mode) and measures of variability (range, variance, standard deviation) that describe data distribution. The document also outlines how to compute these measures in R.

Uploaded by

rithikaoct08
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Descriptive

Descriptive statistics involves summarizing data using methods like charts and tables to make it easily understandable. Key concepts include measures of central tendency (mean, median, mode) and measures of variability (range, variance, standard deviation) that describe data distribution. The document also outlines how to compute these measures in R.

Uploaded by

rithikaoct08
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

DESCRIPTIVE

STATISTICS
DEFINITION:

• Describing our data with the help of various


representative methods using charts, graphs, tables,
excel files, etc.

• In descriptive statistics, we describe our data in some


manner and present it in a meaningful way so that it
can be easily understood.
PROCESS OF DESCRIPTIVE
STATISTCS IN R
MEASURE OF CENTRAL
TENDENCY
• It represents the whole set of data by a single value. It gives us the
location of central points. There are three main measures of central
tendency:
Mean
Mode
Median
MEAN:
• t is the sum of observations divided by the total number of observations.
It is also defined as average which is the sum divided by count.
xˉ=∑xnxˉ=n∑x​
where,
x = Observations
n = number of terms
MEASURE OF CENTRAL
TENDENCY
MEDIAN:
• It is the middle value of the data set. It splits the data into two halves.
• If the number of elements in the data set is odd then the center element is
the median and if it is even then the median would be the average of two
central elements.
MODE:
• It is the value that has the highest frequency in the given data set.
• The data set may have no mode if the frequency of all data points is the
same.
• Also, we can have more than one mode if we encounter two or more data
points having the same frequency.
MEASURE OF VARIABILITY
• A measure of variability is a statistical procedure that describes how spread
out a set of data is. Variability is also known as spread, scatter, or
dispersion.
• In Descriptive statistics in R measure of variability is known as the spread
of data or how well is our data is distributed.

Range
Variance
Standard deviation
MEASURE OF VARIABILITY
RANGE:
The range describes the difference between the largest and smallest data point in our data set.
Range = Largest data value – smallest data value
VARIANCE:
• The average of squared distances from the mean
• It is calculated by finding the difference between every data point and the average which
is also known as the mean, squaring them, adding all of them, and then dividing by the
number of data points present in our data set.
σ2=∑(x−μ)2/N
where,
x -> Observation under consideration
N -> number of terms
mu -> Mean
MEASURE OF VARIABILITY
STANDARD DEVIATION:
• It is defined as the square root of the variance.
• It is calculated by finding the Mean, then subtracting each number from the
Mean which is also known as the average, and squaring the result.
• Adding all the values and then dividing by the no of terms followed by the
square root.
OUTPUT
THANK YOU

You might also like