0% found this document useful (0 votes)
5 views

Quartiles

The document explains quartiles, quartile deviation, and the coefficient of quartile deviation as measures of data dispersion. It details how to calculate quartiles for both ungrouped and grouped data, and provides formulas for quartile deviation and its coefficient. Additionally, it includes solved examples to illustrate the application of these concepts in statistical analysis.

Uploaded by

McAnthony Olisah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Quartiles

The document explains quartiles, quartile deviation, and the coefficient of quartile deviation as measures of data dispersion. It details how to calculate quartiles for both ungrouped and grouped data, and provides formulas for quartile deviation and its coefficient. Additionally, it includes solved examples to illustrate the application of these concepts in statistical analysis.

Uploaded by

McAnthony Olisah
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Quartiles, Quartile Deviation and Coefficient of Quartile Deviation

The Quartile Deviation is a simple way to estimate the spread of a distribution about a measure of its
central tendency (usually the mean). So, it gives you an idea about the range within which the central
50% of your sample data lies. Consequently, based on the quartile deviation, the Coefficient of Quartile
Deviation can be defined, which makes it easy to compare the spread of two or more different
distributions. Since both of these topics are based on the concept of quartiles, we’ll first understand how
to calculate the quartiles of a dataset before working with the direct formulae.

Quartiles

A median divides a given dataset (which is already sorted) into two equal halves similarly, the quartiles
are used to divide a given dataset into four equal halves. Therefore, logically there should be three
quartiles for a given distribution, but if you think about it, the second quartile is equal to the median
itself! We’ll deal with the other two quartiles in this section.

 The first quartile or the lower quartile or the 25th percentile, also denoted by Q1, corresponds to
the value that lies halfway between the median and the lowest value in the distribution (when it
is already sorted in the ascending order). Hence, it marks the region which encloses 25% of the
initial data.

 Similarly, the third quartile or the upper quartile or 75th percentile, also denoted
by Q3, corresponds to the value that lies halfway between the median and the highest value in
the distribution (when it is already sorted in the ascending order). It, therefore, marks the region
which encloses the 75% of the initial data or 25% of the end data.

For a better understanding, look at the representation below for a Gaussian Distribution –

The Quartile Deviation


Formally, the Quartile Deviation is equal to the half of the Inter-Quartile Range and thus we can write it
as –

Qd=Q3–Q12

Therefore, we also call it the Semi Inter-Quartile Range.

 The Quartile Deviation doesn’t take into account the extreme points of the distribution. Thus,
the dispersion or the spread of only the central 50% data is considered.

 If the scale of the data is changed, the Qd also changes in the same ratio.

 It is the best measure of dispersion for open-ended systems (which have open-ended extreme
ranges).

 Also, it is less affected by sampling fluctuations in the dataset as compared to the range (another
measure of dispersion).

 Since it is solely dependent on the central values in the distribution, if in any experiment, these
values are abnormal or inaccurate, the result would be affected drastically.

Quartile Deviation Formula

Quartile Deviation = Q3 – Q12

Q1 = lower quartile

Q3 = upper quartile

Q2 is also known as the median.

Quartile Deviation for Ungrouped Data

For an ungrouped data, the formula to calculate quartiles are:

Q1 = [(n+1)4]th item

Q2 = [(n+1)2]th item

Q3 = [3(n+1)4]th item

Here, n is the total number of observations.

It is important to note here that students need to arrange the given data values in ascending order
before estimating the quartiles.

Quartile Deviation for Grouped Data

For a grouped data, the quartiles can be calculated using the following formula:

Qr=l1+r(N4)−cf(l2−l1)

Here,

Qr = rth quartile
l1 = the lower limit of the quartile class

l2 = the upper limit of the quartile class

f = the frequency of the quartile class

c = the cumulative frequency of the class preceding the quartile class

N = Number of observations in the given data set

The Coefficient of Quartile Deviation

Based on the quartiles, a relative measure of dispersion, known as the Coefficient of Quartile Deviation,
can be defined for any distribution. It is formally defined as –

Coefficient of Quartile Deviation = Q3 –Q1Q3 + Q1 × 100

Since it involves a ratio of two quantities of the same dimensions, it is unitless. Thus, it can act as a
suitable parameter for comparing two or more different datasets which may or may not involve
quantities with the same dimensions.

So, now let’s go through the solved examples below to get a better idea of how to apply these concepts
to various distributions.

Importance of Quartile Deviation

Statistics is a tool that helps us understand the data, its frequency, and the distribution of the trends.
Quartile deviation is the difference between the first quartile and the third quartile in the frequency
distribution table. This is also known as the interquartile range. It is important as in this range numerous
regressions and deviations can be calculated which help to assess the characteristics of the data. When
we divide the interquartile range by two, it is known as quartile deviation or semi-interquartile range.

Solved Examples on Quartile Deviation

Question 1: The number of vehicles sold by a major Toyota Showroom in a day was recorded for 10
working days. The data is given as –

Day Frequency

1 20

2 15

3 18

4 5
5 10

6 17

7 21

8 19

9 25

10 28

Find the Quartile Deviation and its coefficient for the given discrete distribution case.

Solution: We first need to sort the frequency data given to us before proceeding with the quartiles
calculation –

Sorted Data – 5, 10, 15, 17, 18, 19, 20, 21, 25, 28
n(number of data points) = 10

Now, to find the quartiles, we use the logic that the first quartile lies halfway between the lowest value
and the median; and the third quartile lies halfway between the median and the largest value.

First Quartile Q1 = n+14th term.


= 10+14th term = 2.75th term
= 2nd term + 0.75 × (3rd term – 2nd term)
= 10 + 0.75 × (15 – 10)
= 10 + 3.75
= 13.75

Third Quartile Q3 = 3(n+1)4th term.


= 3(10+1)4th term = 8.25th term
= 8th term + 0.25 × (9th term – 8th term)
= 21 + 0.25 × (25 – 21)
= 21 + 1
= 22

Using the values for Q1 and Q3, now we can calculate the Quartile Deviation and its coefficient as follows

Quartile Deviation = Semi-Inter Quartile Range


= Q3–Q12
= 22–13.752
=8.252
= 4.125

Coefficient of Quartile Deviation


= Q3–Q1Q3+Q1×100
= 22–13.7522+13.75×100
= 8.2535.75×100
≈ 23.08

Question 2:

For the following open-ended data, calculate the Quartile Deviation and its coefficient.

No. of
Marks
Students

0-10 10

10-20 20

20-30 30

30-40 50

40-50 40

50-60 30

Solution: For the case of a grouped-data distribution, we can find the quartiles through the following
steps –

⇒ Construct a cumulative frequency table for the given data alongside the given distribution
⇒ From the total number of data values, estimate the groups/classes of the Lower and Upper Quartiles
⇒ Use the following formulae to then calculate the quartiles:

The Lower Quartile Q1 = LB +w14n–fcf


The Upper Quartile Q3 = LB +w34n–fcf

where, LB – the lower bound of the class in which the respective quartile lies
w – the class width
f_c – the cumulative frequency up to that class
f – the frequency corresponding to that particular class

For the given data, we can form the required table with the cumulative frequency as –

Marks Frequency Cumulative Frequency

0-10 10 10

10-20 20 30

20-30 30 60

30-40 50 110

40-50 40 150

50-60 30 180

Since the total number of students is 180, the first quartile must lie at the position of 180/4 = 45th
student. Similarly, the third quartile must lie at the position of 180×3/4 = 135th student. By the
distribution of our data into groups, we can note that the first quartile will lie in the 20-30 marks range.

Calculation –

Q1 = LB + w14n – fcf
Here, LB = 20; w = 10
f_c = 30; f = 30; n = 180
Thus, Q1 = 20 + 1014 × 180 – 3030
= 20 + 1530 × 10
= 25

Similarly, the third quartile will lie in the 40-50 marks range. Calculation –

Q3 = LB + w34n – fcf
Here, LB = 40; w = 10
f_c = 110; f = 40; n = 180
Thus, Q3 = 40 + 1034 × 180 – 11040
= 40 + 2540 × 10
= 46.25

Now, using the values for Q1 and Q3, now we can calculate the Quartile Deviation and its coefficient as
follows –

Quartile Deviation = Semi-Inter Quartile Range


= Q3–Q12
= 46.25–252
=21.252
= 10.625

Coefficient of Quartile Deviation


= Q3–Q1Q3+Q1×100
= 46.25–2546.25+25×100
= 21.2571.25×100
≈ 29.82

You might also like