Chapter 4 - Measures of Central Tendency
Chapter 4 - Measures of Central Tendency
A measure of central tendency is a single value that attempts to describe a set of data by identifying
the central position within that set of data. The mean (often called the average) is most likely the
measure of central tendency that you are most familiar with, but there are others, such as, the
median and the mode.
The mean, median and mode are all valid measures of central tendency but, under different
conditions, some measures of central tendency become more appropriate to use than others. In the
following sections we will look at the mean, mode and median and learn how to calculate them
and under what conditions they are most appropriate to be used.
The mean, median and mode can also be computed when the data is either an ungrouped or
grouped. Ungrouped is raw data. This means that it has just been collected but not sorted into
any group or classes. On the other hand, grouped data is data that has been organized into groups
from the raw data. Grouped data is data given in intervals.
∑𝒏
𝒊=𝟏 𝒙𝒊 ∑𝑿
X= or can be simplified by X =
𝒏 𝒏
Wherein X stands for the values or items and n is the number of measures in distribution.
∑𝒏
𝒊=𝟏 𝒙𝒊 𝒇𝒊
X=
𝒏
Wherein Xi fi is the product of each score (Xi) and its (fi ) and n is the number of items or cases.
Example 2: The mean of 80, 80, 75, 76, 74, 74, 74 is
2 (80)+75+76+3(74)
X= = 76. 14
7
∑𝑛
𝑖=1 𝑋𝑖 𝑓𝑖
X= where Xi stands for the class mark and fi is the corresponding frequency.
𝑛
To illustrate, please see the table below.
Table 4.1. Statistics Test Score
Class Interval Class Mark Frequency Xi fi
90 – 94 92 2 184
85 – 89 87 6 522
80 – 84 82 3 246
75 – 79 77 8 616
70 – 74 72 5 360
65 – 69 67 2 134
60 – 64 62 10 620
55 – 59 57 3 171
50 – 54 52 4 208
45 – 49 47 3 141
40 - 44 42 4 168
∑ 𝑓𝑖 = 50 ∑ 𝑋𝑖 𝑓𝑖 = 3,370
∑𝑛
𝑖=1 𝑋𝑖 𝑓𝑖 3,370
Hence, X= = = 67. 4
𝑛 50
4.2. The Weighted Mean
The weighted mean is a type of mean that is calculated by multiplying the weight (or probability)
associated with a particular event or outcome with its associated quantitative outcome and then
summing all the products together. It is very useful when calculating a theoretically expected
outcome where each outcome has a different probability of occurring, which is the key feature that
distinguishes the weighted mean from the arithmetic mean.
Weighted means are useful in a wide variety of scenarios. For example, a student may use a
weighted mean in order to calculate his/her percentage grade in a course. In such an example, the
student would multiply the weighing of all assessment items in the course (e.g., assignments,
exams, projects, etc.) by the respective grade that was obtained in each of the categories. Consider
a student with the following grades:
Example: Let us determine the weighted mean if 500 bags were sold at P250 each, 350 bags at
P200 each, 200 bags at P150 each, 150 bags at P100 each, and 50 bags at P80.
Example:
Table 4.3. Computations of the Median
∑ 𝑓𝑖 = N = 176
This formula is used only if the modal class and the two adjoining class intervals have the same
width. The modal class is the class interval with the highest frequency.
Thus from the previous example (Table 4. 3)
u = 409.5 ; d1 = 20-13 = 7 ; d1 = 20 – 15 = 5 ; i = 10
𝒅𝟏 7
Mo = u + (𝒅 ) i = 409.5 + ( 7+5 ) 10 = 415.33
𝟏 𝒅𝟐
4.5 Uses of the Measures of Central Tendency
4.5.1. The Mean is used
for interval and ratio measurements
if higher statistical computations are wanted
if there are no extreme values in a distribution since it is easily affected by the
extremely high or extremely low scores. Thus, the distribution is approximately
normal
When the greatest reliability of the measure of central tendency is wanted since its
computations include all the given values.
4.5.2 The Median is Used
It is the most widely used average, because it is the most familiar. It is often, however
misused. It cannot be used if the clustering of values or items is not substantial. An
example is when representing the scores or values, 10 and 100 since they are far apart.
When the given values do not tend to cluster around a central value, the mean is a poor
measure of central location.
It is easily affected by extremely large or small values. One small value can easily pull
down the mean.
The mean cannot be utilized to compare distributions since the means of two or more
distributions may be the same but their other characteristics maybe entirely different.
The means of distribution A whose values are 80, 85, and 90 and distribution B whose
values are 86, 85, and 84 are both 85. However, we cannot imply that both distributions
possess the same characteristics since their patterns of dispersion or variations are
markedly different despite having the same mean.
4.6.2 The Limitations of the Median:
It is easily affected by the number of items in a distribution
It cannot be determined if the given values are not arranged according to magnitude.
If several values are contained in distribution, it becomes laborious task to arrange them
according to magnitude.
Its value is not as accurate as the mean because it is just an ordinal statistic.
4.7 Quantities
The idea of the median can be extended to the discussion of quantities. Quantiles refers to
values which divide the distribution into a given number of equal parts. The median divides the
distribution into two equal parts. There are other types of quantiles which we shall discuss in this
section these are the quantiles, deciles, and the percentiles or centiles.
The quantiles divide the distribution into four equal parts. The quantiles are Q1 (first
quantile), Q2 (second quantile or median), Q3 (third quantile) and Q4 (fourth quantile).
The deciles divide the distribution into ten equal parts. The decile are D1 (first decile), D2
(second decile), D3 (third decile), up to D10 (tenth decile).
The percentiles or centiles divide the distribution into one-hundred equal parts. They are
P1, (first percentile), P2 (second percentile), and so on up to P100. Notice that P50 = Q2 = Median,
P20 = D2; C90 = D9, etc.
Quantiles are determined when the values in the distribution are already arranged according
to magnitude for ungrouped data. The computation for grouped data of the quantiles are similar to
that of the median.
To understand the concept of quantiles, let us take the percentileor or centiles. A percentile
is a point in a distribution below which a given percent of cases lie. For example, the 70th
percentile or P70 is the point or score in a distribution below which 70% of the cases lie. If the 70th
percentile is equal to 80, that is P70 in a distribution scores given to freshmen entrants in a certain
college, a student who got 80 in the admission exam surpassed 70% of the cases with only 30%
examinees higher than his grade.
4.7.1 Computations of the Quantiles for Ungrouped Data
To determine any quantile, change it first to percentile and follow the steps below:
Step 1. Arrange first the scores according to magnitude or size
Step 2. Find the position of the given percentile in the distribution using the formula P(n+1)/100
where P is the given percent and n is the number of cases.
Step 3. Locate the score corresponding to the obtained position in the distributions starting from
the lowest score.
Step 4. Interpolate to get the score if the obtained position from step 2 is not exact.
Example: Find the 20th percentile or P20 of the following scores: 25, 22, 20, 16, 17, 12, 8, 6, 5
Steps:
𝑃 (𝑛+1)
1. Locate the position of the score corresponding to the 20th percentile using 100
20 (9+1)
Solution: =2
100
Example: Find the 60th percentile or P60 of the following scores: 99, 95, 80, 75, 70, 60, 40
Steps:
1. Compute for the position of the 60th percentile.
𝑃 (𝑛+1) 60 (7+1)
= = 4.8
100 100
2. Since 4.8 is between the 4th and 5th score from the bottom, we have to interpolate to find
the answer. Take the 4th score from the bottom which is 75 and the 5th score which is 80.
3. Solve the difference between these two scores, 80 – 75, which is 5.
4. Multiply the difference 5 by the decimal part obtained in step 1 which is 0.8. The product
is 4.
5. Finally add this product to the lower score, 75.
Thus P60 = 75 + 4 = 79
This means that 60% of the cases fall below the score 79.
4.7.2 Computations of the Quantiles for Grouped Data
The computation of any quantile for grouped data is similar to that of the median.
The formula is:
𝒏𝒑−𝒄𝒇
Pp = u + ( )i
𝒇𝒑
Step 3: Find the exact lower limit of 52 which is 51.5. So, u = 51.5.
Step 4: Find the cf value immediately below the class interval, 52-55. Thus, cf = 65.
Step 5: Find the fp value or frequency of the class interval 52-55. So, fp = 5.
Step 7: Compute for P75 by substituting all the needed values in the formula.
Solution:
𝑛𝑝−𝑐𝑓 67.5−65
Pp = u + ( )i = 51.5 + ( ) 4 = 53.5
𝑓𝑝 5
This means that 75% of the cases lie below the score 53.5
References:
Nocon, F.P, Torrecampo, J.T, Balacua, M.P, Dagua, W.B. General Statistics Made Simple for
Filipinos
https://centergrove.instructure.com/courses/1823759/pages/module-7-measures-of-central-
tendency
https://corporatefinanceinstitute.com/resources/knowledge/other/weighted-mean/