0% found this document useful (0 votes)
98 views125 pages

Stat Unit 1

The document provides an introduction to probability and statistics. It defines key statistical concepts such as frequency distributions, measures of central tendency, measures of dispersion, skewness, and kurtosis. It discusses descriptive statistics and their use in summarizing and analyzing data. Descriptive statistics help study averages, compare groups, and examine relationships in data. The document also defines important statistical terms including variable, attribute, data series, frequency distribution, and cumulative frequency distribution.

Uploaded by

Reayan Banday
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
98 views125 pages

Stat Unit 1

The document provides an introduction to probability and statistics. It defines key statistical concepts such as frequency distributions, measures of central tendency, measures of dispersion, skewness, and kurtosis. It discusses descriptive statistics and their use in summarizing and analyzing data. Descriptive statistics help study averages, compare groups, and examine relationships in data. The document also defines important statistical terms including variable, attribute, data series, frequency distribution, and cumulative frequency distribution.

Uploaded by

Reayan Banday
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 125

BCA204A34: PROBABILITY AND

STATISTICS

INTRODUCTION
Dr. Soya Mathew
Unit – 1 Portions
Need for quantifying data; Quantitative and qualitative data;
Frequency distributions – discrete and continuous; Histogram,
Frequency curve, Cumulative frequency curve; Measures of central
tendency – Mean, Median and Mode; Measures of Dispersion –
quartile deviation, standard deviation, coefficient of variation;
Skewness – Karl Pearson and Bowley’s formula;
Kurtosis(definition).
What is Statistics ?

The word statistics has two meanings:

In the more common usage, statistics refers to numerical


facts or statistical data. For example, the numbers that represent the
income of a family, the age of a student, the starting salary of a
typical college graduate etc. are examples of statistics in this sense
of the word.
The second meaning of statistics refers to the field or
discipline of study. It is the science of analysing and interpreting
data and making decisions under uncertainties.

According to Croxten and Cowden

“Statistics is the science which deals with collection,


presentation, analysis and interpretation of numerical data.”
Application of Statistics in Computer Science
 Machine learning

 Data Science

 Data mining

 Big data

 Artificial intelligence

 Network and Traffic Modelling

 Image Analysis
Descriptive Statistics
Descriptive statistics include graphical and numerical procedures
that are used to summarize data and to transform data into information.

For example,

• to study the average height of students in a class room

• to compare the performance of 2 group of students on the final math


exam

• to study the intake of nutrition in women aged 25-50 years in a country

etc…
Variable:

In statistics, the term ‘variable’ is used, only if, the changing


characteristics can be numerically measured.

Examples: heights and weights of individuals, price of commodities, income


of individuals, household expenditures, age etc... are variables, as they can be
measured in numerical terms.
Attribute: Attribute is defined as a characteristic or quality of an object or
individual.

The looks of people, their intelligence and aptitude for art and music change
from one individual to the other, they cannot be measured numerically but is
only expressed qualitatively as good, excellent or average etc…

For example,

Height: Tall, Short

Weight: Fat, Medium, Thin


What is Data ?
Data can be defined as a representation of facts, concepts, or
instructions in a formalized manner, which should be suitable for
communication, interpretation, or processing by human or
electronic machine. It is represented with the help of characters such
as alphabets (A – Z, a – z), digits (0 – 9) or special characters (+, -,
/, *, <, >, = etc.). It is the raw list of facts that are processed to gain
information.
Qualitative data is the data in which classification of objects
is based on attributes. For example, the students of a class are
‘genius’, ‘intelligent’, ‘average’, ‘dull’ according to their intelligence
or their performance as Excellent, Good, bad etc…

Quantitative data is the type of data which can be measured


and expressed numerically. Data relating to height, weight,
temperature etc. are examples of quantitative data. A quantity that
varies from individual to another is called variable.
Types of Series
Individual series: The data as observed by an investigator is known as individual
series. It is a series of observations of the form
𝑥: 𝑥1 , 𝑥2 , 𝑥3 ,…𝑥𝑛
For example:
Marks scored by 10 students(𝑥): 100, 90, 72, 59, 40, 20, 23, 75, 61,43.
Discrete Series: When the data collected are large in number and they are distinct
then we can form a frequency distribution (the number of units associated with
each value of the variable). The number of times a data occurs in a data set (A
data set is a collection of observations on one or more variables) is known as the
frequency of data.
It is a series of observation of the form
𝒙: 𝑥1 𝑥2 𝑥3 ….. 𝑥𝑛

𝑓 𝑓1 𝑓2 𝑓3 ….. 𝑓𝑛

Continuous Series: It is a series of observation of the form

𝒄𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍: 𝑎1 − 𝑎2 𝑎2 − 𝑎3 ….. 𝑎𝑛 − 𝑎𝑛+1

𝑓 𝑓1 𝑓2 ….. 𝑓𝑛
Frequency Distribution
Quantitative Data collected from surveys, reports, interviews
etc… are summarized using Frequency distribution. A systematic
presentation of the values taken by a variable and the corresponding
frequency is called frequency distribution.
A tabular presentation of the frequency distribution is called
the frequency table.
There are two types of frequency distribution:
Discrete (or Ungrouped) Frequency Distribution and
Continuous (or Grouped) Frequency Distribution.
Example for Discrete (or Ungrouped) Frequency Distribution:
For example: In a quiz, the marks obtained by 20 students out of
30 are given as: 12, 15, 15, 29, 30, 21, 30, 30, 15, 17, 19, 15, 20, 20,
16, 21, 23, 24, 23, 21
But if the range of values assumed by the variable is vast, then continuous
frequency distribution is more convenient.
For example: distribution of marks in a class of 40 students
The first column of the table represents the Marks (class No. of students
interval) (Frequency)

class interval with a class width of 10 units.


40 – 50 5
In each class, the lowest number denotes the 50 – 60 7
60 – 70 8
lower class limit and the higher number
70 – 80 12
indicates the upper-class limit. For the class 80 – 90 5

interval 60 – 70, the lower class limit is 60 and 90 – 100 3


40
the upper-class limit is 70.
Now in a frequency distribution, the class intervals may be
inclusive or exclusive. If a class interval is such that the lower as well
as the upper class limits are included in the same class interval it is
inclusive class interval. If a class interval is such that the lower limit
is included in the first class interval whereas the upper class limit is
included in the successive class interval then it is exclusive class
interval. This method is widely followed in practice.
For example: 10, 25, 20, 12, 30, 46, 19, 15, 17, 20, 25.

In inclusive class interval 20 will be included in 20 – 29 and in


exclusive class interval 20 will be included in 20 – 30 instead of 10 – 20
interval. Inclusive Class Interval Exclusive Class Interval

10 – 19 5 10 – 20 5

20 – 29 4 20 – 30 4

30 – 39 1 30 – 40 1

40 – 49 1 40 – 50 1
To ensure the continuity and to get correct class limits, exclusive
method of classification should be adopted. To convert inclusive class
intervals to exclusive, we need to do some adjustment:
Adjusted Class Interval
In the above example,
9.5 – 19.5 5
20−19
Adjustment Factor is = 0.5 19.5 – 29.5
2 4

Then the frequency distribution 29.5 – 39.5 1

with adjusted class is 39.5 – 49.5 1


Cumulative Frequency Distribution
Cumulative frequency of a given variable or class represents the
total frequency of all previous variables including the variable or the
class. i.e:- the added–up frequencies are called Cumulative
Frequency. For example
Class Frequency Cumulative Frequency

10 – 14 6 6
15 – 19 11 17
20 – 24 12 29
25 – 29 10 39
Note:
Frequency distribution of a single variable is called Univariate
Frequency Distribution.

Example: frequency distribution of height of a set of students.

Frequency distribution of more than one variable is called


Multivariate Frequency Distribution.

Example: Frequency distribution of height and weight of a set of


students.
Frequency distribution of height and weight of a set of students:
Weight Height (cms) Total
(Kgs.)

140 – 150 150 – 160 160 – 170 170 - 180

40 - 44 2 1 - - 3

45 - 49 2 4 2 1 9

50 - 54 - 3 5 1 9

55 - 59 - 2 5 4 11

60 - 64 - 1 3 4 8

Total 4 11 15 10 40
In a bivariate distribution, if one of the variables is ignored and the distribution
of other variable alone is considered, then it is called Marginal Frequency
Distribution
Marginal Frequency Distribution Marginal Frequency Distribution
of Weight of Height
Weight (Kgs.) Number of Students Height(cms) Number of Students

140 - 150 4
40 - 44 3

45 - 49 9 150 - 160 11

50 - 54 9 160 - 170 15
55 - 59 11
170 - 180 10
60 - 64 8
Total 40
Total 40
In a bivariate distribution, the distribution of one of the variables subject to
a condition imposed on the other variable is considered is called
Conditional Frequency Distribution.
Conditional Frequency Distribution of weight given that height is in between
150 and 160 cms.
Weight (Kgs.) Number of Students

40 - 44 1

45 - 49 4

50 - 54 3

55 - 59 2

60 - 64 1

Total 11
The following data give the total number of iPods sold by a mail order company on
each of 30 days:

8, 25, 11, 15, 29, 22, 10, 5, 17, 21, 22, 13, 26, 16, 18, 12, 9, 26, 20, 16, 23, 14, 19, 23,
20, 16, 27, 16, 21, 14
iPods Sold Tally Marks Frequency

Construct a frequency distribution table.


5 – 10
Here Min. Value is 5 and Max. value is 29
10 – 15

So 𝑅𝑎𝑛𝑔𝑒 = 29 − 5 = 24 15 – 20

Let 𝑤𝑖𝑑𝑡ℎ = 5 𝑢𝑛𝑖𝑡𝑠 20 – 25

25 – 30
The Frequency Distribution Table for the Data on iPods Sold
MEASURES
OF
CENTRAL TENDENCIES
CENTRAL TENDENCY:
In a frequency distribution the values cluster around a central
values. This property of concentration of the values around a central
value is called central tendency.

The measure of Central Tendency is also known as the measure of


Location or averages.

 It is a single value which represents the entire data.

 It facilitates comparison between two or more data set


Types of Central Tendency:

 Arithmetic Mean (AM)

 Median

 Mode
Arithmetic Mean (A.M.):
The ‘Arithmetic Mean’ (or simply ‘mean’) represented by
𝑥ഥ is a most common measure of central tendency. The mean is a
common measure in which all the values play an equal role and is
used to obtain summary measure.

The A.M of a set of observations is the quotient that results


when the sum of these observations is divided by the number of
observations.
Arithmetic Mean (A.M) for Individual Series (raw data)

Let 𝑥1 , 𝑥2 , 𝑥3 ,…𝑥𝑛 are n variates or observations. Then


the Arithmetic Mean is denoted and defined as
𝑥1 +𝑥2 +…+𝑥𝑛
𝑥ҧ =
𝑛

σ 𝑥𝑖
⟹ 𝑥ҧ =
𝑛

Where σ 𝑥𝑖 is the sum of the observations.


Problems:
1. Calculate the arithmetic mean for the following data:
Marks 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
Marks Mid Point No. of 𝑓𝑥
(Class 𝑥 Students
No of
Interval) (𝑓)
Student 5 10 25 30 20 10 5 5
s 0 – 10 5 5 25

10 – 20 15 10 150
Solution:
20 – 30 25 25 625

σ 𝑓𝑖 𝑥𝑖 30 – 40 35 30 1050
𝑥ҧ =
𝑁 40 – 50 45 20 900

50 – 60 55 10 550
4000
⟹ 𝑥ҧ = 60 – 70 65 5 325
110
70 – 80 75 5 375
⟹ 𝑥ҧ = 36.36 marks Total N = 110 4000
2. The table below show the age of 55 patients selected to study the effectiveness of a particular
medicine. Calculate the mean age of the patients.

Age 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70

No of Age Mid Point No. of 𝑓𝑥


Patients 5 7 17 12 5 2 7 (Class 𝑥 Patients
Interval) (𝑓)

Solution: 0 – 10 5 5 25

10 – 20 15 7 105
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 20 – 30
𝑁 25 17 425

1765 30 – 40 35 12 420
⟹ 𝑥ҧ =
55 40 – 50 45 5 225

⟹ 𝑥ҧ = 32.09 50 – 60 55 2 110

60 – 70 65 7 455

Total N = 55 1765
3. Calculate the arithmetic mean for the following data:
Temperature (°𝑪) −𝟒𝟎 − (−𝟑𝟎) −𝟑𝟎 − (−𝟐𝟎) −𝟐𝟎 − (−𝟏𝟎) −𝟏𝟎 − 𝟎 0 – 10 10 – 20 20 – 30

No of Days 10 28 30 42 65 180 10

Temperature Mid No. of Patients 𝑓𝑥


(°𝑪) Point (𝑓)
Solution: (Class Interval) 𝑥
−𝟒𝟎 − (−𝟑𝟎) −𝟑5 10 −𝟑50

σ 𝑓𝑖 𝑥𝑖 −𝟑𝟎 − (−𝟐𝟎) −𝟐5 28 −𝟕𝟎𝟎


𝑥ҧ =
𝑁
−𝟐𝟎 − (−𝟏𝟎) −𝟏5 30 −𝟒50

1565
⟹ 𝑥ҧ = −𝟏𝟎 − 𝟎 −5 42 −𝟐𝟏𝟎
365
0 – 10 5 65 325
⟹ 𝑥ҧ = 4.29 °𝐶
10 – 20 15 180 2700

20 – 30 25 10 250

Total N = 365 1565


4. Calculate the arithmetic mean for the following data:
Marks More More than More than More than More than More than More than More than
than 0 10 20 30 40 50 60 70
No of
Students 150 140 100 80 80 70 30 14

Marks Mid Point No. of Students 𝑓𝑥


Solution: (Class Interval) 𝑥 (𝑓)

0 – 10 5 150 – 140 = 10 50
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 10 – 20 15 140 – 100 = 40 600
𝑁
20 – 30 25 100 – 80 = 20 500
5890
⟹ 𝑥ҧ = 30 – 40 35 80 – 80 = 0 0
150
40 – 50 45 80 – 70 = 10 450
⟹ 𝑥ҧ = 39.266
50 – 60 55 70 – 30 = 40 2200

Thus the mean mark of the 60 – 70 65 30 – 14 = 16 1040

70 – 80 75 14 – 0 = 14 1050
continuous series is 39.27
Total N = 150 5890
5. The following table shows the age of workers in a factory. Find out the average age
of workers.
Age 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69

No of
Workers 10 8 6 4 2
Age Mid Point No. of 𝑓𝑥
(Class 𝑥 Workers (𝑓)
Solution: Interval)

20 – 29 24.5 10 245
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 30 – 39 34.5 8 276
𝑁

1135 40 – 49 44.5 6 267


⟹ 𝑥ҧ =
30
50 – 59 54.4 4 218
⟹ 𝑥ҧ = 37.83
60 – 69 64.5 2 129

Average age of workers is 37.83 years Total 30 1135


6. Find the missing frequency of the following data if the arithmetic mean is

25.4 Class Interval 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60

Frequency 20 15 10 ? 2

Solution: Class Mid Point Frequency 𝑓𝑥


Interval 𝑥 (𝑓)

Let 𝑚 be the unknown 10 – 20 20

20 – 30 15
frequency 30 – 40 10

40 – 50 𝑚
Given 𝑥ҧ = 25.4
50 – 60 2

Total
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ =
𝑁 Class Mid Point Frequency 𝑓𝑥
Interval 𝑥 (𝑓)

1135+45𝑚
⟹ 25.4 = 10 – 20 15 20 300
47+𝑚
20 – 30 25 15 375
⟹ 25.4 47 + 𝑚 = 1135 + 45𝑚
30 – 40 35 10 350
⟹ 1193.8 + 25.4𝑚 = 1135 + 45𝑚
40 – 50 45 𝑚 45𝑚

⟹ 19.6 𝑚 = 58.8 50 – 60 55 2 110

⟹ 𝑚=3 Total 47 + 𝑚 1135 + 45𝑚


7. Find the missing frequencies 𝑚 and 𝑛 in the table given below, it is being
given that the mean of the given frequency distribution is 50.
Class 0 – 20 20 – 40 40 – 60 60 – 80 80 – 100 Total

Frequency 17 𝑚 32 𝑛 19 120

Class Mid Point Frequency 𝑓𝑥


Solution: Interval 𝑥 (𝑓)

0 – 20 17

Given 𝑥ҧ = 50 20 – 40 𝑚

40 – 60 32
We have
60 – 80 𝑛

80 – 100 19
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = Total 120
𝑁
Class Mid Point Frequency (𝑓) 𝑓𝑥
3480+ 30𝑚 + 70𝑛
⟹ 50 = Interval 𝑥
68 + 𝑚 + 𝑛
⟹ 50(68 + 𝑚 + 𝑛) = 3480+ 30𝑚 + 70𝑛 0 – 20 10 17 170

⟹ 3400 + 50𝑚 + 50𝑛 = 3480 + 30𝑚 + 70𝑛


20 – 40 30 𝑚 30 𝑚
⟹ 20𝑚 − 20𝑛 = 80

⟹ 𝑚 − 𝑛 = 4 … … (1) 40 – 60 50 32 1600

Also, 68+𝑚+𝑛=120
60 – 80 70 𝑛 70 𝑛
⟹ 𝑚 + 𝑛 = 52 … … (2)

Solving (1) and (2), we get 80 – 100 90 19 1710

𝑚 = 28
Total 68 + 𝑚 + 𝑛 = 120 3480 + 30𝑚 + 70𝑛
𝑛 = 24
8. The average age of 20 students in a class is 16 years. One student whose age is 18 years has left the class.
Find out average age of rest of the students.

Solution:

Given, 𝑥ҧ = 16 and 𝑁 = 20

σ 𝑥𝑖
We have 𝑥ҧ =
𝑁

σ 𝑥𝑖
⟹ 16 =
20

⟹ σ 𝑥𝑖 = 320

⟹ Total age of 20 students = 320

Total age of 19 students = 320 − 18 = 302

302
∴ average age of 19 students = = 15.89 years
19
9. The mean marks got by 300 students in statistics was 45. The mean of the top 100 of them was found to be 70 and the
mean of the last 100 was known to be 20 . Find the mean of the remaining 100 students.

Solution:

σ 𝑥𝑖
We have 𝑥ҧ = 𝑁

Mean mark of 300 students in statistics, 𝑥ҧ = 45

Total marks of 300 students = 45 × 300 = 13500

Mean mark of top 100 students in statistics, 𝑥ҧ = 70

Total marks of top 100 students = 70 × 100 = 7000

Mean mark of last 100 students in statistics, 𝑥ҧ = 20

Total marks of last 100 students = 20 × 100 = 2000

Total marks of remaining 100 students = 13500 − (7000 + 2000) = 4500

4500
∴ Mean mark of remaining 100 students = = 45
100
10. The mean of a group of 100 observations is known to be 50. Later it was discovered that
two observations were misread as 92 and 8 instead of 192 and 88. Find the correct mean.

Solution:

Given, 𝑥ҧ = 50 and 𝑁 = 100

σ 𝑥𝑖
We have 𝑥ҧ =
𝑁

Sum of 100 observations = 50 × 100 = 5000


Given that the observations 192 and 88 were misread as 92 and 8.
∴ the corrected sum = 5000 + 192 + 88 − 92 − 8 = 5180

5180
∴ Corrected mean = = 51.80
100
Weighted Arithmetic Mean:
Usually in computing Arithmetic Mean, equal importance is given to
all the observations of the data. However there are cases where all the items
are not of equal importance. In other words some items of a series are more
important as compared to the other items in the same series.

Def: Weighted mean is the mean of a set of values wherein each value or
measurement has a different weight or degree of importance. The following
is its formula:
σ 𝑥𝑊
𝑥ҧ = σ𝑊

Where, 𝑥ҧ is mean, W=number of measurements


1. A student’s final scores in Mathematics, Physics, Chemistry and English are respectively 82, 86, 90 and 70.
If the respective credits received for these courses are 3, 5, 3, and 1, determine the average score.

Solution:

Here the weights associated to the observations 82, 86, 90 and 70 are 3, 5, 3 and 1

𝑥 : 82 86 90 70

𝑤:3 5 3 1

σ 𝑥𝑊
Average Score, 𝑥ҧ = σ𝑊

(82) (3)+(86)(5)+(90)(3)+(70)(1)
⟹ 𝑥ҧ =
3+5+3+1

246+430+270+70
⟹ 𝑥ҧ =
12

1016
⟹ 𝑥ҧ =
12

⟹ 𝑥ҧ = 84.67
2. A contractor employs three types of workers, male, female and children. To a male he
pays Rs.40 per days. To a female worker Rs.32 per day and to a child worker Rs.15 per
day. The number of male, female and children workers employed is 20, 15 and 15
respectively. Find out the average amount.

Solution:
Daily Wages in No: of Workers (𝑊) 𝑥𝑾
Rs. (𝑥)
Calculation of Weighted A.M:
40 20 800
σ 𝑥𝑊
𝑥ҧ = σ𝑊 32 15 480

1505 15 15 225
⟹ 𝑥ҧ =
50

Total 50 1505
⟹ 𝑥ҧ = 30.10 Rs.
Combined Arithmetic Mean:
If we have arithmetic mean, and the number of items of two or
more than two related groups , we can calculate the combined average of
these groups by applying the formula.

𝑛1 𝑥ҧ 1 +𝑛2 𝑥ҧ 2 +⋯+𝑛𝑛 𝑥ҧ 𝑛
𝑥ҧ 1,2…,𝑛 =
𝑛1 +𝑛2 +⋯+𝑛𝑛

Where 𝑥ҧ 1 = A.M of the first group, 𝑥ҧ 2 = A.M of the second group, …

𝑛1 = Number of items in the first group,

𝑛2 = Number of items in the second group, …


Example:
1. The arithmetic mean age of the first group of 80 boys is 10 years, and that of the second
group of 20 boys is 15 years. Find the arithmetic mean of the two groups taken together.

Solution:
Given 𝑛1 =80, 𝑥ҧ 1 =10, 𝑛2 =20, 𝑥ҧ 2 =15
𝑛1 𝑥ҧ 1 +𝑛2 𝑥ҧ 2
We have, 𝑥ҧ 1,2 =
𝑛1 +𝑛2

80)(10 + 20)(15
⟹ 𝑥ҧ 1,2 =
80+20

800+300
⟹ 𝑥ҧ 1,2 =
100

1100
⟹ 𝑥ҧ 1,2 = = 11 years
100

.
2. The mean marks got by 300 students in the subject of statistics was 45. The mean of the top 100 of them
was found to be 70 and the mean of the last 100 was known to be 20. What is the mean marks of the
remaining students?

Solution:

Given 𝑛1 =100, 𝑥ҧ 1 =70, 𝑛2 =100, 𝑥ҧ 2 = 𝑥, 𝑛3 =100, 𝑥ҧ 3 = 20,

𝑥ҧ 1,2,3 = 45, σ 𝑛𝑖 = 300

𝑛1 𝑥ҧ 1 + 𝑛2 𝑥ҧ 2 + 𝑛3 𝑥ҧ 3
We have, 𝑥ҧ 1,2,3 =
𝑛1 + 𝑛2 + 𝑛2

100)(70 + 100)(𝑥 + (100)(20)


⟹ 45=
300

⟹ 13500=7000 + 100 𝑥 + 2000

⟹ 13500 = 9000 + 100 𝑥

⟹ 100 𝑥 = 4500

⟹ 𝑥 = 45
Merits of Arithmetic Mean:
1. It is rigidly defined.
2. It is easy to understand and easy to calculate.
3. It is based upon all the observations.
4. It is capable of further statistical analysis.
5. It is more stable than any other average.
Demerits of Arithmetic Mean:
1. It cannot be determined by inspection nor it can be located graphically.
2. Arithmetic mean cannot be used if we are dealing with qualitative
characteristics(such as, intelligence, honesty, beauty, etc.)
3. Arithmetic mean cannot be obtained if a single observation is missing
or lost or is illegible unless we drop it out and compute the arithmetic
mean of the remaining values.
4. Arithmetic mean is affected very much by extreme values.
MEDIAN:
Median is known as the positional average. Median divides a series
of ordered observations (ascending or descending) so that the number of
observations above is equal to the number of observations below it. The
median is used when an exact midpoint is desired.
• When the number of items in the series is odd then the median is the
mid-point of the ordered values.
• When the number of items in the series is even then the median is
average of two middle numbers of the ordered values.
Example

Odd number of items : 4, 5, 3, 8, 6, 10, 8


Ascending order: 3, 4, 5, 6, 8, 8, 10
Median = 6
Even number of items: 4, 5, 3, 8, 6, 8
Ascending order: 3, 4, 5, 6, 8, 8
5+6
Median = =5.5
2
Median in Individual Series

Let N be the number of values of a variate (i.e. total of all


frequencies). First of all we write the values of the variate (i.e., the
terms) in ascending or descending order of magnitudes.
𝑁+1 th
Case 1: If N is odd then value of 2
term gives the median.

Case 2: If N is even then there are two central terms and the mean of
these two values gives the median.
Problems:
1. Obtain the value of median from the following data: 273, 115, 83, 415, 565, 382, 31, 45, 182.

Solution:
Arranging the data in the ascending order: 31, 45, 83, 115, 182, 273, 382, 415, 565.

Here 𝑁 = 9, odd

𝑁+1 th
∴ Median = term
2

9+1 th
= term
2

= 5th term

= 182
2. Find the value of the median from the following 8, 20, 50, 25, 15, 30

Solution:
Arranging the data in the ascending order : 8, 15, 20, 25, 30, 50

Here 𝑁 = 6, even

20+25
∴ Median =
2

45
=
2

= 22.5
3. Rahna’s maths quiz scores in 9 competitions were 88, 97, 87, 92, 90, 88, 93, 98, 95.
What was her median quiz score ?

Solution:
Arranging the data in the ascending order : 87, 88, 88, 90, 92, 93, 95, 97, 98

Here 𝑁 = 9, odd

𝑁+1 th
∴ Median = term
2

9+1 th
= term = 5th term
2

= 92

∴ Median quiz score is 92


4. Anand’s family plans a trip from Thiruvananthapuram to Wayanad on their summer
vacation. They drove through 8 districts. The following are the petrol prices in the 8
districts on those days; Rs.71.9, Rs.72.3, Rs.72.4, Rs.72.32, Rs.73, Rs.73.1, Rs.72.2
and Rs.72.48 What is the median petrol price ?

Solution:
Arranging the data in the ascending order :

71.9, 72.2, 72.3, 72.32, 72.4, 72.48, 73, 73.1

Here 𝑁 = 8, even

72.32 + 72.4 144.72


∴ Median = = = 72.36
2 2

∴ Median petrol price is Rs.72.76


Median in Discrete Series

1.Arrange the data in the ascending or descending order of


magnitude.
2. Find out the cumulative frequency for each observation.
3. Find 𝑁 = σ 𝑓, the total number of observations.
4. The median is the observation for which the cumulative frequency is
𝑁
just greater than or equal to
2
Example:
1. From the following find out the value of median.

Wages in Rs.: 50 100 150 175 200 250

No. of Persons: 16 24 26 30 20 6

Solution: Wages in Rs. No. of Persons Cumulative Frequency

Here N=122
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟 50 16 16
𝑁
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 100 24 40
2
150 26 66
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
122 175 30 96
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥
2
200 20 116
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
250 6 122
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 61
𝑁 = 122
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 150
2. Calculate the value of median from the following data:
Income
1,200 1,800 5,000 2,500 3,000 1,600 3,500
(₹)
No. of
12 16 2 10 3 15 7
Persons

Income (₹) No. of Persons Cumulative Frequency


Solution:
Here 𝑁 = 65 1200 12 12

1600 15 27
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
1800 16 43
𝑁
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 2500 10 53
2
3000 3 56
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟
3500 7 63
65
𝑤ℎ𝑖𝑐ℎ 𝑐𝑓 𝑖𝑠 ≥ 5000 2 65
2
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 1800 𝑁 = 65
Median in Continuous Series
1. Find the cumulative frequency for each class.
𝑁
2. Find the median class. It is the class interval for which cumulative frequency is ≥ .
2
3. To calculate median , the following formula is used
𝒉 𝑵
𝑴𝒆𝒅𝒊𝒂𝒏 = 𝑳 + −𝒄
𝒇 𝟐

where 𝑁 = σ 𝑓,
𝐿 = lower limit of median class,
h = width of the median class,
𝑓 = frequency of the median class,
𝑐 = cumulative frequency of the class preceeding the median class
Problems:
1. Calculate median from the following information.
Marks: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50

No. of Students: 44 60 36 44 8

Solution:
Marks No. of Students Cumulative Frequency

𝑁 192
Here = = 96𝑡ℎ 𝑡𝑒𝑟𝑚
2 2
0 – 10 44 44

The cumulative frequency value 10 – 20 60 104

20 – 30 36 140
96 falls in the class interval 10 – 20.
30 – 40 44 184
Therefore 10 – 20 is the median 40 – 50 8 192

𝑁 = 192
class.
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Marks No. of Cumulative Frequency
𝑓 2
Students

10
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 10 + 96 − 44
60 0 – 10 44 44

52 10 – 20 60 104
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 10 +
6
20 – 30 36 140
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 10 + 8.67
30 – 40 44 184

⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 18.67 marks 40 – 50 8 192

𝑁 = 192
2. Calculate median from the following data:
Group 60 – 64 65 – 69 70 – 74 75 – 79 80 – 84 85 – 89

Frequency 1 5 9 12 7 2

Solution:
Group Frequency Adjusted Class Cumulative
Interval Frequency
𝑁 36
Here = = 18𝑡ℎ 𝑡𝑒𝑟𝑚
2 2 60 – 64 1 59.5 – 64.5 1

65 – 69 5 64.5 – 69.5 6
The cumulative frequency value
70 – 74 9 69.5 – 74.5 15
18 falls in the class interval
75 – 79 12 74.5 – 79.5 27

74.5 – 79.5 80 – 84 7 79.5 – 84.5 34

85 – 89 2 84.5 – 89.5 36
∴ 74.5 – 79.5 is the median class.
ℎ 𝑁 Group Frequency Adjusted Class Cumulative
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Interval Frequency
𝑓 2

5
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 74.5 + 18 − 15 60 – 64 1 59.5 – 64.5 1
12

5 65 – 69 5 64.5 – 69.5 6
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 74.5 +
4
70 – 74 9 69.5 – 74.5 15
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 74.5 + 1.25
75 – 79 12 74.5 – 79.5 27

⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 75.75 80 – 84 7 79.5 – 84.5 34

85 – 89 2 84.5 – 89.5 36
3. From the following data find out missing frequency if the median is 50.
Marks: 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70

No. of 2 8 6 ? 15 10
Students:

Solution:
Marks No. of Students Cumulative Frequency

Let 𝑚 be the missing frequency. 10 – 20 2 2

20 – 30 8 10
Median = 50 30 – 40 6 16

40 – 50 𝑚 16 + 𝑚
∴ Median Class = 50 – 60
50 – 60 15 31 + 𝑚

60 – 70 10 41 + 𝑚
ℎ 𝑁 Marks No. of Students Cumulative Frequency
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐
𝑓 2

10 – 20 2 2
10 41+𝑚
⟹ 50 = 50 + − 16 + 𝑚
15 2 20 – 30 8 10

10 41 + 𝑚−32−2𝑚
⟹ 0= ( ) 30 – 40 6 16
15 2

9−𝑚 40 – 50 𝑚 16 + 𝑚
⟹ 0=
2
50 – 60 15 31 + 𝑚
⟹ 9 − 𝑚 =0
60 – 70 10 41 + 𝑚
⟹ 𝑚=9
4. Calculate the missing frequency from the following distribution, it is
being given that median of the distribution is 24.
Age in years 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50
No. of persons 5 25 ? 18 7

Solution:
Age in No. of Persons Cumulative Frequency
years
Let 𝑚 be the missing frequency. 0 – 10 5 5

10 – 20 25 30
Median = 24
20 – 30 𝑚 30 + 𝑚

∴ Median Class = 20 – 30 30 – 40 18 48 + 𝑚

40 – 50 7 55 + 𝑚
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Age in No. of Persons Cumulative Frequency
𝑓 2 years

10 55+𝑚 0 – 10
⟹ 24 = 20 + − 30 5 5
𝑚 2

10 55 + 𝑚 − 60 10 – 20 25 30
⟹ 4=
𝑚 2
20 – 30 𝑚 30 + 𝑚
⟹ 8𝑚 = 10 𝑚 − 5

⟹ 8𝑚 = 10𝑚 − 50 30 – 40 18 48 + 𝑚

⟹ 2m = 50
40 – 50 7 55 + 𝑚

⟹ m = 25
5. The median of the following frequency distribution is 56. Find the
missing frequencies.
Class Interval 0 – 20 20 – 40 40 – 60 60 – 80 80 – 100 100 – 120 Total

Frequency 16 𝒎 25 16 𝒏 10 90

Solution: Class Frequency Cumulative Frequency


Interval
0 – 20 16 16
Median = 56
20 – 40 𝑚 16 + 𝑚

∴ Median Class = 40 – 60 40 – 60 25 41 + 𝑚

60 – 80 16 57 + 𝑚

80 – 100 𝑛 57 + 𝑚 + 𝑛

100 – 120 10 67 + 𝑚 + 𝑛
ℎ 𝑁 Class Frequency Cumulative Frequency
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐
𝑓 2 Interval
20 67+𝑚+𝑛
⟹ 56 = 40 + − 16 + 𝑚 0 – 20 16 16
25 2

4 67 + 𝑚 + 𝑛 − 32 − 2𝑚
⟹ 16 = 20 – 40 𝑚 16 + 𝑚
5 2

⟹ 40 = 35 − 𝑚 + 𝑛 40 – 60 25 41 + 𝑚

⟹ 𝑛 − 𝑚 = 5 … … (1)
60 – 80 16 57 + 𝑚
Also 67 + 𝑚 + 𝑛 = 90
80 – 100 𝑛 57 + 𝑚 + 𝑛
⟹ 𝑚 + 𝑛 = 23 … … (2)

Solving (1) and (2), we get 100 – 120 10 67 + 𝑚 + 𝑛

𝑚=9
𝑛 = 14
6. Find the missing frequencies in the following distribution. It is given that the
median of the distribution is 41 and the total number of observations is 82.
Class Interval 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 Total

Frequency 10 𝒎 15 20 𝒏 11 82

Solution: Class Frequency Cumulative Frequency


Interval
10 – 20 10 10
Median = 41
20 – 30 𝑚 10 + 𝑚

∴ Median Class = 40 – 50 30 – 40 15 25 + 𝑚

40 – 50 20 45 + 𝑚

50 – 60 𝑛 45 + 𝑚 + 𝑛

60 – 70 11 56 + 𝑚 + 𝑛
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐 Class Frequency Cumulative Frequency
𝑓 2 Interval
10 56+𝑚+𝑛
⟹ 41 = 40 + − 25 + 𝑚 10 – 20 10 10
20 2

1 56 + 𝑚 + 𝑛 −50 − 2𝑚
⟹ 1= 20 – 30 𝑚 10 + 𝑚
2 2

⟹ 4=6−𝑚+𝑛 30 – 40 15 25 + 𝑚
⟹ 𝑚 − 𝑛 = 2 … … (1)
40 – 50 20 45 + 𝑚
Also 56 + 𝑚 + 𝑛 = 82

⟹ 𝑚 + 𝑛 = 26 … … (2) 50 – 60 𝑛 45 + 𝑚 + 𝑛

Solving (1) and (2), we get


60 – 70 11 56 + 𝑚 + 𝑛
𝑚 = 14
𝑛 = 12
Locate Median Graphically:
Ogives are used to locate the Partition values (Median) of the distribution.
Ogives are also known as frequency distribution curve. It is the curve obtained
by plotting cumulative frequency. There are two types of ogives:
• Less than ogive:
↠ Plot the points with upper limits of the classes as abscissae
(x – axis) and the corresponding less than cumulative frequency
as ordinate (y – axis)
↠ Join the points by a free hand smooth curve to get the less than
ogive. It is a rising curve.
• Greater than ogive:
↠ Plot the points with lower limits of the classes as
abscissae (x – axis) and the corresponding more than
cumulative frequency as ordinate (y – axis)
↠ Join the points by a free hand smooth curve to get the
more than ogive. It is a falling curve.
Problems:

1. Determine the value of median from the following data with the help of
'less than' and 'More than' Ogive Method. (2021)

Profits (in 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80 80 – 90 90 – 100


Lakhs):
No. of 6 8 12 18 25 16 8 5 2
Shops:
Profit No. of Upper Less Lower Greater
Shops Limit than Limit than
Ogive Ogive

10 – 20 6 20 8 10 100

20 – 30 8 30 14 20 94

30 – 40 12 40 26 30 86

40 – 50 18 50 44 40 74

50 – 60 25 60 69 50 56

60 – 70 16 70 85 60 31

70 – 80 8 80 93 70 15

80 – 90 5 90 98 80 7

90 – 2 100 100 90 2
100
2. Determine the value of median from the following data with the help of
'less than' and 'More than' Ogive Method.

Marks: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60

No. of 10 15 25 30 10 10
Students:
Marks No. of Upper Less Lowe Greater
Students Limit than r than
Ogive Limit Ogive

0 – 10 10 10 10 0 100

10 – 20 15 20 25 10 90

20 – 30 25 30 50 20 75

30 – 40 30 40 80 30 50

40 – 50 10 50 90 40 20

50 – 60 10 60 100 50 10
3. From the following data prepare frequency distribution table with class
intervals 2 – 2.5, 2.5 – 3 , 3 – 3.5, 3.5 – 4, 4 – 4.5, 4.5 – 5. Also locate the
median graphically.

3.7, 3.4, 4.1, 4.0, 3.7, 4.7, 3.3, 2.4, 3.1, 4.2, 3.8, 3.6, 4.2, 4.3, 2.9, 3.6,
3.3, 4.8, 4.0, 3.9, 3.5, 3.8, 3.8, 4.2, 3.9, 4.9, 3.2, 4.0, 3.8, 3.2, 2.7, 3.4,
3.3, 3.0, 3.1, 3.5, 3.7, 3.9, 4.3, 3.8, 3.7, 3.0, 4.4, 4.1, 3.6, 3.7, 3.3, 3.5,
3.7, 3.0, 2.9, 3.1, 3.3, 4.2.
Class Frequ Upper Less Lowe Greater
Interv ency Limit than r than
al Ogive Limit Ogive

2 – 2.5 1 2.5 1 2 54

2.5 – 3 3 3 4 2.5 53

3 – 3.5 15 3.5 19 3 50

3.5 – 4 20 4 39 3.5 35

4 – 4.5 12 4.5 51 4 15

4.5 – 5 3 5 54 4.5 3
𝑁 54
Here = = 27𝑡ℎ 𝑡𝑒𝑟𝑚
2 2

The cumulative frequency value 27 falls in the class interval 3.5 – 4

∴ 3.5 – 4 is the median class.

ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + −𝑐
𝑓 2

0.5
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.5 + 27 − 19
20

4
⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.5 +
20

⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.5 + 0.2

⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 3.7
Merits and Demerits of Median:

 Merits of Median:

(i) It is rigidly defined.

(ii) It is easily understood and is easy to calculate. In some

cases it can be located merely by inspection.

(iii) It is not at all affected by extreme values


Merits and Demerits of Median:

 Demerits of Median:

(i) In case of even number of observations median cannot be

determined exactly. We merely estimate it by taking the mean of

two middle terms.

(ii) For calculating median it is necessary to arrange the data,

whereas other averages do not need any arrangement.

(iii) It is not based on all the observations.


MODE:
In everyday life we often apply the concept of majority. Mode
is the measure which express the concept of majority.

Mode (𝑍) of a set of numbers is the value that occurs with


greatest frequency. For a raw data, mode is the value of the variate
which appears maximum number of times. The Mode is used when a
quick and approximate measure of the central tendency is desired.
‘The value of the variable which occurs most frequently in the
distribution is called the mode.’
Unimodal Data

A series of numbers is known to be unimodal if it


has only one Mode.

Example: 3, 3, 4, 5, 5, 5, 7

Mode = 5
Bimodal Data
A series of numbers is known to be bimodal if it has two
Modes.
Example: 105, 105, 105, 107, 108, 109, 109, 109, 110, 112

It has two Modes 105 and 109.


Multimodal Data
A series of numbers is known to be multimodal if it has more
than two Modes.
Example: 105, 105, 105, 107, 108, 109, 109, 109, 110, 112, 115,
115, 115.

It has three Modes 105, 109, 115.


Mode in Individual Series
For Individual Series, the value of mode is the item with the highest
frequency.
Example 1: Find the mode from the following data:
10, 20, 10, 30, 20, 10, 40, 40, 20, 20, 30, 10, 40, 10, 50.
Solution:
Here the observation 10 appears five times, 20 appears four times, 40
appears three times, 30 appears two times and 50 appear in a single time. So
the value which appears a maximum number of times is 10.
∴ Mode = 10
Example 2:

The ages of six persons who participated in an interview were 20, 21, 21,
24, 25, 24, 21 and 27 years. Find the mode of the data.

Solution:

Here the observation 21 appears three times, 24 appears two times


and all the others appear in a single time. So the value which appears a
maximum number of times is 21.

∴ Mode = 21 years.
Example 3:

Mr. Vijaya Kumar, the physical education teacher of a school is trying to


determine the average height of students in the cricket team of the school.
The height of the players in inches are 70, 72, 72, 74, 74, 74, 75, 76, 76, 76
and 77.Calculate the mode of the heights.

Solution:
Here the data has two values, 74 and 76, which appears 3 times.
All the other values appear less than 3 times. So the data set has two distinct
modes 74 and 76.
Mode in Discrete Series

The mode of a discrete frequency is the observation which appears a


maximum number of times. ie, the observation having the highest frequency.

Example: The following distribution shows the sizes of shirts sold on a textile
shop in Bangalore on a month. Calculate the mode.

Size (in inches) : 36 38 40 42 44

No of shirts sold : 15 22 31 30 20

Solution: In the given frequency distribution, the observation having the


maximum frequency is 40. So Mode is 40.
Mode in Continuous Series
As in the case of median, here also we have to locate a class called modal class. Modal class is
the class having highest frequency. Mode can be determined by the formula:

𝒇𝟏 − 𝒇𝟎
𝑴𝒐𝒅𝒆 = 𝑳 + 𝒉
𝟐𝒇𝟏 − 𝒇𝟎 − 𝒇𝟐

where,
𝐿 = lower limit of modal class,
𝑓1 = frequency of the modal class,
𝑓0 = frequency of the class preceeding the modal class
𝑓2 = frequency of the class succeeding the modal class
h = width of the modal class,
Problems:
1. Compute the mode of the following distribution.
Class: 0–7 7 – 14 14 – 21 21 – 28 28 – 35 35 – 42 42 – 49

Frequency: 19 25 36 72 51 43 28

Solution: Class Frequency

Here maximum frequency 72 0–7 19

7 – 14 25
lies in the class-interval 21 – 28.
14 – 21 36

Therefore 21 – 28 is the modal class. 21 – 28 72

28 – 35 51

35 – 42 73

42 – 49 28
𝑓1 − 𝑓0
𝑀𝑜𝑑𝑒 = 𝐿 + ℎ Class Frequency
2𝑓1 − 𝑓0 − 𝑓2

72 −36 0–7 19
⟹ 𝑀𝑜𝑑𝑒 = 21 + 7
2 72 −36 − 51
7 – 14 25

36
⟹ 𝑀𝑜𝑑𝑒 = 21 + 7 14 – 21 36
144 − 87
21 – 28 72
36
⟹ 𝑀𝑜𝑑𝑒 = 21 + 7 28 – 35
57 51

⟹ 𝑀𝑜𝑑𝑒 = 21 + 4.42 35 – 42 73

42 – 49 28
⟹ 𝑀𝑜𝑑𝑒 = 25.42
2. The production per day of a company (in Tons) on 60 days are given

below. Calculate the mode.

Production per day: 21 – 22 23 – 24 25 – 26 27 – 28 29 – 30

Number of days: 7 13 22 10 8

Solution:

Here the classes are of inclusive type. We have to convert it into


exclusive class before determining the mode.
Class Adjusted Frequency
Class
Interval
Here maximum frequency 22
21 – 22 20.5 – 22.5 7
lies in the class-interval 24.5 – 26.5
23 – 24 22.5 – 24.5 13
Therefore 24.5 – 26.5 is the modal class.
25 – 26 24.5 – 26.5 22

27 – 28 26.5 – 28.5 10

29 – 30 28.5 – 30.5 8
𝑓1 − 𝑓0
Mode = 𝐿 + ℎ Class Adjusted Frequency
2𝑓1 − 𝑓0 − 𝑓2
Class
Interval
22 −13
⟹ Mode = 24.5 + 2 21 – 22 20.5 – 22.5 7
2 22 −13 − 10

9 23 – 24 22.5 – 24.5 13
⟹ Mode = 24.5 + 2
44 −23
25 – 26 24.5 – 26.5 22
9
⟹ Mode = 24.5 + 2
21
27 – 28 26.5 – 28.5 10

⟹ Mode = 24.5 + 0.857


29 – 30 28.5 – 30.5 8

⟹ Mode = 25.36 tons


3. Calculate the missing frequency from the following data, if mode is
given as 26.
Marks: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50

No. of Students: 3 5 8 ? 3

Marks No. of Students


Solution:
0 – 10 3
Mode = 26 10 – 20 5
20 – 30 8
∴ Modal Class = 20 – 30 30 – 40 𝑚

40 – 50 3
𝑓1 − 𝑓0 Marks No. of Students
Mode = 𝐿 + ℎ
2𝑓1 − 𝑓0 − 𝑓2

0 – 10 3
8 −5
⟹ 26 = 20 + 10
2 8 −5−𝑚 10 – 20 5

3 20 – 30 8
⟹ 6= 10
11 − 𝑚
30 – 40 𝑚
⟹ 11 − 𝑚 = 5
40 – 50 3
⟹ 𝑚=6
4. Calculate the missing frequencies from the following data, if mode is
given as 24.
Class Interval: 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 Total

Frequency: 14 𝒎 27 𝒏 15 100

Marks No. of Students


Solution:
0 – 10 14
Mode = 24 10 – 20 𝑚
20 – 30 27
∴ Modal Class = 20 – 30 30 – 40 𝑛
𝑓1 − 𝑓0 40 – 50
Mode = 𝐿 + ℎ 15
2𝑓1 − 𝑓0 − 𝑓2
27 − 𝑚
⟹ 24 = 20 + 10 Marks No. of Students Cumulative
2 27 − 𝑚 − 𝑛 Frequency
27 − 𝑚
⟹ 4= 10
54 − 𝑚 − 𝑛
0 – 10 14 14
⟹ 2 54 − 𝑚 − 𝑛 = 5(27 − 𝑚)
⟹ 108 − 2𝑚 − 2𝑛 = 135 − 5𝑚 10 – 20 𝑚 14 + 𝑚

⟹ 3𝑚 − 2𝑛 = 27
20 – 30 27 41 + 𝑚
Also 56 + 𝑚 + 𝑛 = 100
30 – 40 𝑛 41 + 𝑚 + 𝑛
⟹ 𝑚 + 𝑛 = 44
Solving (1) and (2), we get
40 – 50 15 56 + 𝑚 + 𝑛
𝑚 = 23
𝑛 = 21
Merits and Demerits of Mode:

 Merits of Mode:

(i) Mode is readily comprehensible and easy to calculate.

(ii) It is not at all affected by extreme values

 Demerits of Mode:

(i) Mode is ill-defined. It is not always possible to find a clearly defined mode.

(ii) It is not based upon all the observations.


Empirical relationship among Mean, Median and Mode:

 𝑴𝒐𝒅𝒆 = 𝟑 𝑴𝒆𝒅𝒊𝒂𝒏 − 𝟐 𝑴𝒆𝒂𝒏

The importance of this relation is that we can estimate the value of any

one of them by knowing the values of the other two. However, this relationship

is true if the distribution is moderately asymmetric. In the asymmetrical curve

the area on the left of mode is greater than area on the right then

Mean < median < mode

But for symmetrical distribution of mean, median and mode are identical.
Example:
1. If Z=400, and 𝑥=300
ҧ find median?

Solution:

We have 𝑴𝒐𝒅𝒆 = 𝟑 𝑴𝒆𝒅𝒊𝒂𝒏 − 𝟐 𝑴𝒆𝒂𝒏

⟹ 400 = 3𝑀𝑒𝑑𝑖𝑎𝑛 − 2(300)

⟹ 400 = 3𝑀𝑒𝑑𝑖𝑎𝑛 − 600

⟹ 3𝑀𝑒𝑑𝑖𝑎𝑛 = 1000

⟹ 𝑀𝑒𝑑𝑖𝑎𝑛 = 333.33
Example:
2. For a given data the mean= 36.67 and median = 39.23 . Hence find mode.
Solution:

We have 𝑴𝒐𝒅𝒆 = 𝟑 𝑴𝒆𝒅𝒊𝒂𝒏 − 𝟐 𝑴𝒆𝒂𝒏

⟹ 𝑀𝑜𝑑𝑒 = 3(39.23) − 2(36.67)

⟹ 𝑀𝑜𝑑𝑒 = 117.69 − 73.34

⟹ 𝑀𝑜𝑑𝑒 = 44.35
Locate Mode Graphically:

Mode of the distribution can be obtained from Histogram.

In histogram, the frequency distribution is represented by a set

of rectangular bars with area of the rectangles proportional to class

frequency.

Total area of the rectangles in a histogram represents the

frequency.
 Class Intervals of Equal Width:

The variables is taken along the x – axis and class frequency along y –axis.

With class intervals as bases, rectangles with height proportional to class frequency

are drawn.

 Class Intervals of Unequal Width:


The variables is taken along the x – axis and class frequency along y – axis. In

this case, the rectangles are drawn with height proportional to frequency density

(f/w), so that area is proportional to class frequency.


 Notes:

1. If the frequency distribution have inclusive class intervals, they should

be converted into exclusive class intervals and then the histogram

should be drawn.

2. The frequency polygon is obtained by joining the mid – points of the

upper sides of the rectangles


Mode from Histogram:
Step 1: Draw a histogram to the given data.

Step 2: Locate modal class (highest bar of histogram).

Step 3: Join diagonally the upper end points of the highest bar to the end points
of the adjacent bars.

Step 4: Mark the point of intersection of the diagonals.

Step 5: Draw perpendicular from this point of intersection to the 𝑥 − 𝑎𝑥𝑖𝑠.

Step 6: The point where the perpendicular meets the 𝑥 − 𝑎𝑥𝑖𝑠 gives the modal value.
Problems:

1. Locate Mode Graphically

Age 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60

No. of People 12 18 27 20 17 6
𝑓1 − 𝑓0
Mode = 𝐿 + ℎ Class Frequency
2𝑓1 − 𝑓0 − 𝑓2

27 −18 0 – 10 12
⟹ Mode = 20 + 10
2 27 −18−20
10 – 20 18
9
⟹ Mode = 200 + 10 20 – 30 27
54 −38

9 30 – 40 20
⟹ Mode = 20 + 10
16
40 – 50 17

⟹ Mode = 20 + 5.625
50 – 60 6

⟹ Mode = 25.63
Solution:

Mode = 25.6
Problems:

2. Locate Mode graphically for the following distribution

Class 10-19 20-29 30-39 40-49 50-59 60-69 70-79


Interval
Frequency 10 12 18 30 16 6 8
Solution: Class Adjusted Frequency
Class
𝑓1 − 𝑓0 Interval
Mode = 𝐿 + ℎ
2𝑓1 − 𝑓0 − 𝑓2 10 – 19 9.5 – 19.5 10

30 −18 20 – 29 19.5 – 29.5 12


⟹ Mode = 39.5 + 10
2 30 −18− 16
30 – 39 29.5 – 39.5 18
12
⟹ Mode = 39.5 + 10 40 – 49 39.5 – 49.5
60 −34 30

12 50 – 59 49.5 – 59.5 16
⟹ Mode = 39.5 + 10
26
60 – 69 59.5 – 69.5 6
⟹ Mode = 39.5 + 4.62
70 – 79 69.5 – 79.5 8
⟹ Mode = 44.12
Mode = 44.12
Problems:
3. The weights in grams of 50 apples picked at random from a market are as follows:

131, 113, 82, 75, 204, 81, 84, 118, 104, 110, 80, 107, 111, 141, 136, 123, 90,
78, 90, 115, 110, 98, 106, 99, 107, 84, 76, 186, 82, 100, 109, 128, 115, 107, 115,
119, 93, 187, 139, 129, 130, 68, 195, 123, 125, 111, 92, 86, 70, 126.

Form the grouped frequency table by dividing the variate range into intervals

of equal width, each corresponding to 20 gms in such a way that the mid-value

of the first class corresponding to 70 gms. Hence locate mode graphically.


Solution:
Given, Class Frequency

Mid-value of first class = 70,


60 – 80 5
Width of each class = 20 gms
𝑓1 − 𝑓0 80 – 100 13
Mode = 𝐿 + ℎ
2𝑓1 − 𝑓0 − 𝑓2 100 – 120 17
17 −13 120 – 140
⟹ Mode = 100 + 20 10
2 17 −13− 10
140 – 160 1
4
⟹ Mode = 100 + 20 160 – 180
34 −23 0
4 180 – 200
⟹ Mode = 100 + 20 3
11
200 – 220 1
⟹ Mode = 100 + 7.27
Total 50
⟹ Mode = 107.27
Mode = 107.27
Problems:

4. A psychologist estimates the I.Q. of 60 children. The values are as follows:


103, 98, 87, 85, 67, 96, 115, 109, 127, 103, 95, 123, 94, 88, 102, 76, 73, 80, 84, 102, 115, 93,
76, 81, 132, 90, 119, 84, 97, 120, 114, 101, 153, 98, 99, 105, 110, 107, 110, 128, 89, 112,
118, 101, 122, 146, 96, 109, 72, 97, 94, 94, 79, 79, 100, 54, 102, 89, 43, 111.

Form a frequency distribution having class intervals with width 15.

Hence locate mode graphically.


Solution:
Class Frequency
Given,
30 – 45 1
Width of each class = 15
45 – 60 1
𝑓1 − 𝑓0
Mode = 𝐿 + ℎ 60 – 75 3
2𝑓1 − 𝑓0 − 𝑓2
21 −13 75 – 90 13
⟹ Mode = 90 + 15
2 21 −13− 13 90 – 105 21
8
⟹ Mode = 90 + 15 105 – 120 13
42 −26
120 – 135 6
8
⟹ Mode = 90 + 15
16 135 – 150 1

⟹ Mode = 90 + 7.5 150 – 165 1

⟹ Mode = 97.5 Total 60


Mode = 97.5
Problems:

5. From the following data prepare frequency distribution table and hence
determine the mean , median and mode.
0, 1, 2, 3, 4, 5, 6, 6, 5, 4, 4, 5, 5, 4, 4, 3, 3, 3, 3, 2, 2, 2, 3, 2, 3, 2, 2, 2, 1, 1, 1, 0, 0, 1, 0.
Solution: 𝒙 𝒇 𝒇𝒙
σ 𝑓𝑖 𝑥𝑖
𝑥ҧ = 0 4 0
𝑁
94 1 5 5
⟹ 𝑥ҧ = 2 8 16
35
⟹ 𝑥ҧ = 2.686 3 7 21
4 5 20
⟹ 𝑀𝑒𝑎𝑛 = 2.686 5 4 20
Also, 𝑀𝑒𝑑𝑖𝑎𝑛 = 3 6 2 12

𝑀𝑜𝑑𝑒 = 2 Total 35 94

You might also like