02 Chapter 2 021-054
02 Chapter 2 021-054
My average GCSE
result was a grade C. On average, it takes
me 20 minutes to get
the bus to school.
An average of a set of data gives you a typical value for that set of data.
Averages can be useful when you want to compare two sets of data –
for example, to compare the wages of male and female employees
at a company.
There are three different averages that you need to be able to use.
1 Mode This is the most frequent item of data in a set.
2 Median Put all the items of data in order of size and find the middle value.
When there is an even number of data values there will
be two values in the middle.
You need to find the value halfway between these two
values by:
● adding them together ● dividing the answer by 2
3 Mean To calculate the mean of a set of data:
● add up all the numbers
● divide by how many numbers there are.
This is more formally written as:
(iii)13 is the only data value that appears more than once.
The mode is 13 minutes.
Simon has used the mean to work out his average journey time.
However, Simon took less than 20 minutes on most days to get to
school.
So the mean does not give a good typical value of the data set.
When most data values are close together and only one or two are
much higher or lower than the rest, we say the data is skewed.
When data is skewed, the mean does not give a good typical value.
In the above example the mode (13 minutes) doesn’t give a good
typical value either.
13 minutes was Simon’s shortest journey time and only happened
twice!
1 Find (i) the mean, (ii) the median, (iii) the mode of the following sets of data.
a) 2 2 2 3 4 7 8
b) 7 12 14 16 16
c) 7 12 12 13 13 13 14
2 Igor surveys his friends to find out about their favourite sport.
Here are his results.
football tennis hockey football
netball football rugby football
tennis netball football rugby
Which sport is the mode?
4 Find (i) the mean, (ii) the median, (iii) the mode of the following sets of data.
a) 2 4 3 3 7 5 1 0 5 3 0
b) 41 45 52 45 43 45 45 57 50
c) 19 17 12 14 18 18 12 18 17 18 13
5 Find (i) the mean, (ii) the median, (iii) the mode of the following sets of data.
a) 112 110 110 112 111 110
b) 52 49 51 46 51 47 48 51 55 57
c) 17 20 22 24 20 18 28 32
7 Joanne has recorded the temperature at midnight for ten nights during January.
Here are her results.
4 °C 3 °C 1 °C 1 °C 0 °C 0 °C 3 °C 0 °C 1 °C 1 °C
Find a) the mean temperature, b) the median temperature, c) the modal temperature.
2.2 Range
Here are the numbers of goals scored by two football teams over their last ten games:
Ready Rovers 0 3 2 1 0 6 1 2 1 1
Uptown United 2 2 3 3 1 2 1 2 0 1
0321061211 17
The mean for Ready Rovers is:
10 10
mean 1.7 goals
2233121201 17
The mean for Uptown United is:
10 10
mean 1.7 goals
Both teams have the same mean.
To compare their results we can work out the spread of the data.
One measure of spread is the range:
2 The data set below gives the total number of goals scored by each of 15 soccer clubs
during a season.
27 8 33 11 27 40 18 24 28 30 31 34 36 14 23
a) Work out the range of the data. b) Write down the mode.
c) Calculate the value of the mean. d) Find the value of the median.
a) Work out (i) the mean, (ii) the median, (iii) the range for Jay’s results.
b) Work out (i) the mean, (ii) the median, (iii) the range for Felicity’s results.
c) Who did better in their mock exams?
Give a reason for your answer.
23 0 14 28 27 41 22 6 11 18
45 25 15 0 13 29 12 21 40 20
SOLUTION
a) Begin by listing the stems:
0, 10, 20… are represented with stems of 0, 1, 2…
Continue all the way up to 4, as there is a ‘45’ in the data set.
Remember to leave space for a 0’s stem too.
0
1
2 This is the stem.
3
4
0 0
1 Key:
2 | 3 23 runs
2 3
This is a leaf.
3
4
The data is sorted into layers, called branches, but the numbers are still jumbled along
each branch – this is an unsorted stem and leaf diagram.
0 0 6 0
1 4 1 8 5 3 2 Key:
2 | 3 23 runs
2 3 8 7 2 5 9 1 0
3
This is a branch.
4 1 5 0
0 0 0 6 Key:
1 1 2 3 4 5 8 2 | 3 23 runs
2 0 1 2 3 5 7 8 9
3
4 0 1 5
b) The mode is 0
c) 0
0 0 6 Key:
1
1 2 3 4 5 8 2 | 3 23 runs
2 0 1 2 3 5 7 8 9
3
4 0 1 5
By counting in from both ends (crossing out as you count inwards)
the middle two values are 20 and 21
So the median is 20.5
So far you have found the median by ‘counting in’ from both ends of a list of data.
When you have a large amount of data it is easier to use a formula.
n1
The median is the th item of data where n is the number of items of data.
2
99 1
For example, when you have 99 items of data the median will be the th
2
(i.e. the 50th) item of data, when the list is in order.
20 1
In this example you have 20 items and the median is the 10.5 or the ‘10.5th’ item.
2
This means that there are two numbers in the middle and that the median is halfway
between them – in this case the 10th and 11th items.
EXERCISE 2.3
1 This stem and leaf diagram shows the ages of the members of a local golf club.
2 1 4 9
3 0
4 1 7 8
5 0 5 6 7 9 2
6 2
2 Penny has made an unsorted stem and leaf diagram to show the times taken by members
of her class to travel to school each morning.
The times are in minutes.
0 5 1 6
1 4 8 3 1 0
2 4 5 2 1
3 6 9 8
a) Redraw the diagram so that it is fully sorted. b) Add a key to the new diagram.
c) Find the median. d) Find the range.
36 41 29 32 48 19 36 30
50 35 25 44 31 47 27
72 79 68 85 61 88 74 81 73
92 84 49 63 91 86 76 88
You can find averages and the range from a frequency table.
Make sure you understand this method. It is examined regularly in
GCSE papers.
SOLUTION
So these 7 boxes contain
a) Step 1 Make a new column and call it x f 350 matches altogether.
Step 2 Multiply x and f together:
48 1 48 Number of matches Frequency, f xf
49 5 245 48 1 48
etc. 49 5 245
50 7 350
… …
Step 3 Add up the numbers in each of the last two columns.
Number of matches Frequency, f xf
48 1 48
49 5 245
50 7 350
51 0 0
52 5 260
53 1 53
54 1 54
20 1010
There are 20
1010 matchboxes containing
Mean
20 1010 matches altogether.
The mean is 50.5 matches.
m
5
h
ce
sa
5
t0
.mT
sh
eie
a
n
2 Emily surveys cars driving past her school into town in the morning.
She counts the number of occupants of each car.
The frequency table shows her results.
Number of occcupants Frequency
1 7
2 14
3 10
4 9
5 or more 0
Total
EXAMPLE
The marks for Year 10’s end-of-term mathematics test are shown in the table.
Mark, m Frequency, f
This is grouped
0 to 4 0
discrete data.
5 to 9 1
10 to 15 5
16 to 19 23
20 to 24 24
25 to 29 18
a) Calculate an estimate of the value of the mean.
30 to 34 15
b) Write down the modal class.
35 to 39 14 c) Work out an estimate for the range of the data.
40 or more 0 d) In which class does the median lie?
SOLUTION
a) Mark, m Frequency, f Midpoint, x xf
0 to 4 0 2 0
5 to 9 1 7 7
10 to 14 5 12 60
… …
7m
2
4
.im
e
td
saT
th
e
ia
n
b) The modal class is found by looking for the highest frequency:
Mark, m Frequency, f
0 to 4 0
5 to 9 1
10 to 15 5
The data is grouped so
you can’t work out a 16 to 19 23 24 is the highest
single value as the mode 20 to 24 24 frequency, so the
– so the modal group is modal class is 20
25 to 29 18
used instead. to 24
30 to 34 15
35 to 39 14
40 or more 0
However, the notation used for writing the class intervals is different.
EXAMPLE
The resistances of a sample of 50 electronic components are measured.
The table shows the results.
Resistance, R Frequency, f
80 R 90 10
90 R 100 23
100 R 110 11
110 R 130 6
SOLUTION
a) Resistance, R Frequency, f Midpoint, x xf
80 R 90 10 85 850
90 R 100 23 95 2185
100 R 110 11 105 1155
110 R 130 6 120 720
50 4910
4910
Estimated mean
50
98.2
The estimated mean is 98.2
b) The median will be in the 25th/26th position, and from the table
ht R 100
the median lies ine90
2 Maurizio records the number of people at work in his department each day.
The frequency table shows his results.
a) Copy the table, and fill in the midpoint values in the third column.
b) Use the table to help you calculate an estimate of the mean number of people at
work each day.
c) State the modal class.
d) Find the class interval which contains the median.
e) Benoit says, ‘The range is 20.’
Explain why Benoit must be wrong.
3 A registrar records the ages of men who married Age (A) in years Frequency
in his office one week. 20 A 30 4
The frequency table shows her results.
30 A 40 7
a) State the modal class.
b) Find the class interval which contains the median. 40 A 50 6
c) Work out an estimate of the mean age of the 50 A 60 3
men who married that week. 60 A 70 1
Give your answer correct to 3 significant
Total
figures.
Maths mark
Winter temperature, °C 2 3 6 7 8 8 12 13 13 14
Summer temperature, °C 15 17 20 17 25 21 23 26 25 24
SOLUTION
a)
28
Summer temperature (°C)
26
24
22
20
18
16
14
12
0
0 2 4 6 8 10 12 14 16
Winter temperature (°C)
c)
28
Summer temperature (°C)
26
24
22
20
18
16
14
12
0
0 2 4 6 8 10 12 14 16
Winter temperature (°C)
A IQ against weight
B Value of car against age of car
C Height of oak tree against age of oak tree
2 The scatter graph shows some information about the marks of six students in two
examinations – Paper 1 and Paper 2.
40
30
Paper 2 marks
20
10
O 10 20 30 40 50 60
Paper 1 marks
The table shows the marks for Paper 1 and Paper 2 for two more students, A and B.
Student A Student B
Paper 1 mark 20 50
Paper 2 mark 20 35
a) On a copy of the scatter graph, plot the information from the table.
b) Describe the correlation between the marks on Paper 1 and Paper 2.
c) Draw a line of best fit on the diagram.
Another student has a Paper 2 mark of 30
d) Use your line of best fit to estimate the Paper 1 mark for this student. [Edexcel]
a) Complete a copy of the scatter graph to show the information in the table.
The first six points have been plotted for you.
45
40
Units of electricity used
35
30
25
20
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Average temperature (°C)
b) Describe the correlation between the number of units of electricity used and the
average temperature.
c) Draw a line of best fit on your scatter graph.
d) Use your line of best fit to estimate:
(i) the average temperature if 35 units of electricity are used
(ii) the units of electricity used if the average temperature is 7 °C [Edexcel]
80
Life expectancy (years)
70
60
50
40
0
0 10 20 30 40 50 60 70
Birth rate
70
60
50
Time to melt (minutes)
40
30
20
10
O 5 10 15 20 25 30 35
Temperature (°C)
Number of pages 80 130 100 140 115 90 160 140 105 150
Weight 160 270 180 290 230 180 320 270 210 300
a) Complete a copy of the scatter graph to show the information in the table.
The first six points in the table have been plotted for you.
320
300
280
260
Weight of book (g)
240
220
200
180
160
0
60 80 100 120 140 160 180
Number of pages
REVIEW EXERCISE 2
Don’t use your calculator for Questions 1–5.
2 Shirin recorded the number of students late for school each day for 21 days.
The stem and leaf diagram shows this information.
Number of students late
0 4 5 7 8 8 9 Key:
1 2 2 5 6 6 7 7 9 9 9 1 | 4 means 14 students late
2 0 1 3 4 6
Review exercise 2 47
4 The scatter graph shows some information about six new-born baby apes.
For each baby ape, it shows the mother’s leg length and the baby ape’s birth weight.
1.8
1.7
Baby ape’s birth weight (kg)
1.6
1.5
1.4
0
40 50 60 70
Mother’s leg length (cm)
The table shows the mother’s leg length and the birth weight of two more baby apes.
a) On a copy of the scatter graph, plot the information from the table.
b) Describe the correlation between a mother’s leg length and her baby ape’s birth
weight.
c) Draw a line of best fit on the graph.
A mother’s leg length is 55 cm
d) Use your line of best fit to estimate the birth weight of her baby ape. [Edexcel]
7 Andy did a survey of the number of cups of coffee some pupils in his school
had drunk yesterday.
The frequency table shows his results.
Review exercise 2 49
8 The scatter graph shows information about the number of donkey
rides on Blackpool beach, and the number of hours of sunshine on
each day.
10
Number of hours of sunshine
0
0 50 100 150 200 250 300 350
Number of donkey rides
The table shows the number of donkey rides and the number of hours of sunshine on
four other days.
a) On a copy of the scatter graph, plot the information from the table.
b) Describe the relationship between the number of donkey rides and the number of
hours of sunshine.
c) Draw a line of best fit on the scatter graph.
d) Use your line of best fit to estimate:
(i) the number of donkey rides when there are 7 hours of sunshine
(ii) the number of hours of sunshine on a day when there were 200 donkey rides.
[Edexcel]
11 The table shows information about the number of hours that 120 children used a
computer last week.
Work out an estimate for the mean number of hours that the children used a computer.
Give your answer correct to two decimal places. [Edexcel]
Review exercise 2 51
12 A garage keeps records of the costs of repairs to its customers’ cars.
The table gives information about the costs of repairs which were less than £250 in
one week.
13 35 students with Saturday jobs took part in Hourly rate of pay (£x) Frequency
a survey. 3.00 x 3.50 1
They were asked the hourly rate of pay for
3.50 x 4.00 2
their jobs.
This information is shown in the grouped 4.00 x 4.50 4
frequency table. 4.50 x 5.00 7
Work out an estimate for the mean hourly 5.00 x 5.50 19
rate of pay.
5.50 x 6.00 2
Give your answer to the nearest penny.
[Edexcel]
Key points 2 53
Internet Challenge 2
Populations
Here is a list of the twelve most populous countries in the world, in decreasing order.