2 Representing and Interpreting Data
2 Representing and Interpreting Data
1 The weights, in kg, of the 11 members of the Dolphins swimming team and the 11 members of the
Sharks swimming team are shown below.
Dolphins 62 75 69 82 63 80 65 65 73 82 72
Sharks 68 84 59 70 71 64 77 80 66 74 72
(i) Draw a back-to-back stem-and-leaf diagram to represent this information, with Dolphins on the
left-hand side of the diagram and Sharks on the right-hand side. [4]
(ii) Find the median and interquartile range for the Dolphins. [3]
9709/61/M/J/19/Q4
2 The Mathematics and English A-level marks of 1400 pupils all taking the same examinations are
shown in the cumulative frequency graphs below. Both examinations are marked out of 100.
1500
1400
1300
1200
1100 English
Mathematics
1000
Cumulative frequency
900
800
700
600
500
400
300
200
100
0
0 10 20 30 40 50 60 70 80 90 100
Marks
Use suitable data from these graphs to compare the central tendency and spread of the marks in
Mathematics and English. [6]
9709/62/M/J/19/Q6
3 (i) Give one advantage and one disadvantage of using a box-and-whisker plot to represent a set of
data. [2]
(ii) The times in minutes taken to run a marathon were recorded for a group of 13 marathon runners
and were found to be as follows.
180 275 235 242 311 194 246 229 238 768 332 227 228
State which of the mean, mode or median is most suitable as a measure of central tendency for
these times. Explain why the other measures are less suitable. [3]
(iii) Another group of 33 people ran the same marathon and their times in minutes were as follows.
190 203 215 246 249 253 255 254 258 260 261
263 267 269 274 276 280 288 283 287 294 300
307 318 327 331 336 345 351 353 360 368 375
(a) On the grid below, draw a box-and-whisker plot to illustrate the times for these 33 people.
[4]
(b) Find the interquartile range of these times. [1]
9709/63/M/J/19/Q7
4 The times in minutes taken by 13 pupils at each of two schools in a cross-country race are recorded
in the table below.
Thaters School 38 43 48 52 54 56 57 58 58 61 62 66 75
Whitefay Park School 45 47 53 56 56 61 64 66 69 73 75 78 83
(i) Draw a back-to-back stem-and-leaf diagram to illustrate these times with Thaters School on the
left. [4]
(ii) Find the interquartile range of the times for pupils at Thaters School. [2]
The times taken by pupils at Whitefay Park School are denoted by x minutes.
(iv) It is given that Σ x − 60 = 46. Use this result, together with your answer to part (iii), to find the
variance of x. [2]
9709/61/O/N/19/Q5
5 Ransha measured the lengths, in centimetres, of 160 palm leaves. His results are illustrated in the
cumulative frequency graph below.
180
160
140
Cumulative frequency
120
100
80
60
40
20
0
0 5 10 15 20 25 30
Length in centimetres
(i) Estimate how many leaves have a length between 14 and 24 centimetres. [1]
(ii) 10% of the leaves have a length of L centimetres or more. Estimate the value of L. [2]
(iii) Estimate the median and the interquartile range of the lengths. [3]
Sharim measured the lengths, in centimetres, of 160 palm leaves of a different type. He drew a
box-and-whisker plot for the data, as shown on the grid below.
0 5 10 15 20 25 30
Length in centimetres
(iv) Compare the central tendency and the spread of the two sets of data. [2]
9709/62/O/N/19/Q1
6 Twelve tourists were asked to estimate the height, in metres, of a new building. Their estimates were
as follows.
50 45 62 30 40 55 110 38 52 60 55 40
(i) Find the median and the interquartile range for the data. [3]
(ii) Give a disadvantage of using the mean as a measure of the central tendency in this case. [1]
9709/62/O/N/19/Q3
7 The speeds, in km h−1 , of 90 cars as they passed a certain marker on a road were recorded, correct to
the nearest km h−1 . The results are summarised in the following table.
(i) On the grid, draw a histogram to illustrate the data in the table. [4]
(ii) Calculate an estimate for the mean speed of these 90 cars as they pass the marker. [2]
9709/63/O/N/19/Q5
8 Last Saturday, 200 drivers entering a car park were asked the time, in minutes, that it had taken them
to travel from home to the car park. The results are summarised in the following cumulative frequency
table.
Time (t minutes) t ≤ 10 t ≤ 20 t ≤ 30 t ≤ 50 t ≤ 70 t ≤ 90
Cumulative frequency 16 50 106 146 176 200
(i) On the grid, draw a cumulative frequency graph to illustrate the data. [2]
(ii) Use your graph to estimate the median of the data. [1]
(iii) For 80 of the drivers, the time taken was at least T minutes. Use your graph to estimate the value
of T . [2]
(iv) Calculate an estimate of the mean time taken by all 200 drivers to travel to the car park. [4]
9709/52/F/M/20/Q7
9 Helen measures the lengths of 150 fish of a certain species in a large pond. These lengths, correct to
the nearest centimetre, are summarised in the following table.
(c) Calculate an estimate for the variance of the lengths of the fish. [3]
9709/51/M/J/20/Q7
10 The numbers of chocolate bars sold per day in a cinema over a period of 100 days are summarised in
the following table.
A B
21 1 2 4
7 6 3 0 22 2 4 5 5 6
8 7 4 3 1 1 23 0 2 6 8 9 9
5 5 5 3 2 24 3 3 4 6
4 3 1 0 25 6
(a) Find the median and the interquartile range for machine A. [3]
It is given that for machine B the median is 0.232 m, the lower quartile is 0.224 m and the upper
quartile is 0.243 m.
9709/53/M/J/20/Q6
12 The annual salaries, in thousands of dollars, for 11 employees at each of two companies A and B are
shown below.
Company A 30 32 35 41 41 42 47 49 52 53 64
Company B 26 47 30 52 41 38 35 42 49 31 42
(a) Represent the data by drawing a back-to-back stem-and-leaf diagram with company A on the
left-hand side of the diagram. [4]
(b) Find the median and the interquartile range of the salaries of the employees in company A. [3]
A new employee joins company B. The mean salary of the 12 employees is now $38 500.
Dados 6 8 12 15 10 36 42 28 10 22 16
Linva 2 11 15 16 0 32 36 40 10 12 9
9709/53/O/N/20/Q7
15 A particular piece of music was played by 91 pianists and for each pianist, the number of incorrect
notes was recorded. The results are summarised in the table.
Hence find the greatest possible value of the interquartile range. [2]
(c) Calculate an estimate for the mean number of incorrect notes. [3]
9709/52/F/M/21/Q5
16 A driver records the distance travelled in each of 150 journeys. These distances, correct to the nearest
km, are summarised in the following table.
9709/52/M/J/21/Q7
18 The heights, in cm, of the 11 basketball players in each of two clubs, the Amazons and the Giants, are
shown below.
Amazons 205 198 181 182 190 215 201 178 202 196 184
Giants 175 182 184 187 189 192 193 195 195 195 204
(b) Represent the data by drawing a back-to-back stem-and-leaf diagram with Amazons on the
left-hand side of the diagram. [4]
(c) Find the interquartile range of the heights of the players in the Amazons. [2]
Four new players join the Amazons. The mean height of the 15 players in the Amazons is now
191.2 cm. The heights of three of the new players are 180 cm, 185 cm and 190 cm.
160
140
120
Cumulative frequency
100
80
60
40
20
0
0 40 80 120 160 200 240
Height (cm)
(a) Use the graph to estimate the number of plants with heights less than 100 cm. [1]
(b) Use the graph to estimate the 65th percentile of the distribution. [2]
(c) Use the graph to estimate the interquartile range of the heights of these plants. [2]
9709/51/O/N/21/Q6
20 The weights, in kg, of 15 rugby players in the Rebels club and 15 soccer players in the Sharks club
are shown below.
Rebels 75 78 79 80 82 82 83 84 85 86 89 93 95 99 102
Sharks 66 68 71 72 74 75 75 76 78 83 83 84 85 86 92
(a) Represent the data by drawing a back-to-back stem-and-leaf diagram with Rebels on the left-hand
side of the diagram. [4]
(b) Find the median and the interquartile range for the Rebels. [3]
A box-and-whisker plot for the Sharks is shown below.
Sharks
60 70 80 90 100 110
Weight (kg)
(c) On the same diagram, draw a box-and-whisker plot for the Rebels. [2]
(d) Make one comparison between the weights of the players in the Rebels club and the weights of
the players in the Sharks club. [1]
9709/52/O/N/21/Q7
21 The distances, x m, travelled to school by 140 children were recorded. The results are summarised in
the table below.
(a) On the grid, draw a cumulative frequency graph to represent these results. [2]
(b) Use your graph to estimate the interquartile range of the distances. [2]
(c) Calculate estimates of the mean and standard deviation of the distances. [6]
9709/53/O/N/21/Q2
22 Lakeview and Riverside are two schools. The pupils at both schools took part in a competition to see
how far they could throw a ball. The distances thrown, to the nearest metre, by 11 pupils from each
school are shown in the following table.
Lakeview 10 14 19 22 26 27 28 30 32 33 41
Riverside 23 36 21 18 37 25 18 20 24 30 25
(a) Draw a back-to-back stem-and-leaf diagram to represent this information, with Lakeview on the
left-hand side. [4]
(b) Find the interquartile range of the distances thrown by the 11 pupils at Lakeview school. [2]
9709/53/O/N/21/Q3
23 The times taken, in minutes, by 360 employees at a large company to travel from home to work are
summarised in the following table.
(b) Calculate an estimate of the standard deviation of the times taken to travel to college. [3]
(c) In which class interval does the upper quartile lie? [1]
It was later discovered that the times taken to travel to college by two students were incorrectly
recorded. One student’s time was recorded as 15 instead of 5 and the other’s time was recorded as 65
instead of 75.
(d) Without doing any further calculations, state with a reason whether the estimate of the standard
deviation in part (b) would be increased, decreased or stay the same. [1]
9709/52/M/J/22/Q3
26 The back-to-back stem-and-leaf diagram shows the diameters, in cm, of 19 cylindrical pipes produced
by each of two companies, A and B.
Company A Company B
4 33 1 2 8
9 8 3 2 0 34 1 6 8 9 9
8 7 5 4 1 1 35 1 2 2 3
9 6 5 2 36 5 6
4 3 1 37 0 3 4
38 2 8
Key: 1 35 3 means the pipe diameter from company A is 0.351 cm and from company B is 0.353 cm.
(a) Find the median and interquartile range of the pipes produced by company A. [3]
It is given that for the pipes produced by company B the lower quartile, median and upper quartile are
0.346 cm, 0.352 cm and 0.370 cm respectively.
(b) Draw box-and-whisker plots for companies A and B on the grid below. [3]
(c) Make one comparison between the diameters of the pipes produced by companies A and B. [1]
9709/53/M/J/22/Q1
27 The time taken, t minutes, to complete a puzzle was recorded for each of 150 students. These times
are summarised in the table.
9709/51/O/N/22/Q3
28 The Lions and the Tigers are two basketball clubs. The heights, in cm, of the 11 players in each of
their first team squads are given in the table.
Lions 178 186 181 187 179 190 189 190 180 169 196
Tigers 194 179 187 190 183 201 184 180 195 191 197
(a) Draw a back-to-back stem-and-leaf diagram to represent this information, with the Lions on the
left. [4]
(b) Find the median and the interquartile range of the heights of the Lions first team squad. [3]
It is given that for the Tigers, the lower quartile is 183 cm, the median is 190 cm and the upper quartile
is 195 cm.
(c) Make two comparisons between the heights of the players in the Lions first team squad and the
heights of the players in the Tigers first team squad. [2]
9709/52/O/N/22/Q4
29 The times taken, in minutes, to complete a word processing task by 250 employees at a particular
company are summarised in the table.
(b) Calculate an estimate for the standard deviation of these times. [3]
9709/53/O/N/22/Q3
30 The times, t minutes, taken to complete a walking challenge by 250 members of a club are summarised
in the table.
(c) Calculate an estimate for the standard deviation of the times taken to complete the challenge by
these 250 members. [4]
9709/52/F/M/23/Q1
31 Each year the total number of hours, x, of sunshine in Kintoo is recorded during the month of June.
The results for the last 60 years are summarised in the table.
x 30 ≤ x < 60 60 ≤ x < 90 90 ≤ x < 110 110 ≤ x < 140 140 ≤ x < 180 180 ≤ x ≤ 240
Number
4 8 14 25 7 2
of years
Population 100 − 800 900 − 1200 1300 − 2000 2100 − 3200 3300 − 4800
Number of villages 8 12 50 48 32
Company A Company B
5 4 1 1 0 25 4 4 5 6 6 7
9 9 8 7 2 1 0 26 0 1 3 5 5 7 9 9
8 6 4 2 1 0 27 1 3 4 6 6 8 8
6 5 4 2 0 28 0 1 2 2 2
9 8 5 29
1 30 9
. .
Key: 1 27 6 means $2710 for company A and $2760 for company B
(a) Find the median and the interquartile range of the monthly salaries of employees in company A.
[3]
The lower quartile, median and upper quartile for company B are $2600, $2690 and $2780 respectively.
(b) Draw two box-and-whisker plots in a single diagram to represent the information for the salaries
of employees at companies A and B. [3]
9709/53/M/J/23/Q4
34 The times taken, in minutes, to complete a cycle race by 19 cyclists from each of two clubs, the
Cheetahs and the Panthers, are represented in the following back-to-back stem-and-leaf diagram.
Cheetahs Panthers
9 8 7 4
8 7 3 2 0 8 6 8
9 8 7 9 1 7 8 9 9
6 5 3 3 1 10 2 3 4 4 5 6
9 8 2 11 1 2 8
4 12 0 6
. .
Key: 7 9 1 means 97 minutes for Cheetahs and 91 minutes for Panthers
(a) Find the median and the interquartile range of the times of the Cheetahs. [3]
The median and interquartile range for the Panthers are 103 minutes and 14 minutes respectively.
(b) Make two comparisons between the times taken by the Cheetahs and the times taken by the
Panthers. [2]
Another cyclist, Kenny, from the Cheetahs also took part in the race. The mean time taken by the
20 cyclists from the Cheetahs was 99 minutes.
(c) Find the time taken by Kenny to complete the race. [3]
35 9709/51/O/N/23/Q1
120
100
Cumulative frequency 80
60
40
20
0
0 10 20 30 40 50
Time (seconds)
The times taken by 120 children to complete a particular puzzle are represented in the cumulative
frequency graph.
(a) Use the graph to estimate the interquartile range of the data. [2]
35% of the children took longer than T seconds to complete the puzzle.
9709/51/O/N/23/Q4
36 The times, to the nearest minute, of 150 athletes taking part in a charity run are recorded. The results
are summarised in the table.
Time in minutes 101 − 120 121 − 130 131 − 135 136 − 145 146 − 160
Frequency 18 48 34 32 18
Aces 180 174 169 182 181 166 173 182 168 171 164
Jets 175 174 188 168 166 174 181 181 170 188 190
(a) Draw a back-to-back stem-and-leaf diagram to represent this information with the Aces on the
left-hand side of the diagram. [4]
(b) Find the median and the interquartile range of the heights of the players in the Aces. [3]
(c) Give one comment comparing the spread of the heights of the Aces with the spread of the heights
of the Jets. [1]
9709/53/O/N/23/Q4
38 The weights, x kg, of 120 students in a sports college are recorded. The results are summarised in the
following table.
IQR = 80 – 65 M1 nfww
3
2 Median Maths = 40 M1 Indication of finding medians, such as mark on graph or reference
marks to 700 pupils, condone poor terminology such as ‘mean’
Median English = 55 A1 Both values correct, condone 54<English<56 but 54, 56 get A0
Median of English is larger than median of Maths B1 Correct statement, median must be referenced within answer.
No credit if statement references ‘means’
Range Maths is 100 or IQ range Maths = 80 – 12 = 68 M1 Evidence of finding either both ranges or both IQ ranges i.e. see a
minus
Maths marks have more spread then English marks B1 Correct conclusion. Accept standard deviation but must see some
figures
3(i) Advantage: comment referring to spread or range or shape B1 Comments referring to quartiles, IQR, Range, median, shape, skewness,
data distribution, spread score B1
Any comments with reference to mean or standard deviation or any other
‘disadvantage’ will score B0
Comments referring to ‘5-value plot’, comparison with another data set,
overview or ease of drawing/plotting/reading require an appropriate
advantage statement.
Disadvantage: comment referring to limited data information B1 Comments referring to no individual data, no information about the
provided number of values, unable to calculate mean, standard deviation, variance
and mode score B1
Any comments with reference to median, shape or any other ‘advantage’
will score B0
Comments referring to ‘size of data set’ or ‘average’ require an
appropriate disadvantage statement.
Comments referring to outliers are ignored in all cases (as outliers are not
in the syllabus content) unless supported by an appropriate advantage /
disadvantage statement.
2
(ii) Not mean as data skewed by one large value B1 Comment which identifies 768 (or ‘a very large number’) as the problem.
Condone the use of ‘outlier’
Not mode as frequencies all the same B1 Comment which indicates that no mode exists
(e.g. all the data is different, there is no repeated number, all the values are
different)
Median B1 Median identified as choice, dependent upon statements for mean and
mode being given, even if incorrect or very general.
Not mode as frequencies all the same SCB1 Comment which indicates that no mode exists
Not median as not all values used SCB1 Comment which indicates limitation of median e.g. median is not in
middle of range.
(iii)(a) LQ = 256 or 256.5 Med = 280 UQ = 329 B1 Median, UQ and LQ values seen, may not be identified or identified
Min 190 max 375 correctly. (Not read from box plot unless value stated)
150 200 250 300 350 400 B1 Correct end points, whiskers from ends of box but not through box, not at
time minutes top or bottom of box
B1 Uniform scale from 190 to 375 (need at least 3 linear identified points
min) and labelled ‘time’ and ‘minutes’ (can be in title)
(iii)(b) IQR
= their 329 – their 256 = 73 or 72.5 B1 FT Must follow through only from their stated values (condone if correct
quartiles stated here), not reading from graph.
1
4(i) Thaters School Whitefay Park School B1 Correct stem can be upside down, ignore extra values,
8 3
8 3 4 5 7 B1 Correct Thaters School labelled on left, leaves in order from right to
8 8 7 6 4 2 5 3 6 6 left and lined up vertically, no commas
6 2 1 6 1 4 6 9
5 7 3 5 8 B1 Correct Whitefay Park School labelled on same diagram on right
8 3 hand side in order from left to right and lined up vertically, no
commas
Key 8 | 4 | 5 represents 48 minutes for Thaters School and 45 B1 FT Correct key for their diagram, need both teams identified and
minutes for Whitefay Park School. ‘minutes’ stated at least once here or in leaf headings or title.
(iii) Σ(x – 60)2 = (–15)2 + (–13)2 + (–7)2 + (–4)2 + (–4)2 + 12 + 42 + 62 + M1 Summing squares with at least 5 correct unsimplified terms
92 + 132 + 232 + 152 + 182
= −
13 13 Using two coded values in correct formula (variance or sd)
their 1856 46
2 A1 Correct answer
Var = −
13 13 SC if correct variance obtained by another method give SCB1
= 130
2
5(i) 156 – 55 = 99 B1 98 ⩽ answer < 100
(L =) 22 A1
(iv) The Median higher for Ransha (1st set of data) B1 Any correct comparison of central tendency, must mention
median
IQR lower for Ransha (1st set of data) B1 Any correct comparison of spread, must refer to IQR
6(i) Median = 51 B1
UQ = 57.5, LQ = 40
IQR = UQ – LQ M1 55 ⩽ UQ ⩽ 62 – 38 ⩽ LQ ⩽ 45
17.5 A1 NFWW
1
7(i) 0.5 2.4 3 1.4 0.4 M1 At least 3 frequency densities calculated (frequency ÷ class
width)
10 10 10
e.g. , or may be read from graph using their scale,
20 19 19.5
3SF or exact
(ii) 19.5 ×10 + 34.5 × 24 + 44.5 × 30 + 54.5 × 14 + 74.5 × 12 M1 Uses at least 4 midpoint attempts (e.g. 19.5 ± 0.5).
their 90 Allow unsimplified expression.
195 + 828 + 1335 + 763 + 894
=
90
4015 803
= or
90 18
2
8(i) Correct labels and scales B1 Axes labelled ‘cumulative frequency’ (or cf) and ‘time (or t)
[in] min(utes)’, linear scales from 0 to 90 and 0 to 200 with
at least 3 values marked on each axis.
7 correctly plotted points above upper boundaries joined in a curve or line B1 (0, 0); (10, 16); (20, 50); (30, 106); (50, 146); (70,176);
segments (90,200)
(ii) B1 28 ⩽ median ⩽ 30
5×16 + 15 × 34 + 25 × 56 + 40 × 40 + 60 × 30 + 80 × 24 M1 At least 4 correct midpoints (5, 15, 25, 40, 60, 80) used in a
Est. Mean = calculation
200
4
9(a) 15, 63, 129, 150 B1 Correct cumulative frequencies seen (may be on graph)
A1 Linear vertical scale, all points at correct upper end points (9.5
etc.), curve drawn accurately, joined to (0,0)
(condone (–0.5, 0))
4.752 × 15 + 122 × 48 +17 2 × 66 + 252 × 21 M1 Using midpoints ±0.5 in correct var formula, including
Var = − 15.2952 subtraction of their µ2.
150
= 29.1 A1
10(a) Class widths: 10, 5, 15, 20, 10 M1
(b) 11 – 15 and 31 – 50 B1
Greatest IQR = 50 – 11 = 39 B1
77917.5 A1
− mean 2 = 224.57
100
4
11(a) Median = 0.238 B1
UQ = 0.245, LQ = 0.231, M1
So IQR = 0.245 – 0.231
0.014 A1
(b)
LQ M UQ
2
12(a)
A B
2 6
5 2 0 3 0 1 5 8
9 7 1 4 1 2 2 7 9
3 2 5 2
4 6
Correct stem B1
Correct A on LHS B1
Correct key for their diagram, both companies identified and correct units B1
LQ = [$]35 000 B1
UQ = [$]52 000
3
13(a) M1 At least 4 points plotted at upper end points, with both scales
linear with at least 3 values indicated
Correct cumulative frequency curve A1 All plotted correctly with curve drawn joined to (0, 0), axes
labelled cumulative frequency, time, minutes
k = 45 (mins) A1 FT Clear indication that their graph has been used, tolerance ±1mm
2
(c) Frequencies: 12 36 58 28 16 B1 Correct frequencies seen
120 + 900 + 2030 + 1400 + 1280 M1 Correct formula with their midpoints (not upper boundary,
150 lower boundary, class width or frequency density).
1 A1
38.2, 38
5
12 × 102 + 36 × 252 + 58 × 352 + 28 × 502 + 16 × 802 M1 Substitute their midpoints and frequencies (condone use of
Variance = − mean 2 cumulative frequency) in correct variance formula, must have
150
‘– their mean2’
1200 + 22500 + 71050 + 70000 + 102400
= − mean 2
150
6
14(a) B1 Correct stem can be upside down, ignore extra values
Dados Linva
B1 Correct Dados labelled, leaves in order and lined up vertically
8 6 0 0 2 9
(less than midway to next column), no commas etc, no extra
6 5 2 0 1 0 1 2 6 terms
2 4 0 B1 Correct single key for their diagram, need both resorts identified
and ‘cm’ stated at least once here or in leaf headings or title.
KEY 6| 3| 2 means 36 cm (snow) in Dados SC If 2 separate diagrams drawn, SCB1 if both keys meet these
and 32 cm (snow) in Linva criteria B0B1B0SCB1 max.
UQ or Q3 = 28 cm, LQ or Q1 = 10 cm M1 22 ⩽ UQ ⩽ 36 – 8 ⩽ LQ ⩽ 10
IQR = 28 – 10
18 (cm) A1 WWW
(c) On average the snowfall in Davos is higher B1 FT FT from their 5(b) values for Dados.
Statement comparing central tendency in context
The amount of snowfall in Linva varies more than in Davos B1 FT Statement comparing spread in context
Note: simply stating and comparing the values is not sufficient.
2
15(a) Class widths: 5, 5, 10, 20, 30 M1 At least 3 class widths correct and used in a calculation
Frequency density: 2, 1, 2.6, 1.6, 0.6
M1 At least 3 correct frequency densities unsimplified – FT their
class widths
B1 Correct upper bar ends (5.5, 10.5, 20.5, 40.5, 70.5) and 4 correct
lower bar ends of 5.5, 10.5, 20.5, 40.5. Condone 0 or 1.
Greatest IQR = 40 – 11 = 29 B1 FT Subtract lower end of their LQ interval from upper end of their
UQ interval
(c) Midpoints: 3 8 15.5 30.5 55.5 M1 At least 4 midpoints correct and used
3 × 10 + 8 × 5 +15.5× 26 + 30.5 × 32 + 55.5 × 18 M1 Correct formula with their midpoints (not upper boundary,
Mean = lower boundary, class width, frequency density, frequency or
91
30 + 40 + 403 + 976 + 999 cumulative frequency)
=
91
2448
=
91
82 A1 Accept 26 or 27
26.9, 26
91
16(a) B1 Correct cumulative frequencies seen (may be by table or
Distance 0-4 5-10 11-20 21-30 31-40 41-60 plotted accurately on graph), condone 12 not stated.
Upper
4∙5 10∙5 20∙5 30∙5 40∙5 60∙5
boundary
Cumulative
12 28 60 126 146 150
frequency
2
17(a) M1 At least 4 frequency densities calculated, accept
unsimplified.
Class width 10 10 20 20 40
May be read from graph using their scale, 3SF or correct
Frequency Density 1.6 5.4 3.9 1.6 0.5
A1 All heights correct on graph
(b) 16 × 5 + 54 ×15 + 78× 30 + 32 × 50 + 20 × 80 M1 Uses at least 4 midpoint attempts (e.g. 5 ± 0.5).
Mean = Accept unsimplified expression, denominator either
200
correct or their Σfrequencies
80 + 810 + 2340 +1600 + 1600
=
200
5 21 B1 Correct single key for their diagram, need both teams identified
and ‘cm’ stated at least once here or in leaf headings or title.
Key: 1|18|2 means 181 cm for Amazons and 182 cm for Giants SC for if 2 separate diagrams drawn, award SCB1 if both keys
meet these criteria (Max B1, B0, B0, B1)
(c) [UQ = 202 (cm), LQ = 182 (cm)] M1 201 ⩽ UQ ⩽ 205 – 181 ⩽ LQ ⩽ 184
[IQR =] 202 – 182 = 20 (cm)
A1 WW
2
(d) [Σ11 = 2132 B1 Bot Σ11 and Σ15 found. Accept unevaluated.
Σ15 = 191.2 ×15 = 2868 ]
their 2868 = their 2132 + (180 + 185 + 190) + h M1 Forming an equation for the height using their Σ11 and Σ15.
181 (cm) A1
[Σ15 = 191.2 ×15 = 2868 B1 Σ15 found using the mean and raw data methods. Accept
Σ15 = 2687 + h ] unevaluated.
their 2868 = their 2687 + h M1 Forming an equation for the height using their Σ15 expressions.
181 (cm) A1
[Σ15 = 2687 + h B1 Σ15 found using raw data method and statement on calculating
Σ15 new mean. Accept unevaluated.
= 191.2 ]
15
their 2687 + h M1 Forming an equation for the height using their Σ15 expressions
= 191.2
15
181 (cm) A1
(b) 65% of 160 = 104 M1 0.65 × 160 (=104) seen unsimplified or implied by use
on graph
20(a)
(a) Re Sharks B1 Correct stem, ignore extra values (not in reverse).
6 6 8
9 8 5 7 1 2 4 5 5 6 8 B1 Correct Rebels labelled on left, leaves in order from right
9 6 5 4 3 2 2 0 8 3 3 4 5 6 to left and lined up vertically, no commas.
9 5 3 9 2
2 10 B1 Correct Sharks labelled on same diagram, leaves in order
and lined up vertically, no commas.
Key: 8 | 7 | 2 means 78 kg for Rebels and 72 kg for Sharks
B1 Correct key for their diagram, need both teams identified
and ‘kg’ stated at least once here or in leaf headings or
title.
4
(b) Median = 84 (kg) B1
(c) Box and whisker with end points 75 and 102 B1 Whiskers drawn to correct end points not through box,
not joining at top or bottom of box.
Median and quartiles plotted as found in (b) B1 FT Quartiles and median plotted as box graph.
(d) e.g. Average weight of Rebels is higher than average weight of Sharks B1 Acceptable answers refer to: Range, skew, central
tendency within context.
E.g. range of Rebels is greater B0.
Range of weights of the rebels is greater B1.
Simple value comparison insufficient.
21(a) Cumulative frequency graph drawn B1 Axes labelled ‘cumulative frequency’ (or cf) from 0 to at least 140 and
‘distance (or d) [in] m’ from 0 to at least 1600, linear scales with at
least 3 values stated.
B1 All plotted correctly at correct upper end points (200 etc.) curve drawn
accurately joined to (0, 0) (straight line segments B0) but no daylight
above 140.
Cf scale no less than 2 cm = 20 children .
(b) [UQ at 75% of 140 = 105, LQ at 25% of 140 = 35] M1 Accept 660 ⩽ UQ ⩽ 720 – 240 ⩽ LQ ⩽ 290.
[IQR:] 700 – 260 If values are outside our range, FT providing scales linear and
increasing cf drawn.
2
(c) [M =] B1
Frequencies 16 30 42 34 12 6
16 × 100 + 30 × 250 + 42 × 400 + 34 × 700 + 12 × 1050 + 6 × 1400
140 Mid-points 100 250 400 700 1050 1400
5 or 6 correct frequency values seen.
M1 Values substituted into mean formula using their midpoints which must
be in the class – condone 1 data error.
1600 + 7500 + 16 800 + 23 800 + 12 600 + 8400 70 700
Accept or .
140 140
70 770
Condone for M1.
140
505 A1 WWW
Variance = M1 Values substituted into variance formula using (their mean)2 and their
16 × 1002 + 30 × 2502 + 42 × 4002 + 34 × 7002 + 12 ×10502 + 6 × 14002 midpoints and their frequencies (including for denominator). Accept
unsimplified. Condone 1 data error.
140
2 Accept:
−505 160 000 + 1875 000 + 6 720 000 + 16 660 000 + 13 230 000 + 11 760 000
[
140
50 405 000
or or 360 035.7143] − 5052 or 255 025
140
If formula stated accept 105 010 or 105 011 WWW.
6
22(a)
(a) Lake Riverside B1 Correct stem, ignore extra values.
9 4 1 8 8
B1 Correct Lakeview labelled on left, leaves in order from
8 7 6 2 0 1 4 5 5 right to left and lined up vertically, no commas.
3 2 3 0 6
1 4 B1 Correct Riverside labelled on same diagram, leaves in
order and lined up vertically, no commas.
Key: 6|2|3 means 26m for Lakeview and 23m for Riverside B1 Correct key for their diagram, need both teams identified
and ‘m’ stated at least once here or in leaf headings or
title.
IQR = 32 – 19 = 13 A1 WWW
2
23(a) M1 At least 4 frequency densities calculated
Class
30 15 20 10 25 A1 All heights correct on graph
Width
Frequency
0.7 2 3.4 8.6 1.8 B1 Bar ends at 0∙5, 30∙5, 45∙5, 65.5, 75.5, 100.5
Density
(at axis), 5 bars drawn, condone 0 in first bar
0.5 ⩽ time axis ⩽ 100.5, linear scale with at least 3 values
indicated.
B1 Axes labelled: Frequency density (fd), time (t) and mins (or
appropriate title). Linear fd scale, with at least 3 values
indicated 0 ⩽ fd axis ⩽ 8∙6
1
24(a) M1 At least 4 frequency densities calculated
Class
30 15 20 10 25 A1 All heights correct on graph
Width
Frequency
0.7 2 3.4 8.6 1.8 B1 Bar ends at 0∙5, 30∙5, 45∙5, 65.5, 75.5, 100.5
Density
(at axis), 5 bars drawn, condone 0 in first bar
0.5 ⩽ time axis ⩽ 100.5, linear scale with at least 3 values
indicated.
B1 Axes labelled: Frequency density (fd), time (t) and mins (or
appropriate title). Linear fd scale, with at least 3 values
indicated 0 ⩽ fd axis ⩽ 8∙6
1
25(a) M1 At least 4 frequency densities calculated
Class 20 10 10 20 30
(Frequency ÷ class width, e.g.
width
440 440 440
condone ,
Frequency 22 72 92 15 4 20 19.5 20.5
density Accept unsimplified, may be read from graph using
their scale
4
(b) B1 At least 4 correct midpoints seen
Midpoints 10 25 35 50 75
Variance =
440(10 31.44) 2 720(25 31.44) 2 920(35 31.44) 2 300(50 31.44) 2 120(75 31.44) 2
2500
202 256 29860 11659 103342 227 697 574814
229.9264
2500 2500
(c) 30‒40 B1
(d) Stays the same, data still in same intervals B1 Frequencies unchanged
1
26(a) Median = 0.355 B1 Identified condone Q2.
(b) Box-and-whisker plot on provided grid B1 All 5 key values for B plotted accurately in standard format
using their scale. Labelled B.
Check accuracy in the middle of vertical line.
1
27(a) Cumulative frequency (cf) graph M1 At least 3 points plotted accurately at class upper
end points (25,16) (50,44) (75,86) (100,104)
(150, 132) (200, 150).
Linear cf scale 0 ⩽ cf ⩽ 150 and linear time scale
0 ⩽ time(mins) ⩽ 200 with at least 3 values
identified on each axis.
(b) Line from cumulative frequency = 30 to meet graph at t is between 37.5 and B1 FT Not from wrong working. Must be an increasing
42 cumulative frequency graph.
1
28(a) Lions Tigers B1 Correct stem can be upside down, ignore extra values (not in
9 16 reverse).
9 8 17 9
9 7 6 1 0 18 0 3 4 7 B1 Correct Lions labelled on left, leaves in order from right to left
6 0 0 19 0 1 4 5 7 and lined up vertically, no commas or other punctuation.
20 1
B1 Correct Tigers labelled on same diagram, leaves in order and lined
up vertically, no commas or other punctuation.
Key 1|18|3 means 181 cm for Lions and 183 cm for Tigers B1 Correct single key for their diagram, need both teams identified
and ‘cm’ stated at least once here or in leaf headings or title.
11[cm] A1 WWW
(c) Tigers are (generally) taller B1 Comparison about central tendency in context.
Heights of Tigers are slightly less consistent than heights of Lions B1 Comparison about spread in context. (Condone ‘similar spread’ in
context.)
2
29(a) Cw 20 20 10 10 40 M1 At least 4 frequency densities calculated
Fd 1.6 2.3 9.6 5.2 0.6 f 32 f
eg condone if unsimplified , accept unsimplified,
cw 20 cw 0.5
may be read from graph using their scale no lower than 1 cm = fd 1
A1 All bar heights correct on graph, using their suitable linear scale
with at least 3 values indicated, no lower than 1 cm = fd 2.
B1 Bar ends at [0,] 20, 40, 50, 60, 100 (at axis), 5 bars drawn 0 ⩽ time
axis ⩽ 100, linear scale with at least 3 values indicated.
4
(b) Midpoints 10 30 45 55 80 B1 At least 4 correct midpoints seen (check data table).
[Mean = 43.2 given] M1 Appropriate variance formula with their 5 midpoints (not upper
32 102 + 46 302 + 96 452 + 52 552 + 24 802 bound, lower bound, class width, frequency density, frequency or
[Var =] − 43.22 cumulative frequency).
250
Or Condone 1 frequency error.
If correct midpoints seen accept
32 (10 − 43.2 ) + 46 ( 30 − 43.2 ) + 96 ( 45 − 43.2 )
2 2 2
3200 + 41400 + 194400 +157300 + 153600 549900
or
+52 ( 55 − 43.2 ) + 24 (80 − 43.2 )
2 2
250 250
250 −{43.22 or 1866.24} .
3
30(a) Cumulative frequency graph M1 At least 3 points plotted accurately at class upper end points:
(20,32), (30, 66), (35, 112), (40, 178), (50, 228),
(60, 250).
Linear cf scale 0 ⩽ cf ⩽ 250 and linear time scale 0 ⩽ time
⩽ 60 with at least 3 values identified on each.
(b) Line drawn from 150 on cf axis to meet graph at about B1 FT Must be an increasing cf graph with correct upper bounds.
t =38 minutes Use of graph must be seen.
Expect an answer in range 37 ⩽ t ⩽ 39 for a correct graph
1
(c) [Frequencies] [32] 34 46 66 50 22 B1 May be unsimplified and/or in variance calculation.
4
31(a) B1 All cumulative frequencies stated.
Upper value 60 90 110 140 180 240 May be under data table, condone omission of 4.
May be read accurately from graph, must include 4.
cf 4 12 26 51 58 60
2
(c) Midpoints: 45, 75, 100, 125, 160, 210 B1 At least 5 correct mid-points seen, check by data table or used
in formula.
4 45 + 8 75 +14100 + 25 125 + 7 160 + 2 210 M1 Correct mean formula using their 6 midpoints (must be within
[Mean =] class, not upper bound, lower bound), condone 1 data error
60
If correct midpoints seen accept
6845 180 + 600 + 1400 + 3125 + 1120 + 420
= 60 .
60
3
32(a) cw 800
00 00 00 1200 1600
M1 At least 4 frequency densities calculated (F/cw, e.g.
8 8
fd 0.01 0.03 0.0625 0.04 0.02 condone , 799 ⩽ n ⩽ 801 ) Accept unsimplified,
800 n
may be read from graph using their scale.
B1 Bar ends at 50, 850, 1250, 2050, 3250, 4850 read at the
axis with a horizontal linear scale with at least 3 values
indicated.
50 ⩽ horizontal scale ⩽ 4850.
(b) 2100 – 3200 B1 Accept 2050 – 3250 OE. Condone ‘4th interval’.
2
33(a) Median = 2710 B1 Must be identified, condone Q2.
Ignore units throughout.
230 A1 www
If M0 scored SC B1 for 230 www.
If key ignored consistently:
B0 Median = 271
SC M1 282 ⩽ UQ ⩽ 285 – 260 ⩽ LQ ⩽ 262
SC A1 23.
(b) Box-and-whisker plot on provided grid. B1 All 5 key values for B plotted accurately in standard
format using a linear scale with 3 identified values.
Labelled B.
Scale at least 1 cm = $100.
B: 2540 2600 2690 2780 3090 B1FT All 5 key values for A, FT from (a), plotted accurately in
A: 25 2610 2710 2840 3010 standard format using a linear scale with 3 identified
values. Labelled A.
Scale at least 1cm = $100
3
(c) Examples: B1 Must refer to company B, may be implied by appropriate
Mean less appropriate than median because of extreme value for company B use of $3090.
[at $3090]. Must include an indication that the mean is not
No, extreme value in company B. appropriate.
No, $3090 is an anomaly. No contradictory statements can be present, e.g.
acceptable comment with ‘but mean could be used for
company A’.
Condone reference to $309.
23 [minutes] A1 www.
If M0 scored SC B1 for 23 www.
(b) The times for the Cheetahs are faster than the times for the B1 Correct statement comparing central tendency in context.
Panthers
The times for the Cheetahs are more spread than the times for the B1 Correct statement comparing range/IQR in context.
Panthers
3
35(a) [IQR =] 31 – 23.7 M1 30.5 < UQ < 31.25 – 23.25 < LQ ⩽ 24
Evidence of graph use must be seen at least once.
36(a) M1 f 18
Class 20 10 5 10 15 At least 4 frequency densities calculated by e.g. (condone
width cw 20
f
if unsimplified).
Frequency 0.9 4.8 6.8 3.2 1.2 cw 0.5
density
Accept unsimplified, may be read from graph using their scale, no
lower than 1cm = 1 fd.
A1 All bar heights correct on graph (no FT), using their suitable linear
scale with at least 3 values indicated, no lower than 1cm = 1 fd.
B1 Bar ends at 120.5, 130.5, 135.5, 145.5, 160.5. 5 bars drawn with a
horizontal linear scale, no lower than 1 cm = 10 min, with at least 3
values indicated. 100 ⩽ horizontal scale ⩽ 160.
B1 Axes labelled frequency density (fd), time (t) and minutes (min, m)
oe, or an appropriate title.
(Axes may be reversed).
4
(b) [Midpoints ] 110.5 125.5 133 140.5 153 B1 At least 4 correct mid-points seen, may be by data table or used in
formula.
18 110.5 + 48 125.5 + 34 133 + 32 140.5 +18 153 M1 Correct formula for mean using midpoints ±0.5, condone 1 midpoint
Mean = error within class.
150
1989 + 6024 + 4522 + 4496 + 2754
=
150
= 131.9 A1 9 1319
Accept 132, 131 , or . Must be identified.
10 10
5
37(a) Aces Jets B1 Correct stem, ignore extra values (not in
9 8 6 4 16 6 8 reverse, not split).
4 3 1 17 0 4 4 5
2 2 1 0 18 1 1 8 8 If a split stem-and-leaf plot is used (i.e., stem
19 0 values are repeated), the remaining B marks
are available.
Key: 1 | 17 | 0 means 171 cm for the Aces and 170 cm for the Jets B1 Correct Jets labelled on same diagram, leaves
in order and lined up vertically, no commas
or other punctuation.
13 [cm] A1 www
If M0 scored SC B1 for 13 www.
(c) Jets have a greater variety of heights. B1 [Jets IQR = 18 cm, Range = 24 cm
Jets have a wider range of height. Aces IQR = their 4(b), Range = 18 cm]
Jets have a greater/larger/bigger/wider/’more’ spread of heights. Comment about spread in context, must
Aces have a smaller variety of height etc… include height.
Comparison of values does not score until a
comment in context is made.
If values for range or IQR are stated, they
must be correct or FT from 4(b).
If more than one comment about spread,
mark the final comment.
Additional comments about central tendency
score B0.
1
38(a) M1 At least 3 points plotted accurately at class upper end points (40,0)
(60,14) (65,38) (70,60) (85,106) (100,120).
Linear cumulative frequency scale 0 ⩽ cf ⩽ 120 and linear weight
scale 40 ⩽ weight(kg) ⩽ 100 with at least 3 values identified on
each axis.
Condone scale reversed.
2
(c) Frequencies: [0] 14 24 22 46 14 B1 At least 5 correct frequencies seen (condone omission of 0).
Midpoints: 20 50 62.5 67.5 77.5 92.5 B1 At least 5 correct midpoints seen (condone omission of 20).
Mean = M1 Correct formula for mean using their midpoints and their
0 20 +14 50 + 24 62.5 + 22 67.5 + 46 77.5 +14 92.5 8545
frequencies, implied by if correct midpoints & frequencies
120 120
[0] + 700 +1500 +1485 + 3565 +12950 8545 seen.
= = 120
120 May be gained in variance calculation.
If midpoints not clearly identified, condone midpoints ± 0.5.
= 71.2 A1 1709 5
Accept , 71 or 71.208333 to at least 3SF.
24 24
1709 5
If M0 scored, SC B1 for , 71 or 71.208333 to at least 3SF
24 24
www.
−
120 120
120 120 frequencies seen.
[=138.23]