0% found this document useful (0 votes)
121 views34 pages

Basic Statistics Answer Key

Uploaded by

beautysarah1000
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
121 views34 pages

Basic Statistics Answer Key

Uploaded by

beautysarah1000
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 34

Basic Statistics [180 marks]

1. [Maximum mark: 6] 20N.2.SL.TZ0.S_2


(a.i) [3]

Markscheme

valid approach (M1)

eg correct value for a or b (or for r or r 2 = 0. 962839 seen in (ii))

a = −9. 84636, b = 221. 592

a = −9. 85, b = 222 A1A1 N3

[3 marks]

(a.ii) [1]

Markscheme

−0. 981244

r = −0. 981 A1 N1

[1 mark]

(b) [2]

Markscheme

correct substitution into their equation (A1)

eg −9. 85 × 12 + 222

103. 435 (103. 8 from 3 sf )

103 (hot chocolates) A1 N2

[2 marks]
2. [Maximum mark: 6] 20N.1.SL.TZ0.T_3
(a) [1]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences
in marking or structure. It appeared in a paper that permitted the use of a calculator, and
so might not be suitable for all forms of practice.

400 ≤ w < 500 (A1) (C1)

Note: Accept alternative notation [400, 500) or [400, 500[.

Do not accept "400-500".

[1 mark]

(b) [2]

Markscheme

115 (115. 265 … (g)) (A2) (C2)

Note: Award (A1)(A0) for an answer of 116 (116. 459 …).

[2 marks]

(c) [3]

Markscheme
(A2)(A1) (C3)

Note: Award (A2) for all correct heights of bars or (A1) for three or four correct heights of
bars.
Award (A1) for rectangular bars all with correct left and right end points (
100, 200, 300, 400, 500 and 600) and for no gaps; the bars do not have to be

shaded.
Award at most (A2)(A0) if a ruler is not used for all lines.

[3 marks]

3. [Maximum mark: 15] 20N.1.SL.TZ0.S_8


(a) [2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences
in marking or structure.

valid approach (M1)

eg Q 3 − Q 1 , Q 3 − 1. 1 , 4. 5 − a = 1. 1
a = 3. 4 A1 N2

[2 marks]

(b) [1]

Markscheme

(= 6. 4) (km) A1 N1
32

[1 mark]

(c) [4]

Markscheme

METHOD 1 (standard deviation first)

valid approach (M1)

eg
16
standard deviation = √ variance , √
9

standard deviation= 4

3
(km) (A1)

valid approach to convert their standard deviation (M1)

eg
4 5 16 8
× , √ = M
3 8 9 5

20

24
(miles) (=
5

6
) A1 N3

Note: If no working shown, award M1A1M0A0 for the value 4

3
.
If working shown, and candidate’s final answer is 4

3
, award M1A1M0A0.

METHOD 2 (variance first)

valid approach to convert variance (M1)

2 2
eg (
5

8
) ,
64

25
,
16

9
× (
5

8
)
variance = 25

36
(A1)

valid approach (M1)

2
eg
25 16 5
standard deviation = √ variance , √ , √ × ( )
36 9 8

20 5

24
(miles) (=
6
) A1 N3

[4 marks]

(d) [3]

Markscheme

correct frequency for 22 minutes (A1)

eg 20

adding their frequency (do not accept 22 + 400) (M1)

eg 20 + 400 , 420 athletes

m = 30 (minutes) A1 N3

[3 marks]

(e) [5]

Markscheme

27 (minutes) (A1)

correct working (A1)

150−20 13
eg 130 athletes between 22 and 27 minutes, P(22 < t < 27) =
600
,
60

evidence of conditional probability or reduced sample space (M1)

P(22<t<27) 150
eg P(A B) , P(t < 27 22 < t < 30) ,
P(22<t<m)
,
400

correct working (A1)


130

eg 600
400
,
150−20

400
600

130

400
(
13

40
=
78000

240000
=
390

1200
= 0. 325) A1 N5

Note: If no other working is shown, award A0A0M1A0A0 for answer of 150

400
.
Award N0 for answer of 3

8
with no other working shown.

[5 marks]

4. [Maximum mark: 6] 21M.2.SL.TZ2.1


(a.i) [2]

Markscheme

a = 0. 805084 … and b = 2. 88135 …

a = 0. 805 and b = 2. 88 A1A1

[2 marks]

(a.ii) [1]

Markscheme

r = 0. 97777 …

r = 0. 978 A1

[1 mark]

(b) [1]
Markscheme

a represents the (average) increase in waiting time (0. 805 mins) per additional customer
(waiting to receive their coffee) R1

[1 mark]

(c) [2]

Markscheme

attempt to substitute x = 7 into their equation (M1)

8. 51693 …

8. 52 (mins) A1

[2 marks]

5. [Maximum mark: 7] 21M.2.SL.TZ1.2


(a) [2]

Markscheme

a = 0. 433156 … , b = 4. 50265 …

a = 0. 433, b = 4. 50 A1A1

[2 marks]

(b) [2]

Markscheme
attempt to substitute x = 18 into their equation (M1)

y = 0. 433 × 18 + 4. 50

= 12. 2994 …

= 12. 3 A1

[2 marks]

(c) [1]

Markscheme

¯
¯
x = 15, y = 11 A1

[1 mark]

(d) [2]

Markscheme

A1A1
Note: Award marks as follows:

A1 for a straight line going through (15, 11)

A1 for intercepting the y-axis between their b ± 1. 5 (when their line is extended), which
includes all the data for 3. 3 ≤ x ≤ 25. 3.

If the candidate does not use a ruler, award A0A1 where appropriate.

[2 marks]

6. [Maximum mark: 14] 21M.1.SL.TZ2.7


(a) [2]

Markscheme

evidence of median position (M1)

40 students

median = 14 (hours) A1

[2 marks]

(b) [3]

Markscheme

recognizing there are 8 students in the top 10% (M1)

72 students spent less than k hours (A1)

k = 18 (hours) A1

[3 marks]
(c) [4]

Markscheme

15 hours is 60 students OR p = 60 − 4 (M1)

p = 56 A1

21 hours is 76 students OR q = 80 − 76 OR q = 80 − 4 − 56 − 16 (A1)

q = 4 A1

[4 marks]

(d) [3]

Markscheme

20 of the 80 students OR spend more than 15 hours doing homework (A1)


1

20 x 1

80
=
320
OR 4
× 320 OR 4 × 20 (A1)

80 (students) A1

[3 marks]

(e.i) [1]

Markscheme

only year 12 students surveyed OR amount of homework might be different for different
year levels R1

[1 mark]

(e.ii) [1]
Markscheme

stratified sampling OR survey students in all years R1

[1 mark]

7. [Maximum mark: 5] 21M.1.SL.TZ1.4


(a) [3]

Markscheme

attempt to use definition of outlier

1. 5 × 20 + Q 3 (M1)

1. 5 × 20 + U ≥ 75 ( ⇒ U ≥ 45, accept U > 45) OR 1. 5 × 20 + Q 3 = 75

A1

minimum value of U = 45 A1

[3 marks]

(b) [2]

Markscheme

attempt to use interquartile range (M1)

U − L = 20 (may be seen in part (a)) OR L ≥ 25 (accept L > 25)

minimum value of L = 25 A1

[2 marks]
8. [Maximum mark: 5] 21N.2.SL.TZ0.1
(a) [2]

Markscheme

use of GDC to give (M1)

r = 0. 883529 …

r = 0. 884 A1

Note: Award the (M1) for any correct value of r, a, b or r 2 = 0. 780624 … seen in part
(a) or part (b).

[2 marks]

(b) [1]

Markscheme

a = 1. 36609 … , b = 64. 5171 …

a = 1. 37 , b = 64. 5 A1

[1 mark]

(c) [2]

Markscheme

attempt to find their difference (M1)

5 × 1. 36609 … OR
1. 36609 … (h + 5) + 64. 5171 … − (1. 36609 … h + 64. 5171 …)

6. 83045 …

= 6. 83 (6. 85 f rom 1. 37)


the student could have expected her score to increase by 7 marks. A1

Note: Accept an increase of 6, 6. 83 or 6. 85.

[2 marks]

9. [Maximum mark: 6] 22M.2.SL.TZ2.5


(a) [1]

Markscheme

0. 28 (s) A1

[1 mark]

(b) [3]

Markscheme

IQR = 0. 35 − 0. 27 (= 0. 08) (s) (A1)

substituting their IQR into correct expression for upper fence (A1)

0. 35 + 1. 5 × 0. 08 (= 0. 47) (s)

0. 46 < 0. 47 R1

so 0. 46 (s) is not an outlier AG

[3 marks]

(c) [1]

Markscheme
EITHER

the median is closer to the lower quartile (positively skewed) R1

OR

The distribution is positively skewed R1

OR

the range of reaction times below the median is smaller than the range of reaction times
above the median R1

Note: These are sample answers from a range of acceptable correct answers. Award R1 for
any correct statement that explains this.
Do not award R1 if there is also an incorrect statement, even if another statement in the
answer is correct. Accept a correctly and clearly labelled diagram.

[1 mark]

(d) [1]

Markscheme

EITHER

the distribution for ‘not sleeping well’ is centred at a higher reaction time R1

OR

The median reaction time after not sleeping well is equal to the upper quartile reaction
time after sleeping well R1

OR
75% of reaction times are < 0. 35 seconds after sleeping well, compared with 50%
after not sleeping well R1

OR

the sample size of 9 is too small to draw any conclusions R1

Note: These are sample answers from a range of acceptable correct answers. Accept any
relevant correct statement that relates to the median and/or quartiles shown in the box
plots. Do not accept a comparison of means. Do not award R1 if there is also an incorrect
statement, even if another statement in the answer is correct.

Award R0 to “correlation does not imply causation”.

[1 mark]

10. [Maximum mark: 5] 22N.2.SL.TZ0.1


(a) [2]

Markscheme

1. 01206 … , 2. 45230 …

a = 1. 01, b = 2. 45 (1. 01x + 2. 45) A1A1

[2 marks]

(b) [1]

Markscheme

0. 981464 …

r = 0. 981 A1
Note: A common error is to enter the data incorrectly into the GDC, and obtain the
answers a = 1. 01700 … , b = 2. 09814 … and r = 0. 980888 … Some
candidates may write the 3 sf answers, ie. a = 1. 02, b = 2. 10 and r = 0. 981 or 2
sf answers, ie. a = 1. 0, b = 2. 1 and r = 0. 98. In these cases award A0A0 for part (a)
and A0 for part (b). Even though some values round to an accepted answer, they come
from incorrect working.

[1 mark]

(c) [2]

Markscheme

correct substitution of 78 into their regression equation (M1)

81. 3930 … 81. 23 from 3 sf answer

81 A1

[2 marks]

11. [Maximum mark: 7] 22M.1.SL.TZ1.3


(a) [3]

Markscheme

IQR = 10 − 6(= 4) (A1)

attempt to find Q 3 + 1. 5 × IQR (M1)

10 + 6

16 A1
[3 marks]

(b.i) [2]

Markscheme

choosing c =
1

2
a − 9 (M1)

1
× 42 − 9
2

= 12 (years old) A1

[2 marks]

(b.ii) [2]

Markscheme

attempt to solve system by substitution or elimination (M1)

34 (years old) A1

[2 marks]

12. [Maximum mark: 4] 22M.2.SL.TZ1.2


(a) [2]

Markscheme

EITHER

recognising that half the total frequency is 10 (may be seen in an ordered list or indicated
on the frequency table) (A1)

OR
5 + 1 + 4 = 3 + x (A1)

OR

∑ f = 20 (A1)

THEN

x = 7 A1

[2 marks]

(b) [2]

Markscheme

METHOD 1

1. 58429 …

1. 58 A2

METHOD 2

EITHER
2 2 2 2 2
2 5×(2−4.3) +1×(3−4.3) +4×(4−4.3) +3×(5−4.3) +7×(6−4.3)
σ = (= 2. 51)
20

(A1)

OR
2 2 2 2 2

(A1)
2 5×2 +1×3 +4×4 +3×5 +7×6 2
σ = − 4. 3 (= 2. 51)
20

THEN

σ = √ 2. 51 = 1. 58429 …
= 1. 58 A1

[2 marks]

13. [Maximum mark: 6] 23M.1.SL.TZ1.3


(a)
(a.i) [2]

Markscheme

summing frequencies of riders or finding complement (M1)

probability= 34

40
A1

[2 marks]

(a.ii) [2]

Markscheme

attempt to find expected value

16

40
+ (2 ×
13

40
) + (3 ×
2

40
) + (4 ×
3

40
) (M1)

60

40
(= 1. 5) A1

[2 marks]

(b) [2]

Markscheme

evidence of their rides/visitor ×1000 or ÷10 (M1)

1500 OR 0. 15
150 (times) A1

[2 marks]

14. [Maximum mark: 5] 24M.2.AHL.TZ2.2


(a) [1]

Markscheme

EITHER

y = 2. 1875 × 7 + 0. 6875
¯ A1

OR

¯
y = 15. 3125 + 0. 6875 A1

THEN

y = 16̄ AG

[1 mark]

(b) [4]

Markscheme

Σy
attempts to use 16 =
n
to form a linear equation in p and q (M1)

9+13+p+q+21
16 =
5
(80 = p + q + 43 ⇒ p + q = 37) (A1)

attempts to solve two linear equations simultaneously for p and q (one of which is
q = p + 3) (M1)

9+13+p+p+3+21
16 = (80 = 2p + 46)
5

p = 17 and q = 20 A1
[4 marks]

15. [Maximum mark: 17] 23N.1.SL.TZ2.7


(a.i) [1]

Markscheme

p = 9 A1

[1 mark]

(a.ii) [1]

Markscheme

600 < n ≤ 800 A1

Note: Award A0 if candidate answers 700.

[1 marks]

(b)
(b.i) [1]

Markscheme

median= 600 A1

[1 marks]

(b.ii) [3]

Markscheme

80 % of 800 = 640 (A1)

40 (performances less than 80 % of tickets sold) (A1)

20 (performances) A1
[3 marks]

(c.i) [1]

Markscheme

any reasonable answer which suggests a biased sample (must include reason, do not
accept reasons such as “sample size is too small”, or answers that simply say “not
representative of entire audience” without a valid reason) A1

eg likely to come from the same part of the theatre OR be part of same group OR be from
priority seating OR it is convenience sampling

[1 mark]

(c.ii) [2]

Markscheme

every 20th person A1A1

Note: Award A1 for recognizing that sampling occurs at regular intervals eg “every”.

Award A1 for interval length is 20.

[2 marks]

(c.iii) [1]

Markscheme

quota (sampling method) A1

[1 mark]

(d)
(d.i) [2]

Markscheme

75 % (of 36 000 spent between $3 and $25) (M1)


= 27000 A1

[2 marks]

(d.ii) [1]

Markscheme

a = 7 A1

[1 marks]

(e)
(e.i) [3]

Markscheme

METHOD 1

old mean is 600 (tickets) (A1)

recognising new mean is old mean + 17 (M1)

600 + 17

= 617 (tickets) A1

METHOD 2

new total number of tickets = 36000 + 60 × 17(= 37020) (A1)

new mean = (M1)


36000+60×17 37020
(= )
60 60

= 617 (tickets) A1

[3 marks]

(e.ii) [1]

Markscheme

no effect on the variance A1


[1 marks]

16. [Maximum mark: 5] 24M.2.SL.TZ2.3


(a) [1]

Markscheme

EITHER

¯
y = 2. 1875 × 7 + 0. 6875 A1

OR

y = 15. 3125 + 0. 6875


¯ A1

THEN

y = 16̄ AG

[1 mark]

(b) [4]

Markscheme

Σy
attempts to use 16 =
n
to form a linear equation in p and q (M1)

9+13+p+q+21
16 =
5
(80 = p + q + 43 ⇒ p + q = 37) (A1)

attempts to solve two linear equations simultaneously for p and q (one of which is
q = p + 3) (M1)

9+13+p+p+3+21
16 = (80 = 2p + 46)
5

p = 17 and q = 20 A1

[4 marks]

17. [Maximum mark: 6] 24M.2.SL.TZ2.1


(a)
(a.i) [1]

Markscheme

4. 45 (hours) A1

[1 mark]

(a.ii) [2]

Markscheme

one correct quartile either Q 1 = 1. 9 or Q 3 = 5. 7 (A1)

IQR = 5. 7 − 1. 9

= 3. 8 (hours) A1

[2 marks]

(b) [3]

Markscheme

attempts to find the upper fence value (M1)

upper fence = 11. 4 (A1)

11. 7 > 11. 4 R1

⇒ 11. 7 is an outlier AG

[3 marks]

18. [Maximum mark: 7] 24M.2.SL.TZ1.6


(a) [2]

Markscheme
r = 0. 901017 …

r = 0. 901 A2

[2 marks]

(b) [1]

Markscheme

Student 11 Test B: should not extrapolate R1

[1 mark]

(c.i) [1]

Markscheme

Student 12 Test A: should not use line of y on x to predict x from y (or equivalent) R1

[1 mark]

(c.ii) [3]

Markscheme

attempt to find the equation of the regression line of x on y (M1)

(x =)0. 987124 … y − 3. 21970 … ((x =) 0. 987y − 3. 22) A1

(x =)0. 987124 … (90) − 3. 21970 … (= 85. 6214 …) A1

= 86 to nearest integer. AG

Note: Condone notation for x and y switched if values are correct.

[3 marks]
19. [Maximum mark: 7] 24M.2.SL.TZ1.1
(a) [3]

Markscheme

median = 75 A1

(upper quartile =) 84 OR (lower quartile =) 68 (A1)

Interquartile range = 16 A1

[3 marks]

(b)
(b.i) [2]

Markscheme

Note: In this part, their reasoning and answer must be consistent with their values in part
(a).

In both part (i) and part (ii), award R0A1 for a correct answer with no reasoning.

80 > 75 OR Australia has a higher median OR Spain’s median is lower R1

Note: Award R1 for correct reasoning based on a comparison of their medians.

in general (on average), rabbits in Australia have longer ears (than rabbits in Spain

OR

in the top 50 % of each distribution, some rabbits in Spain have smaller ears than those in
Australia

OR

in the bottom 50 % of each distribution, some rabbits in Australia have longer ears than
those in Spain A1
[2 marks]

(b.ii) [2]

Markscheme

Note: In this part, their reasoning and answer must be consistent with their values in part
(a).

In both part (i) and part (ii), award R0A1 for a correct answer with no reasoning.

16 > 11 OR higher IQR in Spain OR lower IQR in Australia R1

Note: Award R1 for correct reasoning based on a comparison of their IQR’s.

(the IQR’s suggest that) there is more variation/spread in (the middle 50 % of ) rabbit ears
in Spain (than those in Australia) (or equivalent) A1

Note: Award A1 for any correct answer which uses the IQR’s to compare each distribution.

[2 marks]

20. [Maximum mark: 5] SPM.2.SL.TZ0.5


(a) [2]

Markscheme

a = 1.29 and b = −10.4 A1A1

[2 marks]

(b) [3]
Markscheme

recognising both lines pass through the mean point (M1)

p = 28.7, q = 30.3 A2

[3 marks]

21. [Maximum mark: 15] EXM.2.SL.TZ0.1


(a) [1]

Markscheme

Stratified sampling A1

[1 mark]

(b.i) [3]

Markscheme

There are 260 students in total A1

84

260
× 9 = 2.91 M1A1

So 3 students will be selected. AG

[3 marks]

(b.ii) [2]

Markscheme

grade 9 = 60

260
× 9 ≈ 2, grade 10 = 83

260
× 9 ≈ 3, grade 11 = 33

260
× 9 ≈ 1

A2

[2 marks]

(c) [1]
Markscheme

Systematic sampling A1

[1 mark]

(d.i) [2]

Markscheme

r = −0.901 A2

[2 marks]

(d.ii) [1]

Markscheme

The negative value of r indicates that more time spent on social media leads to lower self-
esteem, supporting the principal’s concerns. R1

[1 mark]

(d.iii) [1]

Markscheme

r being close to –1 indicates there is strong correlation, so a regression line is appropriate.


R1

[1 mark]

(e) [4]

Markscheme

Find the regression line of t on s. M1

t = −0.281s + 9.74 A1
t = (−0.2807 …) (29) + 9.739 … = 1.60 hours M1A1

[4 marks]

22. [Maximum mark: 8] EXN.2.SL.TZ0.4


(a.i) [2]

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to


aid teachers in preparing for external assessment in the new MAA course. There may be
minor differences in formatting compared to formal exam papers.

r = 0. 946 A2

[2 marks]

(a.ii) [1]

Markscheme

the value of r shows a (very) strong positive correlation between age and (systolic) blood
pressure A1

[1 mark]

(b) [2]

Markscheme

p = 1. 05t + 69. 3 A1A1

Note: Only award marks for an equation. Award A1 for a = 1. 05 and A1 for b = 69. 3

. Award A1A0 for y = 1. 05x + 69. 3.


[2 marks]

(c) [2]

Markscheme

122 (mmHg) (M1)A1

[2 marks]

(d) [1]

Markscheme

the regression equation should not be used because it involves extrapolation A1

[1 mark]

23. [Maximum mark: 9] EXM.1.SL.TZ0.2


(a) [2]

Markscheme

x 1 +x 2 x 4 +x 5 x 4 +x 5 −x 1 −x 2
LQ =
2
, UQ =
2
, I QR =
2
M1A1

[2 marks]

(b) [5]

Markscheme

U Q + 1.5I QR = 1.25x 4 + 1.25x 5 − 0.75x 1 − 0.75x 2 ⩾ x 5 M1A1


Since 1.25x 4 + 0.25x 5 ⩾ 0.75x 1 + 0.75x 2 due to the ascending order. R1

Similarly LQ − 1.5I QR = 1.25x 1 + 1.25x 2 − 0.75x 4 − 0.75x 5 ⩽ x 1

M1A1

Since 0.25x 1 + 1.25x 2 ⩽ 0.75x 3 + 0.75x 4 due to the ascending order.

So there are no outliers for a data set of 5 numbers. AG

[5 marks]

(c) [2]

Markscheme

For example 1, 2, 3, 4, 5, 6, 100 where I QR = 4 A1A1

[2 marks]

24. [Maximum mark: 4] EXN.2.SL.TZ0.1


[4]

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to


aid teachers in preparing for external assessment in the new MAA course. There may be
minor differences in formatting compared to formal exam papers.

16

Σ xi
i=1

16
= 14. 5 (M1)

Σ xi

Note: Award M1 for use of x =


¯ . i=1

16

⇒ Σ x i = 232 (A1)
i=1
new x =
¯ (A1) 232−9

15

223
¯
= 14. 9 (= 14. 86, = ) A1 15

Note: Do not accept 15.

[4 marks]

© International Baccalaureate Organization, 2024

You might also like