0% found this document useful (0 votes)
2K views19 pages

HW Lesson 5 - MAA HL - Statistics 1 DP1

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views19 pages

HW Lesson 5 - MAA HL - Statistics 1 DP1

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 19

MAA HL - Statistics 1 DP1 [60 marks]

1. [Maximum mark: 15] SPM.1.SL.TZ0.7


A large company surveyed 160 of its employees to find out how much time they
spend traveling to work on a given day. The results of the survey are shown in
the following cumulative frequency diagram.

(a) Find the median number of minutes spent traveling to work. [2]

Markscheme

evidence of median position (M1)


80th employee

40 minutes A1

[2 marks]

(b) Find the number of employees whose travelling time is within


15 minutes of the median. [3]

Markscheme

valid attempt to find interval (25–55) (M1)

18 (employees), 142 (employees) A1

124 A1

[3 marks]

Only 10% of the employees spent more than k minutes traveling to work.

(c) Find the value of k. [3]

Markscheme

recognising that there are 16 employees in the top 10% (M1)

144 employees travelled more than k minutes (A1)

k= 56 A1

[3 marks]

The results of the survey can also be displayed on the following box-and-
whisker diagram.
(d) Write down the value of b. [1]

Markscheme

b= 70 A1

[1 mark]

(e.i) Find the value of a. [2]

Markscheme

recognizing a is first quartile value (M1)

40 employees

a= 33 A1

[2 marks]

(e.ii) Hence, find the interquartile range. [2]

Markscheme

47 − 33 (M1)

IQR = 14 A1

[2 marks]

(f ) Travelling times of less than p minutes are considered outliers.

Find the value of p.


[2]

Markscheme

attempt to find 1.5 × their IQR (M1)

33 − 21

12 (A1)

[2 marks]
2. [Maximum mark: 4] EXN.2.SL.TZ0.1
A data set consisting of 16 test scores has mean 14. 5 . One test score
of 9 requires a second marking and is removed from the data set.

Find the mean of the remaining 15 test scores. [4]

Markscheme

* This sample question was produced by experienced DP mathematics


senior examiners to aid teachers in preparing for external assessment in the
new MAA course. There may be minor differences in formatting compared
to formal exam papers.

16

Σ xi
i=1

16
= 14. 5 (M1)

Σ xi

Note: Award M1 for use of x =


¯ . i=1

16

⇒ Σ x i = 232 (A1)
i=1

232−9
new x =
¯ (A1) 15

223
¯
= 14. 9 (= 14. 86, = ) A1 15

Note: Do not accept 15.

[4 marks]
3. [Maximum mark: 9] EXM.1.SL.TZ0.2
A set of data comprises of five numbers x 1 , x2 , x3 , x4 , x 5 which have been

placed in ascending order.

(a) Recalling definitions, such as the Lower Quartile is the n+1


th
4

piece of data with the data placed in order, find an expression


for the Interquartile Range. [2]

Markscheme

x 1 +x 2 x 4 +x 5 x 4 +x 5 −x 1 −x 2
LQ =
2
, UQ =
2
, I QR =
2
M1A1

[2 marks]

(b) Hence, show that a data set with only 5 numbers in it cannot
have any outliers. [5]

Markscheme

U Q + 1.5I QR = 1.25x 4 + 1.25x 5 − 0.75x 1 − 0.75x 2 ⩾ x 5

M1A1

Since 1.25x 4 + 0.25x 5 ⩾ 0.75x 1 + 0.75x 2 due to the ascending


order. R1

Similarly
LQ − 1.5I QR = 1.25x 1 + 1.25x 2 − 0.75x 4 − 0.75x 5 ⩽ x 1

M1A1

Since 0.25x 1 + 1.25x 2 ⩽ 0.75x 3 + 0.75x 4 due to the ascending


order.

So there are no outliers for a data set of 5 numbers. AG

[5 marks]
(c) Give an example of a set of data with 7 numbers in it that does
have an outlier, justify this fact by stating the Interquartile
Range. [2]

Markscheme

For example 1, 2, 3, 4, 5, 6, 100 where I QR = 4 A1A1

[2 marks]
4. [Maximum mark: 15] EXM.2.SL.TZ0.1
The principal of a high school is concerned about the effect social media use
might be having on the self-esteem of her students. She decides to survey a
random sample of 9 students to gather some data. She wants the number of
students in each grade in the sample to be, as far as possible, in the same
proportion as the number of students in each grade in the school.

(a) State the name for this type of sampling technique. [1]

Markscheme

Stratified sampling A1

[1 mark]

The number of students in each grade in the school is shown in table.

(b.i) Show that 3 students will be selected from grade 12. [3]

Markscheme

There are 260 students in total A1

84

260
× 9 = 2.91 M1A1

So 3 students will be selected. AG

[3 marks]

(b.ii) Calculate the number of students in each grade in the sample. [2]
Markscheme

grade 9 = 60

260
× 9 ≈ 2, grade 10 = 83

260
× 9 ≈ 3, grade 11
=
33

260
× 9 ≈ 1 A2

[2 marks]

In order to select the 3 students from grade 12, the principal lists their names in
alphabetical order and selects the 28th, 56th and 84th student on the list.

(c) State the name for this type of sampling technique. [1]

Markscheme

Systematic sampling A1

[1 mark]

Once the principal has obtained the names of the 9 students in the random
sample, she surveys each student to find out how long they used social media
the previous day and measures their self-esteem using the Rosenberg scale. The
Rosenberg scale is a number between 10 and 40, where a high number
represents high self-esteem.

(d.i) Calculate Pearson’s product moment correlation coefficient, r. [2]

Markscheme

r = −0.901 A2
[2 marks]

(d.ii) Interpret the meaning of the value of r in the context of the


principal’s concerns. [1]

Markscheme

The negative value of r indicates that more time spent on social media
leads to lower self-esteem, supporting the principal’s concerns. R1

[1 mark]

(d.iii) Explain why the value of r makes it appropriate to find the


equation of a regression line. [1]

Markscheme

r being close to –1 indicates there is strong correlation, so a regression line


is appropriate. R1

[1 mark]

(e) Another student at the school, Jasmine, has a self-esteem value


of 29.

By finding the equation of an appropriate regression line,


estimate the time Jasmine spent on social media the previous
day. [4]

Markscheme

Find the regression line of t on s. M1

t = −0.281s + 9.74 A1

t = (−0.2807 …) (29) + 9.739 … = 1.60 hours M1A1


[4 marks]
5. [Maximum mark: 17] 23N.1.SL.TZ1.7
A ballet company performs The Sleeping Beauty every year. Last year they gave a total
of 60 performances at their theatre which has a maximum capacity of 800. The
number of tickets sold, n, at each performance is shown in the following
frequency table.

(a.i) Find the value of p. [1]

Markscheme

p = 9 A1

[1 marks]

(a.ii) Write down the modal class. [1]

Markscheme

600 < n ≤ 800 A1

Note: Award A0 if candidate answers 700.

[1 marks]

The following cumulative frequency diagram also displays these data.


(b) Use the cumulative frequency curve to estimate

(b.i) the median number of tickets sold. [1]

Markscheme

median = 600 A1

[1 marks]

(b.ii) the number of performances where at least 80 % of the tickets


were sold. [3]

Markscheme

80 % of 800 = 640 (A1)

40 (performances less than 80 % of tickets sold) (A1)


20 (performances) A1

[3 marks]

After a performance, the company decides to conduct a survey to obtain


feedback from the audience.

(c.i) State one disadvantage of the company surveying only the first
5 % of the audience as they leave the theatre. [1]

Markscheme

any reasonable answer which suggests a biased sample (must include


reason, do not accept reasons such as “sample size is too small”, or answers
that simply say “not representative of entire audience” without a valid
reason) A1

e.g. likely to come from the same part of the theatre OR be part of same
group OR be from priority seating OR it is convenience sampling

[1 marks]

(c.ii) Describe briefly how the company could collect feedback from
5 % of the audience using the systematic sampling method. [2]

Markscheme

every 20th person A1A1

Note: Award A1 for recognizing that sampling occurs at regular intervals


e.g. “every”.

Award A1 for interval length is 20.

[2 marks]
(c.iii) State the sampling method which should be used if the survey
is to be representative of the number of children and the
number of adults in the audience. [1]

Markscheme

quota (sampling method) A1

[1 marks]

Last year 36 000 tickets were sold to The Sleeping Beauty.

(d) The following box and whisker diagram displays the amount
spent by the audience at the souvenir shop when they
attended the performance.

(d.i) Estimate the number of people who spent between $3 and


$25. [2]

Markscheme

75 % (of 36000 spent between $3 and $25) (M1)

= 27000 A1

[2 marks]

(d.ii) Half the audience spent less than $a. Estimate the value of a. [1]
Markscheme

a = 7 A1

[1 marks]

This year the company will again give 60 performances and expects to sell 18
additional tickets for each performance.

(e.i) Calculate the mean number of tickets the company expects to


sell this year for each performance. [3]

Markscheme

METHOD 1

old mean is 600 (tickets) (A1)

recognising new mean is old mean +18 (M1)

600 + 18

= 618 (tickets) A1

METHOD 2

new total number of tickets = 36000 + 60 × 18(= 37080)

(A1)

new mean =
36000+60×18

60
(=
37080

60
) (M1)

= 618 (tickets) A1

[3 marks]

(e.ii) State what effect, if any, this increase in ticket sales would have
on the variance of the number of tickets sold for each
performance. [1]

Markscheme

no effect on the variance A1

[1 marks]

© International Baccalaureate Organization, 2024

You might also like