0% found this document useful (0 votes)
82 views24 pages

Data Presentation and Interpretation

This document contains sample exam questions related to statistics. It includes questions about sampling techniques, interpreting scatter plots and regression lines, identifying outliers, calculating measures of center and spread from data sets, and using box plots to compare data across groups. The questions refer to a large data set on weather patterns and ask students to analyze subsets of the data and draw conclusions.

Uploaded by

kienkien
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
82 views24 pages

Data Presentation and Interpretation

This document contains sample exam questions related to statistics. It includes questions about sampling techniques, interpreting scatter plots and regression lines, identifying outliers, calculating measures of center and spread from data sets, and using box plots to compare data across groups. The questions refer to a large data set on weather patterns and ask students to analyze subsets of the data and draw conclusions.

Uploaded by

kienkien
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.

com

Questions

Q1.

Sara is investigating the variation in daily maximum gust, t kn, for Camborne in June and
July 1987.

She used the large data set to select a sample of size 20 from the June and July data for
1987. Sara selected the first value using a random number from 1 to 4 and then selected
every third value after that.

(a) State the sampling technique Sara used.


(1)
(b) From your knowledge of the large data set, explain why this process may not generate
a sample of size 20.
(1)
The data Sara collected are summarised as follows

(c) Calculate the standard deviation.


(2)

(Total for question = 4 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q2.

The partially completed histogram and the partially completed table show the time, to the
nearest minute, that a random sample of motorists were delayed by roadworks on a stretch
of motorway.

Estimate the percentage of these motorists who were delayed by the roadworks for between
8.5 and 13.5 minutes.
(5)

(Total for question = 5 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q3.
Sara was studying the relationship between rainfall, r mm, and humidity, h %, in the UK. She takes a
random sample of 11 days from May 1987 for Leuchars from the large data set.

She obtained the following results.

Sara examined the rainfall figures and found

A value that is more than 1.5 times the interquartile range (IQR) above Q3 is called an outlier.

(a) Show that r = 20.6 is an outlier.


(1)
(b) Give a reason why Sara might
(i) include
(ii) exclude
this day's reading.
(2)
Sara decided to exclude this day's reading and drew the following scatter diagram for the
remaining 10 days' values of r and h.

(c) Give an interpretation of the correlation between rainfall and humidity.


(1)
The equation of the regression line of r on h for these 10 days is r = –12.8 + 0.15h

(d) Give an interpretation of the gradient of this regression line.


(1)
(e) (i) Comment on the suitability of Sara's sampling method for this study.
(ii) Suggest how Sara could make better use of the large data set for her study.
(2)
(Total for question = 7 marks)
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q4.

Helen is studying the daily mean wind speed for Camborne using the large data set from
1987.
The data for one month are summarised in Table 1 below.

Table 1

(a) Calculate the mean for these data.


(1)
(b) Calculate the standard deviation for these data and state the units.
(2)
The means and standard deviations of the daily mean wind speed for the other months from
the large data set for Camborne in 1987 are given in Table 2 below. The data are not in
month order.

Table 2

(c) Using your knowledge of the large data set, suggest, giving a reason, which month had a
mean of 11.57
(2)
The data for these months are summarised in the box plots on the opposite page.
They are not in month order or the same order as in Table 2.

(d) (i) State the meaning of the * symbol on some of the box plots.
(ii) Suggest, giving your reasons, which of the months in Table 2 is most likely to be
summarised in the box plot marked Y.
(3)
.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

.............................................................................................................................................

(Total for question = 8 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q5.

Joshua is investigating the daily total rainfall in Hurn for May to October 2015

Using the information from the large data set, Joshua wishes to calculate the mean of the
daily total rainfall in Hurn for May to October 2015

(a) Using your knowledge of the large data set, explain why Joshua needs to clean the data
before calculating the mean.
(1)
Using the information from the large data set, he produces the grouped frequency table
below.

(b) Use linear interpolation to calculate an estimate for the upper quartile of the daily total
rainfall.
(2)
(c) Calculate an estimate for the standard deviation of the daily total rainfall in Hurn for May
to October 2015
(2)
(d) (i) State the assumption involved with using class midpoints to calculate an estimate of
a mean from a grouped frequency table.
(ii) Using your knowledge of the large data set, explain why this assumption does not
hold in this case.
(iii) State, giving a reason, whether you would expect the actual mean daily total
rainfall in Hurn for May to October 2015 to be larger than, smaller than or the
same as an estimate based on the grouped frequency table.
(3)

(Total for question = 8 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q6.

The histogram in Figure 1 shows the times taken to complete a crossword by a random
sample of students.

The number of students who completed the crossword in more than 15 minutes is 78.

Estimate the percentage of students who took less than 11 minutes to complete the
crossword.

(Total for question = 4 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q7.

Jerry is studying visibility for Camborne using the large data set June 1987.

The table below contains two extracts from the large data set.

It shows the daily maximum relative humidity and the daily mean visibility.

Given that daily mean visibility is given to the nearest 100,

(a) write down the range of distances in metres that corresponds to the recorded value 0 for
the daily mean visibility.
(1)
Jerry drew the following scatter diagram, Figure 2, and calculated some statistics using the
June 1987 data for Camborne from the large data set.

Jerry defines an outlier as a value that is more than 1.5 times the interquartile range above
Q3 or more than 1.5 times the interquartile range below Q1.

(b) Show that the point circled on the scatter diagram is an outlier for visibility.
(2)
(c) Interpret the correlation between the daily mean visibility and the daily maximum relative
humidity.
(1)
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Jerry drew the following scatter diagram, Figure 3, using the June 1987 data for Camborne
from the large data set, but forgot to label the x–axis.

(d) Using your knowledge of the large data set, suggest which variable the x-axis on this
scatter diagram represents.
(1)

(Total for question = 5 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q8.

The partially completed table and partially completed histogram give information about the
ages of passengers on an airline.

There were no passengers aged 90 or over.

(a) Complete the histogram.


(3)
(b) Use linear interpolation to estimate the median age.
(4)
An outlier is defined as a value greater than Q3 + 1.5 × interquartile range.

Given that Q1 = 27.3 and Q3 = 58.9

(c) determine, giving a reason, whether or not the oldest passenger could be considered as
an outlier.
(2)

(Total for question = 9 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q9.

Each member of a group of 27 people was timed when completing a puzzle.

The time taken, x minutes, for each member of the group was recorded.

These times are summarised in the following box and whisker plot.

(a) Find the range of the times.


(1)
(b) Find the interquartile range of the times.
(1)

For these 27 people

(c) calculate the mean time taken to complete the puzzle,


(1)
(d) calculate the standard deviation of the times taken to complete the puzzle.
(2)
Taruni defines an outlier as a value more than 3 standard deviations above the mean.

(e) State how many outliers Taruni would say there are in these data, giving a reason for
your answer.
(1)
Adam and Beth also completed the puzzle in a minutes and b minutes respectively, where a
> b.

When their times are included with the data of the other 27 people
• the median time increases
• the mean time does not change

(f) Suggest a possible value for a and a possible value for b, explaining how your values
satisfy the above conditions.
(3)
(g) Without carrying out any further calculations, explain why the standard deviation of all 29
times will be lower
than your answer to part (d).
(1)

(Total for question = 10 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q10.

Stav is studying the large data set for September 2015

He codes the variable Daily Mean Pressure, x, using the formula y = x − 1010

The data for all 30 days from Hurn are summarised by

(a) State the units of the variable x


(1)
(b) Find the mean Daily Mean Pressure for these 30 days.
(2)
(c) Find the standard deviation of Daily Mean Pressure for these 30 days.
(3)
Stav knows that, in the UK, winds circulate

• in a clockwise direction around a region of high pressure


• in an anticlockwise direction around a region of low pressure

The table gives the Daily Mean Pressure for 3 locations from the large data set on
26/09/2015

The Cardinal Wind Directions for these 3 locations on 26/09/2015 were, in random order,

W NE E

You may assume that these 3 locations were under a single region of pressure.

(d) Using your knowledge of the large data set, place each of these Cardinal Wind
Directions in the correct location in the table.
Give a reason for your answer.
(2)

(Total for question = 8 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q11.
Charlie is studying the time it takes members of his company to travel to the office.
He stands by the door to the office from 08 40 to 08 50 one morning and asks workers, as
they arrive, how long their journey was.

(a) State the sampling method Charlie used.


(1)
(b) State and briefly describe an alternative method of non-random sampling Charlie could
have used to obtain a sample of 40 workers.
(2)
Taruni decided to ask every member of the company the time, x minutes, it takes them to
travel to the office.

(c) State the data selection process Taruni used.


(1)
Taruni's results are summarised by the box plot and summary statistics below.

(d) Write down the interquartile range for these data.


(1)
(e) Calculate the mean and the standard deviation for these data.
(3)
(f) State, giving a reason, whether you would recommend using the mean and standard
deviation or the median and interquartile range to describe these data.
(2)
Rana and David both work for the company and have both moved house since Taruni
collected her data.

Rana's journey to work has changed from 75 minutes to 35 minutes and David's journey to
work has changed from 60 minutes to 33 minutes.

Taruni drew her box plot again and only had to change two values.

(g) Explain which two values Taruni must have changed and whether each of these values
has increased or decreased.
(3)

(Total for question = 13 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q12.

A lake contains three different types of carp.

There are an estimated 450 mirror carp, 300 leather carp and 850 common carp.

Tim wishes to investigate the health of the fish in the lake.

He decides to take a sample of 160 fish.

(a) Give a reason why stratified random sampling cannot be used.


(1)
(b) Explain how a sample of size 160 could be taken to ensure that the estimated
populations of each
type of carp are fairly represented.
You should state the name of the sampling method used.
(2)
As part of the health check, Tim weighed the fish.

His results are given in the table below.

(c) Calculate an estimate for the standard deviation of the weight of the carp.
(2)
Tim realised that he had transposed the figures for 2 of the weights of the fish.

He had recorded in the table 2.3 instead of 3.2 and 4.6 instead of 6.4.

(d) Without calculating a new estimate for the standard deviation, state what effect
(i) using the correct figure of 3.2 instead of 2.3
(ii) using the correct figure of 6.4 instead of 4.6
would have on your estimated standard deviation.
Give a reason for each of your answers.
(2)

(Total for question = 7 marks)


Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Mark Scheme
Q1.

Q2.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q3.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q4.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q5.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q6.

Q7.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q8.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q9.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q10.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q11.
Data Presentation and Interpretation - Year 1 Statistics PhysicsAndMathsTutor.com

Q12.

You might also like