0% found this document useful (0 votes)
13 views

Exercise Sheet 1

The document contains exercises about introductory statistics concepts including classifying variables, describing sampling schemes, interpreting histograms and scatter plots, calculating measures of center and spread, and stem and leaf plots. The exercises cover a range of foundational statistical topics.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views

Exercise Sheet 1

The document contains exercises about introductory statistics concepts including classifying variables, describing sampling schemes, interpreting histograms and scatter plots, calculating measures of center and spread, and stem and leaf plots. The exercises cover a range of foundational statistical topics.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

MAT1114 Introductory Statistics

Exercise Sheet 1

1. Classify the following variables as either:


a. Quantitative or Qualitative;
b. Nominal, Ordinal, Interval, or Ratio;
c. Discrete or Continuous (Not applicable if Qualitative).

i. Marital status of teachers at a school.


ii. Time it takes to run 100 m.
iii. Colours of pencils in a pencil case.
iv. Capacity of a passenger train.
v. Academic employment level (i.e. Associate Lecturer, Lecturer, Senior Lecturer,
Associate Professor, Professor).
vi. Volume of liquid in a soft-drink can.

2. What sampling schemes are used in the following situations:

a. A sample of Airline passengers is obtained by randomly selecting 10 flights leaving


Perth Domestic airport and obtaining details from all passengers who board those
flights.
b. A sample of West Australian families is selected from the white pages, with each
phone number having an equal probability of being selected.
c. ECU students are divided into groups according to the school in which they are
enrolled. A random sample of 10 students is chosen from each group, and asked to
rate the quality of computing facilities provided by the university.
d. Every 500th pair of jeans manufactured by a company is tested for quality.

3. A random sample of families from the Perth Metro area is to be selected. The home
number of each randomly chosen family is phoned between the hours of 9:00am and
5:30pm on Wednesday. If no one answers the phone, then there is no attempt to contact the
family at a later time, and so they are not included in the sample. For each of the following
characteristics, indicate whether you think the sample results are likely to overestimate,
underestimate, or give correct values.

a. The proportion of families with a retired person.


b. The proportion of families with at least one child below the age of 1 year.
c. The proportion of families in which both partners work.

1|P age
4. Consider the following histogram displaying body mass index (BMI) data for a sample of
40 women.

Using the histogram, determine:

a. The number of women in the sample with a BMI of between 25 and 30.
b. The BMI interval with the most women.
c. The number of women who are classified as morbidly obese (i.e. BMI above 35).
d. The middle of the BMI distribution (approximate).
e. The spread (range) of the BMI distribution (approximate).
f. The shape of the distribution (i.e. symmetric of positively/negatively skewed).

5. Consider the following histogram displaying height (cm) data for a sample of 40 women.

2|P age
Using the histogram, determine:

a. The number of women in the sample with a height of below 155 cm.
b. The height interval with the second largest number of women.
c. The number of women who are between 160 cm and 170 cm tall.
d. The middle of the height distribution (approximate).
e. The spread (range) of the height distribution (approximate).
f. The shape of the distribution (i.e. symmetric of positively/negatively skewed).

6. This scatter plot below shows the relationship between the BMI and waist circumference of
a sample of 40 women.

Using the scatter plot:


a. Specify whether the relationship between BMI and waist circumference is
direct/positive or indirect/negative.
b. Approximate the BMI of the woman in the sample who has the smallest waist
circumference.
c. Approximate the waist circumference of the woman in the sample with the largest
BMI.
d. By first sketching a trend line, estimate the BMI of a woman (not necessarily from
the sample) with a waist of 100 cm.

3|P age
7. Draw a stem and leaf plot for each of the following sets.

a. The masses (in kg to the nearest kg) of a group of 20 people were as follows:

45 58 77 60 60
56 60 55 46 56
62 58 48 52 64
60 54 55 52 60

b. The times (in minutes to the nearest tenth) taken by 40 children to complete a
puzzle were as follows:

2.1 2.5 2.9 3.4 2.4 3.6 3.1 3.0


3.0 2.7 3.2 3.9 3.9 2.5 2.9 3.4
3.3 3.2 2.9 2.5 3.0 2.0 2.8 3.2
2.7 2.7 3.1 3.6 3.0 3.8 3.1 2.7
3.6 3.4 2.9 2.4 3.0 2.3 2.9 3.4

8. Find the mean, median and mode of each of the following data samples:

a. 7 6 8 4 5 3

b. 5 6 4 4 6 7 8

c. 10 12 9 8 10 11 1 7

9. Determine the mean of x for each of the following cases:

a. 𝑛 = 10, ∑ 𝑥 = 84;

b. 𝑛 = 15, ∑ 𝑥 = 63;

c. 𝑛 = 25, ∑ 𝑥 = 84;

d. 𝑛 = 14, ∑ 𝑥 = 126.

4|P age
10. For each of the following stem and leaf plots:

i. Determine the sample size n;


ii. Determine the third largest score;
iii. Determine the mode;
iv. Determine the median.

(a) 6 ¦ 5 8 (b) 1 ¦ 4 5 9
7 ¦ 4 6 7 2 ¦ 1 2 2 7
8 ¦ 0 3 8 3 ¦ 0 2 4 4 8
9 ¦ 2 4 ¦ 1 2

(c) 2 ¦ 2 4 (d) 6 ¦ 1 2 2 4
¦ 6 7 7 9 ¦ 5 6 6 6 7 9 9
3 ¦ 1 2 2 3 4 7 ¦ 1 2 3 3 3 3 4
¦ 5 6 6 7 9 9 ¦ 6 7 8 8 8 8
4 ¦ 0 1 1 2 3 8 ¦ 0 1 1 2 4
¦ 6 ¦ 6 7 8

5|P age
11. Given that the mean weight of men who were sampled is higher than the mean weight
of women who were sampled, which of the histograms below is displaying the men’s
weights? Why?

6|P age

You might also like