0% found this document useful (0 votes)
1K views

Answer Key - Worksheet 3

1. The document provides definitions and examples of measures of central tendency (mean, median, mode) and dispersion (range, variance, standard deviation). 2. It includes sample problems calculating and comparing these measures for different data sets, and commenting on things like skewness. 3. Key points covered are how to calculate the mean, median, mode, range, variance and standard deviation for both population and sample data. Comparisons are made between the measures and what they indicate about the shape of the distribution.

Uploaded by

ali
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1K views

Answer Key - Worksheet 3

1. The document provides definitions and examples of measures of central tendency (mean, median, mode) and dispersion (range, variance, standard deviation). 2. It includes sample problems calculating and comparing these measures for different data sets, and commenting on things like skewness. 3. Key points covered are how to calculate the mean, median, mode, range, variance and standard deviation for both population and sample data. Comparisons are made between the measures and what they indicate about the shape of the distribution.

Uploaded by

ali
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 11

School of Business Department of Economics

BSTA205
Answer Key of Work Sheet 3

1. Measures of Central Tendency (Location):

 A parameter is a specific characteristic of a population.


 A statistic is a specific characteristic of a sample.

a) Arithmetic Mean:

- Population Mean: the population mean is the sum of all the population values divided
by the total number of population values.
ΣX
μ=
N

- Sample mean: the sample mean is the sum of all the sample values divided by the
number of sample values.
ΣX
X́ =
n

- Properties of the arithmetic mean:


 Every set of interval-level and ratio-level data has a mean.
 All the values are included in computing the mean.
 A set of data has a unique mean. The mean is affected by unusually large or small data
values.
 The arithmetic mean is the only measure of central tendency where the sum of the
deviations of each value from the mean is zero is zero.
Σ ¿)=0

b) Median:
The Median is the midpoint of the values after they have been ordered from the smallest
to the largest.

 If the number of observations n is odd :

Me= X n+1
2

 If the number of observations n is even :

Me= 0.5 (X n + X n +1)


2 2

1
School of Business Department of Economics

- Properties of the median:


 There is a unique median for each data set.
 It is not affected by extremely large or small values and is therefore a valuable measure
of central tendency when such values occur.

c) Mode:
The mode is the value of the observation that appears most frequently.

d) The relative positions of the mean, median and mode:


 Mean=Median=Mode => Symmetric (Zero skewness).
 Mean>Median>Mode (positive skewness).
 Mean<Median<Mode (negative skewness).

2. Measures of Dispersion:

 Why study dispersion?


- A measure of location, such as the mean or the median, only describes the center of the
data but it does not tell us anything about the spread of the data.
- A second reason for studying the dispersion in a set of data is to compare the spread in
two or more distributions.

a) Range = Maximum value – Minimum value.

b) Variance:
(X −μ)2
Population variance: σ 2=
N

2
School of Business Department of Economics

( X− X́)2
Sample variance: S2=
n−1

c) Standard deviation:
2
Population standard deviation: σ = ( X−μ)
N√
2
Sample standard deviation: s= ( X− X́)
n−1√

3
School of Business Department of Economics

Problems

Problem 1
The following values represent the hours per day that a sample of 11 bank employees spend
working on the computers.

4 5 7 4 5 6 3 8 7 5 9
1. Calculate the mean.
Mean: X́ = ∑ X = 4+5+7+4+5+6+3+8+7+5+9 / 11 = 5.72
n
2. What do we call the value computed in part 1? Statistic or parameter?
if the values are sample, statistic mean but if the values are population , parameter mean .
Statistic. It is a sample mean.
3. Calculate the median and mode.
Median: order :3,4,4,5,5,5,6,7,7,8,9 Odd  Median= 5
Mode: 5 (occurs most frequently : 3 times)

Problem 2
The following values represent a population:

3.0 3.4 2.1 3.1 3.7 3.5 3.7 3.1 3.2 3.7

a. Find the arithmetic mean, median and mode.mean= 3+3.4+ 2.1+


3.1+3.7+3.5+3.7+3.1+3.2+3.7/10=3.25
Mean=3.25 Mean= 3+3-4+2-1+……. 3.7/ 10 = 3.25
Median= 3.2+3.4/2= 3.3 2.1, 3.0, 3.1, 3.1, 3.2, 3.4, 3.5 , 3.7, 3.7, 3.7
Mode=3.7

b. Comment on the skewness of the data.


mean< median < mode (data is negatively skewed)

Problem 3
A random sample of students in BSTA205 were selected and their grades (in points) on Exam 1
were reported in the following table.

95 87 65 76 64 59 89 91 76 79
For the distribution above: 59, 64, 65, 76, 76, 79, 87, 89, 91, 95
a- Calculate the mean, median and the mode. Mean= 95+87+etc…./10= 78.1
Median=middle observation=( 76 +79)/2 =77.5 points
59 64 65 76 76 79 87 89 91 95
Mean= sum(X)/n = 78.1 points= 95+87+etc / 10

4
School of Business Department of Economics

Mode= most repeated= 76


b- Compare the mean, median and mode then comment on skewness of the distribution.
Mean>median>mode (therefore distribution skewed to the right or positively skewed)
c- Calculate the Range. Range = max – min = 95 – 59 = 36
Range = 95-59 =36 points
d- Can we consider the range as the best measure of dispersion? Why?
No, because the Range does not take into account all the values of the distribution and it is
extremely affected by outliers.
e- Calculate the variance and standard deviation. (Show your calculations)
X X- X́ (X- X́ )²
95-78.1=
95 16,9. 16.9 =285.61
16.9
87-78.1=
87 = 8.9 .8.9 = 79.21
8.9
=64-
= (-14.1).(-14.1)=
64 78.1=
198.81
-14.1
76 -2.1 4.41
65 -13.1 171.61
59 -19.1 364.81
89 10.9 118.81
91 12.9 166.41
76 -2.1 4.41
79 0.9 0.81
Total   1394.9

Variance = 1394.9/9 =154.98


Standard deviation= √ variance =√ 154.98=12.4491

Problem 4

Given a sample data set showing the student ages in an economics class:
Section A: 18, 18, 21, 17, 19, 20, 22, 19, 18, 30

a- Calculate the mean.


18+18+21+17+19+20+ 22+19+ 18+30
Mean = = 20.2
10
b- What is the median?
19+19
Median: 17, 18, 18, 18, 19, 19, 20, 21, 22, 30  median is = 19
2
c- Calculate the mode and the range.
Mode =18.
Range = max – min = 30 -17 = 13

5
School of Business Department of Economics

d- Comment on the skewness.


Mean >Median>Mode => positively skewed.
e- Calculate the variance and standard deviation.
Section A: variance = 14.1 and standard deviation = 3.75.

X x-xbar x-xbar square


18 18-20.2= -2.2 4. 84
18 -2.2 4.84
21 21-20.2= 0.8 0.64
17 17-20.2= -3.2 10.24
19 19-20.2= -1.2 1.44
20 20-20.2= -0.2 0.04
21 21-20.2= 0.8 0.64
22 22-20.2= 1.8 3.24
30 30-20.2= 9.8 96.04
18 -2.2 4.84
Total 126.8

Variance = 126.8 / n-1= 126.8 /9 = 14. 1


Standard deviation = √ variance=3.75

Problem 5
Consider the two following samples:
Data Set 1 represents the number of professional in 6 accounting firms in Beirut.
Data Set 2 represents the number of professionals in 5 accounting firms in Tripoli.
Data Set 1 Data Set 2
7 6
15 8
12 9
5 13
13 9
18
Compute then compare the following measures for each data set: Mean, Median, Mode, Range,
Variance, Standard Deviation.
Data set 1: data set 2
Mean = 11.67 9 mean = 9
Median = 12+13/2= 12.5 9 median = 9
Mode= no mode 9 mode= 0
Range= 18-5= 13 7 range = 7
Variance = 23.71 6.5
Standard deviation = 4.88 2.54

6
School of Business Department of Economics

Mean = 7+15+12+5+13+18/6= 11.67


Median : 5,7,12,13,15,18
Median= 12+13/2= 12.5
Mode = no mode
Range =max-min = 18- 5 = 13
X x-xbar x-xbar square
7 7-11.67= -4.67 21.80
5 5-11.67=-6.67 44.48
12 12-11.67= 0.33 0.108
13 13-11.67= 1.33 1.76
15 15-11.67= 3.33 11.08
18 18-11.67= 6.33 40.06
Total 118.568
Variance = 118.568 / 5 = 23.71
Standard deviation = 4.87

(table)
Data Set 1 Comparariso Data Set 2
n
Mean 11.67 > 9

Arranged Data 5, 7, 12, 13, 15,18 6, 8, 9, 9, 13


Median 12.5 > 9
Mode No mode 9
Range 13 > 7
Variance 23.8668 > 6.5
Stand. Deviat. 4.885 > 2.5
Data Set 1 has basically higher measures in almost all measures computed. Data Set 1
seems to be more dispersed than Data Set 2 as seen with a greater standard deviation.

7
School of Business Department of Economics

Problem 6

The average of hourly wages of 6 employees is 77.5. One value is missing.


The wages are as follows:
66, 90, 85, 70, 74, and X x=?
. Find the value of X. 77.5 = 66+90+85+70+74+x / 6
77.5 times 6 = 385+x
465=385+x
X= 465-385 = 80

66+90+ 85+70+74+ X
Mean = = 77.5  385 + X = 465  X = 465- 385= 80
6

Problem 7

In BSTA 205 class, the grades of a sample of 9 students are:


40, 82, 91, 100, 92, 70, 62, 70 and 23
a- What is the average grade? (Mean)
40+82+ 91+100+ 92+70+62+70+23
Average grade = = 70
9
b- Calculate the mode, median and range.
Mode is the most frequent observation and it is 70
Median is the middle observation after arranging them23,40,62,70,70,91,92,100
Median is 70
Range = Highest observation – Lowest observation = 100 – 23 = 77
c- Calculate the variance and standard deviation.
∑ ( Xi− X́ )2
Variance = = (900 + 144 + 441 + 900 + 484 + 0 + 64 + 0 + 2209)/8 =
n−1
642.75
Standard deviation = √ variance = √ 639 = 25.35
d- Can we consider the standard deviation as the best measure of dispersion? Why?
Yes, because the SD takes into account all the values of the distribution and it is not affected by
outliers.
e- Compare the mean, median and mode of the grades and comment on the shape of their
distribution.
Mean = Mode = Median  grades are normally distributed ( symmetric )

Problem 8

8
School of Business Department of Economics

In a course entitled “experimental psychology” students train white rats to do various tasks based
on performance/reward theory. One event is the hurdles. Times in seconds for seven rats to run
the hurdles were:
5.2, 3.3, 3.3, 3.1, 2.8, 2.3, 1.6
Is the data discrete or continuous?
Data is continuous because the time is measured

Problem 9
If the average of 7 numbers is 9 and the average of 3 of these numbers is 5, what is the average
of the other 4 numbers?
a+b+ c+ d+ e+ f + g
Assume that the numbers are a, b, c, d, e, f, and g  =9
7
 a + b + c + d + e + f + g = 7*9 = 63
a+b+ c
And = 5  a + b + c = 3*5 = 15
3
d +e+ f + g 48
Thus d + e + f + g = 63 – 15 = 48  = = 12
4 4
Problem 10
The average arithmetic mean of a set of 6 numbers is 28, if a certain number Y is removed from
the set; the average of the remaining numbers in the set is 24. What is the value of Y?
Assume that the numbers are as follows a, b, c, d, e and y
a+b+ c+ d+ e+ y
Mean is = 28  a+ b+c +d +e + y = 168
6
168− y
If y is removed then the mean of the remaining 5 numbers is 24  = 24 thus Y = 48
5

Problem 11

The time (in seconds) that it took for each of a sample of 4 vehicles to exit a parking lot in
downtown Cincinnati is:
106 51 72 139

1) Calculate the mean, median, and mode. Mean = 106+51+72+139 /4 = 92

X=
∑ X =368 =92
n 4
Median : 51 72 106 139 median = 72+106/2=
89

The median is the middle number when the data are ordered.
51 72 106 139

9
School of Business Department of Economics

72+106 178
= =89
Median = 2 2
No mode.

2) Comment on the skewness.


Since median < mean the distribution is positively skewed (skewed to the right).
89 < 92
3) Calculate the variance and standard deviation for the sample data (sample 1) using the
below table.

X x bar x-x bar square


51 51-92= -41 1681
72 72-92= -20 400
106 106-92= 14 196
139 139-92=47 2209
Total 4,486

Variance = 4,486/ 3 = 1495 .3

X X −X ( X −X )2
51 -41= 51-92 1681
72 -20 400
106 14 196
139 47 2209
Total 0 4486

Variance of sample 1
2 ∑ ( X− X )2 4486 4486
S= = = =1495. 33
n−1 4−1 3

Standard deviation of sample 1


S= √1495 .33=38 .67
4) Suppose the time to exit a parking lot in St. Louis ( 2) has a standard deviation 45.
Compare the dispersion in both parking lot.
Since the standard deviation of sample 1 is less than the standard deviation in sample 2
because 38.67 < 45, the dispersion in sample 1 is less than the dispersion in sample 2.

10
School of Business Department of Economics

11

You might also like