Sta 221_229 Tutorial Note
Sta 221_229 Tutorial Note
3
651
01
STA
34
80
201/209/221/229
(0
Y
TUTORIAL NOTE
U
G
TS
A
Agricultural Sciences)
A
TH
INTRODUCTION
)
3
I've worked hard to make this guide as accessible as possible.
6 51
You're embarking on a journey toward academic success, and I'm here
01
cheering you on every step of the way. Congratulations on taking this
first step—let's achieve greatness together!
34
That Stats Guy ( 2024 )
80
(0
Need a Personal Statistics Tutor? Text me on WhatsApp :
08034016513
Y
U
G
OR SCAN ME
TS
A
ST
T
A
TH
1
STA 201/209/221/229 TUTORIAL NOTE
COURSE OUTLINES
1. Scope of Statistical Methods
2. Measure of Location, Partition and Dispersion :
3. Measure of Symmetry and Peakedness : Skewness and Kurtosis
4. Elements of Probability
5. Probability Distributions : Bernoulli Distribution, Binomial
Distribution, Poisson Distribution.
)
3
6. Point and Interval Estimation: Confidence Intervals
51
7. Test of Hypothesis for Mean and Proportion of both Large and
Small Samples
6
01
34
80
(0
Y
U
G
ANY REASON
A
ST
T
A
TH
2
STA 201/209/221/229 TUTORIAL NOTE
WHAT’S STATISTICS?
Statistics can be described as a branch of Sciences which deals with the
Collection, Organization, Sorting, Presentation and Analysis of Data. You
can take statistics as a branch of Science that help describe and make
)
conclusions about life situations that can be evaluated numerically.
3
51
Statistical methods are tools that help us understand and make sense
6
of data. They involve collecting, organizing, analyzing, and interpreting
01
information so that we can draw conclusions and make decisions based
on that information. Imagine having a lot of numbers and trying to
34
figure out what they mean—that’s where statistical methods come in.
80
They help us summarize the data, see patterns, and understand the
bigger picture.
(0
Statistics has its application in all disciplines from Aeronautics,
Anatomy, Quality Control, Zoology e.t.c
Y
U
they’re important:
ST
3
STA 201/209/221/229 TUTORIAL NOTE
)
more leads to better grades. They show us how strong that
3
connection is and whether it's likely to be real or just by chance.
51
5. Making Decisions: Whether it’s in business, healthcare, or
6
government, decisions often rely on data. Statistical methods
01
provide the tools to analyze that data so that decisions are based
34
on solid evidence rather than guesswork.
6. Ensuring Accuracy: Statistical methods help ensure that the
80
conclusions we draw from data are accurate. They help us check
for errors, biases, or mistakes in the data so that our findings are
(0
trustworthy.
7. Being Fair and Ethical: Statistical methods guide us in using data
Y
privacy, being honest about the results, and not twisting the data
G
4
STA 201/209/221/229 TUTORIAL NOTE
MEASURES OF LOCATION
)
statistical tools used to identify the central value within a data set. These
3
measures help us understand where the "center" of the data lies, giving
51
us a summary point around which the data points are distributed. The
6
most common measures of location are the mean, median, and mode,
01
each offering a different perspective on the data
34
THE MEAN
80
The mean, often referred to as the average, is the sum of all the values in
(0
a data set divided by the number of values. It’s a commonly used
measure because it considers all data points, making it a good indicator
Y
The Mean can be evaluated for both Grouped and Ungrouped Data.
G
Ungrouped data is raw data that has been collected but not yet
TS
points and is often listed in the order in which it was collected. For
example, if you collect the ages of 10 people, you might just list them as
ST
21, 25, 19, 22, 20, 23, 24, 26, 21, 22.
T
A
Grouped data is data that has been organized into groups or classes.
Instead of listing every individual data point, the data is sorted into
TH
intervals, and the frequency (the number of times data points fall within
each interval) is recorded. This method is particularly useful when
dealing with large data sets. For example, ages might be grouped into
intervals like 18-20, 21-23, and 24-26, with the frequency of ages falling
within each range recorded.
5
STA 201/209/221/229 TUTORIAL NOTE
Σ𝑥𝑖
Mean = 𝑁
)
Example 1.11 : Calculate the mean for the set of data :
3
13,12,24,7,9,10,13,13,18,8
6 51
Solution:
01
Σ𝑥𝑖
Mean = 𝑁
34
13+12+24+7+9+10+13+13+18+8 127
= 10
= 10
= 12. 7
Σ𝑓𝑖𝑥𝑖
U
𝑀𝑒𝑎𝑛 = Σ𝑓
G
Lower Class Limit : The lower class limit is the smallest value that can
A
Upper Class Limit : The upper class limit is the largest value that can
A
Weight 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
6
STA 201/209/221/229 TUTORIAL NOTE
Solution
Class Interval f x fx
0-9 20 4.5 90
)
3
10-19 41 14.5 594.5
51
20-29 98 24.5 2401
6
01
30-39 149 34.5 5140.5
34
8544
80-89 44 84.5
Y
3718
U
Σ𝑓𝑖𝑥𝑖
A
𝑀𝑒𝑎𝑛 = Σ𝑓
ST
49200
= 1000
= 49. 2
T
A
THE MEDIAN
TH
The median is the middle value in a data set when the values are
arranged in ascending or descending order. If the number of
observations is odd, the median is the middle value. If even, it's the
average of the two middle values.
7
STA 201/209/221/229 TUTORIAL NOTE
𝑁+1 𝑡ℎ
𝑀𝑒𝑑𝑖𝑎𝑛 = ( 2
) 𝑖𝑡𝑒𝑚
)
13,12,7,9,10,13,13,18,8
3
Solution
51
Arranging the data in ascending order we have ; 7,8,9,10,12,13,13,13,18
6
Here, N = 9 (Odd Value)
01
𝑁+1 𝑡ℎ
𝑀𝑒𝑑𝑖𝑎𝑛 = ( 2
) 𝑖𝑡𝑒𝑚
34
9+1 𝑡ℎ
=( 2
) 𝑖𝑡𝑒𝑚
80
10 𝑡ℎ
=( 2
) 𝑖𝑡𝑒𝑚
(0
= 5𝑡ℎ 𝑖𝑡𝑒𝑚
From the data above we can see that the fifth item is 12. Therefore, our
Y
median is 12
U
13,12,7,9,10,13,13,18,8,24
TS
Solution
Arranging the data in ascending order we have ; 7,8,9,10,12,13,13,13,18,24
A
𝑁+1 𝑡ℎ
𝑀𝑒𝑑𝑖𝑎𝑛 = ( 2
) 𝑖𝑡𝑒𝑚
T
10+1 𝑡ℎ
=( ) 𝑖𝑡𝑒𝑚
A
2
TH
11 𝑡ℎ
=( 2
) 𝑖𝑡𝑒𝑚
= 5. 5𝑡ℎ 𝑖𝑡𝑒𝑚
From the data above we can see that the 5. 5𝑡ℎ 𝑖𝑡𝑒𝑚 is between 12 and
12+13 25
13. Therefore, our median is 2
= 2
= 12. 5
8
STA 201/209/221/229 TUTORIAL NOTE
𝑁
2
− 𝐶𝑓𝑏𝑚
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿𝑖 + ( 𝑓𝑚
)𝑐
where , 𝐿𝑖 = 𝐿𝑜𝑤𝑒𝑟 𝑐𝑙𝑎𝑠𝑠 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐶𝑙𝑎𝑠𝑠
𝑁 = 𝑇𝑜𝑡𝑎𝑙 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑑𝑎𝑡𝑎
)
𝐶𝑓𝑏𝑚 = 𝐶𝑢𝑚𝑢𝑙𝑎𝑡𝑖𝑣𝑒 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐶𝑙𝑎𝑠𝑠
3
51
𝑓𝑚 = 𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐶𝑙𝑎𝑠𝑠
𝑐 = 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ
6
01
Lower Class Boundary ( LCB ): The lower class boundary is the smallest
34
value that can still be considered part of a particular class or interval
when considering the data's continuous nature. It is typically found by
80
subtracting 0.5 from the lower class limit. This adjustment accounts for
the possibility of data points being right on the edge of the class
(0
intervals. For a class interval with a lower class limit of 10, the lower
class boundary would be 9.5
Y
U
Higher Class Boundary ( HCB ): The upper class boundary is the largest
G
ensures that any value just below the next class's lower limit is included
in the current class. For a class interval with an upper class limit of 19,
A
of data points that fall within or below a certain class interval. By adding
TH
the frequency of each class to the sum of the frequencies of all previous
classes, cumulative frequency helps in understanding how data
accumulates across intervals. If the frequency of the first class is 5, the
second is 7, and the third is 3, the cumulative frequency for the third
class would be 5+7+3=15
9
STA 201/209/221/229 TUTORIAL NOTE
Class Width : Class Width is the difference between the Lower Class
Boundary and Higher Class Boundary of a class. If the lower class
boundary is 9.5 and the upper class boundary is 19.5, the class width
would be 19.5−9.5=10
)
3
51
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
ht
6
01
Freq 20 41 98 149 192 197 152 97 44 10
34
Solution:
Class
Interval
Frequency
80
LCB - HCB CF
(0
0-9 20 -0.5 - 9.5 20
Y
Σ𝑓 = 1000
10
STA 201/209/221/229 TUTORIAL NOTE
𝑁 𝑡ℎ
The median class is the class where the ( 2
) 𝑖𝑡𝑒𝑚 is. Here, N = 1000
1000 𝑡ℎ
Median Class = ( 2
) 𝑖𝑡𝑒𝑚 = 500𝑡ℎ 𝑖𝑡𝑒𝑚
Checking the Cumulative Frequency ( CF ) we can see that the 500th
item is in the class 40 - 49
Median Class = 40 - 49
)
In order to calculate for the Median Value we make use of the formula
3
𝑁
2
− 𝐶𝑓𝑏𝑚
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿𝑖 + ( )𝑐
51
𝑓𝑚
𝐿𝑖 = 39. 5 𝐶𝑓𝑏𝑚 = 308 𝑓𝑚 = 192 𝑁 = 1000 𝑐 = 49. 5 − 39. 5 = 10
6
01
500 − 308
𝑀𝑒𝑑𝑖𝑎𝑛 = 39. 5 + ( 192
)10
34
192
= 39. 5 + ( 192
)10
= 39. 5 + (1)10
80
= 39. 5 + 10 = 49. 5
(0
THE MODE
The mode is the value that appears most frequently in a data set. A data
Y
set may have one mode (unimodal), more than one mode (bimodal or
U
frequency.
TS
11
STA 201/209/221/229 TUTORIAL NOTE
)
3
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
51
ht
6
Freq 20 41 98 149 192 197 152 97 44 10
01
34
Solution:
80
Class Frequency LCB - HCB
Interval
(0
0-9 20 -0.5 - 9.5
Y
Σ𝑓 = 1000
12
STA 201/209/221/229 TUTORIAL NOTE
By looking at this we can see that the class with the highest frequency
50-59 with a frequency of 197
Modal Class = 50-59
To calculate the Modal value we make use of the Formula
1
𝐷
𝑀𝑜𝑑𝑒 = 𝐿𝑖 + ( 𝐷 +𝐷 )𝐶
1 2
)
3
5
𝑀𝑜𝑑𝑒 = 49. 5 + ( 5+45 )10
51
5
= 49. 5 + ( 50 )10
6
= 49. 5 + (0. 1)10
01
= 49. 5 + 1 = 59. 5
34
THE GEOMETRIC MEAN
80
The geometric mean is the nth root of the product of all values, typically
used for data that is multiplicative or for growth rates.
(0
𝑁
𝐺𝑒𝑜𝑚𝑒𝑡𝑟𝑖𝑐 𝑀𝑒𝑎𝑛 = 𝑥1𝑥2... 𝑥𝑛
Y
U
Example 1.41 : Calculate the Geometric mean for the set of data :
G
13,12,24,7,9,10,13,13,18,8
TS
Solution:
A
10
𝐺𝑒𝑜𝑚𝑒𝑡𝑟𝑖𝑐 𝑀𝑒𝑎𝑛 = (13)(12)(24)(7)(9)(10)(13)(13)(18)(8)
ST
10 10
= 5. 7401𝑥10
T
= 11. 91
A
𝑁
𝐻𝑎𝑟𝑚𝑜𝑛𝑖𝑐 𝑀𝑒𝑎𝑛 = 1 1 1
𝑛1
+ 𝑛2
+...+ 𝑛
𝑛
13
STA 201/209/221/229 TUTORIAL NOTE
Example 1.51 : Calculate the Harmonic mean for the set of data :
13,12,24,7,9,10,13,13,18,8
𝑁
𝐻𝑎𝑟𝑚𝑜𝑛𝑖𝑐 𝑀𝑒𝑎𝑛 = 1 1 1
𝑛1
+ 𝑛2
+...+ 𝑛
𝑛
10
𝐻𝑎𝑟𝑚𝑜𝑛𝑖𝑐 𝑀𝑒𝑎𝑛 = 1 1 1 1 1 1 1 1 1 1
13
+ 12 + 24 + 7 + 9 + 10 + 13 + 13 + 18 + 8
10
= = 11. 23
)
4861
5460
3
51
MEASURE OF PARTITION
6
Measures of partition, also known as quantiles, divide a data set into
01
equal parts, each representing a fraction of the data. The most common
measures of partition are quartiles, percentiles, and deciles. These
34
measures help to understand the distribution and spread of data,
80
indicating where a particular value lies within the overall distribution.
They can be applied to both grouped and ungrouped data, but the
(0
methods for calculation differ slightly.
QUARTILE
Y
U
Quartiles divide the data into four equal parts. The first quartile (Q1)
represents the 25th percentile, the second quartile (Q2) represents the
G
50th percentile or the median, and the third quartile (Q3) represents
TS
To calculate the Quartile for ungrouped data you'll need to arrange the
data in Ascending data
T
𝑛+1 𝑡ℎ
A
2(𝑛+1) 𝑡ℎ
For the Second Quartile (Q2) : ( 4
) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
3(𝑛+1) 𝑡ℎ
For the Third Quartile (Q3) : ( 4
) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
Example 1.61 : Evaluate Q1, Q2 and Q3 for the set of data : 13,12,7,9,10,13,8
Solution
Arranging the data in ascending order we have ; 7,8,9,10,12,13,13
14
STA 201/209/221/229 TUTORIAL NOTE
𝑛+1 𝑡ℎ
𝑄1 = ( 4
) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
7+1 𝑡ℎ
𝑄1 = ( 4
) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
8 𝑡ℎ
𝑄1 = ( 4 ) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
𝑛𝑑
𝑄1 = 2 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
The value in the second position is 8. Therefore Q1 is 8
)
Follow the same steps to evaluate Q2 and Q3
3
51
QUARTILE FOR GROUPED DATA
6
Quartile for Ungrouped data is similar to that of Grouped data. Here is
01
the formula to evaluate the nth quartile (1st, 2nd, 3rd..)
34
𝑛𝑁
− 𝐶𝑓𝑏𝑄𝑛
80
4
𝑄𝑛 = 𝐿𝑄 + ( 𝑓𝑄𝑛
)𝑐
𝑛
𝑐 = 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ
TS
𝑁
4
− 𝐶𝑓𝑏𝑄1
𝑄1 = 𝐿𝑄 + ( )𝑐
ST
𝑓𝑄1
1
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
ht
15
STA 201/209/221/229 TUTORIAL NOTE
Solution:
)
3
20-29 98 19.5 - 29.5 159
51
30-39 149 29.5 - 39.5 308
6
01
40-49 192 39.5 - 49.5 500
34
50-59 197 49.5 - 59.5 697
70-79 97 80
69.5 - 79.5 946
(0
80-89 44 79.5 - 89.5 990
Y
Σ𝑓 = 1000
G
TS
3𝑁
4
− 𝐶𝑓𝑏𝑄3
𝑄3 = 𝐿𝑄 + ( 𝑓𝑄3
)𝑐
A
3𝑁 𝑡ℎ
ST
4
The 750th item will be in class 60-69. Therefore
A
3
750 − 697
𝑄3 = 59. 5 + ( 152
)10
53
𝑄3 = 59. 5 + ( 152 )10
265
𝑄3 = 59. 5 + ( 76
)
𝑄3 = 59. 5 + 3. 49
16
STA 201/209/221/229 TUTORIAL NOTE
𝑄3 = 62. 99
Evaluate Q1 by yourself using these same step
PERCENTILE
Percentiles divide the data into hundred equal parts.
)
PERCENTILE FOR UNGROUPED DATA
3
To calculate the Percentile for ungrouped data you'll need to arrange
51
the data in Ascending data
6
𝑛+1 𝑡ℎ
For the First Percentile (P1) : ( ) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
01
100
2(𝑛+1) 𝑡ℎ
For the Second Percentile (P2) : ( ) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
34
100
3(𝑛+1) 𝑡ℎ
For the Third Percentile (P3) : ( ) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 𝑎𝑛𝑑 𝑠𝑜 𝑜𝑛..
80
100
𝑛𝑁
100
− 𝐶𝑓𝑏𝑃𝑛
G
𝑃𝑛 = 𝐿𝑃 + ( 𝑓𝑃𝑛
)𝑐
𝑛
𝑐 = 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ
A
TH
17
STA 201/209/221/229 TUTORIAL NOTE
DECILE
Deciles divide the data into ten equal parts.
)
For the First Decile (D1) : ( 10
) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
3
2(𝑛+1) 𝑡ℎ
51
For the Second Decile (D2) : ( 10
) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛
6
3(𝑛+1) 𝑡ℎ
For the Third Decile (D3) : ( ) 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 𝑎𝑛𝑑 𝑠𝑜 𝑜𝑛..
01
10
34
PERCENTILE FOR GROUPED DATA
Percentile for Ungrouped data is similar to that of Grouped data. Here is
80
the formula to evaluate the nth quartile (1st, 2nd, 3rd…, 9th)
(0
𝑛𝑁
10
− 𝐶𝑓𝑏𝐷𝑛
𝐷𝑛 = 𝐿𝐷 + ( 𝑓𝐷𝑛
)𝑐
𝑛
Y
𝑐 = 𝐶𝑙𝑎𝑠𝑠 𝑤𝑖𝑑𝑡ℎ
ST
10
− 𝐶𝑓𝑏𝐷1
𝐷1 = 𝐿𝐷 + ( )𝑐
A
𝑓𝐷1
1
Getting the Decile for Grouped data is the same method as quartile, the
only difference is that you divide by 10 instead of 4.
18
STA 201/209/221/229 TUTORIAL NOTE
THE RANGE
It is the difference between the maximum and minimum values in a
data set.
)
𝑅𝑎𝑛𝑔𝑒 = 𝑀𝑎𝑥𝑖𝑚𝑢𝑚 𝑣𝑎𝑙𝑢𝑒 − 𝑀𝑖𝑛𝑖𝑚𝑢𝑚 𝑉𝑎𝑙𝑢𝑒
3
Example 1.71 :Calculate the Range for the set of data :
51
13,12,24,7,9,10,13,13,18,8
6
01
𝑀𝑖𝑛𝑖𝑚𝑢𝑚 𝑉𝑎𝑙𝑢𝑒 = 8 𝑀𝑎𝑥𝑖𝑚𝑢𝑚 𝑉𝑎𝑙𝑢𝑒 = 24
34
𝑅𝑎𝑛𝑔𝑒 = 24 − 8 = 16
80
VARIANCE
Variance measures the average squared deviation of each data point
(0
from the mean. It quantifies how much the data points spread out
around the mean.
Y
U
The formula for variance for any ungrouped set of data is given by
2
2
TS
Σ(𝑥−𝑥)
𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒(σ ) = 𝑁
Example 1.81 :Calculate the Variance for the set of data :
A
13,12,24,7,9,10,13,13,18,8
ST
𝑥 2
TH
𝑥 −𝑥 (𝑥 − 𝑥)
13 0.3 0.09
12 -0.7 0.49
24 11.3 127.69
7 -5.7 32.49
19
STA 201/209/221/229 TUTORIAL NOTE
9 -3.7 13.69
10 -2.7 7.29
13 0.3 0.09
13 0.3 0.09
18
)
5.3 28.09
3
8 -4.7 22.09
51
2
6
Σ(𝑥 − 𝑥) = 232. 1
01
2
2 Σ(𝑥−𝑥) 232.1
𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒(σ ) = = = 23. 21
34
𝑁 10
80
VARIANCE FOR GROUPED DATA
The variance of a grouped data is given by the formula
(0
2
2 Σ𝑓(𝑥−𝑥)
𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒(σ ) = Σ𝑓
Example 1.82 : The following data represent the weight of products
Y
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
TS
ht
Solution:
T
Class x
A
𝑓𝑥
Interva
TH
2 2
l 𝑓 (𝑥 − 𝑥) (𝑥 − 𝑥) 𝑓(𝑥 − 𝑥)
0-9 20 4.5 90 -44.7 1998.09 39961.8
10-19 41 14.5 594.5 -34.7 1204.09 49367.69
20-29 98 24.5 2401 -24.7 610.09 59788.82
30-39 149 34.5 5140.5 -14.7 216.09 32197.41
20
STA 201/209/221/229 TUTORIAL NOTE
)
90-99 10 94.5 945 45.3 2052.09 20520.9
3
Σ𝑓𝑥 = 492 2
51
Σ𝑓 = 1000 Σ𝑓(𝑥 − 𝑥) == 36411
6
01
Σ𝑓𝑥 49200
Mean = Σ𝑓
= 1000
= 49, 2
2
34
2 Σ𝑓(𝑥−𝑥) 364110
𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒(σ ) = Σ𝑓
= 1000
= 364. 11
STANDARD DEVIATION
80
Standard deviation is the square root of variance and provides a
(0
measure of dispersion in the same units as the original data. The
standard deviation can be found by finding the square root of the
Y
The formula for standard deviation for any ungrouped set of data is
given by
A
2
ST
Σ(𝑥−𝑥)
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛(σ) = 𝑁
Example 1.91 :Calculate the Standard Deviation for the set of data :
T
13,12,24,7,9,10,13,13,18,8
A
21
STA 201/209/221/229 TUTORIAL NOTE
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
ht
)
Freq 20 41 98 149 192 197 152 97 44 10
3
51
Solution:
6
From Example 1.82 we found the variance to be 364.11 therefore
01
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 = 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 = 364. 11 = 19. 08
34
COEFFICIENT OF VARIATION
80
The coefficient of variation is the ratio of the standard deviation to the
mean, expressed as a percentage. It measures the relative variability of
(0
the data
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
𝐶𝑂𝑉 = ( 𝑀𝑒𝑎𝑛
)𝑥100
Y
U
The IQR measures the range of the middle 50% of the data, calculated
TS
as the difference between the third quartile (Q3) and the first quartile
(Q1).
A
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
ST
𝑄3−𝑄1
𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 = 2
22
STA 201/209/221/229 TUTORIAL NOTE
)
Σ𝑥𝑖 13+12+24+7+9+10+13+13+18+8 127
𝑀𝑒𝑎𝑛(𝑥) = = = = 12. 7
3
𝑁 10 10
6 51
𝑥 𝑥 −𝑥 |𝑥 − 𝑥|
01
13 0.3 0.3
34
12 -0.7 0.7
24 11.3
80
11.3
(0
7 -5.7 5.7
9 -3.7 3.7
Y
U
10 -2.7 2.7
G
13 0.3 0.3
TS
13 0.3 0.3
A
18 5.3 5.3
ST
8 -4.7 4.7
T
Σ|𝑥 − 𝑥| = 35
A
Σ|𝑥−𝑥| 35
TH
𝑀𝑒𝑎𝑛 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 = 𝑁
= 10
= 3. 5
23
STA 201/209/221/229 TUTORIAL NOTE
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
ht
)
3
Solution:
6 51
Class
01
Interval 𝑓 𝑥 𝑓𝑥 𝑥 − 𝑥 |𝑥 − 𝑥| 𝑓|𝑥 − 𝑥|
34
0-9 20 4.5 90 -44.7 44.7 894
10-19 41 14.5 594.5 -34.7 34.7 1422.7
20-29 98 24.5 2401
80
-24.7 24.7 2420.6
(0
30-39 149 34.5 5140.5 -14.7 14.7 2190.3
40-49 192 44.5 8544 -4.7 4.7 902.4
Y
Mean = Σ𝑓
= 1000
= 49, 2
A
Σ𝑓|𝑥−𝑥| 15660
𝑀𝑒𝑎𝑛 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 = = = 15. 66
TH
Σ𝑓 1000
I only did my best as at this time, this should not limit your reading to this
guide only.
24
STA 201/209/221/229 TUTORIAL NOTE
)
3
6 51
01
34
THIS MATERIAL IS NOT TO BE SOLD FOR 80
(0
ANY REASON
Y
U
G
TS
A
ST
T
A
TH
25
STA 201/209/221/229 TUTORIAL NOTE
MEASURE OF SYMMETRY
In statistics, the symmetry of a data distribution refers to how evenly the
data is spread around the central point. A perfectly symmetrical
distribution is one where the left side of the distribution mirrors the
)
right side. The concept of symmetry is essential for understanding the
3
shape of a data distribution and for making assumptions about the
51
data when applying statistical methods. Measures of symmetry
6
quantify the degree to which a distribution deviates from perfect
01
symmetry.
34
SKEWNESS
Skewness is the most common measure of symmetry in a data
80
distribution. It indicates the direction and degree of asymmetry. If a
distribution is symmetrical, the skewness is zero. If it is not, skewness
(0
tells us whether the data is skewed to the left (negatively skewed) or to
the right (positively skewed).
Y
U
TYPES OF SKEWNESS
G
Positive Skewness: The distribution has a longer tail on the right side.
TS
Most data points are clustered on the left, with a few larger values
pulling the mean to the right.
A
Negative Skewness: The distribution has a longer tail on the left side.
TH
Most data points are clustered on the right, with a few smaller values
pulling the mean to the left.
Example: Test scores where most students perform well, but a few have
low scores.
26
STA 201/209/221/229 TUTORIAL NOTE
)
3
6 51
01
34
80
For a Negatively skewed distribution Mean < Median < Mode
(0
For a Positively Skewed Distribution Mean > Median >Mode
Y
𝑀𝑒𝑎𝑛−𝑀𝑜𝑑𝑒
𝑆𝑘𝑒𝑤𝑛𝑒𝑠𝑠 = 𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
OR
3(𝑀𝑒𝑎𝑛−𝑀𝑒𝑑𝑖𝑎𝑛)
𝑆𝑘𝑒𝑤𝑛𝑒𝑠𝑠 = 𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
27
STA 201/209/221/229 TUTORIAL NOTE
METHODS OF MOMENT
3
𝑀3 Σ𝑓(𝑥−𝑥)
𝑆𝑘𝑒𝑤𝑛𝑒𝑠𝑠(ψ3) = 3 = 3
)
2
𝑀22 [Σ𝑓(𝑥−𝑥) ]
2
3
Example 2.0 : The following data represent the weight of products
51
manufactured in the Factory ( in Kg ). Evaluate the Skewness using
6
method of Moments
01
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
34
ht
80
Freq 20 41 98 149 192
(0 197 152 97 44 10
Solution:
Y
Interv
2 2 3 3
al f x fx
U
𝑥 −𝑥 (𝑥 − 𝑥) 𝑓(𝑥 − 𝑥) (𝑥 − 𝑥) 𝑓(𝑥 − 𝑥)
G
3
𝑀3 Σ𝑓(𝑥−𝑥) −459774 −459774
𝑆𝑘𝑒𝑤𝑛𝑒𝑠𝑠(ψ3) = 3 = 3 = 3 = 219709537.6
=− 0. 00209
2 2 2 2
𝑀2 [Σ𝑓(𝑥−𝑥) ] (364110)
28
STA 201/209/221/229 TUTORIAL NOTE
KURTOSIS
While primarily a measure of the "peakedness" of a distribution, kurtosis
also provides insights into the symmetry of data. It complements
skewness by focusing on the tails of the distribution rather than the
symmetry around the center.
)
Types of Kurtosis:
3
51
Leptokurtic (Kurtosis > 3): Distributions with heavy tails and a sharp
6
peak, indicating that data values are concentrated around the mean
01
with a higher likelihood of extreme values.
34
Mesokurtic (Kurtosis = 3): A distribution with tails similar to a normal
distribution, such as the standard bell curve.
80
Platykurtic (Kurtosis < 3): Distributions with light tails and a flatter peak,
(0
indicating that data values are more spread out and less likely to be
extreme.
Y
U
G
TS
A
ST
T
𝑀4
A
𝐾𝑢𝑟𝑡𝑜𝑠𝑖𝑠(φ4) = 2
𝑀2
TH
4 2
Σ𝑓(𝑥−𝑥) Σ𝑓(𝑥−𝑥)
𝑤ℎ𝑒𝑟𝑒 𝑀4 = Σ𝑓
𝑎𝑛𝑑 𝑀2 = Σ𝑓
29
STA 201/209/221/229 TUTORIAL NOTE
Weig 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89 90-99
ht
Solution:
)
Interv
2 2 4 4
3
al f x fx 𝑥 −𝑥 (𝑥 − 𝑥) 𝑓(𝑥 − 𝑥) (𝑥 − 𝑥) 𝑓(𝑥 − 𝑥)
51
0-9 20 4.5 90 -44.7 1998.09 39961.8 3992363.648 79847272.96
6
10-19 41 14.5 594.5 -34.7 1204.09 49367.69 1449832.728 59443141.85
01
20-29 98 24.5 2401 -24.7 610.09 59788.82 372209.8081 36476561.19
34
46694.8881 6957538.327
80
50-59 197 54.5 10736.5 5.3 28.09 5533.73 789.0481 155442.4757
4
Σ𝑓(𝑥−𝑥) 341476643.7
𝑀4 = Σ𝑓
= 1000
= 341476. 6437
A
2
Σ𝑓(𝑥−𝑥) 364110
𝑀2 = = = 364. 11
ST
Σ𝑓 1000
𝑀4 341476.6437
𝐾𝑢𝑟𝑡𝑜𝑠𝑖𝑠(φ4) = 2 = 2 = 2. 58 < 3(𝑃𝑙𝑎𝑡𝑦𝑘𝑢𝑟𝑡𝑖𝑐)
T
𝑀2 (364.11)
A
TH
I only did my best as at this time, this should not limit your reading to this
guide only.
30
STA 201/209/221/229 TUTORIAL NOTE
LESSON 4 : PROBABILITY
Probability is often described as the Mathematics of Chance or the
likeliness of an event to occur. We shall develop foundation knowledge
of Probability in this Lesson
)
3
experiment is called a Random Experiment.
51
The set of all possible outcomes of any random experiment will be
called a sample space and it will be denoted as S. A subset of the
6
sample space which may be a collection of Outcomes is called an Event
01
Space, and it will be denoted by A.
34
The Probability of an Event A denoted by P(A) is defined as
𝑃(𝐴) =
𝑛(𝐴)
80
(0
𝑛(𝑆)
1. 𝑃(𝐴) ≥ 0
G
2. 0 ≤ 𝑃(𝐴) ≤ 1
TS
𝑐 𝑐
3. 1 − 𝑃(𝐴 ) = 𝑃(𝐴) 𝑤ℎ𝑒𝑟𝑒 𝐴 = 𝐶𝑜𝑚𝑝𝑙𝑒𝑚𝑒𝑛𝑡 𝑜𝑓 𝑒𝑣𝑒𝑛𝑡 𝐴
Let’s treat a few examples
A
ST
Example 3.0
1. In a single throw of a fair coin, find the probability that: (a) a head
T
appears
A
2. In a single throw of two fair coins, find the probability that: (a) two
TH
31
STA 201/209/221/229 TUTORIAL NOTE
Solutions:
1. 𝐿𝑒𝑡 𝐴 𝑏𝑒 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡 𝑡ℎ𝑎𝑡 𝑎 ℎ𝑒𝑎𝑑 𝑜𝑐𝑐𝑢𝑟𝑠
𝑆 = {𝐻, 𝑇} 𝑛(𝑆) = 2 𝑛(𝐴) = 1
𝑛(𝐴) 1
𝑃(𝐴) = 𝑛(𝑆)
= 2
= 0. 5
2. 𝐿𝑒𝑡 𝐵 𝑏𝑒 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡 𝑡ℎ𝑎𝑡 𝑡𝑤𝑜 ℎ𝑒𝑎𝑑𝑠 𝑜𝑐𝑐𝑢𝑟𝑠
)
𝑆 = {𝐻𝐻, 𝑇𝐻, 𝐻𝑇. 𝑇𝑇} 𝑛(𝑆) = 4 𝑛(𝐵) = 1
3
51
𝑛(𝐵) 1
𝑃(𝐴) = 𝑛(𝑆)
= 4
= 0. 25
6
01
𝐿𝑒𝑡 𝐶 𝑏𝑒 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡 𝑡ℎ𝑎𝑡 𝑜𝑛𝑒 ℎ𝑒𝑎𝑑 𝑜𝑛𝑒 𝑡𝑎𝑖𝑙 𝑜𝑐𝑐𝑢𝑟𝑠
34
𝑆 = {𝐻𝐻, 𝑇𝐻, 𝐻𝑇. 𝑇𝑇} 𝑛(𝑆) = 4 𝑛(𝐶) = 2
𝑛(𝐶) 2
𝑃(𝐴) = = = 0. 5
80
𝑛(𝑆) 4
3. Let’s construct a table for the sums
(0
+ 1 2 3 4 5 6
Y
1 2 3 4 5 6 7
U
2 3 4 5 6 7 8
G
3 4 5 6 7 8 9
TS
4 5 6 7 8 9 10
A
ST
5 6 7 8 9 10 11
6 7 8 9 10 11 12
T
A
𝑛(𝑠) = 36
TH
32
STA 201/209/221/229 TUTORIAL NOTE
𝑛(𝐸) 15
𝑃(𝐴) = 𝑛(𝑆)
= 36
= 0. 417
4. 𝐿𝑒𝑡 𝐹 𝑏𝑒 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡 𝑡ℎ𝑎𝑡 𝑎 ℎ𝑒𝑎𝑑 𝑎𝑛𝑑 𝑡𝑤𝑜 𝑡𝑎𝑖𝑙𝑠 𝑜𝑐𝑐𝑢𝑟𝑠
𝑆 = {𝐻𝐻𝐻, 𝐻𝐻𝑇, 𝐻𝑇𝐻, 𝐻𝑇𝑇, 𝑇𝐻𝐻, 𝑇𝐻𝑇, 𝑇𝑇𝐻, 𝑇𝑇𝑇} 𝑛(𝑆) = 8 𝑛(𝐹) = 3
𝑛(𝐹) 3
𝑃(𝐴) = 𝑛(𝑆)
= 8
= 0. 375
COMBINATION OF EVENTS
)
Two or more events which are subsets of a sample space can be
3
combined using a set notation ∩ 𝑜𝑟 ∪. For example, if 𝐸 ⊂ 𝑆 and 𝐹 ⊂ 𝑆
51
(a) 𝐸 ∩ 𝐹 𝑖𝑠 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡 𝑡ℎ𝑎𝑡 𝑒𝑖𝑡ℎ𝑒𝑟 𝐸 𝑜𝑟 𝐹 𝑜𝑟 𝑏𝑜𝑡ℎ 𝑜𝑐𝑐𝑢𝑟
6
(b)𝐸 ∪ 𝐹 𝑖𝑠 𝑡ℎ𝑒 𝑒𝑣𝑒𝑛𝑡 𝑏𝑜𝑡ℎ 𝐸 𝑎𝑛𝑑 𝐹 𝑜𝑐𝑐𝑢𝑟
01
MUTUALLY EXCLUSIVE EVENTS
34
Two events E and F are said to be mutually exclusive if they cannot
80
occur simultaneously. We say that the two sets are disjoint.
𝑖. 𝑒 𝐸 ∩ 𝐹 = ϕ
(0
Addition of two events (say E and F) is given by
Y
INDEPENDENT EVENTS
A
Two events E and F are said to be mutually exclusive if they can occur
ST
simultaneously.
For Independent Events,
T
CONDITIONAL PROBABILITY
Given a sample space S, Let E be a non-empty proper subset of S. The
Probability of an event F occurs after E has occurred denoted by 𝑃𝑟(𝐹|𝐸)
is defined as
𝑃𝑟(𝐹∩𝐸)
𝑃𝑟(𝐹|𝐸) = 𝑃𝑟(𝐸)
33
STA 201/209/221/229 TUTORIAL NOTE
More exercises with solutions are in the 500 MCQ PDF. Want more? Pay
for more and set yourself ahead of 80% of your peers
)
08034016513 with the code “500 MCQ” and wait for a reply. The earlier, the
3
better. Take action now!!
6 51
Note: This is different from the Tutorials
01
34
80
(0
Y
ANY REASON
TS
A
ST
T
A
TH
34
STA 201/209/221/229 TUTORIAL NOTE
)
they bridge the gap between real-world randomness and mathematical
3
analysis.
6 51
There are two main types of random variables:
01
1. Discrete Random Variables
34
2. Continuous Random Variables
80
Discrete Random Variables
(0
A discrete random variable takes on a finite or countably infinite set of
values. These values are distinct and separate, meaning there are gaps
between possible outcomes. Discrete random variables are often
Y
associated with processes where the results are counted, like the
U
classroom.
TS
1. Number of heads when flipping a coin three times: It can take the
ST
values 0, 1, 2, or 3.
2. Number of students in a class: It can take values such as 20, 21, 22,
T
etc.
A
6.
35
STA 201/209/221/229 TUTORIAL NOTE
)
2. Time to complete a task: It could be 5.5 minutes, 6.7 minutes, or
3
51
any other value in between.
3. Temperature: It could be measured as 25.3°C, 25.35°C, or any other
6
value in a continuous range.
01
PROBABILITY MASS FUNCTION
34
For discrete random variables, the Probability Mass Function (PMF)
80
describes the probability of each possible value that the random
variable can take. It is denoted as:
(0
𝑃(𝑋 = 𝑥)
Y
U
Where 𝑃(𝑋 = 𝑥) gives the probability that the random variable X takes
the value x.
G
TS
1
𝑃𝑟(𝑋 = 1) = 𝑃𝑟(𝑋 = 2) = 𝑃𝑟(𝑋 = 3) = 𝑃𝑟(𝑋 = 4) = 𝑃𝑟(𝑋 = 5) = 𝑃𝑟(𝑋 = 6) = 6
ST
Properties of a PMF
36
STA 201/209/221/229 TUTORIAL NOTE
𝑛
Σ𝑖=1𝑃𝑟(𝑋 = 𝑥) = 1
)
of a continuous random variable taking any exact value is zero. Instead,
3
probabilities are calculated over intervals.
6 51
The PDF is denoted as 𝑓(𝑥), where the probability that the random
01
variable 𝑋 lies within a particular interval [𝑎, 𝑏] is given by:
34
𝑏
𝑃𝑟(𝑎 ≤ 𝑋 ≤ 𝑏) = ∫ 𝑓(𝑥)𝑑𝑥
80
𝑎
∞
3. ∫ 𝑓(𝑥)𝑑𝑥 = 1 𝑓𝑜𝑟 𝑎𝑛𝑦 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑥 ∈ [𝑎, 𝑏]
TS
−∞
A
BERNOULLI DISTRIBUTION
ST
Note:
37
STA 201/209/221/229 TUTORIAL NOTE
BINOMIAL DISTRIBUTION
)
3
The Binomial distribution extends the Bernoulli distribution to multiple
51
independent trials. It represents the number of successes in a fixed
6
number of n trials, where each trial is independent, and the probability
01
of success p remains constant. The PMF of the binomial distribution is
given by
34
𝑥 𝑛−𝑥
𝑃(𝑋 = 𝑥) = 𝑛𝐶𝑋𝑃 (1 − 𝑝)
Where: 80
(0
● 𝑋 is the number of successes.
● 𝑛 is the number of trials.
Y
𝑛!
● 𝑛𝐶𝑋 = (𝑛−𝑥)!𝑥!
TS
A
Properties:
ST
2
● The mean is µ = 𝑛𝑝 and the variance is σ = 𝑛𝑝(1 − 𝑝) 𝑜𝑟 𝑛𝑝𝑞
● Used when there are a fixed number of independent trials with
T
Example:
Past experience has shown that 7% of all luncheon vouchers are in error.
If a random sample of 5 vouchers is selected, what is the probability
that;
a. Exactly two will have error
b. At least two will have error
38
STA 201/209/221/229 TUTORIAL NOTE
Solution: 𝑛 = 5 𝑝 = 0. 07 𝑞 = 1 − 0. 07 = 0. 93
𝑥 𝑖𝑠 𝑡ℎ𝑒 𝑖𝑠 𝑡ℎ𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑙𝑢𝑛𝑐ℎ𝑒𝑜𝑛𝑠 ℎ𝑎𝑣𝑖𝑛𝑔 𝑎𝑛 𝑒𝑟𝑟𝑜𝑟
a. 𝑥 = 2
2 5−2
𝑃(𝑋 = 2) = 5𝐶2(0. 07) (0. 93)
2 3
𝑃(𝑋 = 2) = 10 × (0. 07) (0. 93) = 0. 0394
)
3
b. 𝑥 ≥ 2
651
𝑃(𝑋 ≥ 2) = 𝑃(𝑋 = 2) + 𝑃(𝑋 = 3) + 𝑃(𝑋 = 4) + 𝑃(𝑋 = 5)
01
2 3
𝑃(𝑋 = 2) = 10 × (0. 07) (0. 93) = 0. 0394
34
3 2
𝑃(𝑋 = 3) = 5𝐶3 × (0. 07) (0. 93) = 0. 002967
4
80 1
𝑃(𝑋 = 4) = 5𝐶4 × (0. 07) (0. 93) = 0. 0001116
(0
5 0
𝑃(𝑋 = 5) = 5𝐶5 × (0. 07) (0. 93) = 0. 00001681
Y
U
POISSON DISTRIBUTION
A
ST
time or space, given that these events happen independently and the
A
Where:
39
STA 201/209/221/229 TUTORIAL NOTE
Properties:
)
● The average number of successes that occur in a specified region
3
is known as λ
51
● The probability that a success will occur in an extremely small
region is virtually zero
6
01
Example:
34
The average number of defective 100-watts electric bulbs in every
production batch is given as 4. What is the probability that the current
production batch will contain;
80
(0
a. 6 defectives
b. At most 1 defective
Y
Solution:
TS
a. 𝑥 = 6
ST
6 −4
4𝑒 4096×(0.0183156)
𝑃(𝑋 = 6) = 6!
= 720
= 0. 1042
T
A
b. 𝑥 ≤ 1
TH
1 −4
4𝑒 4×(0.0183156)
𝑃(𝑋 = 1) = 1!
= 1
= 0. 0732626
40
STA 201/209/221/229 TUTORIAL NOTE
c. 2 < 𝑥 < 5
)
3
4 −4
4𝑒 256×(0.0183156)
𝑃(𝑋 = 4) = = = 0. 19537
51
4! 24
6
𝑃(2 < 𝑋 < 5) = 0. 19537 + 0. 19537 = 0. 3907
01
34
Want to join me and your peers as we go on a challenge of solving 500+
likely to come out in your forthcoming exams? Text me on WhatsApp at
80
08034016513 with the code “500 MCQ” and wait for a reply. The earlier, the
better. Take action now!!
(0
Note: This is different from the Tutorials
Y
U
G
TS
A
ST
ANY REASON
41
STA 201/209/221/229 TUTORIAL NOTE
LESSON 6 : ESTIMATION
Estimation theory is a framework that uses a combination of effect
sizes, confidence intervals and precision to plan experiments, analyzes
data and interpret results originated from samples as a means of
getting the true estimate of population parameter
.
There are two forms of estimation theory namely:
)
3
1. Point Estimation
51
2. Interval Estimation
6
Some important Definitions
01
1. Sample Statistic: A sample is a subset of the population chosen
34
for study. A sample statistic is a measure that describes a
characteristic of this sample, which we use to estimate the
80
corresponding population parameter. Since a sample is more
feasible to obtain than the entire population, sample statistics are
(0
practical for data analysis. Examples of sample statistics include:
a. Sample mean (𝑥): the average value of a characteristic in the
Y
sample.
U
2
b. Sample variance (𝑠 ): a measure of data dispersion in the
G
sample.
TS
variables rather than fixed values. However, with proper sampling and
analysis methods, they can serve as good estimators for population
T
parameters.
A
TH
42
STA 201/209/221/229 TUTORIAL NOTE
)
3
measuring an entire population can be impractical or impossible.
51
Here’s a summary of populations characteristics (parameter) and their
6
corresponding sample statistic
01
34
Characteristics Parameter Statistic
80
Mean µ 𝑥
(0
Standard Deviation σ 𝑠
Variance σ
2
𝑠
2
Y
U
Proportion 𝑃 𝑝
G
TS
POINT ESTIMATION
This is the procedure that allow us to calculate a single sample statistic
A
ST
INTERVAL ESTIMATION
TH
43
STA 201/209/221/229 TUTORIAL NOTE
For instance, a 95% C.I estimate implies that, if all possible samples of
)
3
the same size were drawn, then 95% of them would include the true
51
population mean somewhere within the interval around their sample
mean and only 5% of them would not. Generally, we expressed
6
confidence interval as 100(1 − 𝛼)% and for the determination of level of
01
significance (𝛼), the value of Student-t distribution at infinity (∞) is
34
always equal to that of Normal distribution.
80
CONFIDENCE INTERVAL FOR MEAN OF LARGE SAMPLE AND SMALL
SAMPLES
(0
We’ll treat this in two cases which are small and Large Samples
Y
σ
𝑋 ± 𝑍α
𝑛
TS
𝑆
ST
𝑋 ± 𝑡α
2
, (𝑛−1) 𝑛
2
Σ(𝑥−𝑥)
T
44
STA 201/209/221/229 TUTORIAL NOTE
)
3
51
σ
Margin of Error = 𝑍 α
2 𝑛
6
5
= 𝑍 0.05
01
2 40
5
= 𝑍 0.05
34
2 40
= 𝑍0.025 × 0. 7905694
= 1. 96 × 0. 7905694
= 1.54952 80
(0
Confidence Interval = 𝑋 ± 𝑀𝑎𝑟𝑔𝑖𝑛 𝑜𝑓 𝐸𝑟𝑟𝑜𝑟
= 25 ± 1. 54952
Y
60, 45, 80. Construct a 95% confidence interval for the average score of
ST
Σ𝑋 59+40+38+67+78+45+56+60+45+80 568
𝑋= = = = 56. 8
A
𝑛 10 10
TH
2
Σ(𝑥−𝑥)
𝑆= 𝑛−1
𝑥 2
𝑥 −𝑥 (𝑥 − 𝑥)
59 2.2 4.84
45
STA 201/209/221/229 TUTORIAL NOTE
40 -16.8 282.24
38 -18.8 353.44
67 10.2 104.04
78 21.2 449.44
45 -11.8
)
139.24
3
56 -0.8 0.64
51
60 3.2 10.24
6
01
48 -8.8 77.44
34
80 23.2 538.24
80
(0 1959.8
1959.8
𝑆= 9
= 14. 8
14.8
Margin of Error = 𝑡 0.05
Y
2
, (10−1) 10
U
= 2. 26 × 4. 68017
TS
= 10. 577
Confidence Interval = 𝑋 ± 𝑀𝑎𝑟𝑔𝑖𝑛 𝑜𝑓 𝐸𝑟𝑟𝑜𝑟
A
𝑥
where 𝑃 = 𝑛
Example: Out of a random sample of 800 people who casted their votes
at Lagos polling post, it was discovered that only 400 people voted for
46
STA 201/209/221/229 TUTORIAL NOTE
𝑝(1−𝑝)
Margin of Error =𝑍 α 𝑛
2
)
0.5(1−0.5)
=𝑍 0.05
3
800
51
2
0.5(0.5)
=𝑍0.025
6
800
01
0.25
=𝑍0.025 800
34
=1. 96 × 0. 0177
=0. 0346
Confidence Interval = 𝑃 ± 𝑀𝑎𝑟𝑔𝑖𝑛 𝑜𝑓 𝐸𝑟𝑟𝑜𝑟
= 0. 5 ± 0. 0346 80
(0
Interval: 0. 5 − 0. 0346 < 𝑃 < 0. 5 + 0. 0346
0. 4654 < 𝑃 < 0. 5346
Y
U
G
08034016513 with the code “500 MCQ” and wait for a reply. The earlier, the
better. Take action now!!
A
ST
47
STA 201/209/221/229 TUTORIAL NOTE
)
3
Level of significance: This is the probability of rejecting the null hypothesis
51
when it is true. It’s denoted by α
6
Null Hypothesis (𝐻0): It is the hypothesis being tested and it is usually referred
01
to as the hypothesis of no significance or no change.
34
Alternative Hypothesis (𝐻1): It is the hypothesis of practical interest which is
80
expected to disprove the null hypothesis. It is the hypothesis of change or
(0
hypothesis that shows difference.
hypothetical statements
TS
𝐻0: µ = µ0
A
𝐻0: µ > µ0
𝐻0: µ = µ0
T
48
STA 201/209/221/229 TUTORIAL NOTE
Critical Region: Is that region such that the null hypothesis is rejected if the
test statistic computed from a particular sample takes on a value in the region.
The critical region for one-tail hypothesis occupies only one side of the normal
curve tails while that of two-tail lies on the two tails. This can be illustrated in
the figures below
)
3
6 51
01
34
80
(0
Y
U
Acceptance Region: Is that region such that the null hypothesis is not
G
Type I Error: Error of rejecting null hypothesis when it is true (α) it's also known
A
as a false positive.
ST
Type II Error: Error of failing to reject null hypothesis when (β)it is false it's also
known as a false negative
T
Size of a test: Probability of falsely rejecting the null hypothesis. It is the same
as the Type I Error and is also denoted by α
Critical Value: is the value of the sample statistic that separates the regions of
acceptance and rejection.
49
STA 201/209/221/229 TUTORIAL NOTE
)
5. Write out the Test Statistic
3
6. Write your Decision Rule
51
7. Perform all necessary calculations
6
8. State your Conclusions
01
HYPOTHESIS TEST FOR MEAN OF LARGE AND SMALL SAMPLES
34
Case 1 (Large Samples) : When n≥30
80
NB: n is known as the Sample Size and N is known as the population size
(0
The hypothesis test can be one tailed or two tailed and can be stated as
follows
Y
at α level of significance
G
𝑋−µ0 𝑋−µ0
𝑍 = = (σ𝑋 = σ/ 𝑛)
A
σ𝑋 σ/ 𝑛
ST
The hypothesis test can be one tailed or two tailed and can be stated as
follows
𝐻0: µ = µ0 𝑣𝑠 𝐻1: µ ≠ µ0 𝑜𝑟 𝐻1: µ > µ0 𝑜𝑟 𝐻1: µ < µ0
at α level of significance
The test statistic is given by
50
STA 201/209/221/229 TUTORIAL NOTE
𝑋−µ0
𝑡 =
𝑠/ 𝑛
Example 1 : According to a voters register in Lagos state, it was found that the
mean and standard deviation of ages of a random sample of 40 registered
voters in a particular ward were 25 years and 5 years respectively. Test the
)
hypothesis that the average age of the entire registered voters is significantly
3
different from 27 years at 5% level of significance.
51
Solution:
6
Step 1: From the problem context, Identify the parameter of interest
01
Parameter of Interest: Mean ages of registered voters
34
Step 2: Identify the Null Hypothesis 𝐻0
𝐻0: µ = 27 𝑦𝑒𝑎𝑟𝑠
80
Step 3: Identify the Alternative Hypothesis 𝐻1
(0
𝐻1: µ ≠ 27 𝑦𝑒𝑎𝑟𝑠
Step 4: Identify the Level of Significance
Y
α = 0. 05
U
since we are dealing with a large sample i.e n>30 we’ll be using Case 1 stated
above
TS
𝑋−µ0
𝑍𝑐𝑎𝑙𝑐 =
A
σ/ 𝑛
ST
51
STA 201/209/221/229 TUTORIAL NOTE
𝑍𝑐𝑎𝑙𝑐 =− 2. 53
− 𝑍α/2 =− 𝑍0.05/2 = − 𝑍0.025 =− 1. 96
Step 8: State your Conclusions
Conclusion: Since 𝑍𝑐𝑎𝑙𝑐 <− 𝑍α/2 we reject 𝐻0 and therefore conclude that the
mean ages of registered voters may not be 27 years at 5% level of significance.
)
3
Example 2: The following represent the scores of 10 randomly selected
51
students in Statistics methods exams: 59, 40, 38, 67, 78, 45, 56, 60, 45, 80.
Test hypothesis against the claim that average scores of the entire
6
01
students in Statistics methods exams is different from 72 at 5% level of
significance.
34
Solution:
𝑛 = 10 𝑋 = 56. 8 𝑠 = 14. 8 µ = 72 α = 0. 05
80
Step 1: From the problem context, Identify the parameter of interest
Parameter of Interest: Mean Scores of Students in Statistical Methods Exam
(0
Step 2: Identify the Null Hypothesis 𝐻0
Y
𝐻0: µ = 72
U
𝐻1: µ ≠ 72
TS
since we are dealing with a small sample i.e n<30 we’ll be using Case 2 stated
above
T
𝑋−µ0
A
𝑡𝑐𝑎𝑙𝑐 =
TH
𝑠/ 𝑛
Step 6: Write your Decision Rule
Since it is a two tailed test our decision rule is stated as follows
Decision Rule: Reject 𝐻0 if 𝑡𝑐𝑎𝑙𝑐 <− 𝑡 α or 𝑡𝑐𝑎𝑙𝑐 > 𝑡 α
2
, (𝑛−1) 2
, (𝑛−1)
52
STA 201/209/221/229 TUTORIAL NOTE
𝑋−µ0
𝑡𝑐𝑎𝑙𝑐 =
𝑠/ 𝑛
56.8−72
𝑡𝑐𝑎𝑙𝑐 =
14.8/ 10
𝑡𝑐𝑎𝑙𝑐 =− 3. 25
)
− 𝑡α =− 𝑡 0.05 =− 𝑡0.025, 9 =− 2. 26
, (𝑛−1) , (10−1)
3
2 2
51
Conclusion: Since 𝑡𝑐𝑎𝑙𝑐 <− 𝑡 α we reject 𝐻0 and therefore conclude that
6
2
, (𝑛−1)
01
the Mean Scores of Students in Statistical Methods Exam may not be 72 at 5%
level of significance.
34
HYPOTHESIS TEST FOR PROPORTION
The null and Alternative Hypothesis are stated as follows
𝐻0: 𝑃 = 𝑃0 𝑣𝑠 𝐻1: 𝑃 ≠ 𝑃0 𝑜𝑟 𝐻1: 𝑃 > 𝑃0 𝑜𝑟 80 𝐻1: 𝑃 < 𝑃0
(0
at α level of significance
The test statistic is given by
Y
U
𝑃−𝑃0 𝑃−𝑃0
𝑍 = =
G
σ𝑃
𝑃(1−𝑃)/𝑛
TS
Example: Out of a random sample of 800 people who casted their votes at
Lagos polling post, it was discovered that only 400 people voted for APC, test
A
hypothesis against the claim that the proportion of registered voters who
ST
53
STA 201/209/221/229 TUTORIAL NOTE
)
Since it is a two tailed test our decision rule is stated as follows
3
Decision Rule: Reject 𝐻0 if 𝑍𝑐𝑎𝑙𝑐 <− 𝑍α/2 or 𝑍𝑐𝑎𝑙𝑐 > 𝑍α/2
51
Step 7: Perform all necessary calculations
6
01
𝑛 = 800 α = 0. 05 𝑃0 = 0. 30 𝑃 = 0. 5
34
0.5−0.3
𝑍𝑐𝑎𝑙𝑐 =
80
0.5(1−0.5)/800
0.2
𝑍𝑐𝑎𝑙𝑐 =
(0
0.5(0.5)/800
0.2
𝑍𝑐𝑎𝑙𝑐 =
Y
0.25/800
U
0.2
𝑍𝑐𝑎𝑙𝑐 =
G
0.0176777
TS
𝑍𝑐𝑎𝑙𝑐 = 11. 31
A
Conclusion: Since 𝑍𝑐𝑎𝑙𝑐 > 𝑍α/2 we reject 𝐻0 and therefore conclude that the
Proportion of voters who voted for APC may not be 0.3 at 5% level of
T
significance.
A
TH
54