Statistics Unit 3
Statistics Unit 3
SYLLABUS:
• Hypothesis testing: Hypothesis testing: one tailed and two tailed tests for means
of small sample (t-test)- F-test – one way and two way analysis of variance
(ANOVA) – chi-square test for simple sample standard deviation, independence
of attributes and goodness of fit.
Sample Design
𝐻0: 𝜇 = 𝜇0
Right tailed test at 12
5% Significance level
8 64 7 49 12 144
10 100 5 25 9 81
7 49 10 100 13 169
14 196 9 81 12 144
11 121 9 81 14 196
Total 50 530 40 336 60 734
1. Number of observations , N= 15
2. Total sum of all observations , T= 50 + 40 + 60 = 150
3. Correction factor = T2 / N=(150)2 /15= 22500/15=1500
4. Total sum of squares, SST= 530+ 336+ 734 – 1500= 100
5. Sum of squares between samples, SSC=(50)2/5 + (40)2 /5 + (60) 2 /5 - 1500=40
6. Sum of squares within samples, SSE= 100-40= 60
ANOVA Table
SOURCE OF SUM OF DEGREES OF MEAN SUM VARIANCE
VARIATION SQUARES FREEDOM OF SQUARES RATIO
Between SSC = 40 c-1 = 3-1 = 2 MSC = F=
Columns SSC/ (c-1) MSC / MSE
=40/2 = 20 =20/5 = 4
(Since,
MSC>MSE)
Within SSE = 60 N-c = 15-3 = MSE =
Columns 12 SSE / (N-c) =
(Errors) 60/12 = 5
Total SST = 100 N-1 = 15-1 =
14
Table Value:
• N.d.f = (k-1) = (4-1) = 3
• Level of Significance = 5%
Final Interpretation
• Calculated Value = 23.67
• Table Value = 7. 8147
• Calculated Value > Table Value
• So, Null hypothesis is rejected and alternate
hypothesis is accepted.
• Hence the results of the four category of
students do not follow the ratio of 4:3:2:1
CHI SQUARE TEST FOR A SPECIFIED
POPULATION VARIANCE OR STANDARD
DEVIATION
• To test a claim about the value of the variance
or the standard deviation of a population,
then the test statistic will follow a chi-square
distribution with n−1 degrees of freedom, and
is given by the following formula.
χ2= ns2 / σ2
• where n=Sample size, s=Sample S.D and
σ = Population S.D
Example:
1. A random sample of size 20 from a population
gives the sample standard deviation of 6. Test the
hypothesis that the population standard deviation
is 9 at 1% level of significance.
Solution:
H0: σ = 9 and H1: σ ≠ 9
Formula: χ2= ns2 / σ2
Given: n = 20, s=6 and σ = 9
Substituting the values in the formula,
χ2= (20 x 62)/92 = 720/81 = 8.88
Final Interpretation
• Table Value:
• N.d.f = n-1 = 20-1 =19
• Level of Significance = 1%
• Table value = 36.1909
• Calculated value < Table value.
• So, Null hypothesis is accepted and hence the
population standard deviation for the given
distribution is 9.
Parametric vs Non-parametric
• Parametric tests => have information about
population, or can make certain assumptions
– Assume normal distribution of population.
– Data is distributed normally.
– population variances are the same.
• Non-parametric tests are used when there are no
assumptions made about population distribution
– Also known as distribution free tests.
– But info is known about sampling distribution.