0% found this document useful (0 votes)

5 views24 pages

Analysis of Variance

This document provides an introduction to Analysis of Variance (ANOVA), covering key concepts such as null and alternative hypotheses, types of errors, and the assumptions required for ANOVA. It explains the procedure for one-way and two-way ANOVA, including the calculation of the F-statistic and the interpretation of results. Additionally, it includes examples and practice exercises to illustrate the application of ANOVA in testing for differences in population means.

Uploaded by

harold chisunzi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views24 pages

Analysis of Variance

Uploaded by

harold chisunzi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Unit 2

Introduction to Analysis of
Variance
(ANOVA)

C. Chafuwa April 2019 1

Null and Alternative hypothesis

• In hypothesis testing we begin by making a tentative

assumption about a population parameter. This tentative
assumption is called the null hypothesis and is denoted by H0.

• We then define another hypothesis, called the alternative

hypothesis, which is the opposite of what is stated in the null
hypothesis. The alternative hypothesis is denoted by Ha.

2
Some concepts
• Type I Error: Rejecting the null hypothesis when it is true.

• Type II Error: Accepting the null hypothesis when it is false.

• Level of confidence (α): Probability of committing Type I

Error.
• β is the probability of committing Type II Error.

• Power (1-β) is the probability of rejecting the null

hypothesis when in fact it is false.

• P-value: Probability that provides a measure of the evidence

against the null hypothesis provided by the sample. Smaller p-
values indicate more evidence against H0. 3
Errors and correct conclusions in hypothesis
testing

Population condition
H0 True Ha True

Accept H0 Correct conclusion Type II Error

Conclusion

Reject H0 Type I Error Correct conclusion

4
ANOV
A
• ANOVA – Analysis of Variance

• ANOVA can be used to test for the equality of k

population means using data obtained from a
completely randomized design as well as data
obtained from an observational study.

• ANOVA is the statistical procedure used to determine

whether the observed differences in three or more
sample means are large enough to reject H0.
5
Introduction to ANOVA
• The null hypothesis is that the several population
means are mutually equal.

• The sampling procedure used is that several

independent random samples are collected, one for
each of the data categories (treatment levels).

• The assumption underlying the use of the analysis of

variance is that the several sample means
were obtained
from normally distributed populations
having the same variance.
6
Assumptions for ANOVA
Three assumptions are required to use analysis of variance.

1. For each population, the response variable is

normally distributed
– Populations follow a normal distribution.

2. The variance of the response variable, denoted σ2, is the same

for all of the populations.
– σ2 1 = σ2 2 = σ2 3 = … = σ2 k
– This means the populations have equal standard deviations.

3. The populations are independent.

When these conditions are met, the F-statistic is used as the

test statistic. 7
Test statistic for ANOVA

Recall:
• To compare means of 2 groups we use a Z or a T statistic.
– Compare means from two groups to see whether they are so far
apart that the observed difference cannot reasonably be attributed to
sampling variability.

• To compare means of more than 2 groups, we use a new test

called ANOVA and a new statistic called F-statistic.
– Compare the means from two or more groups to see whether they
are so far apart that the observed differences cannot all reasonably be
attributed to sampling variability

8
Between- and Within- variability

• Variability between groups: amount of variation among sample

means due to assigned causes or treatments

• Variability within groups: the amount of variation within the

sample observations due to error (or unexplained chance
causes)

9
What are the characteristics of the F
distribution?
• The F distribution is continuous. This means that it can
assume an infinite number of values between zero and
positive infinity.
• The F distribution cannot be negative. The smallest value
F can assume is 0.
• It is positively skewed. The long tail of the distribution is
to the right-hand side. As the number of degrees of
freedom increases in both the numerator and denominator
the distribution approaches a normal distribution.
• It is asymptotic. As the values of X increase, the F curve
approaches the X-axis but never touches it. This is similar to
the behavior of the normal distribution.
10
Two types of ANOVA

One-way ANOVA
• The different populations are classified according to one
attribute or factor
– E.g. levels of crop production classified according to
type of fertilizer used

Two-way ANOVA
• The different populations are classified according to 2
attributes or factors
– E.g. levels of crop production classified according to
type of fertilizer used and seed type 11
One-way ANOVA
• Classification of populations is based on a single
factor or attribute or treatment.

• To make inferences about several popln means based

on the sample data, the null hypothesis for k
population means is

– H0: μ1= μ2= · · · = μk

– Ha: Not all k means are equal

12
What are the steps for testing the equality of
means using the one-way ANOVA procedure?
• Step 1: State the null and alternative hypotheses as follows:
– H0: μ1= μ2= · · · = μk
– Ha: Not all k means are equal.

• Step 2: Use the F-distribution table and the level of

significance, α, to determine the rejection region.

• Step 3: Build the ANOVA table, and from the table determine
the computed value of the F-ratio.

• Step 4: State your conclusion. The null hypothesis is rejected if

the computed value of the test statistic falls in the rejection
region. Otherwise, the null hypothesis is not rejected. 13
Testing procedure
• Estimate the popln variance from the variance between
sample means (sum of squares between treatments, SSTR)

• Treatment variation: sum of squared differences between each

treatment and the grand overall mean
– Calculate mean values for each sample
– Calculate the grand mean
– Calculate the difference between the mean of each sample
and the grand mean, square the difference, multiply by each
sample size, and sum over to the number of samples
(SSTR
– sum of squares between sample means)
– Between treatments mean squares (MSTR) = SSTR/k-1
14
Testing procedure cont’d

• Estimate the variance from the variance within the samples

• Random variation: sum of squared differences between each

observation and its treatment mean
– Calculate mean values for each sample
– Calculate the difference of each observation in k samples
from the mean value of the respective samples
– Finally, compute the sum of squares within samples (SSE –
Error Sum of Squares)
– Compute the variance from the variance within the
samples
(MSE)
15
Testing procedure cont’d

• Calculate the F-ratio or F-test statistic using the 2

population variances
• F = MSTR/MSE

• Using the calculated F-value, make a decision on the null

hypothesis (No difference among the means)

• Decision rule:
– If Fcal > Ftab, reject the null hypothesis
– Ftab from F-table using a given level of significance
and degrees of freedom k-1 and n-k
16
Alternative short-cut method

• Calculate the total of observations in samples from each

sample
• Calculate the correction factor (CF)
• Calculate the total sum of squares (SST)
– Total variation: sum of squared differences between each
observation and the overall mean
• Calculate the treatment sum of squares (between samples)
(SSTR) and MSTR
• Calculate the error sum of squares (within samples) (SSE) and
MSE
• Compute Fk-1, n-k

17
18
Example
4 NGOs were randomly selected in 3 districts in Malawi
to test whether significant variations exist in helping
farmers in adoption of farm technologies. The following
table records the number of farmers they have reached
with disseminating the farm technology.

District 1 District 2 District 3

NGO1 20 15 16

NGO2 10 10 5

NGO3 12 15 20

NGO4 15 7 5
19
Example cont’d
• Formulate the hypotheses to test if there are
significant differences in mean number of farmers
reached with disseminating the farm technology.

• Set up the ANOVA table, clearly showing your

calculations.

• Are there significant differences in the mean number

of farmers reached? Use the 95% confidence level.

20
Two-way ANOVA

• Variation within samples can be due to some measurable rather

than pure error
• SSE partitioned into 2:
– Unwanted variation due to some excluded variables
– Actual variation due to random error

• Therefore investigate 2 factors of interest for testing

the difference between sample means
• Also consider interaction between the 2 factors under
investigation (beyond scope of this course)

21
Two-way ANOVA

• The two-way ANOVA tests the null hypotheses of equal means for each
factor
• E.g suppose that an agricultural experiment consists of examining
the yields per acre of 4 different varieties of wheat, where each
variety is grown on 5 different plots of land.
– n=20
– Yield differences can be due to either of the 2 factors or
classifications:
1) type of wheat grown 2) block (or plot) used
– The 2 factors are referred to as treatments and blocks, or simply
factor 1 and factor 2
• Assume we have a treatments (factor 1) and b blocks (factor 2)
• It is supposed that there is one experimental value (such as yield per
acre) corresponding to each treatment and block combination.
22
Practice exercise
Table 1 gives fresh graduates daily earnings (in
thousands of MK) of former students with bachelor’s
degrees from 5 colleges and for 3 class rankings at
graduation.

Test at the 5% level of significance that the means are

identical
(a) for college populations and
(b) for class-ranking populations

23
Table 1
Bunda Chanco Poly CoM Nursing

Top 20 18 16 14 12

Middle 19 16 13 12 8

Bottom 18 14 10 10 10

LCA - Group 2
No ratings yet
LCA - Group 2
9 pages
Module 1 Introduction To Ecology
No ratings yet
Module 1 Introduction To Ecology
36 pages
ANOVA
0% (1)
ANOVA
26 pages
Corporate Strategic Human Resource Audit: A Paper Presented As A Final Requirement in
100% (1)
Corporate Strategic Human Resource Audit: A Paper Presented As A Final Requirement in
54 pages
Lecture 2
No ratings yet
Lecture 2
13 pages
16 Anova Updated
No ratings yet
16 Anova Updated
68 pages
Anova and Design of Experiments
No ratings yet
Anova and Design of Experiments
35 pages
Hypothesis Testing ANOVA Module 5
No ratings yet
Hypothesis Testing ANOVA Module 5
49 pages
Design of Experiments and ANOVA
No ratings yet
Design of Experiments and ANOVA
45 pages
T (Ea) For Two
No ratings yet
T (Ea) For Two
31 pages
Hypothesis Testing ANOVA
No ratings yet
Hypothesis Testing ANOVA
61 pages
Anova Mab2024
No ratings yet
Anova Mab2024
30 pages
ANOVA
No ratings yet
ANOVA
11 pages
ANOVA PPT Explained PDF
No ratings yet
ANOVA PPT Explained PDF
50 pages
Lecture 9: Analysis of Variance: Statistics For Economics 1
No ratings yet
Lecture 9: Analysis of Variance: Statistics For Economics 1
50 pages
Chapter10_ANOVA - Student(1)
No ratings yet
Chapter10_ANOVA - Student(1)
38 pages
UNIT 3 a
No ratings yet
UNIT 3 a
36 pages
Topic: ANOVA (Analysis of Variation) : Md. Jiyaul Mustafa
No ratings yet
Topic: ANOVA (Analysis of Variation) : Md. Jiyaul Mustafa
49 pages
12 Anova
No ratings yet
12 Anova
43 pages
Chapter 5, ANOVA
No ratings yet
Chapter 5, ANOVA
6 pages
Analysis of Variance ANOVA
No ratings yet
Analysis of Variance ANOVA
39 pages
Hypothesis Testing - Analysis of Variance
No ratings yet
Hypothesis Testing - Analysis of Variance
19 pages
Unit 8 8614 Research
No ratings yet
Unit 8 8614 Research
38 pages
STAT-107 Statistical Inference: Topic:)
No ratings yet
STAT-107 Statistical Inference: Topic:)
22 pages
Chapter 4 Hypotheses Testing of More Than Two Populations
No ratings yet
Chapter 4 Hypotheses Testing of More Than Two Populations
90 pages
Triola Cap 12 Slide
No ratings yet
Triola Cap 12 Slide
65 pages
Analysis of Variance
No ratings yet
Analysis of Variance
45 pages
Chapter 4 Stat
No ratings yet
Chapter 4 Stat
14 pages
Bst 32202 Linear Regression 3 Anova One Way
No ratings yet
Bst 32202 Linear Regression 3 Anova One Way
29 pages
Readings For Lecture 5,: S S N N S N
No ratings yet
Readings For Lecture 5,: S S N N S N
16 pages
Chapter 15 PDF
No ratings yet
Chapter 15 PDF
16 pages
CH V Anova
No ratings yet
CH V Anova
22 pages
Ch.6 ANOVA & F Distribution Overview-Module(2)
No ratings yet
Ch.6 ANOVA & F Distribution Overview-Module(2)
17 pages
Unit 3 Theoretical Questions
No ratings yet
Unit 3 Theoretical Questions
5 pages
QT Module-6
No ratings yet
QT Module-6
10 pages
Module 11
No ratings yet
Module 11
52 pages
Just Learn Stats
No ratings yet
Just Learn Stats
9 pages
Analysis of Variance
No ratings yet
Analysis of Variance
40 pages
An Ova 1
No ratings yet
An Ova 1
21 pages
Unit-12
No ratings yet
Unit-12
27 pages
Introduction To Analysis of VarianceC
No ratings yet
Introduction To Analysis of VarianceC
35 pages
Anovappt-141025002857-Conversion-Gate01 (1) - 240403 - 185855
No ratings yet
Anovappt-141025002857-Conversion-Gate01 (1) - 240403 - 185855
33 pages
Week5 ANOVA 202425
No ratings yet
Week5 ANOVA 202425
56 pages
Anova
67% (3)
Anova
55 pages
Last Lecture 1
No ratings yet
Last Lecture 1
17 pages
BBADM 221 Unit 10 - With Notes
No ratings yet
BBADM 221 Unit 10 - With Notes
51 pages
ANOVA-Notes
No ratings yet
ANOVA-Notes
26 pages
20-Introduction To Analysis of Variance
No ratings yet
20-Introduction To Analysis of Variance
31 pages
Chapter 7 Anova
No ratings yet
Chapter 7 Anova
20 pages
Analysis of Variance
No ratings yet
Analysis of Variance
20 pages
Session 12 - 2023
No ratings yet
Session 12 - 2023
43 pages
Analysis of Variance
No ratings yet
Analysis of Variance
4 pages
PLU Quantitative Techniques 4
No ratings yet
PLU Quantitative Techniques 4
13 pages
ANova & experiemntal design
No ratings yet
ANova & experiemntal design
40 pages
Anova-Ppt For Sonia Kalra Ma'Am
No ratings yet
Anova-Ppt For Sonia Kalra Ma'Am
31 pages
Chapter 6 - S
No ratings yet
Chapter 6 - S
66 pages
OneWayANOVA LectureNotes
No ratings yet
OneWayANOVA LectureNotes
13 pages
CHAPTER 12 Analysis of Variance
No ratings yet
CHAPTER 12 Analysis of Variance
49 pages
Unit 5 - STUDENTS - ANOVA
No ratings yet
Unit 5 - STUDENTS - ANOVA
32 pages
Chapter - 13 Correlation and Linear Regression
No ratings yet
Chapter - 13 Correlation and Linear Regression
26 pages
The One-Way Analysis of Variance (ANOVA) Process For STAT 461 Students at Penn State University
No ratings yet
The One-Way Analysis of Variance (ANOVA) Process For STAT 461 Students at Penn State University
7 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
LEAP-ZPCET-PT04-PHYSICS PRACTICE PAPER
No ratings yet
LEAP-ZPCET-PT04-PHYSICS PRACTICE PAPER
5 pages
Chapter 10 Polar Coordinates
No ratings yet
Chapter 10 Polar Coordinates
4 pages
Copy of Mathematics SSS3 Scheme of Work - SyllabusNG
No ratings yet
Copy of Mathematics SSS3 Scheme of Work - SyllabusNG
16 pages
INCLUSIVE LEARNING ENVIRONMENT FOR CHILDREN WITH AUTISM
No ratings yet
INCLUSIVE LEARNING ENVIRONMENT FOR CHILDREN WITH AUTISM
130 pages
Understanding Employee Motivation and Organizational Performance: Arguments For A Set-Theoretic Approach
No ratings yet
Understanding Employee Motivation and Organizational Performance: Arguments For A Set-Theoretic Approach
8 pages
(Ch-10) Wave Optics DPP 3
No ratings yet
(Ch-10) Wave Optics DPP 3
4 pages
Technical Information - Sterikon
No ratings yet
Technical Information - Sterikon
4 pages
Huawei Gabinete - Indoor
No ratings yet
Huawei Gabinete - Indoor
2 pages
ISL88738HRTZ
No ratings yet
ISL88738HRTZ
1 page
GE3 - Learning Module 9 - Midterm Period
No ratings yet
GE3 - Learning Module 9 - Midterm Period
4 pages
BRI402
No ratings yet
BRI402
3 pages
Sampling Station: Source: Sampling Date: Testing Date:: Determination of Ten Per Cent Fines Value (TFV)
No ratings yet
Sampling Station: Source: Sampling Date: Testing Date:: Determination of Ten Per Cent Fines Value (TFV)
2 pages
Flow Vision Lab Assignment-2: Submitted in Fulfillment of The Requirements of
No ratings yet
Flow Vision Lab Assignment-2: Submitted in Fulfillment of The Requirements of
8 pages
Worksheet G11 Art
No ratings yet
Worksheet G11 Art
3 pages
Solution Ej 4 EDO
No ratings yet
Solution Ej 4 EDO
7 pages
Super Dual Band MW Antenna - Design Doc - 2022
No ratings yet
Super Dual Band MW Antenna - Design Doc - 2022
4 pages
Artificial_Intelligence_and_Machine_Learning_for_G
No ratings yet
Artificial_Intelligence_and_Machine_Learning_for_G
17 pages
Introduction to English language-Final paper-Phạm Hà Kiều Oanh-D21NNAN10
No ratings yet
Introduction to English language-Final paper-Phạm Hà Kiều Oanh-D21NNAN10
5 pages
Lec12HW
No ratings yet
Lec12HW
3 pages
Autonomous Urban Garden
No ratings yet
Autonomous Urban Garden
6 pages
Evolution of Pharmacy: The Drug-Taking Animal
No ratings yet
Evolution of Pharmacy: The Drug-Taking Animal
20 pages
Select The Next Number For IP Address
No ratings yet
Select The Next Number For IP Address
17 pages
Shanu Resume DBL New
No ratings yet
Shanu Resume DBL New
2 pages
Grammar and Vocabulary
100% (2)
Grammar and Vocabulary
3 pages
Scent Marketing The Results Are In, Happier Customers Who Remember Your Brand and Linger Longer
No ratings yet
Scent Marketing The Results Are In, Happier Customers Who Remember Your Brand and Linger Longer
21 pages
Engineering Management-2
No ratings yet
Engineering Management-2
23 pages
Class 5B Promotion List
No ratings yet
Class 5B Promotion List
2 pages

Analysis of Variance

Uploaded by

Analysis of Variance

Uploaded by

Unit 2

C. Chafuwa April 2019 1

• In hypothesis testing we begin by making a tentative

• We then define another hypothesis, called the alternative

• Type II Error: Accepting the null hypothesis when it is false.

• Level of confidence (α): Probability of committing Type I

• Power (1-β) is the probability of rejecting the null

• P-value: Probability that provides a measure of the evidence

Accept H0 Correct conclusion Type II Error

Reject H0 Type I Error Correct conclusion

• ANOVA can be used to test for the equality of k

• ANOVA is the statistical procedure used to determine

• The sampling procedure used is that several

• The assumption underlying the use of the analysis of

1. For each population, the response variable is

2. The variance of the response variable, denoted σ2, is the same

3. The populations are independent.

When these conditions are met, the F-statistic is used as the

• To compare means of more than 2 groups, we use a new test

• Variability between groups: amount of variation among sample

• Variability within groups: the amount of variation within the

• To make inferences about several popln means based

– H0: μ1= μ2= · · · = μk

– Ha: Not all k means are equal

• Step 2: Use the F-distribution table and the level of

• Step 4: State your conclusion. The null hypothesis is rejected if

• Treatment variation: sum of squared differences between each

• Estimate the variance from the variance within the samples

• Random variation: sum of squared differences between each

• Calculate the F-ratio or F-test statistic using the 2

• Using the calculated F-value, make a decision on the null

• Calculate the total of observations in samples from each

District 1 District 2 District 3

• Set up the ANOVA table, clearly showing your

• Are there significant differences in the mean number

• Variation within samples can be due to some measurable rather

• Therefore investigate 2 factors of interest for testing

Test at the 5% level of significance that the means are

You might also like