0% found this document useful (0 votes)

20 views

Just Learn Stats

Uploaded by

MounicaRasagyaPalla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Just Learn Stats

Uploaded by

MounicaRasagyaPalla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

www.analyttica.

com

Introduction to
ANOVA

© Analy Datalab Inc., 2016. All rights reserved.

https://leaps.analyttica.com

Table of Contents

What is ANOVA?

Assumptions of ANOVA

One-Way ANOVA

Two-Way ANOVA

Advantages and Limitations of ANOVA

Page 2
https://leaps.analyttica.com

What is ANOVA?
Analysis of Variance (ANOVA) is a statistical method used to test differences between
two or more means by analysing the variations in observations between and within
different groups. It was developed by Ronald Fisher.

ANOVA is based on the principal of total variance, where the total observed variation is
partitioned into two subcomponents, namely, the variance between the groups and the
variance within the groups. Using these two variances, we can statistically test whether
the groups are significantly different or not. This process is explained later in this
document.

The three principles of ANOVA are as follows:

1) Randomization: Consider a hospital is analysing the effect of three drugs, Drug A, Drug
B and Drug C. These are to be tested on 10 patients (say). Randomization implies that
each patient is equally likely to receive any of the three drugs. There is no pre-
experiment bias while administering the drugs on the patients. One-way ANOVA takes
care of the randomization priniciple. It is also called a completely randomized design
(CRD). In this example, patients 1,3,4 and 7 may receive Drug 1, patients 2,5 and 9 may
receive Drug 2, and the remaining patients may receive Drug 3.

2) Replication: Consider a farmer wants to test the effectiveness of three fertilizers on his
crops, Fertilizer A, Fertilizer B and Fertilizer C. He also wants to consider the effect of
soil type on his crops. Suppose his plot of land has five different types of soil, and he
wants to plant 15 crops in total. The ideal design in such a case would be to divide the
land into five heterogenous groups (called as blocks), each block corresponding to one
particular type of soil, and each block having three crops. Then, he would take each
block, and apply the three fertilizers to the block in a random manner. One thing to be
kept in mind is that, he would have to apply all three fertilizers in each block. Such a
design is called is called a randomized block design or two-way ANOVA, and it takes into
consideration both the randomization principle, as within each block the treatments are
applied randomly, and also the replication principle, as the same treatments are applied in
every block.

Figure 1: Sample Design of RBD

Page 3
https://leaps.analyttica.com

Assumptions of ANOVA
While carrying out ANOVA, one must keep in mind the following assumptions:

1) Normality: Each sample unit is taken from a normal distribution

2) Independence: All sample units are independent of each other

3) Homoscedasticity: The variance across different groups must be equal.

4) Continuity: The dependant variable must be a continuous numerical variable

Although these assumptions are necessary and essential while theoretically deriving the
results obtained from ANOVA, however, practically, very few real-life datasets follow
any of these criteria, and in such cases, the user may choose to carry out ANOVA
anyway, ignoring the violation of the assumptions.

One-Way ANOVA
One-way ANOVA involves one independent categorical variable, and one dependant
continuous variable. This technique is used to analyse whether the different categories of
the independent variable differ significantly, based on the differences in the mean value
of the dependent variable for each category. It involves dividing the total variation in the
dependant variable into the explained variation and the unexplained variation.
The explained variation is due to the application of the different treatments. The
unexplained variation is the variation which cannot be numerically explained. It may be
due experimental error or sampling error.

Suppose the independent variable has k classes, and y represents the dependent variable.
Consider the following notations:

y!" = the j#$ unit in the i#$ class.

where, i = 1,2, … , k and j = 1,2, … , n!

n = total sample units

n! = number of units in the i#$ class and ∑%!&' n! = n
y=!. = mean of the i#$ class
y=. . = overall mean
Then, the total sum of squares is defined as,
% *!
)
Total sum of squares = A ABy!" − y=. . D
!&' "&'

Page 4
https://leaps.analyttica.com

The total sum of squares, after simplification, can be written as follows,

% *! % *! % *!
) )
A ABy!" − y=. . D = A A(y!. − y=. . )) + A ABy!" − y=!. D
!&' "&' !&' "&' !&' "&'

The first term on the right hand side is called the Treatment Sum of Squares, or the
Between Sum of squares, and the second term on the right hand side is called the Error
Sum of Squares or the Within Sum of Squares.

Using these values, our objective is to test the null hypothesis H+ vs the alternative
hypothesis H' , where,

H+ = All categories have equal mean

H' = Not all categories have equal mean
We use the F test to test the hypothesis, where the test statistic is defined as
Variance between Treatments SS./,0#1,*# /(k − 1)
F#,-# = =
Variance within Treatments SS2//3/ /(n − k)
If the value of F#,-# is close to 1, we can accept the null hypothesis. As the value increases
above 1, the evidence against the null hypothesis increases.
As an example, we consider a data case where we want to find whether the final score in
a Mathematics exam is significantly different among different study groups.
Our null hypothesis and alternative hypothesis are as follows:

H+ : Maths score does not differ significantly across study groups

H' : Maths score differs significantly across study groups
Here, our independent variable is ‘Study Group’ and the dependent variable is ‘Math
Score’. We obtain the final ANOVA table:

Page 5
https://leaps.analyttica.com

Figure 2: Summary Table for One-Way ANOVA

As we see from the table, the F value is high, which indicates that we should reject the
null hypothesis.

Another indicator of testing the hypothesis is the p-value. The p-value is essentially the
probability that the null hypothesis is true. Usually, we keep a level of significance of 5%.
This means that, if the p-value is less than 5%, we reject the null hypothesis. Else, we
accept the null hypothesis. However, the desired level of significance can change
depending on our requirement, and correspondingly, our inference and decision to accept
or reject the null hypothesis will change.

In the above data case, we see that the p-value is extremely low. So, we can safely reject
the null hypothesis at 5% level of significance. Our final inference will be that all study
groups do not have the same mean test score in Mathematics. In other words, study
groups have a significant effect on the Maths score.

Two-way ANOVA
In two-way ANOVA, we have two independent categorical variables, and a dependent
variable. Along with testing the equality of means of the individual categorical variables,
two-way ANOVA also helps in testing the significance of the interaction between the
two independent variables on the dependent variable.
The calculations and testing of hypotheses are similar to one-way ANOVA. The only
difference is that we have the additional terms for the sum of squares of the second
categorical variable, along with the interaction sum of squares.
If we have two categorical variables A and B, then the total sum of squares can be written
as,

SS.3#04 = SS5 + SS6 + SS56 + SS2//3/

For two-way ANOVA, we have the following null hypotheses:

Page 6
https://leaps.analyttica.com

H+' = The levels (or categories) of variable A do not differ significantly

H+) = The levels of variable B do not differ significantly
H+7 = There is no significant interaction effect between variables A and B
The alternative hypotheses are the complement of the corresponding null hypotheses.
Just like in one-way ANOVA, we use the F test to test each of the hypotheses. In each
case, if the test statistic value is close to 1, we accept the corresponding null hypothesis.
Else, we reject the null hypothesis.
In one-way ANOVA, we had studied the effect of Study Group on the Maths score. Let us
introduce another categorical variable: Test Preparation. We want to test whether the
Maths score varies significantly across different study groups and different levels of test
preparation, and we also want to find out the significance of the interaction effect of
study group and test preparation on Maths score.

The null hypotheses are:

H+' : Maths score does not differ significantly across study groups
H+) : Maths score does not differ significantly across test preparation levels
H+7 : There is no significant interaction effect between study group
and test preparation on Maths score
The corresponding alternative hypotheses are:

H'' : Maths score differs significantly across study groups

H') : Maths score differs significantly across test preparation levels
H'7 : There is significant interaction effect between study group and
test preparation on Maths score
We obtain the following ANOVA table:

Figure 3: Summary Table for Two-Way ANOVA

Page 7
https://leaps.analyttica.com

As we see, the p-values for both Study Group and Test Preparation is very small,
indicating that the Maths score differs significantly between different Study Groups and
different Test Preparation levels.

Advantages and Limitations of ANOVA

Advantages

1) Compared to other tests, ANOVA is a robust test against violations of its assumptions.

2) ANOVA facilitates testing of differences among multiple means without increasing the
Type I error rate i.e. increases statistical power.

3) Two-way ANOVA looks at interaction between factors, reduces random variability, it

provides a mechanism to look at effect on second variable after controlling the first
variable.

Limitations

1) Requires that the population distributions are normal. It assumes equality of variances
for each group which may not be true at times.

2) A one-way ANOVA will confirm that at least two groups are different from each other;
however, it does not confirm what groups are different. If H+ is rejected, to find out
which exact groups have a difference in means, you need to run Fisher’s LSD or pairwise t
test.

Page 8
https://leaps.analyttica.com

Write to us at

[email protected]

USA Address
Analyttica Datalab Inc.
1007 N. Orange St, Floor-4,
Wilmington, Delaware - 19801
Tel: +1 917 300 3289/3325

India Address
Analyttica Datalab Pvt. Ltd.
702, Brigade IRV Centre,2nd Main Rd,
Nallurhalli,
Whitefield, Bengaluru - 560066.
Tel : +91 80 4650 7300

Page 9

PS4 PDF
No ratings yet
PS4 PDF
10 pages
LSCM 3403 Assigned Problems - F 2018
No ratings yet
LSCM 3403 Assigned Problems - F 2018
2 pages
SMuR Complete
No ratings yet
SMuR Complete
114 pages
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
No ratings yet
18MEO113T - DOE - Unit 5 - AY2023 - 24 ODD
76 pages
Unit 8 8614 Research
No ratings yet
Unit 8 8614 Research
38 pages
DAV 2 UNIT
No ratings yet
DAV 2 UNIT
7 pages
Anovaparametrictest 240312091837 c0b4bb94
No ratings yet
Anovaparametrictest 240312091837 c0b4bb94
12 pages
Business Statics
No ratings yet
Business Statics
28 pages
ANALYSIS OF VARIANCE
No ratings yet
ANALYSIS OF VARIANCE
11 pages
What Is Analysis of Variance
No ratings yet
What Is Analysis of Variance
15 pages
T (Ea) For Two
No ratings yet
T (Ea) For Two
31 pages
5 ASAP Advanced Statistics - ANOVA - Total
No ratings yet
5 ASAP Advanced Statistics - ANOVA - Total
127 pages
One Way Annova (SPSS)
No ratings yet
One Way Annova (SPSS)
10 pages
Hypothesis Testing ANOVA Module 5
No ratings yet
Hypothesis Testing ANOVA Module 5
49 pages
Analysis of Variance
No ratings yet
Analysis of Variance
25 pages
ANOVA
No ratings yet
ANOVA
29 pages
Statistics FOR Management Assignment - 2: One Way ANOVA Test
No ratings yet
Statistics FOR Management Assignment - 2: One Way ANOVA Test
15 pages
Analysis of Variance
No ratings yet
Analysis of Variance
4 pages
-WEEK 8- Analysis of Variance_copy
No ratings yet
-WEEK 8- Analysis of Variance_copy
11 pages
ANOVA Test in Python1
No ratings yet
ANOVA Test in Python1
12 pages
TWO WAY ANOVA Final
No ratings yet
TWO WAY ANOVA Final
12 pages
BBADM 221 Unit 10 - With Notes
No ratings yet
BBADM 221 Unit 10 - With Notes
51 pages
Lecture 10 - ANOVA
No ratings yet
Lecture 10 - ANOVA
27 pages
ANOVA-Reader
No ratings yet
ANOVA-Reader
7 pages
Anova
No ratings yet
Anova
4 pages
ANOVA Executive Summary
No ratings yet
ANOVA Executive Summary
6 pages
Anova and Design of Experiments
No ratings yet
Anova and Design of Experiments
35 pages
One-Way ANOVA Is Used To Test If The Means of Two or More Groups Are Significantly Different
No ratings yet
One-Way ANOVA Is Used To Test If The Means of Two or More Groups Are Significantly Different
17 pages
ANOVA
No ratings yet
ANOVA
36 pages
ANOVA
No ratings yet
ANOVA
4 pages
Chapter7 ANOVA
No ratings yet
Chapter7 ANOVA
20 pages
Session 10
No ratings yet
Session 10
10 pages
One Way ANOVA, Two Way ANOVA and Interaction ANOVA
No ratings yet
One Way ANOVA, Two Way ANOVA and Interaction ANOVA
25 pages
What Is Analysis of Variance (ANOVA) ?: Z-Test Methods
No ratings yet
What Is Analysis of Variance (ANOVA) ?: Z-Test Methods
7 pages
F Test
No ratings yet
F Test
19 pages
17-18.anova Doe
No ratings yet
17-18.anova Doe
18 pages
Anova
No ratings yet
Anova
38 pages
ANOVA
No ratings yet
ANOVA
19 pages
Data Preparation & Analysis
No ratings yet
Data Preparation & Analysis
27 pages
Mm13 Content Module 9
No ratings yet
Mm13 Content Module 9
12 pages
Topic: ANOVA (Analysis of Variation) : Md. Jiyaul Mustafa
No ratings yet
Topic: ANOVA (Analysis of Variation) : Md. Jiyaul Mustafa
49 pages
Aritra Majumder QUANTATIVE TECHNIQUES
No ratings yet
Aritra Majumder QUANTATIVE TECHNIQUES
10 pages
One Way Final
No ratings yet
One Way Final
9 pages
ESD 515 Research Methodology and Methods
No ratings yet
ESD 515 Research Methodology and Methods
5 pages
11-Anova For BRM
No ratings yet
11-Anova For BRM
39 pages
Hypothesis Testing Using The One-Way Analysis of Variance
No ratings yet
Hypothesis Testing Using The One-Way Analysis of Variance
52 pages
Anova
No ratings yet
Anova
46 pages
Techniques of Annova_20241103_232802_0000
No ratings yet
Techniques of Annova_20241103_232802_0000
32 pages
Advanced Statistics: Analysis of Variance (ANOVA) Dr. P.K.Viswanathan (Professor Analytics)
No ratings yet
Advanced Statistics: Analysis of Variance (ANOVA) Dr. P.K.Viswanathan (Professor Analytics)
19 pages
Analysis of Variance (ANOVA)
No ratings yet
Analysis of Variance (ANOVA)
23 pages
Annova
0% (1)
Annova
19 pages
Correlation Regression Hypo ANOVA
No ratings yet
Correlation Regression Hypo ANOVA
22 pages
Chapter 5 Analysis of Variance (ANOVA)
No ratings yet
Chapter 5 Analysis of Variance (ANOVA)
10 pages
IGNOU MBA MS-95 Solved Assignment Dec 2012
No ratings yet
IGNOU MBA MS-95 Solved Assignment Dec 2012
14 pages
ANova & experiemntal design
No ratings yet
ANova & experiemntal design
40 pages
unit 4- notes
No ratings yet
unit 4- notes
14 pages
Class Notes
No ratings yet
Class Notes
25 pages
Analysis of Variance (ANOVA)
No ratings yet
Analysis of Variance (ANOVA)
8 pages
T-Tests Type I Errors: Developed by Ronald Fisher, ANOVA Stands For Analysis of Variance
No ratings yet
T-Tests Type I Errors: Developed by Ronald Fisher, ANOVA Stands For Analysis of Variance
5 pages
Lecture 2 Anova Erb
No ratings yet
Lecture 2 Anova Erb
26 pages
One Way Anova
No ratings yet
One Way Anova
5 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Design and Analysis of Disc Plate in Hot Blast Valve #DN1800
No ratings yet
Design and Analysis of Disc Plate in Hot Blast Valve #DN1800
8 pages
Condition Monitoring Engineer (13010) : Perfectly
No ratings yet
Condition Monitoring Engineer (13010) : Perfectly
1 page
Mounica Palla M.Tech Design Engineer
No ratings yet
Mounica Palla M.Tech Design Engineer
3 pages
NPD Baker Huges JD
No ratings yet
NPD Baker Huges JD
3 pages
Spectradaq-200 Is A Precision Data Acquisition Sound Card Optimized For Test and Measurement
No ratings yet
Spectradaq-200 Is A Precision Data Acquisition Sound Card Optimized For Test and Measurement
2 pages
To Whom It May Concern
No ratings yet
To Whom It May Concern
1 page
2020 09 25 - Masterclass On EV NVH - RoadNoiseTPA
No ratings yet
2020 09 25 - Masterclass On EV NVH - RoadNoiseTPA
78 pages
Experiment Evolution For Downdraft Gasifier: With Using Various Biomass Wood, Bagasse and Coconut Shell
No ratings yet
Experiment Evolution For Downdraft Gasifier: With Using Various Biomass Wood, Bagasse and Coconut Shell
5 pages
Impact Test by LMS - Mounica Palla
No ratings yet
Impact Test by LMS - Mounica Palla
8 pages
2020 09 17 - Masterclass On EV NVH - SoundDesign
No ratings yet
2020 09 17 - Masterclass On EV NVH - SoundDesign
45 pages
Impact Test by LMS - Mounica Palla
No ratings yet
Impact Test by LMS - Mounica Palla
8 pages
SpectraPLUS-RT Features and Specifications
No ratings yet
SpectraPLUS-RT Features and Specifications
1 page
Report
No ratings yet
Report
1 page
Driver Installation: Selecting The Device
No ratings yet
Driver Installation: Selecting The Device
3 pages
System Features:: RH560 Wireless Data Collector Input and Output
No ratings yet
System Features:: RH560 Wireless Data Collector Input and Output
2 pages
SpectraPLUS Product Brochure
No ratings yet
SpectraPLUS Product Brochure
1 page
SpectraPLUS-SC Features and Specifications
No ratings yet
SpectraPLUS-SC Features and Specifications
1 page
Spectraplus-Sc Product Options
No ratings yet
Spectraplus-Sc Product Options
2 pages
SpectraPLUS-SC Features and Specifications
No ratings yet
SpectraPLUS-SC Features and Specifications
1 page
RONDS Intelligent Wireless Condition Monitoring System: Anhui Rong Zhi Ri Xin Information Technology Co., LTD
No ratings yet
RONDS Intelligent Wireless Condition Monitoring System: Anhui Rong Zhi Ri Xin Information Technology Co., LTD
12 pages
RH802 Dual Channel Vibration Analyzer Manual
No ratings yet
RH802 Dual Channel Vibration Analyzer Manual
7 pages
RONDS RH711 - RH802 Portable Vibraion Analyzer Introduction
No ratings yet
RONDS RH711 - RH802 Portable Vibraion Analyzer Introduction
37 pages
Relative Humidity and Temperature Sensor
No ratings yet
Relative Humidity and Temperature Sensor
23 pages
Advanced Mechatronics Portfolio: Vibration Control A V C - Add
No ratings yet
Advanced Mechatronics Portfolio: Vibration Control A V C - Add
2 pages
Hasil Data Project Spasial - Summary
No ratings yet
Hasil Data Project Spasial - Summary
8 pages
Pert
No ratings yet
Pert
52 pages
Formula Sheet
No ratings yet
Formula Sheet
5 pages
Bivariate Data Analysis
100% (1)
Bivariate Data Analysis
34 pages
Lect - 00 - Course Information
No ratings yet
Lect - 00 - Course Information
16 pages
Forecasting Using Simple Exponential Smoothing Method: Acta Electrotechnica Et Informatica December 2012
No ratings yet
Forecasting Using Simple Exponential Smoothing Method: Acta Electrotechnica Et Informatica December 2012
6 pages
Econometric Project - Linear Regression Model
No ratings yet
Econometric Project - Linear Regression Model
17 pages
Brand Loyalty Data Logistic Regression
No ratings yet
Brand Loyalty Data Logistic Regression
4 pages
Business Statistics A First Course 8th Edition David Levine Kathryn Szabat download
100% (1)
Business Statistics A First Course 8th Edition David Levine Kathryn Szabat download
39 pages
Arima Modeling With R Listendata
No ratings yet
Arima Modeling With R Listendata
12 pages
Measures of Spread
No ratings yet
Measures of Spread
9 pages
Machine Learning: Notes by Aniket Sahoo - Part II
No ratings yet
Machine Learning: Notes by Aniket Sahoo - Part II
140 pages
Using R For Introductory Statistics Second Edition John Verzanidownload
100% (1)
Using R For Introductory Statistics Second Edition John Verzanidownload
52 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Guitguit, Jazmine B. - Module 3
No ratings yet
Guitguit, Jazmine B. - Module 3
3 pages
Stat Prob Q4 Module 4
50% (2)
Stat Prob Q4 Module 4
20 pages
1 PB
No ratings yet
1 PB
8 pages
Short Quiz 1
No ratings yet
Short Quiz 1
3 pages
Logistic Regression Analysis
100% (4)
Logistic Regression Analysis
65 pages
Sample Size Determination
No ratings yet
Sample Size Determination
42 pages
Lecturenotes12 10
No ratings yet
Lecturenotes12 10
22 pages
Tobit Postestimation - Postestimation Tools For Tobit
No ratings yet
Tobit Postestimation - Postestimation Tools For Tobit
5 pages
PHD Thesis (Eaves)
No ratings yet
PHD Thesis (Eaves)
372 pages
CPK Calculation
No ratings yet
CPK Calculation
7 pages
FORCASTING Example
No ratings yet
FORCASTING Example
17 pages
Formula Sheet and Statistical Tables
100% (1)
Formula Sheet and Statistical Tables
11 pages
Statistics Syllabus gyanSHiLA
No ratings yet
Statistics Syllabus gyanSHiLA
8 pages
A-Cat Corp - Forecasting
No ratings yet
A-Cat Corp - Forecasting
7 pages

Just Learn Stats

Uploaded by

Just Learn Stats

Uploaded by

www.analyttica.

© Analy Datalab Inc., 2016. All rights reserved.

Advantages and Limitations of ANOVA

The three principles of ANOVA are as follows:

Figure 1: Sample Design of RBD

1) Normality: Each sample unit is taken from a normal distribution

2) Independence: All sample units are independent of each other

3) Homoscedasticity: The variance across different groups must be equal.

4) Continuity: The dependant variable must be a continuous numerical variable

y!" = the j#$ unit in the i#$ class.

n = total sample units

The total sum of squares, after simplification, can be written as follows,

H+ = All categories have equal mean

H+ : Maths score does not differ significantly across study groups

Figure 2: Summary Table for One-Way ANOVA

SS.3#04 = SS5 + SS6 + SS56 + SS2//3/

For two-way ANOVA, we have the following null hypotheses:

H+' = The levels (or categories) of variable A do not differ significantly

The null hypotheses are:

H'' : Maths score differs significantly across study groups

Figure 3: Summary Table for Two-Way ANOVA

Advantages and Limitations of ANOVA

3) Two-way ANOVA looks at interaction between factors, reduces random variability, it

You might also like