Multivariate_Analysis_of_Variance-MANOVA
Multivariate_Analysis_of_Variance-MANOVA
com
Chapter 415
Introduction
Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In
ANOVA, differences among various group means on a single-response variable are studied. In MANOVA, the
number of response variables is increased to two or more. The hypothesis concerns a comparison of
vectors of group means. When only two groups are being compared, the results are identical to Hotelling’s
T² procedure.
The multivariate extension of the F-test is not completely direct. Instead, several test statistics are available,
such as Wilks’ Lambda and Lawley’s trace. The actual distributions of these statistics are difficult to calculate,
so we rely on approximations based on the F-distribution.
Technical Details
A MANOVA has one or more factors (each with two or more levels) and two or more dependent variables.
The calculations are extensions of the general linear model approach used for ANOVA.
Unlike the univariate situation in which there is only one statistical test available (the F-ratio), the
multivariate situation provides several alternative statistical tests. We will describe these tests in terms of
two matrices, H and E. H is called the hypothesis matrix and E is the error matrix. These matrices may be
computed using a number of methods. In NCSS, we use the standard general linear models (GLM) approach
in which a sum of squares and cross-products matrix is computed. This matrix is based on the dependent
variables and independent variables generated for each degree of freedom in the model. It may be
partitioned according to the terms in the model.
415-1
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
𝜙𝜙𝑖𝑖
𝜃𝜃𝑖𝑖 = 1 − 𝜆𝜆𝑖𝑖 =
1 + 𝜙𝜙𝑖𝑖
𝜃𝜃𝑖𝑖 1 − 𝜆𝜆𝑖𝑖
𝜙𝜙𝑖𝑖 = =
1 − 𝜃𝜃𝑖𝑖 𝜆𝜆𝑖𝑖
1
𝜆𝜆𝑖𝑖 = 1 − 𝜃𝜃𝑖𝑖 =
1 + 𝜙𝜙𝑖𝑖
Wilks’ Lambda
Define Wilks’ Lambda as follows:
|𝐸𝐸|
Λ 𝑝𝑝,ℎ,𝑒𝑒 =
|𝐸𝐸 + 𝐻𝐻|
𝑝𝑝
= ��1 − 𝜃𝜃𝑗𝑗 �
𝑗𝑗=1
with e ≥ p.
The following approximation based on the F-distribution is used to determine significance levels:
1
𝑓𝑓 = 𝑒𝑒 − (𝑝𝑝 − ℎ + 1)
2
𝑝𝑝ℎ − 2
𝑔𝑔 =
2
𝑝𝑝2 ℎ2 − 4 2 2
�
𝑡𝑡 = � 𝑝𝑝2 + ℎ2 − 5 if 𝑝𝑝 + ℎ − 5 > 0
1 otherwise
415-2
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
𝑠𝑠
𝑇𝑇𝑔𝑔2 = 𝑒𝑒 � 𝜙𝜙𝑗𝑗
𝑗𝑗=1
where
𝑠𝑠 = min(𝑝𝑝, ℎ)
The following approximation based on the F-distribution is used to determine significance levels:
𝑇𝑇𝑔𝑔2
𝐹𝐹𝑎𝑎,𝑏𝑏 =
𝑐𝑐𝑐𝑐
where
𝑎𝑎 = 𝑝𝑝ℎ
𝑏𝑏 = 4 + (𝑎𝑎 + 2)/(𝐵𝐵 − 1)
𝑎𝑎(𝑏𝑏 − 2)
𝑐𝑐 =
𝑏𝑏(𝑒𝑒 − 𝑝𝑝 − 1)
(𝑒𝑒 + ℎ − 𝑝𝑝 − 1)(𝑒𝑒 − 1)
𝐵𝐵 =
(𝑒𝑒 − 𝑝𝑝 − 3)(𝑒𝑒 − 𝑝𝑝)
Pillai’s Trace
Pillai’s trace statistic, 𝑉𝑉 (𝑠𝑠) , is defined as follows:
𝑠𝑠
(𝑠𝑠)
𝑉𝑉 = � 𝜃𝜃𝑗𝑗 = tr(𝐻𝐻(𝐸𝐸 + 𝐻𝐻)−1 )
𝑗𝑗=1
where
𝑠𝑠 = min(𝑝𝑝, ℎ)
The following approximation based on the F-distribution is used to determine significance levels:
415-3
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
where
𝑠𝑠 = min(𝑝𝑝, ℎ)
𝑚𝑚 = (|𝑝𝑝 − ℎ| − 1)/2
𝑛𝑛 = (𝑒𝑒 − 𝑝𝑝 − 1)/2
2𝜈𝜈2 + 2
𝐹𝐹(2𝜈𝜈1 +2),(2𝜈𝜈2 +2) = 𝜙𝜙
2𝜈𝜈1 + 2 𝑚𝑚𝑚𝑚𝑚𝑚
where
𝑠𝑠 = min(𝑝𝑝, ℎ)
415-4
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Linearity
MANOVA assumes linear relationships among the dependent variables within a particular cell. You should
study scatter plots of each pair of dependent variables using a different color for each level of a factor. Look
carefully for curvilinear patterns and for outliers. The occurrence of curvilinear relationships will reduce the
power of the MANOVA tests.
415-5
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Data Structure
The data must be entered in a format that places the dependent variables and values of each factor side by
side. An example of the data for a MANOVA design is shown in the table below. In this example, WRATR and
WRATA are the two dependent variables. Treatment and Disability are two factor variables. This database is
stored in the file MANOVA1.
415-6
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Setup
To run this example, complete the following steps:
Variables Tab
___________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________________
The Expected Mean Square expressions are provided to show the appropriate error term for each factor.
The correct error term for a factor is that term that is identical except for the factor being tested.
Source Term
The source of variation or term in the model.
415-7
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
DF
The degrees of freedom. The number of observations “used” by this term.
Term Fixed?
Indicates whether the term is “fixed” or “random.”
Denominator Term
Indicates the term used as the denominator in the F-ratio.
MANOVA Tests
─────────────────────────────────────────────────────────────────────────
Term(DF): Name Decision
Test Statistic Test Value DF1 DF2 F-Ratio P-Value (α = 0.05)
───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
A(1): Treatment
Wilks' Lambda 0.137721 2 11 34.44 0.000018 Reject
Lawley - Hotelling Trace 6.261036 2 11 34.44 0.000018 Reject
Pillai's Trace 0.862279 2 11 34.44 0.000018 Reject
Roy's Largest Root 6.261036 2 11 34.44 0.000018 Reject
F (WRATR) 2090.888889 1 12 46.12 0.000019 Reject
F (WRATA) 1494.222222 1 12 33.25 0.000089 Reject
B(2): Disability
Wilks' Lambda 0.255263 4 22 5.39 0.003528 Reject
Lawley - Hotelling Trace 2.895034 4 20 7.24 0.000896 Reject
Pillai's Trace 0.750481 4 24 3.60 0.019460 Reject
Roy's Largest Root 2.887241 2 12 17.32 0.000290 Reject
F (WRATR) 260.388889 2 12 5.74 0.017784 Reject
F (WRATA) 563.388889 2 12 12.54 0.001151 Reject
AB(2)
Wilks' Lambda 0.908068 4 22 0.27 0.893037 Accept
Lawley - Hotelling Trace 0.100954 4 20 0.25 0.904790 Accept
Pillai's Trace 0.092192 4 24 0.29 0.881598 Accept
Roy's Largest Root 0.098039 2 12 0.59 0.570550 Accept
F (WRATR) 1.055556 2 12 0.02 0.977029 Accept
F (WRATA) 26.388889 2 12 0.59 0.571116 Accept
─────────────────────────────────────────────────────────────────────────
This report gives the results of the various significance tests. Usually, the four multivariate tests will lead to
the same conclusions. When they do not, refer to the discussion of these tests found earlier in this chapter.
415-8
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Once a multivariate test has found a term significant, use the univariate ANOVA to determine which of the
variables and factors are “causing” the significance.
Term(DF): Name
The term in the design model with the degrees of freedom of the term in parentheses. For main effects, the
variable name is also given.
Test Statistic
The name of the statistical test shown on this row of the report. The four multivariate tests are followed by
the univariate F-tests of each variable.
Test Value
The value of the test statistic.
DF1
The numerator degrees of freedom of the F-ratio corresponding to this test.
DF2
The denominator degrees of freedom of the F-ratio corresponding to this test.
F-Ratio
The value of the F-test corresponding to this test. In some cases, this is an exact test. In other cases, this is
an approximation to the exact test. See the discussion of each test to determine if it is exact or approximate.
P-Value
The significance level of the above F-ratio. The probability of an F-ratio larger than that obtained by this
analysis. For example, to test at an alpha of 0.05, this probability would have to be less than 0.05 to make
the F-ratio significant.
Decision (α = 0.05)
The decision to accept or reject the null hypothesis at the given level of significance. Note that you specify
the level of significance when you select Alpha.
415-9
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This report displays the correlations and covariances formed by averaging across all of the individual group
covariance matrices. The correlations are shown in the lower-left half of the matrix. The within-group
covariances are shown on the diagonal and in the upper-right half of the matrix.
This report analyzes the within-cell correlation matrix. It lets you diagnose multicollinearity problems as well
as determine the number of dimensions that are being used. This is useful in determining if Pillai’s trace
should be used.
Canonical Variate
The identification numbers of the canonical variates that are generated during the analysis. The total
number of variates is the smaller of the number of variables and the number of degrees of freedom in the
model.
Eigenvalue
The eigenvalues of the within correlation matrix. Note that this value is not associated with the variable at
the beginning of the row, but rather with the canonical variate number directly to the left.
415-10
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Percent of Total
The percent that the eigenvalue is of the total. Note that the sum of the eigenvalues will equal the number
of variates. If the percentage accounted for by the first eigenvalue is relatively large (70 or 80 percent),
Pillai's trace will be less powerful than the other three multivariate tests.
Cumulative Total
The cumulative total of the Percent of Total column.
This is the standard ANOVA report as documented in the General Linear Models chapter. A separate report
is displayed for each of the response variables.
415-11
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
A: Treatment
1 9 99.88889 2.234687
2 9 78.33334 2.234687
B: Disability
1 6 95.83334 2.736922
2 6 88.83334 2.736922
3 6 82.66666 2.736922
A: Treatment
1 9 96.33334 2.234687
2 9 78.11111 2.234687
B: Disability
1 6 96.5 2.736922
2 6 88 2.736922
3 6 77.16666 2.736922
This report provides the least-squares means and standard errors for each variable. Note that the standard
errors are calculated using the mean square error from the ANOVA table. They are not the standard errors
that would be calculated from the individual cells.
415-12
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
Means Plots
Means Plots
─────────────────────────────────────────────────────────────────────────
415-13
© NCSS, LLC. All Rights Reserved.
NCSS Statistical Software NCSS.com
This section provides the means plots for main effects and interactions.
415-14
© NCSS, LLC. All Rights Reserved.