STA3005 Exploratory Data Analysis Notes
STA3005 Exploratory Data Analysis Notes
Etc.
Example2
Variables
X1 X2 X3
7 10 12
5 15 18
10 12 20
6 11 10
Example 2 (continued)
The mean vector is the vector with the means for each of the
variables.
Mean of X1 is (7+5+10+6)/4=7
Mean of X2 is (10+15+12+11)/4=12
Mean of X3 is 15
The mean vector is represented by x-bar in bold type or for
the purpose of hand written notes X-bar with a line below
That is = or =
Also,
The sample variance may be found for each variable.
S11 or = =4.7
Cov(X1,X3)==
=
= = 4.7
Similarly, Cov(X2,X3)==6.7
But generally, . Therefore, and
And finally,
All of these value make up the sample variance-covariance
matrix S.
The variance-covariance matrix is given by
Properties of Matrix Algebra and Random Vectors related to
multivariate data
These many properties are stated within MAT1043
The prerequisite module, Linear Algebra
Important properties
Population Variance-Covariance matrices represented by upper
case sigma is a non-singular symmetric matrix. This makes it
ideal for all the mathematical manipulations in multivariate
statistical methods.
This is the star actor
on which the entire module is based
The Variance-Covariance matrix
Suppose for
The diagonal elements form the variance matrix
consisting of the variances of the individual variables.
The standard deviation matrix is the diagonal
matrix with the square roots of each element
.
Similarly the correlation matrix is given by
eigen values are all strictly positive and the eigen vectors are
,
Multivariate parameter estimates
1. is distributed as
2. is distributed as a Wishart random matrix
with n-1 degrees of freedom
3. and are independent