0% found this document useful (0 votes)

131 views

Multivariate Normal - Chi Square

Multivariate normal distributions generalize normal (gaussian) to m-dimensions. Because mean and covariance are easy to estimate from a data set, it's easy to fit a normal distribution to data. The only way I know how to do this integral is by trickery involving the Cholesky decomposition.

Uploaded by

Wagner Jorge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

131 views

Multivariate Normal - Chi Square

Uploaded by

Wagner Jorge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

CS395T Computational Statistics with Application to Bioinformatics

Prof. William H. Press Spring Term, 2010 The University of Texas at Austin Unit 6: Multivariate Normal Distributions and Chi Square

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

(Let me explain where were going here)

Building up prerequisites to do a fairly sophisticated treatment of model fitting
Bayes parameter estimation p-value tail tests really understand multivariate normal and covariance really understand chi-square

Then, we get to appreciate the actual model fitting stuff

fitted parameters their uncertainty expressed in several different ways goodness-of-fit

And it will in turn be a nice platform for learning some other things
bootstrap resampling

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Multivariate Normal Distributions Generalizes Normal (Gaussian) to M-dimensions Like 1-d Gaussian, completely defined by its mean and (co-)variance Mean is a M-vector, covariance is a M x M matrix

N (x|, ) =

1 T 1 1 exp[ ( x ) (x )] 2 M/ 2 1 / 2 (2 ) det()

The mean and covariance of r.v.s from this distribution are*

= hxi

= (x )(x )

In the one-dimensional case is the standard deviation, which can be visualized as error bars around the mean. In more than one dimension can be visualized as an error ellipsoid around the mean in a similar way.

1 = (x )T 1 (x )
*really?
The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Because mean and covariance are easy to estimate from a data set, it is easy perhaps too easy to fit a multivariate normal distribution to data.

1 X = hxi xi N i

I.e., estimate by sample averages.

1 X T = (x )(x ) (xi )(xi )T N i

But back to really? The mean follows from the symmetry argument Z Z 1 T 1 M 1 0 = (x ) exp[ ( x ) ( x )] d x 2 (2)M/2 det()1/2 Its not obvious that the covariance in fact obtains from the definition of the multivariate Normal. One has to do the multidimensional (and tensor) integral: M2 = Z Z (x)(x)T 1 T 1 M 1 exp[ ( x ) ( x )] d x 2 (2)M/2 det()1/2

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

The only way I know how to do this integral is by trickery involving the Cholesky decomposition (square root of a positive definite matrix):
were setting to 0 for convenience

y p(y) = p(x) x

Jacobian determinant. The transformation law for multivariate probability distributions.

This is the distribution of N independent univariate Normals N(0,1)!

Ha!

(I dont know an elementary proof, i.e., without some matrix decomposition. Can you find one?)

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Reduced dimension properties of multivariate normal

1 T 1 1 N (x|, ) = exp[ ( x ) (x )] 2 (2 )M/2 det()1/2

1. Any slice through a m.v.n. is a m.v.n (constraint or conditioning) 2. Any projection of a m.v.n. is a m.v.n (marginalization)

You can prove both assertions by completing the square in the exponential, producing an exponential in (only) the reduced dimension times an exponential in (only) the lost dimensions. Then the second exponential is either constant (slice case) or can be integrated over (projection case).

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

How to generate multivariate normal deviates N():

Cholesky:

= LLT y = {yi } N(0, 1)

Fill y with independent Normals: Transform: Proof:

x = Ly +
T

Thats it! x is the desired m.v.n.

T T (x )(x ) = (Ly)(Ly) T T T T = L(yy )L = L yy L = LLT =

Even easier: MATLAB has a built-in function mvnrnd(MU,SIGMA). But be sure you get a bunch of m.v.n.s all in one call, because it (probably) re-does the Cholesky decomposition on each call!
Notice that the proof never used Normality. You can fill y with anything with zero mean and variance one, and youll reproduce . But the result wont be Normal!
The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

So, easy operations are: 1. Fitting a multivariate normal to a set of points (just compute the sample mean and covariance!) 2. Sampling from the fitted m.v.n.
mu = mean([len1 len2]) sig = cov(len1,len2)

mu = 3.2844 3.2483 sig = 0.6125 0.2476

In MATLAB, for example, these are one-line operations.

0.2476 0.5458

Example:

rsamp = mvnrnd(mu,sig,1000);

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

A related, useful, Cholesky trick is to draw error ellipses (ellipsoids, )

= LLT
So, locus of points at 1 standard deviation is

1 = (x ) x = Lz +

(x )

So, if z is on the unit circle (sphere, ) then

1 L (x ) = 1

will be on the error ellipse.

my coding of this idea looks like this

function [x y] = errorellipse(mu,sigma,stdev,n) L = chol(sigma,'lower'); circle = [cos(2*pi*(0:n)/n); sin(2*pi*(0:n)/n)].*stdev; ellipse = L*circle + repmat(mu,[1,n+1]); x = ellipse(1,:); y = ellipse(2,:);

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

The distribution we have been looking at has some interesting biology in it!

file genestats.dat (on course web site) contains 20694 lines like this:
number of exons N N exon lengths N-1 intron lengths

ENST00000341866 17470 3262 0.00002 4 1290 349 1412 211 169 678 13361 <EOL> ENST00000314348 22078 1834 0.00001 7 100 166 113 178 165 262 850 5475 385 3273 1149 2070 7892 ENST00000313081 13858 1160 0.00001 6 496 150 107 85 151 171 2068 76 2063 674 7817 ENST00000298622 80000 6487 0.00001 24 135 498 216 120 147 132 36 60 129 129 84 63 99 99 54 66 69 78 204 66 73 1081 397 2452 12133 15737 1513 769 942 103 829 2272 1340 3058 327 2371 1361 471 2922 735 85 9218 1257 2247 897 822 12104

total length gene name

ignore for now

total length of exons

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Log10 of size of 1st and 2nd introns for 1000 genes:

This is kind of fun, because its not just the usual featureless scatter plot
notice the hard edges this is biology!

log10(second intron length)

log10(first intron length) Is there a significant correlation here? If the first intron is long, does the second one also tend to be? Or is our eye being fooled by the non-Gaussian shape?

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Biology: The hard lower bounds on intron length are because the intron has to fit around the big spliceosome machinery! Its all carefully arranged to allow exons of any length, even quite small. Why? Could the spliceosome have evolved to require a minimum exon length, too? Are we seeing chance early history, or selection?
credit: Alberts et al. Molecular Biology of the Cell

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

The covariance matrix is a more general idea than just for multivariate Normal. You can compute the covariances of any set of random variables. Its the generalizaton to M-dimensions of the (centered) second moment Var.

For multiple r.v.s, all the possible covariances form a (symmetric) matrix:

Notice that the diagonal elements are the variances of the individual variables.

The variance of any linear combination of r.v.s is a quadratic form in C :

This also shows that C is positive definite, so it can still be visualized as an ellipsoid in the space of the r.v.s., where the directions are the different linear combinations.
The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

The covariance matrix is closely related to the linear correlation matrix.

rij = p

Cij Cii Cjj

more often seen written out as

When the null hypothesis is that X and Y are independent r.v.s, then r is useful as a p-value statistic (test for correlation), because 1. For large numbers of data points N, it is normally distributed,

r N(0, N 1/2 ) so r N is a normal t-value

2. Even with small numbers of data points, if the underlying distribution is multivariate normal, there is a simple form for the pvalue (comes from a Student t distribution).

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

For the exon length data, we can easily now show that the correlation is highly significant.

r = sig ./ sqrt(diag(sig) * diag(sig)') tval = sqrt(numel(len1))*r

r = 1.0000 0.3843 tval = 31.6228 12.1511 rr = 1.0000 0.3843 p = 1.0000 0.0000 0.0000 1.0000
not clear why Matlab reports 1 on the diagonals. Id call it 0!

0.3843 1.0000 12.1511 31.6228

statistical significance of the correlation in standard deviations (but note: uses CLT) Matlab has built-ins

[rr p] = corrcoef(i1llen,i2llen)

0.3843 1.0000

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Lets talk more about chi-square. Recall that a t-value is (by definition) a deviate from 2 is a statistic defined as the sum of the squares of n independent t-values.

2 =

X xi i 2
i

xi N(i , i )

is a distribution (special case of Gamma), defined as

The important theorem is that 2 is in fact distributed as Chisquare. Lets prove it.

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Prove first the case of =1: Suppose and

1 1 x2 pX (x) = e 2 2

x N(0, 1)

y = x2

pY (y ) dy = 2pX (x) dx

So, pY (y) = y

1/2

pX (y

1/2

1 1 2y e 2 y

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

To prove the general case for integer , compute the characteristic function

Since we already proved that =1 is the distribution of a single t2-value, this proves that the general case is the sum of t2-values.
The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Question: What is the generalization of

2 =

X xi i 2
i

xi N(i , i )

to the case where the xis are normal, but not independent? I.e., x comes from a multivariate Normal distribution? Answer:

2 = (x )T 1 (x ),

x N(, )

Proof is one of those Cholesky things,

= LLT ,

show that y is product of independent N(0,1)s, as we did before, and that

Ly = x , X
2 yi

=y y=

The University of Texas at Austin, CS 395T, Spring 2010, Prof. William H. Press

Hinge Theorem
No ratings yet
Hinge Theorem
3 pages
Solusi Soal Bab 4
No ratings yet
Solusi Soal Bab 4
9 pages
Craig Turnbull (Auth.) - A History of British Actuarial Thought-Palgrave Macmillan (2017)
No ratings yet
Craig Turnbull (Auth.) - A History of British Actuarial Thought-Palgrave Macmillan (2017)
350 pages
Computational Statistics With Application To Bioinformatics: Unit 9: Working With Multivariate Normal Distributions
No ratings yet
Computational Statistics With Application To Bioinformatics: Unit 9: Working With Multivariate Normal Distributions
9 pages
3 CommonDistributions
No ratings yet
3 CommonDistributions
13 pages
Murphy Gaussians
No ratings yet
Murphy Gaussians
15 pages
CS395T Computational Statistics With Application To Bioinformatics
No ratings yet
CS395T Computational Statistics With Application To Bioinformatics
28 pages
1) Common Univariate Summaries: I) I) Iii) I) Ii)
No ratings yet
1) Common Univariate Summaries: I) I) Iii) I) Ii)
5 pages
Tut2 Questions
No ratings yet
Tut2 Questions
3 pages
Random Vectors and Multivariate Normal Distribution
No ratings yet
Random Vectors and Multivariate Normal Distribution
6 pages
Multivariate Statistical Analysis: The Multivariate Normal Distribution
No ratings yet
Multivariate Statistical Analysis: The Multivariate Normal Distribution
13 pages
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
No ratings yet
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
20 pages
Multivariate Normal Distribution
100% (1)
Multivariate Normal Distribution
8 pages
Multivariate Methods Assignment Help
No ratings yet
Multivariate Methods Assignment Help
17 pages
My Notes For Discrete and Continuous Distributions 987654
No ratings yet
My Notes For Discrete and Continuous Distributions 987654
28 pages
w2e_multivariate_gaussian
No ratings yet
w2e_multivariate_gaussian
6 pages
R Commands
No ratings yet
R Commands
5 pages
4 PvalueTests
No ratings yet
4 PvalueTests
24 pages
AE - Tema 3 - The Multivariate Gaussian Distribution
No ratings yet
AE - Tema 3 - The Multivariate Gaussian Distribution
6 pages
Capitulo 1 Rencher
No ratings yet
Capitulo 1 Rencher
19 pages
PSF_week8_samp.pdf-BIVARIATE NORMAL DISTRIBUTION
No ratings yet
PSF_week8_samp.pdf-BIVARIATE NORMAL DISTRIBUTION
25 pages
1.12.2024-BSC-301-CSBS-class note_2024-25
No ratings yet
1.12.2024-BSC-301-CSBS-class note_2024-25
58 pages
The Mvtnorm Package: R Topics Documented
No ratings yet
The Mvtnorm Package: R Topics Documented
12 pages
stat
No ratings yet
stat
53 pages
STAT456 Study Guide
No ratings yet
STAT456 Study Guide
31 pages
Multivariate
0% (1)
Multivariate
319 pages
Multivariate Statistical Analysis: Old School
No ratings yet
Multivariate Statistical Analysis: Old School
319 pages
STAT3006: Tutorial 2
No ratings yet
STAT3006: Tutorial 2
3 pages
2018dec_02402_solution_en
No ratings yet
2018dec_02402_solution_en
31 pages
STAT3006: Tutorial 1: Sample Solutions
No ratings yet
STAT3006: Tutorial 1: Sample Solutions
10 pages
Lecture01 Uppsala EQG 12
No ratings yet
Lecture01 Uppsala EQG 12
39 pages
Presentation B 6 Sep 2021
No ratings yet
Presentation B 6 Sep 2021
68 pages
Statistics
No ratings yet
Statistics
60 pages
Gaussian Process Intuitive
No ratings yet
Gaussian Process Intuitive
17 pages
斯坦福大学机器学习数学基础 33-40
No ratings yet
斯坦福大学机器学习数学基础 33-40
8 pages
Document
No ratings yet
Document
234 pages
Advanced Machine Learning: CS 281
100% (1)
Advanced Machine Learning: CS 281
88 pages
Multivariate_normal
No ratings yet
Multivariate_normal
24 pages
murphysolns
No ratings yet
murphysolns
45 pages
CH 1 Introduction
No ratings yet
CH 1 Introduction
19 pages
Multivariate Normal Distribution: 3.1 Basic Properties
No ratings yet
Multivariate Normal Distribution: 3.1 Basic Properties
13 pages
Unit 19
No ratings yet
Unit 19
16 pages
Multivariate Material
No ratings yet
Multivariate Material
58 pages
MVA Section1 2012
No ratings yet
MVA Section1 2012
14 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
Probability and Statistics
No ratings yet
Probability and Statistics
28 pages
Manual For Instructors: TO Linear Algebra Fifth Edition
No ratings yet
Manual For Instructors: TO Linear Algebra Fifth Edition
12 pages
Error and Uncertainty: General Statistical Principles
No ratings yet
Error and Uncertainty: General Statistical Principles
8 pages
Package Mvtnorm': R Topics Documented
No ratings yet
Package Mvtnorm': R Topics Documented
17 pages
Intro To Data Science Lecture 2
No ratings yet
Intro To Data Science Lecture 2
12 pages
Sst304 Lesson 1
No ratings yet
Sst304 Lesson 1
8 pages
Multivariate Analysis
No ratings yet
Multivariate Analysis
25 pages
HMWK 4
No ratings yet
HMWK 4
5 pages
The Multivariate Gaussian Distribution: 1 Relationship To Univariate Gaussians
No ratings yet
The Multivariate Gaussian Distribution: 1 Relationship To Univariate Gaussians
10 pages
Presentation 3
No ratings yet
Presentation 3
29 pages
handbook 3rd sem
No ratings yet
handbook 3rd sem
15 pages
Joining Instructions Lisboa
No ratings yet
Joining Instructions Lisboa
8 pages
Bera 2 - Introduction to Statistics for Econometricians, Part II Apostila
No ratings yet
Bera 2 - Introduction to Statistics for Econometricians, Part II Apostila
114 pages
Intro To Statistic Using R - Session 1
No ratings yet
Intro To Statistic Using R - Session 1
1 page
Chapter 6 - The Multivariate Normal Distribution and Copulas - 2013 - Simulation
No ratings yet
Chapter 6 - The Multivariate Normal Distribution and Copulas - 2013 - Simulation
13 pages
Ordinary Differential Equations and Stability Theory: An Introduction
From Everand
Ordinary Differential Equations and Stability Theory: An Introduction
David A. Sanchez
No ratings yet
Useful Formulae: Mathematical & Physical
From Everand
Useful Formulae: Mathematical & Physical
Matthew Watkins
No ratings yet
Fundamentals of Mathematics 9th Enhanced Edition James Van Dyke All Chapters Instant Download
100% (12)
Fundamentals of Mathematics 9th Enhanced Edition James Van Dyke All Chapters Instant Download
81 pages
Examination Paper For TTT4120 Digital Signal Processing: Department of Electronic Systems
No ratings yet
Examination Paper For TTT4120 Digital Signal Processing: Department of Electronic Systems
7 pages
c6d - Channel Coding Part 1
No ratings yet
c6d - Channel Coding Part 1
78 pages
2016febreal Analysis Problems
No ratings yet
2016febreal Analysis Problems
159 pages
Laplace Transforms notes
No ratings yet
Laplace Transforms notes
35 pages
EQAO - Question Breakdown by Strand
No ratings yet
EQAO - Question Breakdown by Strand
4 pages
2lessons Circle Theorems
100% (1)
2lessons Circle Theorems
29 pages
Term 2 - Week 14 - Activity 1 - Angles in Circles
No ratings yet
Term 2 - Week 14 - Activity 1 - Angles in Circles
2 pages
Force Estimation Using Vibration Data
No ratings yet
Force Estimation Using Vibration Data
9 pages
Basics of Monte Carlo Simulation
No ratings yet
Basics of Monte Carlo Simulation
10 pages
Fundamentals of Computers: Reema Thareja
100% (1)
Fundamentals of Computers: Reema Thareja
34 pages
Kiran Kedlaya - A Is Less Than B
No ratings yet
Kiran Kedlaya - A Is Less Than B
37 pages
Table of Specification Grade 9 Patience
No ratings yet
Table of Specification Grade 9 Patience
1 page
Notes On Plane Wave Expansions
No ratings yet
Notes On Plane Wave Expansions
6 pages
University of Kerala: Syllabus For Iii Semester Computer Science & Engineering
No ratings yet
University of Kerala: Syllabus For Iii Semester Computer Science & Engineering
21 pages
Allocation of Support Department Costs, Common Costs, and Revenues
No ratings yet
Allocation of Support Department Costs, Common Costs, and Revenues
17 pages
PAMO2024
No ratings yet
PAMO2024
4 pages
Priority Order of Chapters
No ratings yet
Priority Order of Chapters
3 pages
Using Some of Microsoft Office Excel Fun
No ratings yet
Using Some of Microsoft Office Excel Fun
79 pages
Engineering Drawing Questions
No ratings yet
Engineering Drawing Questions
26 pages
DLD -UNIT-1
No ratings yet
DLD -UNIT-1
14 pages
Marking Scheme PERCUBAAN STPM 2018 P 1 SEC (A)
No ratings yet
Marking Scheme PERCUBAAN STPM 2018 P 1 SEC (A)
7 pages
Paper 1
No ratings yet
Paper 1
16 pages
Verbal Reasoning Measure Reading Comprehension
No ratings yet
Verbal Reasoning Measure Reading Comprehension
6 pages
Ratio of Exponentials
No ratings yet
Ratio of Exponentials
2 pages
The Copperbelt University School of Mathematics and Natural Sciences Department of Mathematics
No ratings yet
The Copperbelt University School of Mathematics and Natural Sciences Department of Mathematics
12 pages
Kumaduan Eksamin Ed Mathematics III S.Y. 2016 - 2017
No ratings yet
Kumaduan Eksamin Ed Mathematics III S.Y. 2016 - 2017
3 pages

Multivariate Normal - Chi Square

Uploaded by

Multivariate Normal - Chi Square

Uploaded by

CS395T Computational Statistics with Application to Bioinformatics

(Let me explain where were going here)

Then, we get to appreciate the actual model fitting stuff

The mean and covariance of r.v.s from this distribution are*

I.e., estimate by sample averages.

1 X T = (x )(x ) (xi )(xi )T N i

Jacobian determinant. The transformation law for multivariate probability distributions.

This is the distribution of N independent univariate Normals N(0,1)!

Reduced dimension properties of multivariate normal

1 T 1 1 N (x|, ) = exp[ ( x ) (x )] 2 (2 )M/2 det()1/2

How to generate multivariate normal deviates N():

= LLT y = {yi } N(0, 1)

Fill y with independent Normals: Transform: Proof:

Thats it! x is the desired m.v.n.

T T (x )(x ) = (Ly)(Ly) T T T T = L(yy )L = L yy L = LLT =

mu = 3.2844 3.2483 sig = 0.6125 0.2476

In MATLAB, for example, these are one-line operations.

A related, useful, Cholesky trick is to draw error ellipses (ellipsoids, )

So, if z is on the unit circle (sphere, ) then

will be on the error ellipse.

my coding of this idea looks like this

total length gene name

ignore for now

total length of exons

Log10 of size of 1st and 2nd introns for 1000 genes:

log10(second intron length)

The variance of any linear combination of r.v.s is a quadratic form in C :

The covariance matrix is closely related to the linear correlation matrix.

Cij Cii Cjj

more often seen written out as

r N(0, N 1/2 ) so r N is a normal t-value

r = sig ./ sqrt(diag(sig) * diag(sig)') tval = sqrt(numel(len1))*r

0.3843 1.0000 12.1511 31.6228

is a distribution (special case of Gamma), defined as

Prove first the case of =1: Suppose and

Question: What is the generalization of

Proof is one of those Cholesky things,

show that y is product of independent N(0,1)s, as we did before, and that

You might also like