Review of Stat
Review of Stat
Statistics
Course outline
Review of Statistics
A random variable
Some definitions:
• Let random variable S be the number of days it will snow in the last
week of January
Probability distribution of S
Outcome 0 1 2 3 4 5 6 7
Probability 0.20 0.25 0.20 0.15 0.10 0.05 0.04 0.01
14
• A probability distribution that lists all values and the probability of each
value is not suitable for a continuous random variable.
.08 1
.02
.2
0 0
-30 -20 -10 0 10 -30 -20 -10 0 10 20
Tomorrow's temperature Tomorrow's temperature
16
The expected value or mean of a random variable is the average value over
many repeated trails or occurrences.
h k
i X
2 2
σY2 = Var (Y ) = E (Y − µY ) = (yi − µY ) · Pr (Y = yi )
i=1
Var (S) = (0 − 2.06)2 · 0.2 + (1 − 2.06)2 · 0.25 + (2 − 2.06)2 · 0.2 + (3 − 2.06)2 · 0.15
+(4 − 2.06)2 · 0.1 + (5 − 2.06)2 · 0.05 + (6 − 2.06)2 · 0.04 + (7 − 2.06)2 · 0.01
2.94
p
The standard deviation σY = Var (Y ) has the same units as Y
18
• For example, let B be a random variable which equals 1 if you pass the
exam and 0 if you don’t pass
(
1 with probability p
B=
0 with probability (1 − p)
• The variance of B is
Pk
Var (B) = σB2 = (bi − µB )2 · Pr (B = bi )
i=1
= (0 − p)2 · (1 − p) + (1 − p)2 · p
= p(1 − p)
p
• The standard deviation of B is σB = p(1 − p)
19
• In this course we will consider random variables (say X and Y ) that are
related by a linear function
Y =a+b·X
E (Y ) = µY = E (a + b · X ) = a + b · E (X ) = a + b · µX
E b2 (X − µX )2
=
b2 E (X − µX )2
=
= b2 · σX2
20
.4 .4
.3 .3
.2 .2
.
.
.1 .1
0 0
-4 -2 0 2 4 -10 -5 0 5 10
Low kurtosis High kurtosis
E (Y | X = 1) = 1 · Pr (Y = 1 |X = 1) + 0 · Pr (Y = 0 |X = 1)
1 · 0.5 + 0 · 0.5 = 0.5
25
Men Women
∙ Pr (IQ|G=male) (IQ|G=female)
= E (IQ | G = m) · Pr (G = m) + E (IQ | G = f ) · Pr (G = f )
26
Independence
Pr (Y = y | X = x) = Pr (Y = y )
Pr (X = x, Y = y ) = Pr (X = x) · Pr (Y = y |X = x)
(see slide 23)
= Pr (X = x) · Pr (Y = y )
E (Y | X ) = E (Y )
E (Y | X = 1) = E (Y | X = 0) = E (Y )
27
Covariance
Example: the covariance between snow (Y) and it being very cold (X):
Cov (X , Y ) = (1 − 0.3) (1 − 0.22) · 0.15
+ (1 − 0.3) (0 − 0.22) · 0.15
+ (0 − 0.3) (1 − 0.22) · 0.07
+ (0 − 0.3) (0 − 0.22) · 0.63
= 0.084
28
Correlation
Let
Z = aX + bY
The mean of Z equals
E (Z ) = E (aX + bY ) = aE (X ) + bE (Y )
µ-1.96*σ µ µ-1.96*σ
.
33
BY: [email protected]. Printing is for personal, private use only. No part of this book may be reproduced or transmitted without
The Normal distribution
sion. Violators will be prosecuted.
Y ∼ N (µ, σ)
= Pr (Z ≤ −2.5)
= 0.0062
35
• Appendix table 3 shows the 90th 95th and 99th percentiles of the
Y: [email protected]. Printing is for personal, private use only. No part of this book may be reproduced or transmitted wi
χ2 -distribution
on. Violators will be prosecuted.
P
3
• For example Pr i=1 Zi2 ≤ 7.81 = 0.95
37
-2.57 2.57
Student t distribution with 5 degrees of freedom
• The t distribution has fatter tails than the standard normal distribution.
• When m ≥ 30 it is well approximated by the standard normal
distribution.
38
The F-distribution
Let W a chi-squared random variable with m degrees of freedom and V a
chi-squared random variable with n degrees of freedom.
3.52 5.19
F distribution with m=4 and n=5 degrees of freedom
tion to Econometrics, Update, Global Edtion 40
Page 1
The F-distribution
The 90th, 95th & 99th percentiles of the Fm,n distribution are shown in Table 5
Y: [email protected]. Printing is for personal, private use only. No part of this book may be reproduced or transmitted without publi
ion. Violators will be prosecuted.
• For example with m = 4 and n = 5;
[email protected]. Printing is for personal, private use only. No part of this book may be reproduced or transmitted without publishe
n. Violators will be prosecuted.
41
The F-distribution
• A special case of the F distribution which is often used in econometrics
Introduction to Econometrics, Update, Global Edtion Page 1 of 1
is when Fm,n can be approximated by Fm,∞
• In this limiting case the denominator is the mean of infinitely many
squared standard normal random variables, which equals 1.
PRINTED BY: [email protected]. Printing is for personal, private use only. No part of this book may be reproduced or transmitted without publisher's
prior permission. Violators will be prosecuted.
• Appendix Table 4 shows the 90th, 95th and 99th percentiles of the Fm,∞
distribution.