Econ Review Stat W2 Jan2023
Econ Review Stat W2 Jan2023
CONCEPTS (continued)
1. Probability
2. Random variables
• Descriptive Statistics
• Normal distribution, Student’s t-distribution,
• Chi-square distribution, F distribution
• Point Estimation
• Interval Estimation (Confidence Interval)
• The geometric mean involves calculating the Nth root of the product of
the N observations, more relevant for growth
• So the geometric mean of six numbers in a series would be obtained by
multiplying them together and taking the sixth root
• In finance, when the numbers in the series can be negative or 0 (like
returns), we can use a slightly different method to calculate the geometric
mean
• Here we add one to each data point, then multiply together, take the Nth
root and then subtract one at the end
• where r1, r2 etc. are the data points that we wish to take the geometric
mean of
• The geometric mean will always be smaller than the arithmetic mean
unless all of the data points are the same.
‘Introductory Econometrics for Finance’ © Chris Brooks 2019 5
Measures of spread
• The spread of a series about its mean value can be measured using the
variance or standard deviation (which is the square root of the variance)
• This quantity is an important measure of risk in finance
• The standard deviation scales with the data, whereas the variance scales
with the square of the data. So, for §example, if the units of the data
points are US dollars, the standard deviation will also be measured in
dollars whereas the variance will be in dollars squared
• Other measures of spread include the range (the difference between the
largest and smallest of the data points) and the interquartile range (the
difference between the third and first quartile points in the series)
• The coefficient of variation divides the standard deviation by the sample
mean to obtain a unit-free measure of spread that can be compared across
series with different scales.
Means
•The mean of a random variable y is also known as its expected value, written
E(y).
•The expected value of a constant is the constant, e.g. E(c) = c
•The expected value of a constant multiplied by a random variable is equal to
the constant multiplied by the expected value of the variable: E(cy)=c E(y). It
can also be stated that E(cy+d)= cE(y)+d, where d is also a constant.
•For two independent random variables, y1 and y2, E(y1y2) = E(y1) E(y2)
Variances
•The variance of a random variable y is usually written var(y).
•The variance of a random variable y is given by var(y) = E[y − E(y)]2. The
variance of a constant is zero: var(c) = 0
•For c and d constants, var(cy + d) = c2var(y)
•For two independent random variables, y1 and y2, var(cy1 + dy2) = c2var(y1) +
d2var(y2)
Covariances
•The covariance between two random variables, y1 and y2 may be expressed as
cov(y1, y2)
•cov(y1, y2) = E[(y1 − E(y1))(y2 − E(y2))]
•For two independent random variables, y1 and y2, cov(y1, y2) = 0
•For four constants, c, d, e, and f, cov(c+dy1, e+fy2)=dfcov(y1, y2).
‘Introductory Econometrics for Finance’ © Chris Brooks 2019 15
4-16
1. Normal Distribution
1 −
2
f ( x) = e for − x
2
N o rm a l D is trib u tio n : = 0 , = 1
0.4
0.3
0.2
f(x)
0.1
0.0
-5 0 5
x
4-17
0 .4
0 .3
f(z)
0 .2
0 .1
0 .0
-5 -4 -3 -2 -1 0 1 2 3 4 5
=0
Z
Properties of normal distribution
E ( S n ) = n and SD ( S n ) = n ,
where = E ( X i ) and = SD( X i )
6-20
f( )
df = 30
2
0 .0 5
approaches a normal 0 .0 2
0 .0 1
0 .0 0
distribution as the degree of 0 50 100
2
freedom increases.
⚫ Sum of independent chi-square
RV is also a chi-square RV.
3. The t Distribution (Student’)
Z1
t= ~ tk
/k
2
k
k1
2
F( k1 , ) = 1
k2
k2
2
2
The F Distribution
F Distributions with different Degrees of Freedom
1.0 F(25,30)
f(F)
F(10,15)
0.5
F(5,6)
0.0
0 1 2 3 4 5
F
{
Bias
Consistency
n = 10 n = 100
6. Confidence Interval (Interval
Estimate)
2 types of estimators:
• Point Estimate
✓A single-valued estimate.
✓Conveys little information about the actual value of the
population parameter, about the accuracy of the estimate.
0.4
0.3
f( z)
0.2
0.1
0.0
-4 -3 -2 -1 0 1 2 3 4
z
7. Statistical Hypothesis Testing
✓H0: = 100
• The alternative hypothesis, denoted by H1, is the assertion of
all situations not covered by the null hypothesis.
✓H1: 100
The Null Hypothesis, H0
Consider H0: = 100. We may have a decision rule that says: “Reject
H0 if the sample mean is less than 95 or more than 105.”
The tails of a statistical test are determined by the need for an action. If action
is to be taken if a parameter is greater than some value a, then the alternative
hypothesis is that the parameter is greater than a, and the test is a right-tailed
test. H0: 50
H1: 50
Rejection Region
• Step 5: Conclusion.
Five-Step Procedure for
Hypothesis Testing: 2nd approach
The p-Value
The p-value is the area of the rejection region when the Test
Statistics is equal to the Critical Value.