Lecture 3
Lecture 3
Quantitative Analysis
Distributions
Page 2
Describe the key properties of the uniform distribution, Bernoulli
distribution & Binomial distribution and identify common occurrences
of each distribution.
Uniform distribution
Page 3
Describe the key properties of the uniform distribution, Bernoulli
distribution & Binomial distribution and identify common occurrences
of each distribution (continued)
Uniform distribution
If the random variable, X, is continuous, the uniform distribution
is given by the following probability density function (pdf):
1
for ≤ x ≤
= −
0 for x < or x >
The mean, , is calculated as the average of the start and end values of the
distribution. Similarly, the variance, is calculated as shown below.
1
= ( + )
2
1
= ( − )
12
The uniform distribution is characterized by the following cumulative
distribution function (CDF):
−
≤ =
−
Page 4
Describe the key properties of the uniform distribution, Bernoulli
distribution & Binomial distribution and identify common occurrences
of each distribution (continued)
Bernoulli distribution
A random variable X is called Bernoulli distributed with parameter (p) if it has
only two possible outcomes, often encoded as 1 (“success” or “survival”) or 0
(“failure” or “default”), and if the probability for realizing “1” equals p and the
probability for “0” equals 1 – p. The classic example for a Bernoulli-distributed
random variable is the default event of a company.
1
=
0
Page 5
Describe the key properties of the uniform distribution, Bernoulli
distribution & Binomial distribution and identify common occurrences
of each distribution (continued)
Binomial distribution
A binomial distributed random variable is the sum of (n) independent and
identically distributed (i.i.d.) Bernoulli-distributed random variables. The
probability of observing (k) successes is:
!
= = ( − ) where = ! !
Page 6
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Poisson distribution
A Poisson-distributed random variable is used to describe
the random number of events occurring over a certain
time interval. The Poisson distribution depends upon only
one parameter, lambda λ, and can be interpreted as an
approximation to the binomial distribution. The lambda
parameter (λ) indicates the rate of occurrence of the
random events.
If the rate at which events occur over time is constant, and the probability of any one
event occurring is independent of all other events, then the events follow a Poisson
process, where t is the amount of time elapsed (i.e, the expected number of events
before time t is equal to λt):
= =
!
Page 7
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Poisson distribution
In Poisson, lambda is both the expected value (the mean) and the
variance!
The exhibit below represents a Poisson distribution for λ and n =2, 4 and 10.
= =
!
Page 8
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
The normal or Gaussian distribution is often referred to as
the bell curve because of the shape of its probability density
function. Characteristics of the normal distribution include:
Page 9
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
Properties of normal distribution:
• Location-scale invariance: Imagine random variable X, which is
normally distributed with the parameters µ and σ. Now consider
random variable Y, which is a linear function of X, such that: Y = aX
+ b. In general, the distribution of Y might substantially differ from
the distribution of X, but in the case where X is normally distributed,
the random variable Y is again normally distributed with parameters
mean (= a*µ + b) and variance (=a^2* σ). Specifically, we do not
leave the class of normal distributions if we multiply the
random variable by a factor or shift the random variable.
Page 10
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
Here we show an exhibit of a
normal distribution for µ =10 and at
various levels of σ (1, 2 and 3)
1
=
2
Page 11
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
Why is the normal distribution commonplace?
The central limit theorem (CLT) says that sampling distribution of sample
means tends to be normal regardless of the shape of the underlying distribution;
this explains much of the “popularity” of the normal distribution.
Parsimony: It requires (or is fully described by) only two parameters: mean and
variance
Page 12
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
Standard Normal distribution
A normal distribution is fully specified by two parameters,
mean and variance (or standard deviation). We can
transform a normal into a unit or standardized variable:
• Standard normal has mean = 0, and variance = 1
• No parameters required!
This unit or standardized variable is normally distributed with zero mean and
variance of one. Its standard deviation is also one (variance = 1.0 and standard
deviation = 1.0).
Page 13
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
Standard normal distribution: Critical Z values
Memorize the two common critical values: 1.65 and 2.33. These correspond to
confidence levels, respectively, of 95% and 99% for a one-tailed test. For VAR,
the one-tailed test is relevant because we are concerned only about losses (left-tail)
not gains (right-tail).
Page 14
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Normal distribution
Multivariate normal distributions
Page 15
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Examples
In the FRM, four distributions are quite common:
• The Bernoulli is invoked when there are only two outcomes. It is
used to characterize a default: an obligor or bond will either default
or survive. Most bonds “survive” each year, until perhaps one year
they default. At any given point in time, or during any given year,
the bond will be in one of two states.
Page 16
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Examples
Page 17
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Lognormal
The lognormal is common in finance: If an asset return (r) is normally distributed,
the continuously compounded future asset price level (or ratio or prices; i.e., the
wealth ratio) is lognormal. Expressed in reverse, if a variable is lognormal, its natural
log is normal. Here is an exhibit of lognormal distribution for µ =10 and at various
levels of σ (0.25, 0.5 and 1)
Page 18
Describe the key properties of the Poisson distribution, normal
distribution & lognormal distribution and identify common
occurrences of each distribution (continued)
Lognormal
The lognormal distribution is extremely common in finance because it is often the
distribution assumed for asset prices (e.g., stock prices). Specifically, it is common to
assume that log (i.e., continuously compounded) asset returns are normally
distributed such that, by definition, asset prices have a lognormal distribution.
Page 19
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution.
Chi-squared
Page 20
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Chi-squared
Page 21
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Chi-squared
Page 22
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Student’s t
Student t’s distribution (for large samples, approximates the normal)
As the degrees of freedom (d.f.) increases, the t-distribution converges with the
normal distribution. It is similar to the normal, except that it exhibits slightly heavier
tails (the lower the d.f., heavier the tails). The below exhibit shows the basic shape of
the student t’s distribution and how it changes with k (specifically the shape of its tail).
−
=
/
Page 23
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Student’s t
Page 24
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Student’s t
We need the critical (lookup) t value. The critical t value is a function of:
• Degrees of freedom (d.f.); e.g., 10-1 =9 in this example
• Significance: 1-95% confidence = 5% in this example
Page 25
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Student’s t
The critical-t is just a lookup (reference to) the student's t distribution as opposed to a computed t-
statistic, aka t-ratio. In this way, a critical t is an inverse CDF (quantile function) just like, for a
normal distribution, the "critical one-tailed value" at 1% is -2.33 and at 5% is -1.645. In this case
we want the critical t for (n-1) degrees of freedom and two-tailed 5% significance (= one tailed
2.5%). We can find 2.262 on the student's t lookup table where column = 2-tail 0.05 and d.f. = 9.
In Excel, 2.262 = T.INV.2t (5%, 9). The 95% confidence interval can be computed.
1.54%
+ 2.262 = 1.12%
10
1.54%
− 2.262 = −1.08%
10
Page 26
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
Student’s t
Both the normal (Z) and student’s t (t) distribution characterize the sampling
distribution of the sample mean.
The difference is that the normal is used when we know the population
variance; the student’s t is used when we must rely on the sample variance. In
practice, we don’t know the population variance, so the student’s t is typically
appropriate.
( − ) ( − )
= =
Page 27
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
F-Distribution
The F distribution is also called the variance ratio distribution. The F ratio is
the ratio of sample variances, with the greater sample variance in the
numerator:
Properties of F distribution:
• Nonnegative (>0) and skewed to the
right
• Like the chi-square distribution, as
d.f. increases, it approaches normal
• Square of a variable with t-distribution
and k d.f. has an F distribution with
(1, k) d.f: X2 ~ F(1,k)
Page 28
Describe the key properties of the Chi-squared distribution,
Student’s t and F-distributions, and identify common occurrences of
each distribution (continued)
F-Distribution
GOOG YHOO
=VAR() 0.0237% 0.0084%
=COUNT() 10 10
F ratio 2.82
Confidence 90%
Significance 10%
=FINV() 2.44
• The F ratio, therefore, is 2.82 (divide higher variance by lower variance; the F ratio
must be greater than, or equal to, 1.0).
• At 10% significance, with (10-1) and (10-1) degrees of freedom, the critical F value
is 2.44. Because our F ratio of 2.82 is greater than (>) 2.44, we reject the null (i.e.,
that the population variances are the same).
• We conclude the population variances are different.
Page 29
Describe the central limit theorem and the implications it has when
combining i.i.d. random variables.
• The central limit theorem says that this random variable is itself normally
distributed, regardless of the shape of the underlying population. Given a
population described by any probability distribution having mean () and finite
variance (2), the distribution of the sample mean computed from samples
(where each sample equals size n) will be approximately normal. If the size
of the sample is at least 30 (n 30), then we can assume the sample mean
is approximately normal!
Page 30
Describe the central limit theorem and the implications it has when
combining i.i.d. random variables (continued)
Each sample has a sample mean. There are many sample means. The
sample means have variation: a sampling distribution. The central limit
theorem (CLT) says the sampling distribution of sample means is
asymptotically normal.
Page 31
Describe the central limit theorem and the implications it has when
combining i.i.d. random variables (continued)
Page 32
Describe the central limit theorem and the implications it has when
combining i.i.d. random variables (continued)
• The mean of the sample or the sample mean is a random variable defined by:
+ +
=
Page 33
Describe the properties of independent and identically distributed
(i.i.d.) random variables.
Page 34
Describe a mixture distribution and explain the creation and
characteristics of mixture distributions.
= , =1
where fi(x)’s are the component distributions, and wi’s are the mixing proportions
or weights.
Page 35
Describe a mixture distribution and explain the creation and
characteristics of mixture distributions (continued)
Consider a stock whose log returns follow a normal distribution with low volatility
90% of the time, and a normal distribution with high volatility 10% of the time.
Most of the time the stock just bounces along but occasionally, the stock’s
behavior may be more extreme. In this Miller’s example, the mixture distribution
is:
= +
According to Miller, “Mixture distributions are extremely flexible. In a sense they occupy
a realm between parametric distributions and non-parametric distributions. In a typical
mixture distribution, the component distributions are parametric but the weights are
based on empirical (non-parametric) data. Just as there is a trade-off between
parametric distributions and non-parametric distributions, there is a trade-off between
using a low number and a high number of component distributions. By adding more and
more component distributions, we can approximate any data set with increasing
precision. At the same time, as we add more and more component distributions, the
conclusions that we can draw become less and less general in nature.”
Page 36
Describe a mixture distribution and explain the creation and
characteristics of mixture distributions (continued)
Page 37
Describe a mixture distribution and explain the creation and
characteristics of mixture distributions (continued)
By shifting the mean of one distribution, we can also create a distribution with
positive or negative skew. Finally, if we move the means of far enough apart, the
resulting mixture distribution will be bimodal. This exhibit below shows that we
have a PDF with two distinct maxima.
Page 38
The End
Distributions