Prob Stats Notes
Prob Stats Notes
NORMAL SAMPLES
Random samples: Let X1 , X2 , ..., Xn be a sample of observable measurements(i.e they are
random variables) of a population of size N . Further let each variable Xi ,
i = 1, 2, .., N
n
Y
f (xi )
i=1
X is a random variable since it is a function of the n random variables. The mean and
variance of X are
"
#
n
n
1X
1X
E[X] = E
Xi =
E[Xi ]
n i=1
n i=1
1X
=
=
n i=1
n
and
#
n
n
X
1 X
1
V ar[X] = V ar
Xi = 2
V ar[Xi ]
n i=1
n i=1
"
n
1 X 2 2
=
n2 i=1
n
For large samples (n > 30); using Central Limit Theorem, then X is normally distributed with mean and variance
2
;
n
distribution of the random sample. This distribution is referred to as a sampling distribution and the standard deviation of the distribution is called the standard error of the
distribution.
Examples
(1) Let X be the sample mean of a random sample of size 60 from a distribution with
mean=42.5 and variance=135. Determine k such that P (X < k) = 0.65
; hence
The sampling distribution of X is X N 42.5, 135
60
P (X < k) = 0.65
k 42.5
X 42.5
P q
< q
= 0.65
135
60
135
60
k 42.5
= 0.65
P Z < q
135
60
k = 43.085
(a) Let X be the mean of a random sample of size 121 from the distribution that has
probability density function
f (x) = 4(1 x)3,
0<x<1
E[X ] =
x24(1 x)3dx
0
=
=
Z
2x(1 x)4dx =
0
Z 1
2
4 1
2x(1 x)4 dx
x (1 x) 0 +
0
Z 1
2x(1 x)4dx
0+
0
1 Z 1
(1 x)5
(1 x)5
dx
x
+
5
5
0
0
1
(1 x)6
1
0 + 2 x
=
30
15
0
Thus
V ar(X) = E[X 2] (E[X])2
1
2
1
=
=
15 25
75
2
The sampling distribution of X is X N 15 , 9075
The required probability is given by
0.165 0.2
0.234 0.2
X 0.2
P (0.165 < X < 0.234) = P q
< q
< q
2
9075
2
9075
2
9075
Proof:
(n 1)S
n
X
i=1
n
X
n
X
(Xi X ) =
n
X
(Xi + X)2
i=1
2
(Xi ) (X )
i=1
2
(Xi ) 2
n
X
i=1
(Xi )(X ) +
i=1
n
X
(X )2
i=1
Now
n
X
i=1
n
X
(X )2 = n(X )2
i=1
Thus
2
(n 1)S =
n
X
i=1
n
X
i=1
(Xi )2 n(X )2
Dividing through by 2;
n
X
(n 1)S 2
n
(Xi )2
=
2 (X )2
2
2
i=1
Finally
n
X
(Xi )2
i=1
n
X
Zi2;
Zi2
(1) hence
i=1
Therefore
(n1)S 2
2
Zi2 2(n)
i=1
and
n
(X )2 2(1);
2
n
X
since
n(X )
N (0, 1)
2
of chi square distribution, this quantity is a chi square randon variable with n-1 degrees of
freedom.
Example A random sample of 81 high-school students was selected and their IQ measurements taken. Assuming the measurements are normally distributed with mean = 105
and standard deviation = 15; Find Pr(150.581 < S 2 < 299.894)
(n1)S 2
2
2(80) then
2
80
S
80
299.894
80
150.581
<
<
P r(150.581 < S 2 < 299.894) = P r
225
225
225
2
= P r(53.539 < < 106.629)
Solution Given
Example
Let X1 , X2 , .., Xn be a random sample drawn from a population having exponential distribution with parameter . Find the probability density function of Z=max(X1 , X2 , .., Xn)
using the distribution function technique.
Solution:
Let G(z) be the cumulative distribution function of Z. Now since Z is the largest of the n
random variables,
G(z) = Pr(Z z) = Pr(X1 z, X2 z, .., Xn z)
n
n Z z
Y
Y
Pr(Xi z) =
exdx
=
=
i=1
n
Y
i=1
n
1 ez = 1 ez
i=1
z>0
x1
1
1
.
.
.
y
y2
yn
1
x2 x2 . . . x2
y1 y2
yn
.
.
.
.
.
.
.
.
.
.
.
.
xn
y1
xn
y2
...
xn
yn
Example:
Let the joint probability density function of X1 , X2 , X3 be
(
1,
0 < x1 < 1, 0 < x2 < 1, 0 < x3 < 1
f (x1, x2 , x3) =
0, elsewhere
Find the joint probability density function of Y1 , Y2 , Y3 where Y1 = X1 + X2 + X3 ,Y2 =
X2 + X3 and Y3 = X3 .
Solution:
Now
x3 = y3;
x2 = y2 y3 ;
x1 = y1 y2
x1
y1
x1
y2
x1
y3
x2
y1
x2
y2
x2
y3
x3
y1
x3
y2
x3
y3
1 1 0
= 0 1 1
,
0 0
1
J=1;
hence
g(y1 , y2, y3) = f (y1 y2 , y2 y3, y3 ) | J |
= 1 | 1 |= 1
The joint probability density function of Y1 , Y2 , Y3 is
(
1,
0 < y3 < 1, y3 < y2 < y3 + 1, y2 < y1 < y2 + 1
g(y1 , y2, y3) =
0, elsewhere
x1
x1
for discrete variables. If the resulting function can be recognized as the joint moment
generating function of some known probability distribution, it follows that the joint probability distribution of Y1 , Y2 , .., Yk has that probability distribution since moment generating
function when it exists is unique. The same applies to using the characteristic function.
This technique is usually used to determine the probability distribution of functions of
independent random variables.
ExampleFind the probability distribution of the sum of n independent random variables
X1 , X2 , .., Xn that have Poisson distribution with parameters 1 , 2 , .., n respectively.
P
Solution Let Z = ni=1 Xi ; the m.g.f of Z is given by
X
tZ
MZ (t) = E[e ] =
etz f (x1 , x2, .., xn)
z=0
z=0
Y
n
X
z=0 i=1
n X
etxi
n
Y
n
Y
ei xi
i
i=1
xi !
i xi
i
txi e
i=1 xi =0
xi !
ei xi i
xi !
i=1
Pn
i=1
i , therefore Z has