0% found this document useful (0 votes)

38 views

Conditional Expectation: Scott Sheffield

This document summarizes a lecture on conditional expectation. It begins by reviewing conditional probability distributions, defining the conditional probability mass or density function. It then defines conditional expectation as the expected value of a random variable X under the conditional probability measure given some other random variable Y equals y. Conditional expectation can be written as a sum or integral involving the conditional probability distribution. Finally, it notes that conditional expectation E[X|Y] can itself be viewed as a random variable that depends on the value of Y.

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Conditional Expectation: Scott Sheffield

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 58

18.

600: Lecture 25
Conditional expectation

Scott Sheffield

MIT
Outline

Conditional probability distributions

Conditional expectation

Interpretation and examples

Outline

Conditional probability distributions

Conditional expectation

Interpretation and examples

Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
I If X and Y are jointly discrete random variables, we can use
this to define a probability mass function for X given Y = y .
Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
I If X and Y are jointly discrete random variables, we can use
this to define a probability mass function for X given Y = y .
p(x,y )
I That is, we write pX |Y (x|y ) = P{X = x|Y = y } = pY (y ) .
Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
I If X and Y are jointly discrete random variables, we can use
this to define a probability mass function for X given Y = y .
p(x,y )
I That is, we write pX |Y (x|y ) = P{X = x|Y = y } = pY (y ) .
I In words: first restrict sample space to pairs (x, y ) with given
y value. Then divide the original mass function by pY (y ) to
obtain a probability mass function on the restricted space.
Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
I If X and Y are jointly discrete random variables, we can use
this to define a probability mass function for X given Y = y .
p(x,y )
I That is, we write pX |Y (x|y ) = P{X = x|Y = y } = pY (y ) .
I In words: first restrict sample space to pairs (x, y ) with given
y value. Then divide the original mass function by pY (y ) to
obtain a probability mass function on the restricted space.
I We do something similar when X and Y are continuous
random variables. In that case we write fX |Y (x|y ) = ffY(x,y )
(y ) .
Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
I If X and Y are jointly discrete random variables, we can use
this to define a probability mass function for X given Y = y .
p(x,y )
I That is, we write pX |Y (x|y ) = P{X = x|Y = y } = pY (y ) .
I In words: first restrict sample space to pairs (x, y ) with given
y value. Then divide the original mass function by pY (y ) to
obtain a probability mass function on the restricted space.
I We do something similar when X and Y are continuous
random variables. In that case we write fX |Y (x|y ) = ffY(x,y )
(y ) .
I Often useful to think of sampling (X , Y ) as a two-stage
process. First sample Y from its marginal distribution, obtain
Y = y for some particular y . Then sample X from its
probability distribution given Y = y .
Recall: conditional probability distributions
I It all starts with the definition of conditional probability:
P(A|B) = P(AB)/P(B).
I If X and Y are jointly discrete random variables, we can use
this to define a probability mass function for X given Y = y .
p(x,y )
I That is, we write pX |Y (x|y ) = P{X = x|Y = y } = pY (y ) .
I In words: first restrict sample space to pairs (x, y ) with given
y value. Then divide the original mass function by pY (y ) to
obtain a probability mass function on the restricted space.
I We do something similar when X and Y are continuous
random variables. In that case we write fX |Y (x|y ) = ffY(x,y )
(y ) .
I Often useful to think of sampling (X , Y ) as a two-stage
process. First sample Y from its marginal distribution, obtain
Y = y for some particular y . Then sample X from its
probability distribution given Y = y .
I Marginal law of X is weighted average of conditional laws.
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is the probability distribution for X given that Y = 5?
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is the probability distribution for X given that Y = 5?
I Answer: uniform on {1, 2, 3, 4, 5, 6}.
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is the probability distribution for X given that Y = 5?
I Answer: uniform on {1, 2, 3, 4, 5, 6}.
I What is the probability distribution for Z given that Y = 5?
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is the probability distribution for X given that Y = 5?
I Answer: uniform on {1, 2, 3, 4, 5, 6}.
I What is the probability distribution for Z given that Y = 5?
I Answer: uniform on {6, 7, 8, 9, 10, 11}.
Example

I Let X be value on one die roll, Y value on second die roll,

Conditional probability distributions

Conditional expectation

Interpretation and examples

Outline

Conditional probability distributions

Conditional expectation

Interpretation and examples

Conditional expectation

I Now, what do we mean by E [X |Y = y ]? This should just be

the expectation of X in the conditional probability measure
for X given that Y = y .
Conditional expectation

I Now, what do we mean by E [X |Y = y ]? This should just be

the expectation of X in the conditional probability measure
for X given that Y = y .
I Can write this as
P P
E [X |Y = y ] = x xP{X = x|Y = y } = x xpX |Y (x|y ).
Conditional expectation

I Now, what do we mean by E [X |Y = y ]? This should just be

the expectation of X in the conditional probability measure
for X given that Y = y .
I Can write this as
P P
E [X |Y = y ] = x xP{X = x|Y = y } = x xpX |Y (x|y ).
I Can make sense of this in the continuum setting as well.
Conditional expectation

I Now, what do we mean by E [X |Y = y ]? This should just be

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is E [X |Y = 5]?
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is E [X |Y = 5]?
I What is E [Z |Y = 5]?
Example

I Let X be value on one die roll, Y value on second die roll,

and write Z = X + Y .
I What is E [X |Y = 5]?
I What is E [Z |Y = 5]?
I What is E [Y |Z = 5]?
Conditional expectation as a random variable

I Can think of E [X |Y ] as a function of the random variable Y .

When Y = y it takes the value E [X |Y = y ].
Conditional expectation as a random variable

I Can think of E [X |Y ] as a function of the random variable Y .

When Y = y it takes the value E [X |Y = y ].
I So E [X |Y ] is itself a random variable. It happens to depend
only on the value of Y .
Conditional expectation as a random variable

I Can think of E [X |Y ] as a function of the random variable Y .

When Y = y it takes the value E [X |Y = y ].
I So E [X |Y ] is itself a random variable. It happens to depend
only on the value of Y .
I Thinking of E [X |Y ] as a random variable, we can ask what its
expectation is. What is E [E [X |Y ]]?
I Very useful fact: E [E [X |Y ]] = E [X ].
I In words: what you expect to expect X to be after learning Y
is same as what you now expect X to be.
I Proof in discretePcase:
E [X |Y = y ] = x xP{X = x|Y = y } = x x p(x,y )
P
pY (y ) .
Conditional expectation as a random variable