2. Probability Measure and Probability Space
2. Probability Measure and Probability Space
Md Aktar Ul Karim
Probability - Theory and Applications
Symbiosis Statistical Institute, Pune
Contents
2 Probability Measure 2
2.1 Properties of a Probability Measure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
2.2 Solved Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.3 Continuity of probability measure: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3 Probability Space 14
3.1 Example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.2 Properties of a Probability Space: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.3 Theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.4 Solved Problems: . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.5 Advanced Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
A sigma field (or σ field) is a collection of sets that is closed under certain operations. It forms the foundation
for probability theory, as it determines the events for which we can assign probabilities.
Definition
Let Ω be a set. A collection F of subsets of Ω is called a sigma field if:
F = {∅, {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, Ω}
F = {∅, Ω}
Sigma Field Generated by an Interval Consider Ω = R and the interval A = (0, 1]. Then we can
generate a sigma field by A and will be denoted by σ(A), which contains all sets that can be formed using A
and the operations in the sigma field definition.
In this case, it includes A, Ac , ∅, and R.
2. Probability Measure
A probability measure is a function P : F → [0, 1] on a sigma field F that satisfies the following axioms:
• Normalization: P (Ω) = 1.
P (B) = P (A) + P (B \ A)
2
This follows from countable additivity.
Proof: We prove this using mathematical induction. For n = 1, the statement is trivially true because:
P (A1 ) = P (A1 )
1. Let Ω = {1, 2, 3}, and suppose P ({1}) = 0.3, P ({2}) = 0.4, and P ({3}) = 0.3. Show that for A = {1, 2}
and B = {2, 3}, P (A ∪ B) = P (A) + P (B) − P (A ∩ B).
Ans: We have:
P (A ∪ B) = P ({1, 2, 3}) = 1, P (A) = P ({1, 2}) = 0.7, P (B) = P ({2, 3}) = 0.7
3
2. Let Ω = [0, 1], and define a sequence of sets An = [1/n, 1]. Compute limn→∞ P (An ) assuming P is the
Lebesgue measure.
1
Sol: The probability of An = [1/n, 1] is the length of the interval, P (An ) = 1 − n. As n → ∞,
P (An ) → 1 − 0 = 1. Therefore, limn→∞ P (An ) = 1.
3. Let P be a probability measure defined on Ω = {1, 2, 3, 4, 5, 6} and F = 2Ω is the power set of Ω, and P
is a probability measure defined as follows:
P ({1}) = 0.1, P ({2}) = 0.2, P ({3}) = 0.1, P ({4}) = 0.2, P ({5}) = 0.1, P ({6}) = 0.3
(a) P (Ω) = 1.
(b) P (∅) = 0.
(c) For disjoint events A = {1, 2, 3} and B = {4, 5}, verify that P (A ∪ B) = P (A) + P (B).
P (∅) = 0
(c.) Here, sets A and B are disjoint, meaning A ∩ B = ∅. So, we need to prove that:
P (A ∪ B) = P (A) + P (B)
P (A) = P ({1, 2, 3}) = P ({1}) + P ({2}) + P ({3}) = 0.1 + 0.2 + 0.1 = 0.4
4
P (B) = P ({4, 5}) = P ({4}) + P ({5}) = 0.2 + 0.1 = 0.3
P (A ∪ B) = P ({1}) + P ({2}) + P ({3}) + P ({4}) + P ({5}) = 0.1 + 0.2 + 0.1 + 0.2 + 0.1 = 0.7
Therefore,
P (A) + P (B) = 0.4 + 0.3 = 0.7 = P (A ∪ B)
4. Let Ω = [0, 1] be the unit interval and F the Borel sigma algebra B([0, 1]). Define a probability measure
P on (Ω, F) such that for any interval [a, b] ⊆ [0, 1],
P ([a, b]) = b − a.
(a) Verify that P is a valid probability measure by checking the following properties:
i. P (Ω) = 1.
ii. For any disjoint intervals A = [a1 , b1 ] and B = [a2 , b2 ], verify that P (A ∪ B) = P (A) + P (B).
(b) Show that P ([0, 0.5]) = 0.5 and P ([0.25, 0.75]) = 0.5.
Ans: (a.) To verify that P is a valid probability measure we have to show that P (Ω) = 1.
The sample space Ω = [0, 1], and we are given that for any interval [a, b] ⊆ [0, 1],
P ([a, b]) = b − a
Thus, P (Ω) = 1
(b.) Let A = [a1 , b1 ] and B = [a2 , b2 ] be two disjoint intervals such that A ∩ B = ∅. We have to show that:
P (A ∪ B) = P (A) + P (B).
Since the intervals A and B are disjoint, the union A ∪ B is another interval. If we consider A = [a1 , b1 ]
and B = [a2 , b2 ] where b1 < a2 , then A ∪ B = [a1 , b1 ] ∪ [a2 , b2 ]
5
Now,
P (A) = P ([a1 , b1 ]) = b1 − a1 , P (B) = P ([a2 , b2 ]) = b2 − a2
Then
P (A) + P (B) = (b1 − a1 ) + (b2 − a2 ).
P (A ∪ B) = P ([a1 , b1 ] ∪ [a2 , b2 ])
P (A ∪ B) = (b1 − a1 ) + (b2 − a2 ).
Now, we have to show that P ([0, 0.5]) = 0.5 and P ([0.25, 0.75]) = 0.5.
5. Let Ω = N be the set of natural numbers and F = 2Ω be the sigma algebra of all subsets of N. Consider
the probability measure P defined as:
1
P ({n}) = , for all n ∈ N.
2n
(b) Prove that P is countably additive, i.e., if A1 , A2 , . . . are disjoint sets in F, then P ( ∞
S
n=1 An ) =
P∞
n=1 P (An ).
Ans: (a.) We are given that Ω = N, and for each n ∈ N, the probability of the singleton set {n} is given
by:
1
P ({n}) = .
2n
6
To show that P (Ω) = 1, we need to sum the probabilities of all singleton sets in N:
∞ ∞
X X 1
P (Ω) = P ({1}) + P ({2}) + P ({3}) + · · · = P ({n}) = .
2n
n=1 n=1
P∞ 1 1
The series n=1 2n is a geometric series with the first term a = 2 and the common ratio r = 12 . The sum
of an infinite geometric series with |r| < 1 is given by:
∞
X a
arn−1 = .
1−r
n=1
P∞ 1
For the series n=1 2n , the sum is:
∞ 1
X 1 2
= 1 = 1.
2n 1− 2
n=1
Thus, P (Ω) = 1.
(b.) To prove that P is countably additive, we need to show that for any collection of disjoint sets
A1 , A2 , . . . in F, the following holds:
∞ ∞
!
[ X
P An = P (An ).
n=1 n=1
Since the sets A1 , A2 , . . . are disjoint, the union will contain all elements from each set without overlap,
so:
∞ ∞ X
!
[ X X 1
P (A) = P An = P ({k}) = .
2k
n=1 k∈A n=1 k∈An
7
The probability measure for any finite set A is the sum of the probabilities of the individual elements:
1
Using the given formula P ({n}) = 2n , we compute:
1 1 1 1 1 1 1 1 53
P (A) = 2
+ 3+ 5+ 7 = + + + = .
2 2 2 2 4 8 32 128 128
53
Thus, P (A) = 128 .
6. Let Ω = {1, 2, 3} and let F be a sigma algebra on Ω that contains the following sets: F = {∅, {1}, {2, 3}, Ω}.
Define a probability measure P on F such that:
Sol. Given:
We need to prove that P is a valid probability measure and calculate P for all sets in F.
8
Also:
P (A) + P (B) = P ({1}) + P ({2, 3}) = 0.2 + 0.8 = 1.
P (∅) = 0.
P (Ω) = 1.
7. Let Ω = [0, 1] and let F be the Borel sigma algebra on [0, 1]. Define P as the Lebesgue measure on [0, 1],
where P ([a, b]) = b − a for any interval [a, b] ⊆ [0, 1].
(c) Prove that P ([a, b]) is countably additive over disjoint intervals.
9
Sol. (a.) To prove P is a valid probability measure, we have to show it satisfies all three properties:
non-negativity, normalization, and countable additivity.
For any interval [a, b] ⊆ [0, 1], the measure is given by:
P ([a, b]) = b − a.
Since b ≥ a, we have b − a ≥ 0, which ensures that P ([a, b]) ≥ 0. This confirms that P is non-negative.
For the whole space Ω = [0, 1], we need to prove P (Ω) = 1.
Since Ω = [0, 1], we compute:
P ([0, 1]) = 1 − 0 = 1.
Since P ([a, b]) = b − a for any interval [a, b], this property holds by the definition of the Lebesgue measure.
Therefore, P is countably additive.
Thus, P satisfies the properties of a probability measure.
Thus, P (A) = 1.
i ̸= j.
10
We want to prove that:
∞ ∞
!
[ X
P [ai , bi ] = P ([ai , bi ]).
i=1 i=1
P ([ai , bi ]) = bi − ai .
S∞
Since the intervals are disjoint, the union i=1 [ai , bi ] forms a new set of intervals with a total length equal
to the sum of the individual lengths.
Thus, we have:
∞ ∞
!
[ X
P [ai , bi ] = (bi − ai ).
i=1 i=1
But the right-hand side is just the sum of the individual measures:
∞ ∞
!
[ X
P [ai , bi ] = P ([ai , bi ]).
i=1 i=1
Proof. The event lim sup An means a set of outcomes in Ω that belong to infinitely many of the events An . So
we can write:
lim sup An = {ω ∈ Ω : ω ∈ An for infinitely many n} .
This can be written as:
∞ [
\ ∞
lim sup An = Ak .
N =1 k=N
This means that for every N , there is some k ≥ N such that the event Ak occurs.
Now, we define a sequence of events BN as:
∞
[
BN = Ak .
k=N
T∞
Notice that lim sup An = N =1 BN , which implies:
∞
!
\
P (lim sup An ) = P BN .
N =1
For each N , the probability of the event BN can be bounded using the union bound (also known as Boole’s
inequality):
∞ ∞
!
[ X
P (BN ) = P Ak ≤ P (Ak ).
k=N k=N
11
By assumption, ∞
P P∞
n=1 P (An ) < ∞. Therefore, the tail sum k=N P (Ak ) converges to zero as N → ∞.
Specifically, for any ϵ > 0, there exists an N such that:
∞
X
P (Ak ) < ϵ.
k=N
lim P (BN ) = 0.
N →∞
P∞
Theorem 2.2. Second Borel-Cantelli Lemma: If A1 , A2 , . . . are independent and n=1 P (An ) = ∞, then the
probability that infinitely many An occur is 1, i.e., P (lim sup An ) = 1.
The continuity of a probability measures the behaviour of the probabilities with respect to increasing or
decreasing sequences of events.
12
Example
1
P (An ) = 1 − , lim P (An ) = P ([0, 1]) = 1.
n n→∞
1
P (An ) = , lim P (An ) = 0.
n n→∞
1
P (Bn ) = 1 − , lim P (Bn ) = P (∅) = 0.
n n→∞
Since Bn → ∅ as n → ∞, we have:
∞
!
\
P An = lim P (An )
n→∞
n=1
13
3. Probability Space
A probability space is a mathematical framework, used to model random experiments. It consists of three
main components: a sample space, a sigma field, and a probability measure.
Hence, a probability space is a triple (Ω, F, P ), where:
• Ω is the sample space, the set of all possible outcomes of a random experiment.
• F is a sigma field (or sigma algebra) on Ω, which contains all the events we are interested in (subsets of
Ω).
• P is a probability measure, a function that assigns a probability to each event in F, satisfying the axioms
of probability.
3.1. Example
1. For a continuous random variable like the uniform distribution on [0, 1], we can write:
• Ω = [0, 1],
P ((a, b]) = b − a.
In this way, we can define a probability space for a continuous random variable.
2. Define the probability space for the result of tossing a fair coin.
3. Problem: Define the probability space for rolling a fair six-sided die.
14
The probability space is (Ω, F, P ), where P is uniform over Ω.
• Total Probability:
The probability of the entire sample space is 1, i.e., P (Ω) = 1.
• Additivity:
If A1 , A2 , . . . , An ∈ F are mutually disjoint (i.e., Ai ∩ Aj = ∅ for i ̸= j), then:
n n
!
[ X
P Ai = P (Ai )
i=1 i=1
• Monotonicity:
If A ⊆ B, then P (A) ≤ P (B).
3.3. Theorems
Theorem 3.1. Complement Rule: For any event A ∈ F, the probability of its complement is:
P (Ac ) = 1 − P (A)
Proof. Since A ∪ Ac = Ω and A ∩ Ac = ∅, by additivity:
P (A) + P (Ac ) = P (Ω) = 1
Thus, P (Ac ) = 1 − P (A).
Theorem 3.2. Inclusion-Exclusion Principle: For any two events A, B ∈ F, the probability of their union
is:
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
Proof. By considering the union A ∪ B, we can write:
P (A ∪ B) = P (A) + P (B \ A)
Since B \ A is the part of B not in A, then:
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
15
Theorem 3.3. Total Probability: Let {Ai }ni=1 be a partition of Ω, where each Ai ∈ F and Ai ∩ Aj = ∅ for
i ̸= j. Then, for any event B ∈ F, we have:
n
X
P (B) = P (B ∩ Ai )
i=1
1. Consider a company that insures two types of devices: smartphones and laptops. The probability that a
smartphone is insured is 0.7, and the probability that a laptop is insured is 0.6. What is the probability
that a randomly chosen smartphone is not insured?
Sol.
By using the complement rule:
4. Let Ω = R, and let F = B(R) be the Borel sigma algebra. Consider the probability measure P defined
as:
Z b
1 −x2
P ([a, b]) = √ e 2 dx,
2π a
16
Sol. First, we have to verify P (R) = 1
The probability measure P is defined as the standard normal distribution, and we know that the total
probability over the real line is 1, i.e.,:
Z ∞
1 x2
P (R) = √ e− 2 dx = 1.
2π −∞
As n → ∞, this probability decreases rapidly since the tail of the normal distribution decays exponentially.
1. Consider a biased coin that lands on heads with probability p and tails with probability 1 − p. Instead
of focusing on individual coin tosses, imagine you are interested in sequences of outcomes over an infinite
number of tosses.
• Sample Space: The sample space Ω consists of all possible infinite sequences of heads and tails:
Ω = {(H, T, H, . . . ), (T, H, T, . . . ), . . . }.
• Sigma Algebra: F is the Borel sigma algebra on infinite sequences, which also allows us to consider
complex events like ”the sequence has infinitely many heads or tails.”
• Probability Measure: The probability measure assigns probabilities to these sequences. For example,
the probability of obtaining a sequence where the first toss is head is:
17
2. Model a sequence of biased coin tosses where each toss has a different bias. What would the probability
space look like, and how would you define the measure?
Sol. We are modeling a sequence of biased coin tosses where each toss has a different bias. The triplets
of probability space will be defined by:
(a) Sample Space (Ω): Each toss results in either a Head (H) or Tail (T ).
For an infinite sequence of tosses, the sample space is Ω = {H, T }∞ , representing all possible infinite
sequences of heads and tails.
Each element in Ω represents a sequence like (H, T, H, H, . . . ).
(b) Sigma Algebra (F): The sigma algebra (F) is the Borel sigma algebra on Ω.
It contains all events (sets of sequences) that we are interested in. This includes sets like ”the first
toss is a head” or ”the second toss is a tail”, as well as more complex sets involving multiple tosses.
(c) Probability Measure (P ): Now we define the measure for a biased coin for each toss. Let pn be the
probability of getting a head (H) on the n-th toss.
Then the probability of a sequence ω = (ω1 , ω2 , . . . ), where ωn ∈ {H, T }, can be written as:
∞
Y
P (ω) = [pn · 1(ωn = H) + (1 − pn ) · 1(ωn = T )]
n=1
Exercise
Consider a continuous probability space (Ω, F, P ), where Ω = [0, 1] and F is the Borel sigma algebra B([0, 1]).
The probability measure P is given by the Lebesgue measure, so for any interval [a, b] ⊆ [0, 1], we have
P ([a, b]) = b − a.
h i S∞
n−1 n
1. Define An = n , n+1 . Calculate P (An ) and find P ( n=1 An ).
1 1
S∞
2. Let Bn = 2n , 2n−1 . Compute P ( n=1 Bn ), and verify whether countable additivity holds for this
sequence of disjoint sets.
18