0% found this document useful (0 votes)
4 views

Exponential Distribution Unit 2

The document discusses the exponential distribution, a continuous probability distribution used to model the time between successive events in a Poisson distribution. It covers key concepts such as the probability density function, mean, variance, and cumulative distribution function, along with worked examples and solutions. The document serves as a guide for understanding and applying the exponential distribution in statistical contexts.

Uploaded by

raj.mvr85
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Exponential Distribution Unit 2

The document discusses the exponential distribution, a continuous probability distribution used to model the time between successive events in a Poisson distribution. It covers key concepts such as the probability density function, mean, variance, and cumulative distribution function, along with worked examples and solutions. The document serves as a guide for understanding and applying the exponential distribution in statistical contexts.

Uploaded by

raj.mvr85
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

GCE

GCSE
FURTHER MATHEMATICS
AS UNIT 2: FURTHER STATISTICS A

Exponential Distribution
Further Mathematics AS Unit 2: Further Statistics A

Exponential Distribution ..................................................................................................... 2


Continuous Probability Distributions.............................................................................................. 2
The Exponential Distribution........................................................................................................... 3
Specification Content ....................................................................................................................... 3
Worked Example 1 ........................................................................................................................... 4
Worked Example 2 ........................................................................................................................... 5
Worked Example 3 ........................................................................................................................... 6
Summary of Key Points for the Exponential Distribution ........................................................... 6
Questions ............................................................................................................................ 7
Question 1 Worked Solution ........................................................................................................... 9
Question 2 Worked Solution ........................................................................................................... 9
Question 3 Worked Solution ......................................................................................................... 10
Question 4 Worked Solution ......................................................................................................... 10
Question 5 Worked Solution ......................................................................................................... 11
Question 6 Worked Solution ......................................................................................................... 12

1
Further Mathematics AS Unit 2: Further Statistics A

Exponential Distribution
The exponential distribution is a continuous probability distribution. It can be used to
describe the time successive events from a Poisson distribution. For example, the number of
cars passing a point on a motorway could be modelled by a Poisson distribution (assuming
the modelling assumptions are met). In this case, the intervals of time between successive
cars follow an exponential distribution.

Before introducing the exponential distribution, let us revisit the key concepts for continuous
probability distributions.

Continuous Probability Distributions


In this unit, you have already been introduced to continuous probability distributions. Recall
that a continuous random variable 𝑋 can be defined using a probability density function,
denoted 𝑓(𝑥), such that areas under the curve represent probabilities. We assume that 𝑓(𝑥)
is defined between in some interval 𝑎 ≤ 𝑥 ≤ 𝑏, and zero elsewhere. Let 𝑐 and 𝑑 be such that
𝑎 ≤ 𝑐 ≤ 𝑑 ≤ 𝑏, then:
𝑏 𝑑 𝑑 𝑏
∫ 𝑓(𝑥)dx = 1 , 𝑃(𝑐 ≤ 𝑋 ≤ 𝑑) = ∫ 𝑓(𝑥) dx , 𝑃(𝑋 ≤ 𝑑) = ∫ 𝑓(𝑥)dx , 𝑃(𝑋 ≥ 𝑐) = ∫ 𝑓(𝑥) dx.
𝑎 𝑐 𝑎 𝑐

Note that 𝑃(𝑋 = 𝑥) = 0 for any value of 𝑥. The probability density function can also be used
to calculate expected values as well as the variance and standard deviation of the random
variable 𝑋 as follows:
𝑏 𝑏 𝑏
𝐸(𝑋) = ∫ 𝑥𝑓(𝑥) dx, 𝐸(𝑋 2 ) = ∫ 𝑥 2 𝑓(𝑥) dx, 𝐸(𝑔(𝑋)) = ∫ 𝑔(𝑥)𝑓(𝑥) dx.
𝑎 𝑎 𝑎

From this, we can calculate the variance and standard deviation using:
2
Var(𝑋) = 𝐸(𝑋 2 ) − (𝐸(𝑋)) , SD(𝑋) = √𝑉𝑎𝑟(𝑋).

Calculation of probabilities is often easier using the cumulative distribution function 𝐹(𝑥).
This is calculated using:
𝑥
𝐹(𝑥) = ∫ 𝑓(𝑡) dt.
𝑎

Note that if 𝑓(𝑥) is defined for all real values of 𝑥, the lower limit of 𝑎 is replaced by −∞.

Assuming 𝑓(𝑥) is defined on the interval 𝑎 ≤ 𝑥 ≤ 𝑏, and is therefore zero elsewhere, then:
0 if 𝑥 < 𝑎
𝑥
𝐹(𝑥) = {∫ 𝑓(𝑡) dt if 𝑎 ≤ 𝑥 ≤ 𝑏
𝑎
1 if 𝑥 > 𝑏

Observe that since 𝐹 is obtained from 𝑓 by integration, we can also obtain 𝑓 from 𝐹 by
differentiation. Hence:
𝑓(𝑥) = 𝐹 ′ (𝑥).

Let 𝑐 and 𝑑 be such that 𝑎 ≤ 𝑐 ≤ 𝑑 ≤ 𝑏, then:


𝑃(𝑋 ≤ 𝑐) = 𝐹(𝑐), 𝑃(𝑐 ≤ 𝑋 ≤ 𝑑) = 𝐹(𝑑) − 𝐹(𝑐), 𝑃(𝑋 ≥ 𝑑) = 1 − 𝐹(𝑑).

In this section, we consider a specific example of a continuous probability distribution


function, namely the exponential distribution.

2
Further Mathematics AS Unit 2: Further Statistics A

The Exponential Distribution


The exponential distribution with parameter 𝜆 can be used to model the time between two
successive events from a Poisson distribution with mean 𝜆.

Specification Content
• Statistical distributions: exponential distribution
• Find and use the mean and variance of an exponential distribution
1 1
– knowledge and use of: If 𝑌~Exp(𝜆), then 𝐸(𝑌) = 𝜆 and 𝑉𝑎𝑟(𝑌) = 𝜆2 .
• Use the exponential distribution as a model for intervals between events
𝑑
– learners will be expected to know that 𝑑𝑥 (𝑒 𝑘𝑥 ) = 𝑘𝑒 𝑘𝑥 .

Probability Density Function


The probability density function for the exponential distribution is:
−𝜆𝑥
𝑓(𝑥) = { 𝜆𝑒 , 𝑥≥0
0 otherwise.

Here, 𝜆 is a parameter for the distribution, which is constant. If a random variable 𝑋 has this
distribution, we write 𝑋~Exp(𝜆).

Mean and Variance


1 1
The mean of the exponential distribution is 𝜆. The standard deviation is also 𝜆. Therefore, the
1
variance is 𝜆2 . Let 𝑋~Exp(𝜆), then:
1 1 1
𝐸(𝑋) = , Var(𝑋) = , SD(𝑋) = .
𝜆 𝜆2 𝜆

These results can be derived using integration, but are not required for this unit as
knowledge of integration from Mathematics A2 Unit 3, as well as knowledge of improper
integrals from Further Mathematics A2 Unit 4 is needed.

Derivative and integral of 𝑒 𝑘𝑥


For this module, it is important to know the following derivative:
𝑑 𝑘𝑥
(𝑒 ) = 𝑘𝑒 𝑘𝑥 .
𝑑𝑥

Conversely, we have that:


𝑘𝑥 𝑘𝑥 𝑘𝑥
𝑒 𝑘𝑥
∫ 𝑘𝑒 dx = 𝑒 +𝐶 and ∫𝑒 dx = + 𝐶.
𝑘

Cumulative Distribution Function


We derive the cumulative distribution function as follows:
𝑥
𝐹(𝑥) = ∫ 𝜆𝑒 −𝜆𝑡 dt
0
𝑥
𝜆𝑒 −𝜆𝑡
=[ ]
−𝜆 0
𝑥
= [−𝑒 −𝜆𝑡 ]0
= −𝑒 −𝜆𝑥 + 𝑒 0
= 1 − 𝑒 −𝜆𝑥 .

Therefore, the cumulative distribution function is given by:


−𝜆𝑥
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

3
Further Mathematics AS Unit 2: Further Statistics A

Calculating Probabilities for Continuous Random Variables


The exponential distribution is an example of a continuous random variable. The rules for
calculating probabilities using continuous random variables are different to those for discrete
random variables. Let 𝑋 be a continuous random variable and let 𝑎, 𝑏, 𝑐 and 𝑑 be constants.
Then:
• 𝑃(𝑋 = 𝑎) = 0
• 𝑃(𝑋 ≥ 𝑏) = 𝑃(𝑋 > 𝑏) = 1 − 𝑃(𝑋 ≤ 𝑏)
• 𝑃(𝑐 ≤ 𝑋 ≤ 𝑑) = 𝑃(𝑐 < 𝑋 ≤ 𝑑) = 𝑃(𝑐 ≤ 𝑋 < 𝑑) = 𝑃(𝑐 < 𝑋 < 𝑑) = 𝑃(𝑋 ≤ 𝑑) − 𝑃(𝑋 ≤ 𝑐).

Worked Example 1
The interval, 𝑋 seconds, between cars passing a point on a motorway follows an exponential
distribution with probability density function
−2𝑥
𝑓(𝑥) = { 2𝑒 , 𝑥≥0
0 otherwise.
(i) State the mean and variance of 𝑋.
(ii) State the cumulative distribution function.
(iii) Calculate the probability that (give all answers to 3 significant figures):
a. The interval until the next car passes is between 1 and 2 seconds.
b. The interval until the next car passes is longer than 3 seconds.
c. The interval until the next car passes is less than 1.5 seconds.
(iv) State a distribution that could be used to model the number of cars passing the
point each second, giving the values of any parameters.

Solution:
(i) This is an exponential distribution with parameter 𝜆 = 2. Therefore, the mean is
1 1 1 1
𝐸(𝑋) = = and Var(𝑋) = 2 = .
𝜆 2 𝜆 4

−2𝑥
(ii) The cumulative distribution function is 𝐹(𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

(iii) Each probability can be calculated using the cumulative distribution function.
a. We calculate 𝑃(1 ≤ 𝑋 ≤ 2) as follows:
𝑃(1 ≤ 𝑋 ≤ 2) = 𝑃(𝑋 ≤ 2) − 𝑃(𝑋 ≤ 1)
= 𝐹(2) − 𝐹(1)
= (1 − 𝑒 −4 ) − (1 − 𝑒 −2 )
= 𝑒 −2 − 𝑒 −4
= 0.117.
b. We calculate 𝑃(𝑋 > 3) as follows:
𝑃(𝑋 > 3) = 1 − 𝑃(𝑋 ≤ 3)
= 1 − 𝐹(3)
= 1 − (1 − 𝑒 −6 )
= 𝑒 −6
= 0.00248.
c. We calculate 𝑃(𝑋 < 1.5) as follows:
𝑃(𝑋 < 1.5) = 𝐹(1.5)
= 1 − 𝑒3
= 0.950.

(iv) The Poisson distribution with mean 2 could be used to model the number of cars
passing the point each second.
Note: the exponential distribution says that the mean length of an interval is 0.5
seconds. This could correspond to an average of 2 cars passing the point every
second, as specified in the corresponding Poisson distribution.
4
Further Mathematics AS Unit 2: Further Statistics A

Worked Example 2
The number of potholes on a random 1 km section of rural road has a Poisson distribution
with mean 1.6. Let 𝑋 km be the distance between successive potholes on the road.
(i) Show that, for 𝑥 ≥ 0,
𝑃(𝑋 > 𝑥) = 𝑒 −1.6𝑥 .
(ii) Derive the cumulative distribution function (CDF) and probability density function
(PDF) for 𝑋.
(iii) Calculate the mean and median distance between successive potholes. State which
of the mean or median is largest, and what this suggests about the distribution of the
distance between successive potholes.

Solution:
(i) The number of potholes on a randomly chosen 1 km stretch of rural road can be
modelled by Po(1.6). We know that 𝑋 is the distance between two successive potholes. If
𝑋 > 𝑥, this means there are no faults in the first 𝑥 km. The number of the faults in the first 𝑥
km can be modelled using 𝑌~Po(1.6𝑥).

Hence:
𝑒 −1.6𝑥 × (1.6𝑥)0
𝑃(𝑋 > 𝑥) = 𝑃(𝑌 = 0) = = 𝑒 −1.6𝑥 .
0!

(ii) The CDF can be obtained using 𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = 1 − 𝑃(𝑋 > 𝑥) = 1 − 𝑒 −1.6𝑥 .

Therefore:
−1.6𝑥
𝐹(𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

The PDF can then be derived by differentiating, i.e.


𝑓(𝑥) = 𝐹 ′ (𝑥) = 0 − (−1.6𝑒 −1.6𝑥 ) = 1.6𝑒 −1.6𝑥 .

Hence:
−1.6𝑥
𝑓(𝑥) = { 1.6𝑒 , 𝑥≥0
0 otherwise.
1 1
(iii) The mean distance is = = 0.625 km = 625 m.
𝜆 1.6

The median, 𝑚, satisfies 𝐹(𝑚) = 0.5. Hence:


1 − 𝑒 −1.6𝑚 = 0.5
𝑒 −1.6𝑚 = 0.5
−1.6𝑚 = ln 0.5
ln 0.5
𝑚=
−1.6
𝑚 = 0.433.
Therefore, the median distance is 433 m.

The median distance of 433 m is smaller than the mean distance of 625 m. This suggests
that the distribution of the distance between successive potholes is skewed to the right
(positively skewed).

5
Further Mathematics AS Unit 2: Further Statistics A

Worked Example 3
A monitor issues a warning signal when an action is needed as part of a production process.
The interval, 𝑋 hours, between successive signals follows an exponential distribution with
parameter 0.08.
(i) Find the probability that the interval between the next two signals is:
a. Between 10 and 20 hours;
b. Less than two hours;
c. Longer than 50 hours.
(ii) State the mean and standard deviation of the intervals between successive
signals.
(iii) Following a warning signal, what is the longest time the production process could
be left unsupervised whilst ensuring the probability of missing the next signal is
less than 0.01?

Solution:
(i) We have an exponential distribution with parameter 𝜆 = 0.08. The cumulative
−0.08𝑥
distribution function is given by 𝐹(𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

a. 𝑃(10 ≤ 𝑋 ≤ 20) = 𝐹(20) − 𝐹(10) = (1 − 𝑒 −1.6 ) − (1 − 𝑒 −0.8 ) = 0.247.


b. 𝑃(𝑋 < 2) = 𝐹(2) = 1 − 𝑒 −0.16 = 0.148.
c. 𝑃(𝑋 > 50) = 1 − 𝑃(𝑋 ≤ 50) = 1 − 𝑒 −4 = 0.0183.
1 1 1
(ii) 𝐸(𝑋) = 𝜆 = 0.08 = 12.5 hours and SD(𝑋) = 𝜆 = 12.5 hours.

(iii) We wish to find the time, 𝑡, for which:


𝑃(𝑋 < 𝑡) = 0.01
1 − 𝑒 −0.08𝑡 = 0.01
𝑒 −0.08𝑡 = 0.99
−0.08𝑡 = ln 0.99
ln 0.99
𝑡=
−0.08
𝑡 = 0.1256 hours
𝑡 = 7.54 minutes.
Therefore, the production process should be left for no longer than 7.54 minutes
to ensure the probability of missing a signal is less than 0.01.

Summary of Key Points for the Exponential Distribution


If 𝑋~Exp(𝜆), then:
1. The probability density function 𝑓(𝑥) is:
−𝜆𝑥
𝑓(𝑥) = { 𝜆𝑒 , 𝑥≥0
0 otherwise.

2. The mean, variance and standard deviation are:


1 1 1
𝐸(𝑋) = , Var(𝑋) = 2 , SD(𝑋) = .
𝜆 𝜆 𝜆

3. The cumulative distribution function 𝐹(𝑥) is:


−𝜆𝑥
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

Useful Results:
𝑑 𝑘𝑥 𝑒 𝑘𝑥
(𝑒 ) = 𝑘𝑒 𝑘𝑥 , ∫ 𝑘𝑒 𝑘𝑥 dx = 𝑒 𝑘𝑥 + 𝐶, ∫ 𝑒 𝑘𝑥 dx = + 𝐶.
𝑑𝑥 𝑘
6
Further Mathematics AS Unit 2: Further Statistics A

Questions

1. The lifetime of a light bulb is 𝑋 hours, where 𝑋 can be modelled by an exponential


distribution with parameter 𝜆 = 0.0125.
a) Find the mean and variance of the lifetime of a light bulb.
b) Find the probability that the lifetime of a bulb is:
(i) less than 100 hours;
(ii) between 50 hours and 150 hours.

2. The time, 𝑇 seconds, between the arrival of successive vehicles at a zebra crossing on
a road can be modelled by an exponential distribution with parameter 𝜆 = 0.025.
a) Write down the mean and the variance of 𝑇.
b) An elderly pedestrian takes 30 seconds to cross the road using this zebra
crossing. Calculate the probability that:
(i) no vehicle arrives whilst the pedestrian is crossing.
(ii) no vehicle arrives whilst the pedestrian makes two independent crossings.
c) A person starts crossing the road immediately after a vehicle has passed. How
long should this person take to cross the road to ensure the probability of a
vehicle arriving before they have crossed is less than 0.2?

3. The interval, 𝑋 metres, between consecutive minor faults on a roll of cloth has a
distribution whose probability density function is given by:
−𝜆𝑥
𝑓(𝑥) = { 𝜆𝑒 , 𝑥≥0
0 otherwise,
where 𝜆 is a positive constant.
a) It is given that 𝑃(𝑋 < 1) = 2𝑃(𝑋 > 2). Find the mean of the distribution.
b) Find the probability that the interval between two minor faults is greater than 3
metres.

4. The probability that the lifetime, 𝐻, of a certain type of electrical component is more

than ℎ hours is given by 𝑃(𝐻 > ℎ) = 𝑒 −1000 , ℎ > 0.
a) Calculate the probability that a randomly selected component has a lifetime of:
(i) more than 1500 hours;
(ii) between 1000 and 2000 hours;
(iii) precisely 1200 hours.
b) Calculate the probability that three components, chosen at random, all have
lifetimes of more than 1500 hours.
c) Derive the probability density function for 𝐻, and hence state the mean and
variance of the component lifetime.

5. The number of arrivals per hour at a toll booth has a Poisson distribution with mean
90. Let 𝑇 minutes denote the time between successive arrivals at the toll booth.
a) Show that 𝑃(𝑇 > 𝑡) = 𝑒 −1.5𝑡 .
b) Use your answer in a) to find the CDF and PDF for 𝑇. Hence state the distribution
of 𝑇.
c) Calculate the mean and standard deviation of 𝑇.
d) Determine the median time between successive arrivals at the toll booth.
e) Use your answers from c) and d) to comment on the distribution of time between
successive arrivals at the toll booth.

6. The number of flaws in a randomly chosen 100 metre length of ribbon is modelled by a
Poisson distribution with mean 1.6. The random variable 𝑋 metres is the distance
between two successive flaws.
7
Further Mathematics AS Unit 2: Further Statistics A

a) Show that the cumulative distribution function of 𝑋 is given by:


−0.016𝑥
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.
b) Deduce that X has a negative exponential distribution, stating its mean.
c) Find the median and interquartile range for the distance between successive flaws.
d) Find the probability that there is a distance of at least 50 metres between two
successive flaws.

8
Further Mathematics AS Unit 2: Further Statistics A

Question 1 Worked Solution


The lifetime of a light bulb is 𝑋 hours, where 𝑋 can be modelled by an exponential
distribution with parameter 𝜆 = 0.0125.
a) Find the mean and variance of the lifetime of a light bulb.
b) Find the probability that the lifetime of a bulb is:
(i) less than 100 hours;
(ii) between 50 hours and 150 hours.
1 1 1 1
a) The mean is 𝜆 = 0.0125 = 80 and the variance is 𝜆2 = 0.01252 = 6400.

b) Firstly, we note that the CDF is given by:


−0.0125𝑥
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

(i) 𝑃(𝑋 < 100) = 𝑒 −1.25 = 0.713.

(ii) 𝑃(50 ≤ 𝑋 ≤ 150) = 𝑃(𝑋 ≤ 150) − 𝑃(𝑋 ≤ 50)


= (1 − 𝑒 −1.875 ) − (1 − 𝑒 −0.625 )
= 𝑒 −0.625 − 𝑒 −1.875
= 0.382.

Question 2 Worked Solution


The time, 𝑇 seconds, between the arrival of successive vehicles at a zebra crossing on a
road can be modelled by an exponential distribution with parameter 𝜆 = 0.025.
a) Write down the mean and the variance of 𝑇.
b) An elderly pedestrian takes 30 seconds to cross the road using this zebra crossing.
Calculate the probability that:
(i) no vehicle arrives whilst the pedestrian is crossing;
(ii) no vehicle arrives whilst the pedestrian makes two independent crossings.
c) A person starts crossing the road immediately after a vehicle has passed. How long
should this person take to cross the road to ensure the probability of a vehicle
arriving before they have crossed is less than 0.2?
1 1 1 1
a) The mean is = = 40, and the variance is = = 1600.
𝜆 0.025 𝜆2 0.0252

b) Firstly, we note that the CDF is given by:


−0.025𝑡
𝐹(𝑡) = 𝑃(𝑇 ≤ 𝑡) = { 1 − 𝑒 , 𝑡≥0
0 otherwise.

(i) The probability that no vehicle arrives is:


𝑃(𝑇 > 30) = 1 − 𝑃(𝑇 ≤ 30) = 1 − (1 − 𝑒 −0.75 ) = 𝑒 −0.75 = 0.4724.

(ii) From the previous part, we deduce the probability of no vehicles arriving during two
independent crossings is (𝑒 −0.75 )2 = 𝑒 −1.5 = 0.2231.

c) We wish to find the value of 𝑡 such that 𝑃(𝑇 ≤ 𝑡) = 0.2. Therefore:


1 − 𝑒 −0.025𝑡 = 0.2
𝑒 −0.025𝑡 = 0.8
−0.025𝑡 = ln 0.8
ln 0.8
𝑡=
−0.025
𝑡 = 8.93.

9
Further Mathematics AS Unit 2: Further Statistics A

Therefore, a person needs to cross in 8.93 seconds to ensure the probability of a vehicle
arriving before they have crossed is less than 0.2.

Question 3 Worked Solution


The interval, 𝑋 metres, between consecutive minor faults on a roll of cloth has a distribution
whose probability density function is given by:
−𝜆𝑥
𝑓(𝑥) = { 𝜆𝑒 , 𝑥≥0
0 otherwise
where 𝜆 is a positive constant.
a) It is given that 𝑃(𝑋 < 1) = 2𝑃(𝑋 > 2). Find the mean of the distribution.
b) Find the probability that the interval between two minor faults is greater than 3
metres.

a) The cumulative distribution function is:


−𝜆𝑥
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

We have 𝑃(𝑋 < 1) = 1 − 𝑒 −𝜆 , and 𝑃(𝑋 > 2) = 1 − 𝑃(𝑋 ≤ 2) = 1 − (1 − 𝑒 −2𝜆 ) = 𝑒 −2𝜆 .


Therefore, 1 − 𝑒 −𝜆 = 2𝑒 −2𝜆 . Multiplying by 𝑒 2𝜆 gives 𝑒 2𝜆 − 𝑒 𝜆 = 2, i.e. 𝑒 2𝜆 − 𝑒 𝜆 − 2 = 0.

Let 𝑦 = 𝑒 𝜆 . Then 𝑦 2 − 𝑦 − 2 = 0 and so (𝑦 − 2)(𝑦 + 1) = 0.

Hence, 𝑦 = 2 or 𝑦 = −1. Therefore, 𝑒 𝜆 = 2 or 𝑒 𝜆 = −1. Since 𝑒 𝜆 = −1 is impossible, we


1 1
must have 𝑒 𝜆 = 2, and so 𝜆 = ln 2. Therefore, the mean is 𝜆 = ln 2 = 1.4427.

b) Using the CDF from a):


1 1
𝑃(𝑋 > 3) = 1 − 𝑃(𝑋 ≤ 3) = 1 − 𝐹(3) = 1 − (1 − 𝑒 −3 ln 2 ) = 𝑒 −3 ln 2 = 𝑒 ln8 = .
8

Question 4 Worked Solution


The probability that the lifetime, 𝐻, of a certain type of electrical component is more than ℎ

hours is given by 𝑃(𝐻 > ℎ) = 𝑒 −1000 , ℎ > 0.
a) Calculate the probability that a randomly selected component has a lifetime of:
(i) more than 1500 hours;
(ii) between 1000 and 2000 hours;
(iii) precisely 1200 hours.
b) Calculate the probability that three components, chosen at random, all have lifetimes
of more than 1500 hours.
c) Derive the probability density function for 𝐻, and hence state the mean and variance
of the component lifetime.

a) Firstly, we note that 𝐹(ℎ) = 𝑃(𝐻 ≤ ℎ) = 1 − 𝑒 −1000 = 1 − 𝑒 −0.001ℎ .
1500
(i) 𝑃(𝐻 > 1500) = 𝑒 −1000 = 𝑒 −1.5 = 0.223.
(ii) 𝑃(1000 ≤ 𝐻 ≤ 2000) = 𝑃(𝐻 ≤ 2000) − 𝑃(𝐻 ≤ 1000) = (1 − 𝑒 −2 ) − (1 − 𝑒 −1 ) = 0.233.
(iii) 𝑃(𝐻 = 1200) = 0, since 𝐻 is a continuous random variable.

b) From a)(i), we know that the probability of a randomly selected component having a
lifetime of more than 1500 hours is 𝑒 −1.5 . Hence, the probability of three randomly selected
components all having lifetimes more than 1500 is (𝑒 −1.5 )3 = 𝑒 −4.5 = 0.0111.

c) We find 𝑓(ℎ) from 𝐹(ℎ) through differentiation:

10
Further Mathematics AS Unit 2: Further Statistics A

𝑓(ℎ) = 0 − (−0.001)𝑒 −0.001ℎ = 0.001𝑒 −0.001ℎ for ℎ > 0.


i.e.
−0.001ℎ
𝑓(ℎ) = { 0.001𝑒 , ℎ≥0
0 otherwise.

This means that 𝐻 follows an exponential distribution with parameter 𝜆 = 0.001. Hence, the
1 1
mean lifetime is 0.001 = 1000 and the variance is 0.0012 = 1,000,000.

Question 5 Worked Solution


The number of arrivals per hour at a toll booth has a Poisson distribution with mean 90. Let
𝑇 minutes denote the time between successive arrivals at the toll booth.
a) Show that 𝑃(𝑇 > 𝑡) = 𝑒 −1.5𝑡 .
b) Use your answer in a) to find the CDF and PDF for 𝑇. Hence state the distribution of 𝑇.
c) Calculate the mean and standard deviation of 𝑇.
d) Determine the median time between successive arrivals at the toll booth.
e) Use your answers from c) and d) to comment on the distribution of time between
successive arrivals at the toll booth.

a) The number of arrivals per hour can be modelled by Po(90). Therefore, the number of
90
arrivals per minute can be modelled by Po (60) = Po(1.5). We know that 𝑇 is the time
between two successive arrivals. If 𝑇 > 𝑡, this means there are no arrivals for 𝑡 minutes. The
number of the faults in the first 𝑡 minutes can be modelled using 𝑌~Po(1.5𝑡). Hence:
𝑒 −1.5𝑡 × (1.5𝑡)0
𝑃(𝑇 > 𝑡) = 𝑃(𝑌 = 0) = = 𝑒 −1.5𝑡 .
0!

b) The CDF can be obtained using 𝐹(𝑡) = 𝑃(𝑇 ≤ 𝑡) = 1 − 𝑃(𝑇 > 𝑡) = 1 − 𝑒 −1.5𝑡 . Therefore:
−1.5𝑡
𝐹(𝑡) = { 1 − 𝑒 , 𝑡≥0
0 otherwise.

The PDF can then be derived by differentiating: 𝑓(𝑡) = 𝐹 ′ (𝑡) = 0 − (−1.5𝑒 −1.5𝑡 ) = 1.5𝑒 −1.5𝑡 .
−1.5𝑡
Hence, 𝑓(𝑡) = { 1.5𝑒 , 𝑡≥0
0 otherwise.

This is the PDF for an exponential distribution with parameter 1.5. Therefore, 𝑇~Exp(1.5).
1 1 2 1 2
c) 𝐸(𝑇) = = = and SD(𝑇) = = . Therefore, the mean and standard deviation of 𝑇 is
𝜆 1.5 3 𝜆 3
2
3
× 60 = 40 seconds.

d) The median, 𝑚, satisfies 𝐹(𝑚) = 0.5. Hence:


1 − 𝑒 −1.5𝑚 = 0.5
𝑒 −1.5𝑚 = 0.5
−1.5𝑚 = ln 0.5
ln 0.5
𝑚=
−1.5
𝑚 = 0.462.

Therefore, the median time is 0.462 × 60 = 27.726 seconds.

e) Since the median time is less than the mean time, this suggests that the distribution of the
time between successive arrivals at the toll booth is skewed to the right (positively skewed).

11
Further Mathematics AS Unit 2: Further Statistics A

Question 6 Worked Solution


The number of flaws in a randomly chosen 100 metre length of ribbon is modelled by a
Poisson distribution with mean 1.6. The random variable 𝑋 metres is the distance between
two successive flaws.
a) Show that the cumulative distribution function of 𝑋 is given by:
−0.016𝑥
𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.
b) Deduce that X has a negative exponential distribution, stating its mean.
c) Find the median and interquartile range for the distance between successive flaws.
d) Find the probability that there is a distance of at least 50 metres between two
successive flaws.

a) The number of flaws in 100 metre of ribbon can be modelled by Po(1.6), hence the
number of flaws in 1 metre of ribbon can be modelled by Po(0.016). We know that 𝑋 is the
distance between two successive serious faults. If 𝑋 > 𝑥, this means there are no faults in
the first 𝑥 metres. The number of the faults in the first 𝑥 metres can be modelled using
𝑌~Po(0.016𝑥).
Hence:
𝑒 −0.016𝑥 × (0.016𝑥)0
𝑃(𝑋 > 𝑥) = 𝑃(𝑌 = 0) = = 𝑒 −0.016𝑥 .
0!

The CDF can be obtained using 𝐹(𝑥) = 𝑃(𝑋 ≤ 𝑥) = 1 − 𝑃(𝑋 > 𝑥) = 1 − 𝑒 −0.016𝑥 .

Therefore:
−0.016𝑥
𝐹(𝑥) = { 1 − 𝑒 , 𝑥≥0
0 otherwise.

b) The PDF can then be derived by differentiating, i.e.


𝑓(𝑥) = 𝐹 ′ (𝑥) = 0 − (−0.016𝑒 −0.016𝑥 ) = 0.016𝑒 −0.016𝑥 .

Hence:
−0.016𝑥
𝑓(𝑥) = { 0.016𝑒 , 𝑥≥0
0 otherwise.

This is the PDF for an exponential distribution with parameter 0.016, therefore
1 1
𝑋~Exp(0.016). The mean is 𝜆 = 0.016 = 62.5.

c) The median, 𝑚, satisfies 𝐹(𝑚) = 0.5. Hence:


1 − 𝑒 −0.016𝑚 = 0.5
𝑒 −0.016𝑚 = 0.5
−0.016𝑚 = ln 0.5
ln 0.5
𝑚=
−0.016
𝑚 = 43.32.

Therefore, the median is 43.32 metres.

The lower quartile, 𝑙, satisfies 𝐹(𝑙) = 0.25. Hence:


1 − 𝑒 −0.016𝑙 = 0.25
𝑒 −0.016𝑙 = 0.75
−0.016𝑙 = ln 0.75
ln 0.75
𝑙=
−0.016
𝑙 = 17.98.

12
Further Mathematics AS Unit 2: Further Statistics A

The upper quartile, 𝑢, satisfies 𝐹(𝑢) = 0.25. Hence:


1 − 𝑒 −0.016𝑢 = 0.75
𝑒 −0.016𝑢 = 0.25
−0.016𝑢 = ln 0.25
ln 0.25
𝑢=
−0.016
𝑢 = 86.64.

Therefore, the interquartile range is 𝑢 − 𝑙 = 86.64 − 17.98 = 68.66 metres.

d) We calculate:
𝑃(𝑋 > 50) = 1 − 𝑃(𝑋 ≤ 50)
= 1 − 𝐹(50)
= 1 − (1 − 𝑒 −0.016×50 )
= 𝑒 −0.8
= 0.449.

13

You might also like