CH 8 Probability Distributions
CH 8 Probability Distributions
Statistics
Mr M Dominguez
[email protected]
Chapter 8 Probability distributions
§ 8.1 Binomial distributions
A binomial expansion is when a two term expressions
is raised to a power.
a) Expand (a + b)0 ?1
b) Expand (a + b)1 1a ?+ 1b
c) Expand (a + b)2 1a2 + 2ab
? + b2
d) Expand (a + b)3 1a3 + 3a2b ? + 3ab2 + 1b3
e) Expand (a + b)4 1a4 + 4a3b + 6a
? b
2 2
+ 4ab3 + b4
2: 1 2 1 ?
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
4: 1 4 6 ? 4 1
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
3: 1 3 3 ?1
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
5: 1 5 10 ? 10 5 1
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
2: 1 2 1 ?
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
4: 1 4 6 ? 4 1
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
3: 1 3 3 ?1
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
5: 1 5 10 ? 10 5 1
Quickfire Pascal
What coefficients in your expansion do you use if the power is:
4: 1 4 6 ? 4 1
Step 1: You could first put in the first term with decreasing powers.
Step 2: Put in your second term with increasing powers, starting from 0
(i.e. so that 2y doesn’t appear in the first term of the expansion, because the power is 0)
Step 3: Add the coefficients according to Pascal’s Triangle.
* Remember to use brackets to remind your self to raise each term to the correct power.
Expand (x + 2y)4 =
We could think of the coefficients of each term as the number of ways of ordering the letters
Eg in the expansion of the coefficient is 2 as we have
In the expansion of the coefficient can be found by writing down all the different ways of
getting
Hence the coefficient is 4. This can also be called 4 choose 1, as there are 4 options and we are
finding the position of one x
? ? ?
? ?
? ? ?
Binomial Coefficients – non calculator
To calculate Binomial Coefficients easily:
Because when we divide 8! by 6!, we cancel out
all the numbers between 1 and 6 in the product.
i.e. The bottom number of the binomial
coefficient (2) tells us how many consecutive
numbers we multiply together.
? ?
? ?
?
In general we can calculate any binomial expansion using the formula
𝑋 𝐵 ( 𝑛, 𝑝 ) Success probability of
my random variable:
e.g: The probability of
getting a one.
𝑛 𝑟 𝑛 −𝑟
𝑃 ( 𝑟 )= 𝐶 𝑟 × 𝑝 ( 1 −𝑝 )
❑
Quickfire Questions
Show the calculation required to find the indicated probability given the distribution.
?
What is ?
What is ? ?
2. I have a bag of 2 red and 8 white balls. represents the number of red
balls I chose after 5 selections (with replacement).
?
a) How is distributed?
?
The random variable A student suggests using a binomial distribution
1 4 to model the following situations. Give a
Find description of the random variable, state any
assumptions that must be made and give possible
values for and .
?
A sample of 20 bolts is checked for defects from a
a large batch. The production process should
The random variable produce 1% of defective bolts.
2 Find assuming bolts being defective are independent
from each other. ?
? Some traffic lights have three phases: stop 48% of
b
? the time, wait or get ready 4% of the time and go
48% of the time. Assuming that you only cross a
?
traffic light when it is in the go position, model
A balloon manufacturer claims that 95% the number of times that you have to wait or stop
3 of his balloons will not burst when blown on a journey passing through 6 sets of traffic
up. You have 20 balloons. lights.
assuming lights operate independently.
What is the probability that none of ?
them burst? When Stephanie plays tennis with Tim on average
one in eight of her serves is an ‘ace’. How many
c ‘aces’ does Stephanie serve in the next 30 serves
?
What is the probability exactly 2 burst?
against Tim?
assuming serves are independent and
? probability of an ace is constant.
?
How many trials: e.g:
How many times am I
going to roll my die
The random variable.
e.g: rolling a die and
getting a one.
𝑋 𝐵 ( 𝑛, 𝑝 ) Success probability of
my random variable:
e.g: The probability of
getting a one.
𝑛 𝑟 𝑛 −𝑟
𝑃 ( 𝑟 )= 𝐶 𝑟 × 𝑝 ( 1 −𝑝 )
❑
§ 8.2 Normal distributions
The Normal distribution is a suitable model to calculate probabilities if
• The data is continuous
• The distribution is symmetrical
Remember that if the data is symmetrical then mean = median = mode.
Conversely if the data is skewed the normal distribution is not suitable. (The
Poisson distribution can be used)
68%
95%
Sketching the Normal distribution
Key point:
The area
underneath the
graph is the
probability.
Hence the area
under every graph
must add to one.
b) Work out an estimate of the heights within which 95% of the applicants are likely
to be.
is , hence, ?
3. Students sat two examination papers. Paper 1 had a mean mark of 52 and a standard
deviation of 9.5. paper 2 had a mean mark of 60 and a standard deviation of 2.5.
Julia achieved a mark of 70 in one of the papers. Using a sketch of the distributions or
otherwise explain which of the papers this mark is more likely to have been achieved.
Then go to page 349 and 352 etc and answer all questions.
§ 8.3 Standardised scores
English Art
Student Test Test
Score Score The table shows the scores (out of 50) of 10
Abby 30 45 students for an English test and an art test.
Max 25 31
Sarah 45 24 Why is it unwise to simply compare the raw
Joe 15 23 scores for each student?
Claire 50 19
Lee 20 39
Jan 42 30
Matthew 17 42 The results for the English test and Art test are
Aimee 38 47 symmetrical. We can therefore modelled the
Ethan 41 28 two sets of data using the normal distribution.
When we are asked to compare two sets of data (most often exam data) we will
expected to compare standardised scores. It may of course be possible to compare
the sets of data with of standardising the scores.
𝑥−𝜇
Standardised score , 𝑧 =
𝜎
√
∑ 𝑥
∑𝑥
( )
∑𝑥
2 2
𝜇=
𝑛 𝜎= −
𝑛 𝑛
In order to be considered for a place on a mechanics course at a local college,
Wing and Mia took tests in English and Mathematics. Each test had a maximum
mark of 100.
The table shows some information about the tests.
Overall Standard
Wing Mia
Mean deviation
English mark 48 55 50 10
Mathematics mark 59 51 55 8
(c) Who do you think did best overall? Give a reason for your answer.
Wing, as their total standardised score in English and Mathematics was
higher than Mia. They both achieved 0.5 ? in one exam and in the other
Wing’s standardised score of -0.2 is better than Mia’s -0.5.
English Art
English Test Art Test
Student Standardised Standardised
Score (x) Score (y) Score Score
Abby 30 45 ?
-0.18 1.24
Max 25 31 -0.58 -0.18
Sarah 45 24 1.01 -0.89
Joe 15 23 -1.38 -1.0
Claire 50 19 1.41 -1.40
?
Lee 20 39 ?
-0.98 0.63
Jan 42 30 0.77 -0.29
Matthew 17 42 -1.22 0.94
Aimee 38 47 0.45 1.44
Ethan 41 28 0.69 -0.49
1) For each subject, calculate the mean and standard deviation of the test scores.
2) For each student, calculate their standardised scores for each subject. (2 d.p)
3) Decide which subject each student did better in.
4) Which student(s) did the best overall and why?
Given that:
English Art
μ 32.3 32.8
?
σ 11.9 9.3
English Art English Art
English
Student Test Test Squared
Art Squared Standardised Standardised
Score Score Score Score
Abby 30 45
Max 25 31
Sarah 45 24
Joe 15 23
Claire 50 19
Lee 20 39
Jan 42 30
Matthew 17 42
Aimee 38 47
Ethan 41 28
Student English Art Test English Art English Art
Test Score Squared Squared Standardised Standardised
Score Score Score
Abby 30 45 900 2025 -0.183 1.240
Max 25 31 625 961 -0.581 -0.183
Sarah 45 24 2025 576 1.011 -0.894
Joe 15 23 225 529 -1.377 -0.996
Claire 50 19 2500 361 1.409 -1.402
Lee 20 39 400 1521 -0.979 0.630
Jan 42 30 1764 900 0.772 -0.285
Matthew 17 42 289 1764 -1.218 0.935
Aimee 38 47 1444 2209 0.454 1.443
Ethan 41 28 1681 784 0.693 -0.488
323 328 11853 11630
1) For each subject, calculate the mean and standard deviation of the test scores.
2) For each student, calculate their standardised scores for each
subject.
3) Decide which subject each student did better in.
4) Which student(s) did the best overall and why?
§ 8.4 Quality assurance and control charts
Quality Assurance:
• Collecting a sample
• Comparing the sample with the required standards, (mean and range)
• Actions may then need to be taken if the limits are not met.
Why it is important that we look at the mean of the sample and the range?
95% of the samples should lie between the two warning limits. Or 1 in 20 will fall outside the
warning limit.
Assuming that the samples are normally distributed, this will be from the mean.
Upper warning limit
Lower warning limit
99.8% (almost all) of the samples should lie between the two action limits. Or 2 in 1000 will
fall outside the action limits.
Upper Action limit
Lower Action limit
If the sample is between
the warning limits the
You do not need to know how to calculate product is acceptable
warning and action limits for a control chart
showing the range. If the sample is between
the warning and action
limits another sample
should be taken.
If the sample is outside of
the action limits,
production is stopped
Sample 1 2 3 4 5 6 7 8
88.24 90.18 89.88 90.04 90.07 90.12 89.82 90.04
90.04 90.25 94.35 90.00 90.01 89.78 89.86 98.81
Mass (g)
89.46 89.66 93.52 89.59 90.06 87.08 90.28 88.92
90.43 89.50 92.43 89.48 89.93 90.10 93.80 90.03
Mean 89.54 89.90 94.3 89.78 90.02 89.27 90.94 91.95
?
Range 2.19 0.75 4.45 0.56 0.14 3.04 3.98 9.89
?
?