0% found this document useful (0 votes)

142 views

Sample Mean Distribution

maths a level stats for ocr mei, this helps a lot

Uploaded by

Ryan Chung

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views

Sample Mean Distribution

maths a level stats for ocr mei, this helps a lot

Uploaded by

Ryan Chung

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

MEI A level Maths Hypothesis testing

Section 1: Using the Normal distribution

Notes and Examples

These notes contain subsections on:
 The distribution of sample means
 Standardising the distribution of the sample means
 Hypothesis tests
 Using estimated standard deviation
 The left hand tail
 Two tailed tests

The distribution of sample means

Suppose you use a random number generator to choose three numbers at random
from the integers 1 – 100, and find the average of the three numbers you have
chosen. There are a very large number of possible results you could obtain for the
mean of your sample of three, ranging from 1 (if the numbers you obtain are all 1’s)
to 100 (if the numbers you obtain are all 100’s). Clearly, it is quite unlikely that the
mean would be 1 or 100 – it is much more likely to be fairly close to 50.

You could work out the probability distribution for the sample means, by calculating
the probability of each possible value for the mean. What sort of shape would this
probability distribution have, and what would be the mean and standard deviation of
the distribution?

You can investigate the distribution of sample means using a simple example:
throwing an ordinary, fair die. This means that you are dealing with the population
{1, 2, 3, 4, 5, 6}. Throwing one die is equivalent to taking a sample of size 1 from the
population; throwing two dice is equivalent to taking a sample of size 2 from the
population, and so on.

Samples of size 1
If you throw one die, then there are six possible samples you could obtain:

{1} {2} {3} {4} {5} {6}

Each of these samples is equally likely to occur. The sample mean in each case is,
of course, just the value of the score on the die.

So the probability distribution of the sample means for a sample of size 1 is:

x 1 2 3 4 5 6
P( X  x ) 1
6
1
6
1
6
1
6
1
6
1
6

1 of 10 07/02/19 © MEI
integralmaths.org
MEI A level Hypothesis testing 1 Notes and examples
0.2 p

0.15

0.1

0.05
x

It can be shown that E( X )  3.5 and Var( X )  12

Samples of size 2
If you throw two dice, then there are 36 possible samples you could obtain (some of
which are the same, e.g. {1, 2} and {2, 1}).

The table below shows the possible values of the sample mean.

1 2 3 4 5 6
1 1 1.5 2 2.5 3 3.5
2 1.5 2 2.5 3 3.5 4
3 2 2.5 3 3.5 4 4.5
4 2.5 3 3.5 4 4.5 5
5 3 3.5 4 4.5 5 5.5
6 3.5 4 4.5 5 5.5 6

So the probability distribution of the sample means for a sample of size 2 is:

y 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6

P(Y  y ) 1
36
2
36
3
36
4
36
5
36
6
36
5
36
4
36
3
36
2
36
1
36

0.2 p

0.15

0.1

0.05
x

It can be shown that E(Y )  3.5 and Var(Y )  35

Samples of size 3
If you throw three dice, then there are 216 possible samples you could obtain (again,
some are the same, such as {1, 1, 2}, {1, 2, 1} and {2, 1, 1}).
If a complete list is made of all the possible samples, and the sample mean
calculated for each, you can find the probability distribution of the sample mean in
the same way as for samples of size 2.

The probability distribution of the sample means for a sample of size 3 is:

z 1 1 13 1 23 2 2 13 2 23 3 3 13 3 23 4 4 13 4 23 5 5 13 5 23 6
P(Z  z ) 1
216
3
216
6
216
10
216
15
216
21
216
25
216
27
216
27
216
25
216
21
216
15
216
10
216
6
216
3
216
1
216

2 of 10 07/02/19 © MEI
integralmaths.org
MEI A level Hypothesis testing 1 Notes and examples

0.2 p

0.15

0.1

0.05
x

It can be shown that E( Z )  3.5 and Var(Z )  36

Comparing the distributions for samples of size 1, 2 and 3, you can see that whereas
a sample of size 1 has a uniform distribution, for samples of size 2 and 3 the
distribution has a peak in the centre corresponding to the mean value of 3.5.

In addition, the distribution for sample size 2 is triangular, whereas the one for
sample size 3 is more “bell-shaped”, suggesting that the standard deviation is
smaller. In fact, this trend continues with larger sample sizes.

We have used the theoretical distribution of throwing a die to model the outcomes of
sampling from a very simple population (the numbers 1, 2, 3, 4, 5 and 6). The mean
35
(3.5) and standard deviation ( 12 ) are the same as the population mean, , and
standard deviation,  (the population standard deviation is calculated using divisor
n, since we are dealing with a complete population). All three probability distributions
have mean 3.5, which is the same as the population mean .

35
The standard deviation of the distribution for sample size 2 is 24 , which can be
 35
written as . The standard deviation of the distribution for sample size 3 is 36 ,
2

which can be written as .
3

Generalising: given a population with a mean of μ and a standard deviation of σ, the


sampling distribution of the mean has a mean of μ and a standard deviation of ,
n
where n is the sample size.

Notice that the standard deviation of the distribution of sample means (sometimes
called the standard error of the mean) is smaller than the population standard
deviation and decreases as the sample size increases.

As the distribution of the sample means is so important, it is often abbreviated to just

the sampling distribution. However, this does not mean other sampling distributions
are not possible: the sampling distribution of the median is possible of course.

In this topic we are assuming that the underlying distribution has a Normal
distribution.
Given a population X with a mean of μ and a standard deviation of σ

3 of 10 07/02/19 © MEI
integralmaths.org
MEI A level Hypothesis testing 1 Notes and examples

i.e. X ~ N(μ ,  2), and a sample of size n is taken, the distribution of the sample
 2 
means is given by X ~ N   , .
 n 

You can therefore use the skills learnt when working with the Normal distribution to
calculate probabilities with a sample mean.

Note you can become confused between the theoretical distribution and a practical
experiment. If you are conducting a biology experiment you will normally be
collecting one sample of data. When analysing the results you are using the theory
from the theoretical distribution.

Standardising the distribution of the sample means

As you saw in the work on the normal distribution, any normal distribution
X ~ N(μ, σ) can be transformed to the standard normal distribution Z ~ N(0, 1).
The variable X has mean μ and standard deviation σ.
so x, a particular value of X, is transformed into z by the formula:
x
z

So for the distribution of the sample means, X , you can standardise by using
x 
z .
 n

Hypothesis tests
You have already met hypothesis tests involving the binomial distribution
B(n, p), in which you investigate whether a hypothesised value for the population
parameter p takes a particular value.

You will now look at hypothesis tests using the Normal distribution N(μ, σ), in which
you test whether the population mean takes a particular value.

In the test, you are assuming that the value of the population mean is the one given
in the null hypothesis, and then considering the value of the sample mean. If your
sample mean is too far away from the assumed population mean, then you conclude
that as it is very unlikely that a randomly chosen sample would have such a high (or
low) sample mean, the population mean does not in fact have the value that you
assumed it to have. This means that you are rejecting the null hypothesis.

There are two main approaches that can be used in the hypothesis test. They are
equivalent but you should know both.

Suppose that you are using the hypotheses

H0:  = m
H1:  > m

4 of 10 07/02/19 © MEI
integralmaths.org
MEI A level Hypothesis testing 1 Notes and examples

where  is the true population mean

and you are testing at the 5% level.

Method 1: Using a p-value

You need to look at the probability a sample of size n taken from a distribution with
mean m and standard deviation , has a value at least as extreme (in this case, at
least as large) as x , the mean of the given sample. If this probability is less than the
significance level you will reject H0. In such a case you are saying that it is so
unlikely that a sample from a distribution with mean m would give this value for x ,
that you conclude that in fact the distribution does not have mean m, but a larger
mean.
 ²
The distribution of the sample means is N  m,  .
 n 
You use your calculator to find P ( X  x ) (the p-value)
You reject H0 if P ( X  x )  0.05 .

The diagram shows a Normal

distribution with mean m and
standard deviation . If the area
P( X  x ) shown is less than the significance
level, we reject H0.

m x

Example 1
Test results are normally distributed with a mean of 65 and a standard deviation of 10. After
the introduction of a dynamic new teacher the results for a group of 8 students had a mean of
72. Is there evidence that the results have significantly improved at a 5% level of
significance?
You want to see if the results
Solution could have come from a
H0 : μ = 65 distribution where the population
H1 : μ > 65 mean has remained unchanged.
where  is the population mean test score.

Let X be the distribution of test scores. Remember to define  as the

X N(65, 102) population mean – there is often a
mark awarded for this.
 10 2 
X N  65, 
 8  This is the p-value – the
probability that (if the mean is 65)
P( X > 72) = 0.0239 the sample mean is more than 72

Since 0.0239 < 0.05 (the required significance level of 5%) the null hypothesis is rejected.
There is evidence to suggest that the mean score has increased, i.e. the teacher has had some
effect.

5 of 10 07/02/19 © MEI
integralmaths.org
MEI A level Hypothesis testing 1 Notes and examples

Method 2: Finding a critical region

In this method, you find the range of values of the sample mean for which the null
hypothesis would be rejected. This is the critical region. You can then simply look to
see if the sample mean lies in the critical region.

You can use your calculator to find the critical value (the boundary of the critical
region). For a null hypothesis of the form H1:  > m, you are looking at the right-hand
tail, so for a 5% significance level you need the inverse normal value for 0.95 for
 ²
N  m,  .
 n 

Example 2
Test results are normally distributed with a mean of 65 and a standard deviation of 10. After
the introduction of a dynamic new teacher the results for a group of 8 students had a mean of
72. Is there evidence that the results have significantly improved at a 5% level of
significance?

Solution
H0 : μ = 65
H1 : μ > 65
where  is the population mean test score.

Let X be the distribution of test scores.

X ~ N(65, 102)
 10 2  5%
X ~ N  65, 
 8 
Using a calculator, inverse normal of 0.95 is 70.8
The critical region is X  70.8 . 65 70.8

The sample mean x  72 lies in the critical region, so reject H0.

There is evidence to suggest that the mean score has increased, i.e. the teacher has had some
effect.

Notice from the example above that the conclusion should always be given in terms
of the problem. First state whether H0 is to be accepted or rejected, then make a
statement beginning “there is evidence to suggest that …” or “there is not sufficient
evidence to suggest that …”. You should NOT write “this proves that ….” or “so the
claim is right”. You are not proving anything, only considering evidence.

Using estimated standard deviation

The hypothesis test described above requires the value of the standard deviation of
the parent population.

In reality the standard deviation of the parent population will usually not be known.
So in this case the standard deviation will have to be estimated from the sample
data.

In order for us to proceed with the same style of analysis we require the sample size
to be sufficiently large. It is usual to require the sample size n to be 30 or above.

Given a Normal population X with a mean of μ and unknown standard deviation, the
sampling distribution of the mean is:
 s2 
X N  , 
 n
2
where s is the estimated variance from the sample data.

This is illustrated in the next example.

Example 3
The time taken for a bus to go from Oundle to Thrapston is normally distributed with a mean
time of 18 minutes. A new roundabout is introduced, which it is hoped will speed up the
journey.

A large number of observations are taken, following complaints from students that the
journey is now taking longer than 18 minutes.
From the 50 observations, the mean was found to be 19.1 minutes, with a sample standard
deviation of 5 minutes.
Investigate the students’ complaint, state a suitable null and alternative hypothesis for the test
and carry out the test at the 5% level of significance, stating your conclusion carefully.

Solution
H0 : μ = 18.
H1 : μ > 18.
where  is the population mean journey time.

Let X be the distribution of bus times.

We do not know the population variance, but as the sample size is large (n = 50) we can
estimate the distribution of the sample mean to be:
 52 
X N 18, 
 50 

Method 1: Using a p-value

P( X > 19.1) = 0.0599
Since 0.0599 > 0.05 (the required significance level of 5%) the null hypothesis is accepted.
There is not sufficient evidence to suggest that the journey time has increased.

Method 2: Using critical regions

 52 
Inverse normal of 0.95 for N 18,  is 19.16
 50 

So the critical region is X  19.16 .

Since the sample mean of 19.1 is not in the critical region, the null hypothesis is accepted.
There is not sufficient evidence to suggest that the journey time has increased.

The left-hand tail

In the examples above, you were looking at the right-hand tail of the distribution,
since the alternative hypothesis suggested that the mean might have increased. If
the alternative hypothesis suggests a possible decrease in the mean, then you will
be looking at the left-hand tail of the distribution. This means that the critical region
will be on the left-hand side of the distribution and so at a significance level of 5%
you need to use the inverse normal of 0.05. If using the p-value, you will need to find
P( X  x ).

Example 4
The supplier of LITE light bulbs claims that the mean life of a LITE light bulb is 130 hours.
Responding to customer complaints that the light bulbs did not last as long as expected, a
training standards organisation tested 4000 bulbs and found the mean to be 128.5 hours and
the sample standard deviation was 13 hours.
Is there evidence at a 2% level that the mean is lower than 130 hours?

Solution
H0 : μ = 130
H1 : μ < 130.
where  is the population mean lifetime.

Let X be the distribution of times of LITE light bulbs.

X ~ N(130, 13²)
 132 
X ~ N 130, 
 400 

Method 1: Using the p-value

P( X < 128.5) = 0.0105
Since 0.0105 < 0.02 (the required significance level of 2%) the null hypothesis is rejected.
There is evidence to suggest that that the lifespan of a LITE light bulb is less than 130 hours.

Method 2: Using critical regions

 132 
The inverse normal of 0.02 for N  130,  is 128.67
 400 
The critical region is X  128.67 ’
Since the sample mean of 128.5 is in the critical region, the null hypothesis is rejected. There
is evidence to suggest that the lifespan of a LITE light bulb is less than 130 hours.

Two tailed tests

In the examples so far, the alternative hypothesis has been of the form
 > k (in which case you are looking at the right-hand tail) or  < k (in which case you
are looking at the left-hand tail). These are all one-tailed tests.

However, sometimes you will need to look at situations where the alternative
hypothesis is of the form   k (in which you are testing whether the mean is as
stated or not, without specifying in which direction it is likely to be wrong. A test like
this is a two-tailed test, as you are looking at both tails of the distribution.

In a two-tailed test, there are two parts to the critical region. If you are asked to give
the critical region for a test, you must give both parts. However, if you are just asked
to carry out the hypothesis test, you need only look at the relevant tail, depending on
whether the sample mean is higher or lower that the value given in the null
hypothesis. At the 5% significance level, you find the lower tail critical region using
the inverse normal of 0.025, and the upper tail critical region using the inverse
normal of 0.0975, so that the two tails correspond to a total probability of 5%.

Similarly, if you are using p-values, you compare with half the significance level,
since you are looking at just the relevant tail.

Example 5
The lengths of the leaves of a certain species of rare plant are Normally distributed with mean
8.6 cm and standard deviation 1.2 cm. A botanist finds a clump of plants and wants to find
out whether they are of the rare species. She collects and measures 50 leaves and finds that
the total of their lengths is 442 cm. Carry out a test at the 5% level. What should the biologist
conclude?

Solution
This is a two-tailed test, as the alternative hypothesis is that the mean is not 8.6, rather than
being specifically more or less than 8.6.
In this test, we are looking for
H0:  = 8.6 evidence that the plants are
not of the rare species
H1:   8.6
where  is the population mean leaf length.

442
x  8.84
50 2.5%
2.5%
Let X be the distribution of the lengths of the leaves.
 1.22 
X N  8.6, 
 50 
As the sample mean is greater than 8.6, we are looking at the right-hand tail.

Method 1: Using a p-value

P( X > 8.84) = 0.0787

Since 0.0787 > 0.025 (the required significance level of 2.5% in each tail) the null hypothesis
is accepted. There is not sufficient evidence to suggest that the plants are not of the rare
species.
This is not the same as
evidence that they are
of the rare species
Method 2: Using critical regions
The critical value for the upper tail is found using the inverse normal of 0.0975.
 1.22 
For N  8.6,  this is 8.93
 50 
The critical region is X  8.93
Since the sample mean of 8.84 is not in the critical region, the null hypothesis is accepted.
There is not sufficient evidence to suggest that the plants are not of the rare species.

STAT 400 Midterm 1 Cheat Sheet
No ratings yet
STAT 400 Midterm 1 Cheat Sheet
4 pages
Chapter 3 Probability
100% (2)
Chapter 3 Probability
35 pages
Examples Biostatistics. Final
No ratings yet
Examples Biostatistics. Final
90 pages
Glossary of Statistical Terms and Symbols
No ratings yet
Glossary of Statistical Terms and Symbols
4 pages
Micceri, T. (1989) - The Unicorn, The Normal Curve, and Other Improbably Creatures. Micceri89
No ratings yet
Micceri, T. (1989) - The Unicorn, The Normal Curve, and Other Improbably Creatures. Micceri89
18 pages
STAT414.001 Syllabus
No ratings yet
STAT414.001 Syllabus
3 pages
ps7 Sol
No ratings yet
ps7 Sol
7 pages
Discrete Distributions Modified
No ratings yet
Discrete Distributions Modified
12 pages
Prognosis Appraisal Tools
No ratings yet
Prognosis Appraisal Tools
2 pages
CH 10
No ratings yet
CH 10
18 pages
Determining The Sample Size (Continuous Data)
100% (1)
Determining The Sample Size (Continuous Data)
4 pages
CI For A Proportion
No ratings yet
CI For A Proportion
24 pages
Normal Distribution Practice 1
No ratings yet
Normal Distribution Practice 1
5 pages
Chapter 4 Integrals
100% (1)
Chapter 4 Integrals
68 pages
Chap 1-4, Statistical Inference, by Casella and Berger PDF
No ratings yet
Chap 1-4, Statistical Inference, by Casella and Berger PDF
686 pages
Confidence Intervals: By: Asst. Prof. Xandro Alexi A. Nieto UST - Faculty of Pharmacy
No ratings yet
Confidence Intervals: By: Asst. Prof. Xandro Alexi A. Nieto UST - Faculty of Pharmacy
24 pages
Common Probability Distributions: D. Joyce, Clark University Aug 2006
No ratings yet
Common Probability Distributions: D. Joyce, Clark University Aug 2006
9 pages
Box Plot
No ratings yet
Box Plot
4 pages
Tests of Significance and Measures of Association
No ratings yet
Tests of Significance and Measures of Association
21 pages
Master of Statistics
100% (1)
Master of Statistics
24 pages
Session 2 - Sufficient Causes Model
No ratings yet
Session 2 - Sufficient Causes Model
51 pages
Area Between Curves
No ratings yet
Area Between Curves
6 pages
Sampling Distribution of The Proportion
No ratings yet
Sampling Distribution of The Proportion
8 pages
Odds Ratio, Hazard Ratio and Relative Risk: Janez Stare Delphine Maucort-Boulch
No ratings yet
Odds Ratio, Hazard Ratio and Relative Risk: Janez Stare Delphine Maucort-Boulch
9 pages
Continuous Random Variables and Probability Distributions
No ratings yet
Continuous Random Variables and Probability Distributions
45 pages
Ggplot
No ratings yet
Ggplot
67 pages
Logarithm and Its Properties DPP
No ratings yet
Logarithm and Its Properties DPP
12 pages
Confidence Interval
No ratings yet
Confidence Interval
7 pages
Doane4e Preface PDF
100% (1)
Doane4e Preface PDF
23 pages
MTH 102: Probability and Statistics: Quiz 7 Post (A Light) Lunch Assignment 27/05/2020 Sanjit K. Kaul
No ratings yet
MTH 102: Probability and Statistics: Quiz 7 Post (A Light) Lunch Assignment 27/05/2020 Sanjit K. Kaul
3 pages
Statistics: The Chi Square Test
No ratings yet
Statistics: The Chi Square Test
41 pages
Integration by Substitution
No ratings yet
Integration by Substitution
12 pages
Binomial Distribution
100% (1)
Binomial Distribution
15 pages
09 Sampling Distribution
No ratings yet
09 Sampling Distribution
15 pages
03 04 Trig Substitution and Partial Fractions
No ratings yet
03 04 Trig Substitution and Partial Fractions
28 pages
The Evolution of Integration
No ratings yet
The Evolution of Integration
8 pages
Handout 9 PDF
No ratings yet
Handout 9 PDF
79 pages
Lecture 2A - Biological Variability, Descriptive Stats
No ratings yet
Lecture 2A - Biological Variability, Descriptive Stats
9 pages
Making Histograms, Frequency Polygons and Ogives, Using Excel New
No ratings yet
Making Histograms, Frequency Polygons and Ogives, Using Excel New
12 pages
Probability Distributions
No ratings yet
Probability Distributions
17 pages
1 DiscreteDistribution2018
No ratings yet
1 DiscreteDistribution2018
75 pages
Stats 250 W12 Exam 1 Solutions
No ratings yet
Stats 250 W12 Exam 1 Solutions
7 pages
Prob Stat Lesson 9
No ratings yet
Prob Stat Lesson 9
44 pages
(Solutions Manual) Probability and Statistics For Engineers and Scientists Manual Hayler
100% (1)
(Solutions Manual) Probability and Statistics For Engineers and Scientists Manual Hayler
51 pages
Quartiles, Deciles, Percentiles
100% (1)
Quartiles, Deciles, Percentiles
5 pages
Regression Explained SPSS
No ratings yet
Regression Explained SPSS
24 pages
Fundamentals of Biostatistics 7th Edition Bernard Rosner - Instantly access the full ebook content in just a few seconds
No ratings yet
Fundamentals of Biostatistics 7th Edition Bernard Rosner - Instantly access the full ebook content in just a few seconds
54 pages
Interpreting Statistical Results
No ratings yet
Interpreting Statistical Results
17 pages
Chi Square Tests and F Distribution
No ratings yet
Chi Square Tests and F Distribution
83 pages
Kaplan-Meier Estimator: Association. The Journal Editor, John Tukey, Convinced Them To Combine Their
No ratings yet
Kaplan-Meier Estimator: Association. The Journal Editor, John Tukey, Convinced Them To Combine Their
7 pages
Measures of Dispersion: Greg C Elvers, PH.D
100% (1)
Measures of Dispersion: Greg C Elvers, PH.D
27 pages
Questions & Answers Chapter - 7 Set 1
No ratings yet
Questions & Answers Chapter - 7 Set 1
6 pages
Malaria Disease Prediction and Grading System: A Performance Model of Multinomial Naïve Bayes (MNB) Machine Learning in Nigerian Hospitals
No ratings yet
Malaria Disease Prediction and Grading System: A Performance Model of Multinomial Naïve Bayes (MNB) Machine Learning in Nigerian Hospitals
14 pages
[Ebooks PDF] download Introduction to Probability and Statistics - Metric Version, 15e 15th Edition William Mendenhall full chapters
100% (3)
[Ebooks PDF] download Introduction to Probability and Statistics - Metric Version, 15e 15th Edition William Mendenhall full chapters
66 pages
Introduction To The Public Health Approach
No ratings yet
Introduction To The Public Health Approach
64 pages
Biostat Lec M1-8
No ratings yet
Biostat Lec M1-8
25 pages
Day 11 & 12 - Hypothesis Testing
No ratings yet
Day 11 & 12 - Hypothesis Testing
6 pages
Trigonometric Functions DPP
No ratings yet
Trigonometric Functions DPP
13 pages
Properties and Solutions of Triangle DPP
No ratings yet
Properties and Solutions of Triangle DPP
20 pages
Textbook of Urgent Care Management: Chapter 23, Choosing the Electronic Health Record
From Everand
Textbook of Urgent Care Management: Chapter 23, Choosing the Electronic Health Record
John Shufeldt
No ratings yet
The Robotics Program: A How-to-Guide for Physician Leaders On Starting Up a Successful Program
From Everand
The Robotics Program: A How-to-Guide for Physician Leaders On Starting Up a Successful Program
Terrence J. Loftus, MD, MBA
No ratings yet
The American Medical Association on the Case for Teaching Racism: Afrocentric Literary Pedagogy in Nursing Education and Clinical Practice
From Everand
The American Medical Association on the Case for Teaching Racism: Afrocentric Literary Pedagogy in Nursing Education and Clinical Practice
Francis Kwarteng
No ratings yet
Stochastic Processes
No ratings yet
Stochastic Processes
37 pages
Mathematics 523
No ratings yet
Mathematics 523
2 pages
Theory of Sampling and Sampling Practice, Third Edition Francis R Pitard - The latest ebook version is now available for instant access
No ratings yet
Theory of Sampling and Sampling Practice, Third Edition Francis R Pitard - The latest ebook version is now available for instant access
64 pages
Statistic & Probability: (GRADE 11) 3 Quarter
100% (1)
Statistic & Probability: (GRADE 11) 3 Quarter
21 pages
Power Function
No ratings yet
Power Function
7 pages
Bayesian_Inference_for_AI
No ratings yet
Bayesian_Inference_for_AI
22 pages
Basic Statistics PDF
0% (1)
Basic Statistics PDF
5 pages
Unit 3 Graphical Models
No ratings yet
Unit 3 Graphical Models
18 pages
Basic Probability PDF
No ratings yet
Basic Probability PDF
39 pages
Poisson Distribution
No ratings yet
Poisson Distribution
9 pages
Chap 4 - Set Valued Martingales PDF
No ratings yet
Chap 4 - Set Valued Martingales PDF
44 pages
Statistics & Probability: Quarter 3: Week 1 Learning Activity Sheets
No ratings yet
Statistics & Probability: Quarter 3: Week 1 Learning Activity Sheets
9 pages
6 - Stat - Discrete Probability Distributions 2024
No ratings yet
6 - Stat - Discrete Probability Distributions 2024
31 pages
Ons T P 14 Probability 18-10-24 Evening
No ratings yet
Ons T P 14 Probability 18-10-24 Evening
1 page
Stochastic Modeling in Operations Research
No ratings yet
Stochastic Modeling in Operations Research
89 pages
Chap2 Full
No ratings yet
Chap2 Full
18 pages
(Ebook) Head First Statistics by Dawn Griffiths ISBN 9780596527587, 0596527586 instant download
100% (2)
(Ebook) Head First Statistics by Dawn Griffiths ISBN 9780596527587, 0596527586 instant download
51 pages
Permutation Tests - Final
No ratings yet
Permutation Tests - Final
19 pages
Bernstein's Inequality, and Generalizations: CS281B/Stat241B (Spring 2003) Statistical Learning Theory
No ratings yet
Bernstein's Inequality, and Generalizations: CS281B/Stat241B (Spring 2003) Statistical Learning Theory
4 pages
16 ACTL2131 Exercises
No ratings yet
16 ACTL2131 Exercises
94 pages
Measures of Dispersion or Variability Range Variance Standard Deviation
No ratings yet
Measures of Dispersion or Variability Range Variance Standard Deviation
12 pages
CH05 Conditioning and Independence
No ratings yet
CH05 Conditioning and Independence
46 pages
NPR N-W Estimator
No ratings yet
NPR N-W Estimator
4 pages
Computational Bayesian Statistics. An Introduction - Amaral, Paulino, Muller PDF
100% (3)
Computational Bayesian Statistics. An Introduction - Amaral, Paulino, Muller PDF
257 pages
1.1 TG For Normal Distribution
No ratings yet
1.1 TG For Normal Distribution
5 pages
CMA Part 2 RISK
No ratings yet
CMA Part 2 RISK
14 pages
For Economic, Business & Social Studies
No ratings yet
For Economic, Business & Social Studies
33 pages
biostatistics-final-term-papoer-1
No ratings yet
biostatistics-final-term-papoer-1
15 pages

Sample Mean Distribution

Uploaded by

Sample Mean Distribution

Uploaded by

MEI A level Maths Hypothesis testing

Section 1: Using the Normal distribution

Notes and Examples

The distribution of sample means

{1} {2} {3} {4} {5} {6}

It can be shown that E( X )  3.5 and Var( X )  12

y 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6

It can be shown that E(Y )  3.5 and Var(Y )  35

It can be shown that E( Z )  3.5 and Var(Z )  36

Generalising: given a population with a mean of μ and a standard deviation of σ, the

As the distribution of the sample means is so important, it is often abbreviated to just

Standardising the distribution of the sample means

Suppose that you are using the hypotheses

where  is the true population mean

Method 1: Using a p-value

The diagram shows a Normal

Let X be the distribution of test scores. Remember to define  as the

Method 2: Finding a critical region

Let X be the distribution of test scores.

The sample mean x  72 lies in the critical region, so reject H0.

Using estimated standard deviation

This is illustrated in the next example.

Let X be the distribution of bus times.

Method 1: Using a p-value

Method 2: Using critical regions

So the critical region is X  19.16 .

The left-hand tail

Let X be the distribution of times of LITE light bulbs.

Method 1: Using the p-value

Method 2: Using critical regions

Two tailed tests

Method 1: Using a p-value

You might also like