Normal Distribution: It Can Be Spread Out More On The Left or More On The Right
Normal Distribution: It Can Be Spread Out More On The Left or More On The Right
But there are many cases where the data tends to be around a central value
with no bias left or right, and it gets close to a "Normal Distribution" like this:
A Normal Distribution
heights of people
size of things produced by machines
errors in measurements
blood pressure
marks on a test
Quincunx
Assuming this data is normally distributed can you calculate the mean and
standard deviation?
It is good to know the standard deviation, because we can say that any value
is:
Standard Scores
The number of standard deviations from the mean is also called the
"Standard Score", "sigma" or "z-score". Get used to those words!
You can see on the bell curve that 1.85m is 3 standard deviations from the
mean of 1.4, so:
It is also possible to calculate how many standard deviations 1.85 is from the
mean
How many standard deviations is that? The standard deviation is 0.15m, so:
We can take any Normal Distribution and convert it to The Standard Normal
Distribution.
26, 33, 65, 28, 34, 55, 25, 44, 50, 36, 26, 37, 43, 62, 35, 38, 45, 32, 28, 34
To convert 26:
Standard Score
Original Value Calculation
(z-score)
26 (26-38.8) / 11.4 = -1.12
33 (33-38.8) / 11.4 = -0.51
65 (65-38.8) / 11.4 = +2.30
... ... ...
And here they are graphically:
20, 15, 26, 32, 18, 28, 35, 14, 26, 22, 17
Most students didn't even get 30 out of 60, and most will fail.
The test must have been really hard, so the Prof decides to Standardize all the
scores and only fail people 1 standard deviation below the mean.
The Mean is 23, and the Standard Deviation is 6.6, and these are the
Standard Scores:
-0.45, -1.21 , 0.45, 1.36, -0.76, 0.76, 1.82, -1.36 , 0.45, -0.15, -0.91
Now only 2 students will fail (the ones who scored 15 and 14 on the test)
Much fairer!
It also makes life easier because we only need one table (the Standard Normal
Distribution Table ), rather than doing calculations individually for each value of
mean and standard deviation.
In More Detail
Here is the Standard Normal Distribution with percentages for every half of a
standard deviation, and cumulative percentages:
Example: Your score in a recent test was 0.5 standard deviations above the
average, how many people scored lower than you did?
In theory 69.1% scored less than you did (but with real data the percentage
may be different)
A Practical Example: Your company
packages sugar in 1 kg bags.
When you weigh a sample of bags you get these results:
Some values are less than 1000g ... can you fix that?
It is a random thing, so we can't stop bags having less than 1000g, but we can
try to reduce it a lot.
Let's adjust the machine so that 1000g is:
at −3 standard deviations:
From the big bell curve above we see that 0.1% are less. But maybe that is
too small.
increase the amount of sugar in each bag (which changes the mean), or
make it more accurate (which reduces the standard deviation)
10g / 2.5 = 4g