Statistical Errors
Statistical Errors
The most recent Advanced Placement Statistics Outline of Topics includes the concepts of type
I and type II errors, and power. The purpose of this paper is to provide simple examples of
these topics.
Assume that two samples of people have the indicated ethnic distributions. The sample sizes are
25 and 20, respectively.
3 1 15 6
SAMPLE #1
10 4 3 3
SAMPLE #2
Sometimes a visual display of data is helpful. Here are dot plots for each sample.
SAMPLE #1
African-
***
American
Native-
*
American
**************
Caucasian
*
Oriental ******
SAMPLE #2
African-
**********
American
Native-
****
American
Caucasian ***
Oriental ***
Here is our challenge: One of the two samples is randomly chosen, and then one individual is
randomly picked from the chosen sample. Based on our observation of the individual, we must
make a conjecture as to which sample the chosen individual belongs.
Based on our observation of the randomly chosen individual, we must decide to either accept or
reject Ho. A rejection of Ho is, of course, equivalent to asserting that the individual came from
SAMPLE #2.
There are two types of errors that can be made during this process.
Here are four possible tests relating to Ho. A test consists of randomly choosing an individual
and then accepting or rejecting Ho based on an observation of the individual. As defined, each
test can produce a correct conclusion or an incorrect conclusion.
Let's examine Test #1. A Type I error can only occur when Ho is true. Hence, the probability of
a Type I error with Test #1 is 10/25 = 40%. A Type II error can only occur when Ho is false. In
this situation, if Ho is false, then the selected individual came from Sample #2. The probability
of a Type II error is 3/20 = 15%. Here is a probability summary for Test #1.
Sample is #1 Sample is #2
The power of a test is the probability that an incorrect null hypothesis is rejected. In this
situation, the power is 85%, which is calculated by the formula
1 - probability(Type II error).
In this situation, if the randomly chosen individual is from SAMPLE #2, one can expect this test
to correctly conclude that the individual is not from SAMPLE #1 in 85 out of every 100 trials.
Here are corresponding probabilities for all of the indicated tests. (You can easily check these
out!)
4% 80% 20%
Test #3
There are, of course, other tests that could be used. Of the four tests examined, Test #3 produces
the smallest Type I error, but yields a whopping 80% Type II error. Strategy #1 has the smallest
Type II error, but also the largest Type I error.
As suggested by the examples above, decreasing the chance of one type of error frequently
increases the chance for the other error type. In real-life situations, one can decrease the
probability of both error types by collecting more data or having more information available.
However, one must frequently decide which error type should be minimized. Here are two
simple examples:
Example #1:
In the legal world, a null hypothesis might be "This person is innocent." A Type I error would
be judging the person guilty when he is innocent. A Type II error would involve declaring the
person innocent when he is guilty. If one accepts the thought that it is better to release a guilty
person than to convict an innocent one, then it would be important to minimize the chances of a
Type I error.
Example #2:
In the world of medicine, a null hypothesis might be "This drug will cure an illness." A Type I
error would be concluding that the drug does not work when it actually does. A Type II error
would conclude that the drug does work when it actually doesn't. One could argue that a Type II
error should be minimized here if one agrees that spending time and money on a useless drug
would replace what might be some other effective treatment.
Here are three important facts that can help minimize the confusion that sometimes results when
working with these error types: