Week-5
Week-5
Modar Shbat
Division of Engineering
[email protected]
2
Variables were defined as characteristics that change or vary over time and/or for
different individuals or objects under consideration. Quantitative variables generate
numerical data, whereas qualitative variables generate categorical data. However, even
qualitative variables can generate numerical data if the categories are numerically coded
to form a scale. For example, if you toss a single coin, the qualitative outcome could be
recorded as “0” if a head and “1” if a tail.
Random Variables
A numerically valued variable x will vary or change depending on the particular outcome
of the experiment being measured. For example, suppose you toss a die and measure
x, the number observed on the upper face. The variable x can take on any of six
values—1, 2, 3, 4, 5, 6—depending on the random outcome of the experiment. For this
reason, we refer to the variable x as a random variable.
Definition: A variable x is a random variable if the value that it assumes,
corresponding to the outcome of an experiment, is a chance or random event.
We also learned how to use the mean and standard deviation to measure the center and
variability of this data set.
In this section, we defined probability as the limiting value of the relative frequency as
the experiment is repeated over and over again. Now we define the probability
distribution for a random variable x as the relative frequency distribution constructed for
the entire population of measurements.
The values of x represent mutually exclusive numerical events. Summing p(x) over all
values of x is equivalent to adding the probabilities of all simple events and therefore
equals 1.
Recall that the expected value of x is the average of the theoretical population that
would result if the lottery were repeated an infinitely large number of times. If this were
done, your average or expected gain would be a loss of $14.
Therefore, if the insurance company charged a yearly premium of $200, the average
gain calculated for a large number of similar policies would equal zero. The actual
premium would equal $200 plus administrative costs and profit.
Note: The method for calculating the expected value of x for a continuous random
variable is similar to what we have done, but in practice it involves the use of calculus.
Nevertheless, the basic results concerning expectations are the same for continuous
and discrete random variables.
Regardless of whether x is continuous or discrete:
Before, we tossed two fair coins and constructed the probability distribution for x, the
number of heads—a binomial experiment with n=2 and p=.5. The general binomial
probability distribution is constructed in the same way.
Prob. & Stat. 18
BINOMIAL PROBABILITY DISTRIBUTION
Fortunately, the probabilities p(x) follow a general pattern. This allows us to use a single
formula to find p(x) for any given value of x.
However, many other sequences also result in x=2 successes. The binomial formula
uses to count the number of sequences and gives the exact probability when you
use the binomial formula with k=2:
We could repeat the procedure in this example for each value of x—0, 1, 2, . . . ,10—
and find all the values of p(x) necessary to construct a probability histogram for x.
The following figures show the results for different values of p.
Although we could calculate P(x=1), P(x=2), P(x=3) and P(x=4) to find this probability,
using the complement of the event makes your job easier; that is,
In each example, x represents the number of events that occur in a period of time or
space during which an average of m such events can be expected to occur.
The only assumptions needed when one uses the Poisson distribution to model
experiments such as these are that the counts or events occur randomly and
independently of one another.
2. During a 2-week period, the average number of accidents on this section of highway
is 2(2)=4. The probability of at most three accidents during a week period is:
26
THE HYPERGEOMETRIC PROBABILITY DISTRIBUTION
Example:
A case of wine has 12 bottles, 3 of which contain spoiled wine. A sample of 4 bottles is
randomly selected from the case.
1. Find the probability distribution for x, the number of bottles of spoiled wine in the
sample.
2. What are the mean and variance of x?
Solution:
27
THE HYPERGEOMETRIC PROBABILITY DISTRIBUTION
Example (Cont.):
28
THE HYPERGEOMETRIC PROBABILITY DISTRIBUTION
Example:
A particular industrial product is shipped in lots of 20. Testing to determine whether an
item is defective is costly; hence, the manufacturer samples production rather than using
a 100% inspection plan. A sampling plan constructed to minimize the number of
defectives shipped to customers calls for sampling five items from each lot and rejecting
the lot if more than one defective is observed. (If the lot is rejected, each item in the lot is
then tested.) If a lot contains four defectives, what is the probability that it will be
accepted?
Solution:
Let x be the number of defectives in the sample. Then N=20, M=4, (N-M)=16, and n=5.
The lot will be rejected if x=2, 3, or 4. Then:
29