Part 2
Part 2
Distributions
&
Sampling
Techniques
Pop Quiz
– In a cake factory, the standard deviation of sugar per cup is 129 gram. What is
the mean if not more than 62% has less than 440 gram of sugar per cup?
– What if not more than 62% has more than 440 gram of sugar per cup?
– In a vitamin factory, the standard deviation of vitamin C is 16 milligram. What is
the mean if more than 75% should have 51 milligram of vitamin C or more?
Discrete & Continuous
Distributions
– A random variable is discrete if the set of all possible values is at most a finite or a countably infinite number of possible
values.
Examples:
1. Randomly selecting 25 people who consume soft drinks and determining how many people prefer diet
soft drinks
2. Counting the number of people who arrive at a store during a five-minute period
• A random variable is continuous if it can take on values at every point over a given interval.
Examples:
1. Measuring the time between customer arrivals at a retail outlet
2. Measuring the weight of grain in a grain elevator at different points of time
• Discrete distributions (binomial, Poisson, hypergeometric) are constructed from discrete random
variables.
• Continuous distributions (uniform, normal, exponential, and others) are constructed from
continuous random variables.
I. Discrete Distribution
– A histogram is the most common graphical way of describing a discrete
distribution.
• An executive is considering out-of-town business travel for a given Friday. She recognizes
that at least one crisis could occur on the day that she is gone and she is concerned about
that possibility. Table 5.2 shows a discrete distribution that contains the number of crises
that could occur during the day that she is gone and the probability that each number will
occur.
5.2 Describing a Discrete Distribution
where
long-run average
an outcome
probability of that outcome
• In the long run, the mean or expected number
of crises on a given Friday for this executive is
1.15 crises.
• However, there will never be exactly 1.15
crises.
II.
Continuous
Distributions
6.2 The Normal Distribution
Characteristics of the Normal Distribution
• It is a continuous distribution.
• It is a symmetrical
distribution about its mean.
• It is asymptotic to the
horizontal axis.
• It is unimodal.
• It is a family of curves.
• Area under the curve is 1.
6.2 The Normal Distribution
Probability Density Function of the Normal Distribution
– Shows area under the normal curve for a given mean and standard deviation.
– Since it is difficult to use the formula, common to use a table or computer.
6.2 The Normal Distribution
– A z score is the number of standard deviations that a value, x, is above or below the
mean.
– The z distribution is a normal distribution with a mean of 0 and a standard deviation of 1.
6.2 The Normal Distribution
14-15
7.1 Sampling
Systematic Sampling
– Every kth item is selected to produce a sample of size n from a population of size N.
Example: A business researcher wanted to sample Texas manufacturers as part of a management study.
– Wanted to sample 1,000 companies.
– Frame-- most recent edition of the Texas Manufacturers Register® which listed 26,000 manufacturing
companies in alphabetic order.
– The value of k was 26 (26,000/1,000).
– Use random number table to choose the first element in the study.
7.1 Sampling
Nonrandom Sampling
– Any method that does not involve a random selection process.
Convenience Sampling
– Selected for the convenience of the researcher.
Judgment Sampling
– Chosen by the judgement of the researcher.
– Since the probability of an element being selected cannot be determined, cannot determine
sampling error.
– Can be biased due to systematic errors in judgment.
7.1 Sampling
Quota Sampling
– Population subclasses, such as age or gender, are used as strata.
– Can be useful if no frame is available for the population.
– Can be less costly.
– But nonrandom, and thus probabilities cannot be calculated.
Snowball Sampling
7.1 Sampling
Sampling Error
– Occurs when the sample is not representative of the population.
Nonsampling Error
– All other errors other than sampling error.
– Missing data
– Recording errors
– Measurement errors
– Input processing errors
– Analysis errors
– Response errors
– And many more!
7.2 Sampling Distribution of
Suppose that a small, finite population contains only N = 8 numbers:
54 55 59 63 64 68 69 70
– Suppose that all possible samples of size n = 2 are taken from this population.
7.2 Sampling Distribution of
Population:
54 55 59 63 64 68 69 70
– Similarly, the histogram of a Poisson distribution and its samples are different.
7.2 Sampling Distribution of
The Central Limit Theorem
– If random samples of size n are repeatedly drawn from a population that has a mean of μ and a
standard deviation of σ, the sample means,, are approximately normally distributed for sufficiently
large sample sizes (n ≥ 30), regardless of the shape of the population distribution. If the population
is normally distributed, the sample means are normally distributed for any size sample.
– It can be shown that the mean of the sample means is the population mean:
– The standard deviation of the sample means (the standard error of the mean) is:
7.2 Sampling Distribution of