Lecture 1
Lecture 1
Sampling is called with replacement when a unit selected at random from the
random. Whenever a unit is selected, the population contains all the same units, so
a unit may be selected more than once. There is no change at all in the size of the
population at any stage. We can assume that a sample of any size can be selected
Consider a population of potato sacks, each of which has either 12, 13, 14, 15, 16, 17,
or 18 potatoes, and all the values are equally likely. Suppose that, in this population,
there is exactly one sack with each number. So the whole population has seven
sacks. If I sample two with replacement, then I first pick one (say 14). I had a 1/7
probability of choosing that one. Then I replace it. Then I pick another. Every one of
them still has 1/7 probability of being chosen. And there are exactly 49 different
possibilities here (assuming we distinguish between the first and second.) They are:
(12,12), (12,13), (12, 14), (12,15), (12,16), (12,17), (12,18), (13,12), (13,13), (13,14),
etc.
Insert
Question
Let’s say you had a population of 7 people, and you wanted to sample 2. Their names
are:
John
Jack
Qiu
Tina
Hatty
Jacques
Des
You could put their names in a hat. If you sample with replacement, you would choose
one person’s name, put that person’s name back in the hat, and then choose another
name. The possibilities for your two-name sample are:
John, John
John, Jack
John, Qui
Jack, Qui
Jack Tina
…and so on.
When you sample with replacement, your two items are independent. In other words,
one does not affect the outcome of the other. You have a 1 out of 7 (1/7) chance of
choosing the first name and a 1/7 chance of choosing the second name.
Note that P(John, John) just means “the probability of choosing John’s name, and then
John’s name again.” You can figure out these probabilities using the multiplication rule.
John, Jack
John, Qui
Jack, Qui
Jack Tina…
But now, your two items are dependent, or linked to each other. When you choose the
first item, you have a 1/7 probability of picking a name. But then, assuming you don’t
replace the name, you only have six names to pick from. That gives you a 1/6 chance of
choosing a second name. The odds become:
You can tell how dramatic these results are by calculating the covariance. That’s a
measure of how much probabilities of two items are linked together; the higher the
covariance, the more dramatic the results. A covariance of zero would mean there’s no
difference between sampling with replacement or sampling without.
Key points:
The sampling distribution of a statistic is the distribution of the statistic for all
possible samples from the same population of a given size.
https://online.stat.psu.edu/stat500/lesson/4/4.1
How Does it Work?
https://online.stat.psu.edu/stat500/node/510