Unit 3 Statistics Notes

The document provides an overview of data collection methods, including definitions of population, census, sample, and various sampling techniques such as simple random sampling and stratified sampling. It also discusses observational studies and experiments, highlighting the importance of random assignment and control in experimental design. Additionally, it addresses poor sampling methods, response bias, and ethical considerations in studies involving human participants.

Uploaded by

megan.s.aversa

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Unit 3 Statistics Notes

Uploaded by

megan.s.aversa

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

3A Introduction to Data Collection

★ Population = a statistical study’s entire group of individuals that we want

information about
★ Census = collects data from EVERY individual
★ Sample = subset that we collect data from
★ Sample Survey = study that collects data from a sample to learn abt the population
○ first step is to pick population, then decide what to measure
★ Random sampling = using a chance process to determine which members of the
population are chosen for the sample
★ Observational study = observes individuals and measures variables of interest,
DOES NOT attempt to influence responses
○ Examples: sample surveys, recording behavior of animals to see food
preferences, tracking lung cancer for smokers vs nonsmokers overtime
○ They can be retrospective (examining existing data for individuals) or
prospective (track individuals into the future)
○ GOAL = describe group/situation, compare groups, examine relationships
between variables
○ Inferences can only be made to the general population IF the sample was
randomly selected (otherwise it can only be generalized to those in sample)
○ CANNOT conclude cause and effect from observational studies, even if they
have random sampling
★ Experiment = deliberately imposes treatments on experimental units to measure
responses
○ GOAL = determine whether treatment causes change in response
3B Sampling and Surveys
★ Simple Random Sampling (SRS) = a sample chosen in a way where every group of
the sample size in the population has an equal chance of being selected as the
sample
○ EX: Mrs. Smith writes each student of her class of 30 on pieces of paper, and
then randomly selects 5 of them out of a hat to call on
■ each group of 5 has an equal chance of being selected
○ This can also be done with technology
■ give each individual in the population a number 1-N (N being the # of
the population), use a random # generator to get n different numbers
from 1-N, then choose the individuals that correspond
○ Can also be done with table
■ give each individual in the population a number with the same # of
digits, read groups of the appropriate length from left to right across a
line in the table (ignore spaces, groups that were not used as labeled
#s, and duplicates), stop when you choose n different labels, choose
those that correspond
○ SRS samples without replacement, meaning the individual can only be
selected once, so repeated numbers should be ignored
■ on calc, press math, then prob, then randInt
★ Strata = groups of individuals in a population that share characteristics thought to
be associated with the variables being measured in a study
★ Stratified random sample = a sample selected by choosing an SRS from each strata
and combining the SRSs into the overall sample
○ works best when the individuals within
each strata are very similar with respect
to what is measured
○ EX. In a study about sleep habits, the
population is divided into 4 strata: 9th,
10th, 11th, and 12th graders. It works
because 9th graders will have different
sleep habits than 12th graders but they
should be fairly similar within each
grade.
○ How many individuals should be selected in each strata? keep the sample size
proportional to how many are in the population
■ EX: if 20% of the high school students are 12th graders, and we want a
stratified sample of 250, then we should have (0.20)(250)= 50 12th
graders
■ when strata are chosen wisely, the estimate is much more precise than
in a simple random sample of the same size
★ Cluster = group of individuals in the population that are located near each other
★ Cluster sample = a sample selected by randomly choosing clusters and including
each member of the selected clusters in the sample
○ used when populations are large and spread out over a wide area
○ it means dividing the population into non overlapping groups of individuals
that are ‘near’ each other then randomly
selecting whole clusters to form the overall
sample
○ save time and money, but clusters should
be similar to each other in composition
■ EX: Administrators want to survey
100 students, but it would be
difficult to track down 100 different
students, so instead they select an
SRS of 4 homerooms and give the survey ti all 25 students in each
selected homeroom
★ Systematic Random Sample = a sample selected from an ordered arrangement of
the population by randomly selecting one of the first k individuals and choosing
every kth individual from there
○ might be helpful in exit polls, as it would be impossible to use an SRS because
we would have to know which voters will show up, numbering them all, and
then identifying them as they leave
○ select k by dividing the population size by the desired sample size, if possible
○ EX: A poller is asked to poll every 20th voter. They randomly select a number
1-20. If the number was 6, they would poll the 6th, then the 26th, then the
46th, etc
○ HOWEVER, if there are patterns in the way the population is ordered that
coincide with the pattern in a systematic random sample, the sample may not
be representative of the population
★ multistage campaigning combines two or more sampling methods
Poor Sampling Methods
★ Convenience Sample = consists of individuals from a population that is easy to
reach (EX: going to the library to ask 30 students abt their hw time)
○ produces unreliable results bc the members of the sample often differ from
the population; the example would overestimate hw times
○ shows bias, meaning that the design of the study is very likely to
overestimate or underestimate the desired value
★ Voluntary Response Sample = consists of people who choose to be in the sample
by responding to a general invitation, sometimes called self-selected samples (EX:
advice columnist asked her followers if they’d want to have kids again if they had a
redo, and 70% said no)
○ shows bias bc people who respond are likely to feel strongly about it; the
example overestimates those who said no bc only those who felt strongly abt
it (that they wouldn’t want kids again) responded
★ Undercoverage = occurs when some members of the population are less likely to be
chosen or cannot be chosen in the sample (EX: randomly calling telephone numbers
doesn’t include those who do not have telephones in the sample)
○ ideally, sample is chosen from a list of all the individuals in the population
(called the sampling frame)
★ Nonresponse = occurs when an individual chosen for the sample can’t be contacted
or refuses to participate (EX: some people are rarely at home and do not pick up the
phone)
○ can only occur after the sample is selected
★ Response Bias = occurs when there is consistent pattern of inaccurate responses to
a survey question (EX: questions ordered/worded weirdly, characteristics or
behavior of interviewer effects responses, non anonymous survey)
3C Experiments
★ Response variable = measures outcome of study
★ Explanatory variable = may help explain or predict changes in response variable
○ EX: in experiment studying if vitamin D lowers risk of diabetes, the response
is diabetes status and the explanatory is vitamin D level
★ However, we cannot say that more vitamin D lowers risk of diabetes
○ there are too many confounding variables
○ confounding variables = occurs when two variables are associated in such a
way that their effects on a response variable cannot be distinguished from
each other
■ EX: it is possible that those with healthier diets eat foods rich in
vitamin D, and that diet is a confounding variable because it is related
to both vitamin D consumption and diabetes status
★ An experiment was set up to determine whether vitamin D actually impacted
diabetes status
○ Treatment = specific condition applied to individuals in an experiment; if the
experiment has several explanatory variables, the treatment is a combination
of specific values of these variables
■ dose of vitamin D, no dose of vitamin D
○ Experimental Unit = the object to which a treatment is randomly assigned;
when experimental units are human beings, they are also called subjects
■ 500 patients with pre-diabetes
○ Placebo = a treatment that has no active ingredients but is otherwise like the
other treatments
■ treatment with no actual vitamin D given
★ placebo effect = describes the fact that some subjects in an experiment will respond
favorably to any treatment, even inactive treatment
○ because of this, it’s important that the subjects don’t know which treatment
they have; sometimes it’s beneficial that the experiment givers are also
unaware
■ double-blind experiment: neither the subjects nor those who interact
with them and measure the response variable know which treatment
a subject is receiving
■ sing-blind experiment: either the subjects or the people who interact
with them and measure the response don’t know which treatment a
subject is receiving
○ They avoided confounding variables by randomly assigning which
participants got the vitamin D and which didn’t, so people with healthier
diets were evenly split between treatment groups
○ Factor = an explanatory variable that is manipulated and may cause change
in the response variable
■ there is one factor (explanatory variable) = vitamin D level
○ Levels = different values of a factor
■ there is two levels = 20000 mg vitamin D dose, 0 mg vitamin D dose
○ Control group = used to provide a baseline for comparing the effects of other
treatments; depending on the experiment, it can be the inactive treatment,
the active treatment, or no treatment at all
■ not all experiments actually require a control group, as long as there is
comparison in place
★ Basic principles of experimental design
○ Comparison: use a design that compares two or more treatments
○ Random Assignment: use a chance process to assign treatments to
experimental units to create roughly equivalent groups before treatments are
imposed
■ vitamin D or no vitamin D was randomly assigned to pre-diabetes
patients
○ Replication: use each treatment with enough experimental units so that the
effects of the treatments can be distinguished from chance differences
between the groups
■ does NOT mean replicating the experiment
■ EX: using 100 patients per treatment group instead of 1 patient
○ Control: keep other variables the same for all groups; it helps avoid
confounding and reduces the variation in the response variable, making it
easier to decide if the treatment is effective
★ Completely Randomized Design = the experimental units are assigned to the
treatments at random
★ Randomized Block Design = forms groups (blocks) of experimental units that are
similar with respect to a variable that is expected to affect the response. Treatments
are assigned at random within each block then the responses are compared within
each block and combined with the responses of other blocks after accounting for the
differences between each.
○ it is easier to determine if one treatment is more effective than another this
way
★ Matched Pairs Design = a common form of randomized block design for comparing
two treatments, where each subject received both treatments in a random order
○ in others, two very similar subjects are paired, and the two treatments are
randomly assigned within each pair
★ Statistically Significant = when the observed difference in responses between the
groups in an experiment is so large that it is unlikely to be explained by chance
variation in the random assignment
★ The scope of inference = describes the types of conclusions we can make based on
how data is collected
○ Inference about a population = requires that individuals are randomly
selected from the population
○ Inference about cause and effect = requires a well-designed experiment
with random assignment of treatments and statistically significant results
★ Data Ethics: studies involving humans must be screened in advance by an
institutional review board. All participants must give informed consent before
taking part; any info about the participants must be confidential

MYP Science Lab Report:: Checklist
100% (1)
MYP Science Lab Report:: Checklist
5 pages
SOCIAL EXPERIMENT Format
100% (3)
SOCIAL EXPERIMENT Format
1 page
Statistics Chapter 4 Notes Section 4.1 Designing Studies: Definition: Population and Sample
No ratings yet
Statistics Chapter 4 Notes Section 4.1 Designing Studies: Definition: Population and Sample
6 pages
Chapter 4 Designning Studies(1)
No ratings yet
Chapter 4 Designning Studies(1)
59 pages
Levels of Measurement: Study
No ratings yet
Levels of Measurement: Study
13 pages
AP Stats Module 3 Notes
No ratings yet
AP Stats Module 3 Notes
2 pages
Unit 3 - Sampling and Experimental Design New - Read-Only
No ratings yet
Unit 3 - Sampling and Experimental Design New - Read-Only
44 pages
Collecting Data Sensibly: Chapter Is VERY Important!
No ratings yet
Collecting Data Sensibly: Chapter Is VERY Important!
52 pages
5.1 Notes (1)
No ratings yet
5.1 Notes (1)
6 pages
6) BIOSTATISTICs
No ratings yet
6) BIOSTATISTICs
99 pages
Chapter 1 - Sampling and Experimental Design
No ratings yet
Chapter 1 - Sampling and Experimental Design
9 pages
Stat 102 Module 1
No ratings yet
Stat 102 Module 1
11 pages
Reviewer in Practical Reseach - 074601
No ratings yet
Reviewer in Practical Reseach - 074601
5 pages
Psychological Assessment Unit 2
No ratings yet
Psychological Assessment Unit 2
123 pages
7/10/15 SR - Muzaitul Akma Mustapa Kamal Basha: Biostatistics NUR 3163
No ratings yet
7/10/15 SR - Muzaitul Akma Mustapa Kamal Basha: Biostatistics NUR 3163
32 pages
6) BIOSTATISTICs
No ratings yet
6) BIOSTATISTICs
99 pages
Chapter 4 Vocab
No ratings yet
Chapter 4 Vocab
2 pages
2 Collecting Data
No ratings yet
2 Collecting Data
43 pages
4 - Sampling and Sample Size - SFB
No ratings yet
4 - Sampling and Sample Size - SFB
52 pages
Statistics Class Work # 1-3
No ratings yet
Statistics Class Work # 1-3
8 pages
Sampling - 2019
No ratings yet
Sampling - 2019
38 pages
00 Unit 3 Collecting Data Student Notes 2023-24
No ratings yet
00 Unit 3 Collecting Data Student Notes 2023-24
10 pages
Notes 2 Study Design RJMurden 2021
No ratings yet
Notes 2 Study Design RJMurden 2021
44 pages
Applied Statistics - MIT
100% (1)
Applied Statistics - MIT
654 pages
Lecture 1 - Biostat Basic
No ratings yet
Lecture 1 - Biostat Basic
60 pages
AP Review Packet 1 - Important Concepts not on the AP Statistics Formula Sheet
No ratings yet
AP Review Packet 1 - Important Concepts not on the AP Statistics Formula Sheet
16 pages
STATISTICS-REVIEWER-SAMPLING-METHODS
No ratings yet
STATISTICS-REVIEWER-SAMPLING-METHODS
7 pages
Lecture 2
No ratings yet
Lecture 2
65 pages
WK1 Topics - Cs
No ratings yet
WK1 Topics - Cs
22 pages
Chapter 2
No ratings yet
Chapter 2
56 pages
Population and Sampling
No ratings yet
Population and Sampling
27 pages
Sampling
No ratings yet
Sampling
31 pages
RM (WK 6) - Sampling and Generalizability - 2019
No ratings yet
RM (WK 6) - Sampling and Generalizability - 2019
23 pages
Sampling Designs in Operational Health Research: Dr. Syed Irfan Ali
No ratings yet
Sampling Designs in Operational Health Research: Dr. Syed Irfan Ali
35 pages
Sampling Method - CGS
No ratings yet
Sampling Method - CGS
36 pages
Session Ix: Causal Research Design: Experimentation & Introduction To Sample & Sampling
No ratings yet
Session Ix: Causal Research Design: Experimentation & Introduction To Sample & Sampling
50 pages
6NUBioEpiSampling2T24-25
No ratings yet
6NUBioEpiSampling2T24-25
54 pages
Lecture Notes - Data
No ratings yet
Lecture Notes - Data
26 pages
Lesson 2.4 the Sample and Sampling Procedure Copy
No ratings yet
Lesson 2.4 the Sample and Sampling Procedure Copy
40 pages
Sampling: Iiird Year Resident
No ratings yet
Sampling: Iiird Year Resident
26 pages
Unit 2 Statistics PDF
No ratings yet
Unit 2 Statistics PDF
18 pages
Chapter 1 INTRODUCTION TO STATISTICS (New)
No ratings yet
Chapter 1 INTRODUCTION TO STATISTICS (New)
34 pages
Sampling Techniques Ali 2014
No ratings yet
Sampling Techniques Ali 2014
43 pages
Statistics For Beginners 2024
No ratings yet
Statistics For Beginners 2024
37 pages
Lecture 3 Sampling
No ratings yet
Lecture 3 Sampling
83 pages
Hypothesis Sampling - Day9
No ratings yet
Hypothesis Sampling - Day9
40 pages
Introduction To Sampling: Situo Liu Spry, Inc. 10/25/2013
No ratings yet
Introduction To Sampling: Situo Liu Spry, Inc. 10/25/2013
22 pages
UNIT 10 POPULATION AND SAMPLE
No ratings yet
UNIT 10 POPULATION AND SAMPLE
51 pages
Notability Notes 2
No ratings yet
Notability Notes 2
6 pages
Reviewer for Quiz 1 in Practical Research 1(March 5, 2025)
No ratings yet
Reviewer for Quiz 1 in Practical Research 1(March 5, 2025)
5 pages
Unit 3 Sampling
No ratings yet
Unit 3 Sampling
22 pages
Econ 522- Chapter 4
No ratings yet
Econ 522- Chapter 4
55 pages
1.sampling Methods and Sample Size Determination
No ratings yet
1.sampling Methods and Sample Size Determination
80 pages
Sample and Sampling Process
No ratings yet
Sample and Sampling Process
37 pages
Lecture 4 Sampling Techniques
No ratings yet
Lecture 4 Sampling Techniques
26 pages
CS210 Statistics Notes.pdf (1)
No ratings yet
CS210 Statistics Notes.pdf (1)
8 pages
Important Statistical Terms: Population
No ratings yet
Important Statistical Terms: Population
27 pages
Dr. Pius Ochwo: Sampling Methods
No ratings yet
Dr. Pius Ochwo: Sampling Methods
24 pages
4 1 Sampling Techniques Ali 2021
No ratings yet
4 1 Sampling Techniques Ali 2021
43 pages
Selection & formulation of Research Question, Data collection, Types of data & Sampling
No ratings yet
Selection & formulation of Research Question, Data collection, Types of data & Sampling
42 pages
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Exploratory and Formal Studies
100% (2)
Exploratory and Formal Studies
3 pages
Writing Empirical Research Report PDF
100% (1)
Writing Empirical Research Report PDF
164 pages
PB ZN Acumulacion
No ratings yet
PB ZN Acumulacion
10 pages
Trail Smelter Arbitration Case Full Case
No ratings yet
Trail Smelter Arbitration Case Full Case
80 pages
Human Development A Life Span View 3rd Edition Testbank Compress
100% (1)
Human Development A Life Span View 3rd Edition Testbank Compress
32 pages
Module 1 - Chapter 1 - 2
No ratings yet
Module 1 - Chapter 1 - 2
10 pages
Methods of Research
100% (6)
Methods of Research
252 pages
Lesson Plan in Science 7
100% (2)
Lesson Plan in Science 7
4 pages
Annex 1 General Information
No ratings yet
Annex 1 General Information
3 pages
Quality Control by Taguchi Method: Presented By: Pankaj Raj 1002009 Amit Kumar Sharma 1002010
No ratings yet
Quality Control by Taguchi Method: Presented By: Pankaj Raj 1002009 Amit Kumar Sharma 1002010
24 pages
Module+6++Research+Designs+revised+2024
No ratings yet
Module+6++Research+Designs+revised+2024
32 pages
De anh DH Vinh lan 22015 - keys
No ratings yet
De anh DH Vinh lan 22015 - keys
5 pages
Reviewer Answer Key Reviewer
No ratings yet
Reviewer Answer Key Reviewer
20 pages
METHODOLOGY
No ratings yet
METHODOLOGY
4 pages
Composition 1 S2 2020 Final
No ratings yet
Composition 1 S2 2020 Final
46 pages
分层随机分配
100% (2)
分层随机分配
10 pages
Investigating Factors Needed For Photosynthesis
No ratings yet
Investigating Factors Needed For Photosynthesis
3 pages
Approaching Microbiological Method Validation
No ratings yet
Approaching Microbiological Method Validation
19 pages
What Is Natural Science
No ratings yet
What Is Natural Science
6 pages
Research Goals and Types of Research Designs
No ratings yet
Research Goals and Types of Research Designs
10 pages
Valencea - Organic Chemistry - Lab Report - Grade 9
No ratings yet
Valencea - Organic Chemistry - Lab Report - Grade 9
10 pages
Catalysis Through Cultural Synergism To The Target
No ratings yet
Catalysis Through Cultural Synergism To The Target
7 pages
4-RESEARCH-METHODS-IN-CLINICAL-PSYCHOLOGY
No ratings yet
4-RESEARCH-METHODS-IN-CLINICAL-PSYCHOLOGY
25 pages
Francis Bacon's Philosophy of Science
No ratings yet
Francis Bacon's Philosophy of Science
13 pages
Physics Laboratory Report Guidelines For Students
No ratings yet
Physics Laboratory Report Guidelines For Students
2 pages
Cambridge Upper Secondary Science Project
No ratings yet
Cambridge Upper Secondary Science Project
13 pages
Nordtest Incerteza Ambiental PDF
No ratings yet
Nordtest Incerteza Ambiental PDF
52 pages

Unit 3 Statistics Notes

Uploaded by

Unit 3 Statistics Notes

Uploaded by

3A Introduction to Data Collection

★ Population = a statistical study’s entire group of individuals that we want

You might also like