Biostatistics Unit 6. Sampling
Biostatistics Unit 6. Sampling
.
Definition
1. Sample. Subset of the population that is selected for the study. Also called
subjects or respondents of the study
2. Element. Most basic unit about which information is collected
3. Representativeness. Means that the sample must be like the population in as
many ways as possible. The accessible population must be representative of
the target population
Sample and population
The population
Sampling unit. Refers to specific place or location which can be used during
sampling process Sampling frame. Describes the complete list of sampling units
from which the sample is drawn.
Sampling criteria: Refers to the essential characteristics of a subject or respondent
such as ability to read and write responses on the data collecting instruments
Eligibility criteria: A description chosen by the researcher to define which elements
should be included in or excluded from the population. Such criteria may include
sex, age, marital status, education level, and diagnosis.
The steps involved in sampling.
Use Slovin's formula to find out what sample of a population of 1,000 people you
need to take for a study.
Sampling techniques
1. Degree of accuracy
2. Resources
3. Time
4. Advanced knowledge of the population
5. National versus local
6. Need for statistical analysis
Probability sampling
Where:
Z = Z score value based on the confidence interval (approx. = 1.96)
σ = Population standard deviation.
n = Size of the sample.
Conclusion
Occurs when the sample used in the study does not represent the entire
population.
1. Systematic error (or bias). a consistent or proportional difference between the
observed and true values. (eg. a miscalibrated scale consistently records
weights as higher than they actually are)
2. Population-specific error. when a researcher doesn't understand who to
survey.
3. Selection error. when the survey is self-selected, or when only those
participants who are interested in the survey respond to the questions
4. Sampling error (random error). Sample frame error. when a sample is selected
from the wrong population data.
5. Non-response error. Inaccurate response, selection bias.
Primary and secondary data
.
Terminology definitions
Primary data. Data collected by the original user who also actively take part in
collection of the data
Secondary data. Data collected from records. This is also data collected by persons
not the users of the data at the time it collected from the records and used for
different purpose.
Numerical data. Date collected in form of numbers or figures.
Categorical data. Data collected in form of categories or description of the data in
question.
Grouped data. Data that has been grouped in frequencies.
Ungrouped data. Data that is taken the way it is collected without grouping.
Primary data
Data collected by the original user who also actively takes part in collection of the
data. Primary data is first-hand information, original in nature collected from a unit
or individual respondent directly for a predetermined purpose with active
involvement of the researcher.
Methods of collection primary data
1. Observation.
2. Interview
3. Questionnairre
4. Sampling.
5. Focused group discussion (FGD).
6. Personal enquiry.
7. Snowballing.
Secondary data
Data collected from records. This is also data collected by persons not the users of
the data at the time it collected from the records and used for different purpose
These are second hand information, statistically processed, obtained from an
already collected.
Source initially collected for some purpose and are now available for the present
purpose.
Sources of collecting secondary data
Thank you