AEB801_20222023-lecture_02-1
AEB801_20222023-lecture_02-1
Data Presentation
• Population is often very large, such as the number of Zooplankton in the ocean,
population.
It is important that the sample we select represent the target population by making sure that the
subjects in it are similar to the other members of the target population.
The more representative the sample, the more confident the researcher can be that the results can be
generalized to the target population.
One of the problems that can occur when selecting a sample from a target population is sampling bias.
Sampling bias refers to situations where the sample does not reflect the characteristics of the target
population.
Populations and Samples, Sampling Methods, Parameters and
11/7/2024 4
Statistics, Data Presentation
Samples can be obtained from the populations in a number of ways; however to reach valid
conclusions about populations by induction from samples, it is important that samples are
obtained in a random fashion.
Only results obtained from probabilistic (random) sampling are generalizable to the
reference population.
Also known as deliberate or purposive sampling, it uses a subjective method of selecting units from a
When samples are selected from a population on the basis of ease of access, it can be called
Quota sampling techniques – gathering of data from individuals possessing similar characteristics.
To obtain a random sample from a population requires that each member of the
population has an equal and independent chance of being selected.
That is, not only must each measurement in the population have an equal chance of
being chosen as a member of the sample,
But the selection of any member of the population must in no way influence the
selection of any other member.
It assures the researcher that the samples represent, to an acceptable extent, the
Scientists use it to relate his findings about the sample to the entire population
It ensures that no systematic attempt has been made to bias the sample.
The one chosen will depend on a number of factors (such as time, money etc.).
It is most achieved using a set of table of values known as random numbers. The most
important feature of the table is the lack of order or discernible organizational structure.
The use of these two methods requires that we be able to list all the members of the
population. Quite often this is rather impracticable, at times impossible.
Advantages: sample will represent the target population and eliminate sampling bias,
This method involves listing all the members of a population on a numerical interval
1, 2, 3, e.t.c.
If a population is 100 and you want a sample of 25 subjects, divide 100 by 25 =
4. Therefore, select every 4th number, such as 3, 7, 11, 15 etc.
• The basis of randomness in this method is that the first subject in the sample is
selected randomly.
Where the differences among the members of a population subgroup are seen as factors that
might influence their responses, the preferred mode of action is to divide the population to
their respective homogeneous subgroup and then work out the proportions needed for
the sample to be representative.
Then a number of individuals or elements are randomly selected from each homogeneous
subgroup to constitute the sample.
Advantage: The sample will be highly representative of the target population and
therefore we can generalize from the results obtained.
Disadvantage: Gathering such a sample would be extremely time consuming and difficult
to do.
The population or the geographical area is subdivided into clusters or properly defined segments using
simple, systematic or stratified random sampling techniques.
The aim is to reduce the population or geographical spread to such a size as would permit realistic and
adequate representation of a defined population while greatly reducing the cost of the research.
Note that this is significantly different from the stratified sampling where the homogeneity of the various
subgroups constitutes the basis for segmenting the population.
Advantage: The sample will be highly representative of the target population and therefore we can generalize
from the results obtained.
Disadvantage: Gathering such a sample would be extremely time consuming and difficult to do.
3. the degree of variability in the attributes being measured (Miaoulis and Michener, 1976).
Constraint brought about by time and money sometimes imposes serious limitations on selecting
adequate number of individual objects to be included in a sample.
But these should not dictate the sample size. — There is no point in carrying out a study that is too
small, only to come up with results that are inconclusive.
Several formulas have been devised for estimating the appropriate sample size with reference to
the size of any defined population.
A 95% confidence level and P = 0.5 are assumed for the Equation.
Where n is the sample size, N is the population size, and e is the level of precision (5% or 0.05).
representative sample
2. For smaller (N<100) population, there is no need in sampling. Survey the entire population.
5. Beyond a certain point (N=5000), the population size is almost irrelevant and a sample size of