Statistics and Probability Q3
Statistics and Probability Q3
Normal Distribution
The graph of a normal distribution is
bell-shaped and symmetric about the
mean.
It is often referred to as a bell curve.
Data points are more frequent around
the mean and decrease as they move
away from the center.
Properties of Normal Distribution
1. The distribution curve is bell-shaped.
2. The curve is symmetrical about its
center.
The mean, the median, and the
mode are equal and coincide
at the center.
3. The width of the curve is determined
by the standard deviation of the
distribution.
4. The tails of the curve flatten out
indefinitely along the horizontal
axis, always approaching the axis but Interpretation of z-Score:
never touching it. That is, the curve is A positive z-score indicates the value
asymptotic to the base line. is above the mean.
5. The area under the curve is 1. Exactly A negative z-score indicates the value
half of the values are to the left of is below the mean.
the center and exactly half the values
are to the right. The magnitude of the z-score tells how far
the value is from the mean in terms of
z-Table standard deviations.
The z-Table (or standard normal table)
provides the area under the normal
curve to the left of a given z-score.
It is used to find probabilities or
percentiles for normally distributed
data.
z-Score
The z-score measures how many
standard deviations a data point (x) is
from the mean
Advantages: Simple and ensures even
coverage of the population.
Disadvantages: Risk of periodicity if
the population has a hidden pattern
3. Stratified Sampling
Definition: Divides the population into
subgroups (strata) and randomly
n
Probability Sampling samples from each stratum. ∙ Strata
N
Every member of the population has a Uses: When the population has distinct
known, non-zero chance of being subgroups that need representation.
selected. Advantages: Ensures representation of
Used when the goal is to generalize all subgroups and improves accuracy.
results to the entire population. Disadvantages: Requires knowledge
Advantages: Unbiased, representative, of the population structure and can be
and allows for statistical inference. complex.
Disadvantages: Can be time-
consuming, expensive, and requires a
complete population list.
2. Systematic Sampling
Definition: Selects every k-th element
n 5. Multi-Stage
from a list after a random start.
N Definition: Divides the population into
Uses: Useful when the population is subgroups (strata) and randomly
large and evenly distributed. samples from each stratum.
Uses: When the population has distinct Advantages: Quick, easy, and
subgroups that need representation. inexpensive.
Advantages: Ensures representation of Disadvantages: Highly biased and not
all subgroups and improves accuracy. representative of the population.
Disadvantages: Requires knowledge
of the population structure and can be
complex.
2. Quota
6. Area Definition: Selects samples based on
Definition: A form of cluster sampling predefined quotas to reflect population
where geographic areas are selected subgroups.
as clusters. Uses: When time and resources are
Uses: Commonly used in surveys limited but some representation is
covering large regions. needed.
Advantages: Practical for Advantages: Ensures some diversity
geographically dispersed populations. and is faster than probability sampling.
Disadvantages: May not capture Disadvantages: Non-random selection
diversity within areas. can introduce bias.
3. Snowball
Definition: Existing participants recruit
future participants from their network.
Uses: For hard-to-reach or hidden
populations (e.g., rare diseases, niche
groups).
Advantages: Useful for studying rare
or hidden populations.
Disadvantages: Risk of bias and lack
Non-Probability Sampling of representativeness.
Definition: Members of the population
do not have a known or equal chance
of being selected.
Uses: Used when the goal is
exploratory or when resources are
limited.
Advantages: Cost-effective, quick, and 4. Purposive
easy to implement. Definition: Samples are selected
Disadvantages: Results may not be based on the researcher’s judgment or
generalizable and can be biased. specific criteria.
1. Convenience Uses: When studying a specific
Definition: Samples are taken from a subgroup or unique cases.
group that is easily accessible. Advantages: Targets specific
Uses: Exploratory research or pilot characteristics of interest.
studies.
Disadvantages: Highly subjective and
M=
∑x 1+ 3+5 9
3
not generalizable. N 3 3
5. Consecutive
Definition: Recruits every available
participant who meets criteria over a Population Variance
period.
Uses: Common in medical or clinical 2
σ =
∑ (x−M )
research. N
Advantages: Ensures a larger sample
size and reduces selection bias.
Disadvantages: May not be x x−M (x−M )
2
6. Panel Sampling
Definition: Repeatedly samples the
same group of participants over time.
σ=
√ ∑ (x−M )
N
σ x=
√ σ2
n
σ x=
σ
√n
Sample Standard Deviation (without
replacement)
√ √
σ (N−n)
2
σ ( N−n) σ x=
σ x= ∙ ∙
n ( N−1) √ n (N −1)