0% found this document useful (0 votes)

106 views11 pages

STAT 101 Module Handout 5.1

This document provides an overview of statistical methods for making inferences on two or more populations. It discusses comparing population means and proportions using related and independent samples. Related samples are obtained by matching or self-pairing units, while independent samples have no relationship between how units were selected. The document outlines estimating and testing hypotheses for differences between two population means using independent and related samples. It aims to help students understand and apply appropriate statistical tests to compare parameters between multiple populations.

Uploaded by

Ahl Rubianes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

106 views11 pages

STAT 101 Module Handout 5.1

Uploaded by

Ahl Rubianes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

STAT 101: Statistical Methods

MODULE 5.1 1st Semester 2022-2023

Inference on Two or More Populations

From the previous module, you have learned how to deal with inference on a single population. Now, you are
well-equipped on estimation and hypothesis testing on one population mean and proportion, as well as one
population with several proportions and one population variance. But what if our study deals with two or more
populations?

In this module, you will be learning about dealing with inference on two or more populations. Studies are usually
conducted to compare two or several populations. For example, researches are done to compare the average
length of time spent on social media sites of men and women, to determine if difference exists between the
mean pre-test and post-test scores of students in a bridging program, or to compare the cigarette consumption
among heavy smokers after a hypnotherapy program, with 3 time-points such as before, a month after, and six
months after the program. Estimation and hypothesis testing on parameters such as the population mean and
population proportion may also be done for such cases.

To further understand and appreciate the concepts about the inference on two or more populations, some real-
life applications can be found in this Youtube video.

LEARNING OBJECTIVES
At the end of this module, you must be able to:
1. differentiate related from independent samples;
2. use the most appropriate statistical method(s) for comparing two population means,
3. interpret the output of a statistical software for comparing two population means, and
4. recommend appropriate actions based on statistical results.

TOPIC 5A. RELATED VS. INDEPENDENT SAMPLES

In analyzing several populations, you need to first correctly distinguish between the two types of samples,
namely, related (or paired) and independent samples.

DEFINITION

Related or paired samples are obtained by matching two similar units with respect to some important
characteristic or by self-pairing in which two measurements are taken from the same unit. Independent
samples, on the other hand, are obtained when two unrelated sets of units are measured for a variable.
How the set of samples was selected from one population has no relationship with how the other set of
samples was selected from the other population.

For additional reference on the definition of related and independent samples, you may check this
supplementary reading material.

ILLUSTRATION: Differentiating between related and independent samples

Let us consider the following scenarios to better illustrate the use of paired and independent samples.

1. A professor wants to know if there are differences between the scores in midterm and pre-final exams
of her students.
a. She randomly selected 25 students from whom she obtained their midterm scores and another 25
students for their pre-final scores.
b. She randomly selected 25 students from whom she obtained both the midterm and pre-final scores.

Institute of Statistics, CAS, UPLB [1]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

In scenario (a), the professor used independent samples since the midterm scores from
the 25 selected students have no relationship with the pre-final scores from the other 25
selected students. On the other hand, in scenario (b), the professor employed the use of
Icon made by related samples. This is because both midterm and pre-final scores are taken from each of
surang from
www.flaticon.com the 25 selected students.

2. Thirty individuals were randomly selected to rate Milk Teh, a weight gain milk, in terms of its taste.
Afterward, they also rated the taste of the newly introduced competitor, Milk Koya. The ratings for the
two brands of weight gain milk brands were then compared.

Each individual in the study rated both milk brands so the collected data can be organized
by pairs wherein every individual is associated with two ratings from each milk brand. Hence,
this scenario illustrates the use of related samples.

3. A badminton player wants to test if there is a difference in speed between two brands of shuttlecocks.
Twenty shuttlecocks from each brand were randomly selected from a production batch and each
sampled shuttlecock was subjected to a speed test.

The speed was measured on the twenty randomly selected shuttlecocks from each brand.
In this case, all the measurements are independent. Thus, this certain problem employed the
use of independent samples.

ACTIVITY 5.1 Test yourself: Related or independent samples?

To further evaluate if you have learned how to correctly identify the type of sample used for every scenario,
here are some more illustrations. For each item, identify if the type of sample is related or independent.

1. A study was conducted to examine the effect of thermal pollution on the growth of clams. A random
sample of six clams was collected at the intake site without thermal pollution while a random sample
of four clams was taken from the discharge site with thermal pollution. The length of each clam (in cm)
was measured.

2. An experiment aims to determine which of two concentrations of a certain chemical enhances the
growth of a certain type of plant better. The growth of 10 randomly selected plants in concentration A
and 10 randomly selected plants in concentration B were recorded.

3. Studies show that Filipinos are relatively friendlier compared to other races. To verify this, a Japanese
national randomly selected 50 Filipinos and another 50 non-Filipinos and asked them whether or not
they will talk to a stranger while on public transportation.

4. To verify if first-born children tend to be more independent than those who are not, five first-born
children with their second-born siblings were randomly chosen. All sampled children answered a test
that determines if a person can be considered independent. Their test scores were then recorded.

5. High concentration of trace metals in drinking water affects the flavor and poses a health hazard to
drinkers. A health officer wanted to compare the zinc concentration found in bottom water and surface
water for 20 randomly selected brands of bottled water.

6. It is of interest to determine the effect of a very difficult exam on a student’s belief in the existence of
a divine being. To do this, 240 randomly selected students were initially asked if they believe in the
existence of the divine being or not. Then, they were given a very difficult exam and asked again their
belief after taking the said exam.

7. A manager randomly selected 20 of his employees to investigate if losing one night’s sleep affects
employees’ work performance. Each sampled employee was given a problem-solving task at noon for
2 days, in which the employees had a night of good sleep for the first day, while they had no sleep at
all for the other day, then their scores were recorded.

Institute of Statistics, CAS, UPLB [2]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

8. An ear specialist wants to examine if the right and left ears of children have a different mean threshold
of pain to noise. He randomly selected 20 children and recorded their maximum level of tolerance to
noise for both right and left ears.

9. A researcher is interested to determine if two instruments (A and B) differ in their measurement of the
diameter of the ball bearing. He randomly selects 10 ball bearings and measures their diameter using
instrument A. He again randomly selects another 10 ball bearings and measures their diameter using
instrument B.

10. It is of interest to compare the efficacy of a new feed formulation (NF) with the standard (SF) in
increasing the milk yield (kg) of cows. Dr. Beh Ca measured the milk yield from five pairs of cows,
each pair of the same parent and the same health status. For each pair, he assigned one cow to NF
and the other cow to SF.

TOPIC 5B. COMPARISON OF TWO POPULATION MEANS

There are several researches which involve comparing two populations to determine if there are significant
differences between the two populations. To answer such objectives, estimation and hypothesis testing of two
population means are commonly done.

ESTIMATION AND TEST OF HYPOTHESIS ON TWO POPULATION MEANS

In comparing two population means, the parameter of interest is the difference between the two population
means denoted by µD = µ1 − µ2, where µ1 is the first population mean and µ2 is the second population mean.
The estimation and test of significance on µD differ between related and independent samples.

USING INDEPENDENT SAMPLES

Suppose independent simple random samples of size n1 from population 1 and of size n2 from population 2 are
drawn, then a characteristic of interest X is measured on each unit. The data obtained may be presented using
the table below.

Sample values from Population 1 Sample values from Population 2

x11 x21
x12 x22

x1n1 x2n2
Sample Size n1 n2
Sample Mean x1 x2
Sample Variance s12 s22

where
xij represents the observed value of the random variable 𝑋 taken from the jth unit of the ith population,
where i = 1, 2; j = 1, 2, …, ni;

ni is the size of the sample drawn from the ith population;

ni
 xij
j =1
xi is the mean of the random sample obtained from the ith population, xi = ; and
ni
ni
 ( xij − xi )
2

j =1
si2 is the variance of the random sample obtained from the ith population, si =
2
.
ni − 1

Institute of Statistics, CAS, UPLB [3]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

Estimation

A point estimator for the difference between two population means (µD) is given by ˆD = x1 − x2 .
 12  22
Furthermore, the sampling distribution of ̂D is normal with parameters mean µD and variance + .
n1 n2

Hypothesis Testing
In hypothesis testing, note that the null hypothesis is Ho: µD = D0, where D0 is the hypothesized population mean
difference, which could take values from −∞ to +∞ but is commonly set to 0. To test the significance of µD, we
can use different parametric tests depending on our knowledge about the population variances.

REMARK

In formulating the null and alternative hypotheses for one-tailed tests, note that the inequality sign may differ
according to how the parameter is defined. For example, suppose the parameter is defined as µD = µ1 − µ2
and you want to test the hypothesis µD > 0. The same parameter can be defined as µD = µ2 − µ1 but the
corresponding hypothesis will now be µD < 0. Both are correct as long as it is consistent with the defined
populations. Whichever the case is, results of the hypothesis test will still be the same.
.

However, before using these parametric tests, we need to satisfy first the following assumptions:
1. The variable of interest should be measured on at least interval scale.
2. Independent samples must be taken using simple random sampling.
3. In addition to the independence of samples, each of the two populations should be normally
distributed.

The table below provides a summary of the different interval estimators, test procedures, and test statistics to
be applied depending on the assumed condition of the population variances.

Given two Case 1: Case 2: Case 3:

normally Known population Unknown population Unknown population
distributed variances variances variances
populations (equal variances) (unequal variances)

 s12 s22 
 ˆD t + 
 1 1   2( ) n1
df n2 
 ˆD t ( n + n −2)sp +   
  2 1 2 n1 n2 
A (1−α) × 100% 12  22  where
 ˆD + 
( ) ( )
confidence Z where
 2 n1 n2   s2 n + s2 n 
interval estimator   df = 
1 1 2 2 
( n1 − 1) s12 + ( n2 − 1) s22
sp =
( ) ( )
2 2
n1 + n2 − 2 s12 n1 s22 n2
+
n1 n2

Hypothesis testing
Non-pooled t-test
Test procedure Z-test Pooled t-test
(Welch’s test)

Test statistic ˆD − D0 ˆD − D0 ˆD − D0

Zc = tc = tc =
 12  22 1
+
1 s12 s22
+ sp +
n1 n2 n1 n2 n1 n2

Institute of Statistics, CAS, UPLB [4]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

From the given table, note that when the population variances are known (Case 1), Z-test is the
most appropriate test to be used. On the other hand, the t-test is more appropriate when the
population variances are unknown. Under this scenario, two cases may still arise; we may have
equal (Case 2) or unequal (Case 3) variances between the two populations. Thus, a test on
equality of variances must be performed first to know which between pooled or non-pooled t-
test is more appropriate to be used.
Icon made by mynamepong from www.flaticon.com

TAKE NOTE!

▪ The Z-test, pooled t-test, and the non-pooled t-test require independent and simple random samples,
and normal populations.
▪ If the population variances are different, the pooled t-test can result in a significantly larger Type I error.
▪ The non-pooled t-test applies whether or not the population variances are equal.
▪ However, pooled t-test is slightly more powerful, on the average, if the population variances are equal.

EXAMPLE 5.1. Indigenous People

Indigenous people (IP) or native people are ethnic groups who are the original settlers of
a place. They maintain the traditions, ways, language, religion, dress, or other aspects of
an early culture even in modern times. Originally, they occupy vast tracts of lands that are
usually rich in natural resources. As such, they play the role of protector of nature and
preserver of heritage.
Icon made by Eucalyp from www.flaticon.com

However, IPs are threatened with extinction due to poverty, encroachment by the outside world,
colonization, modernization, land grabbing, profit-oriented businesses, and corrupt governments, leading
to loss of cultural identity and ancestral lands. The Hinereben Foundation, a donor-based organization, is
aimed at promoting the welfare of IP in the country. Its activities, however, are limited, relying heavily on
donors for funds. In Alakan Valley, one of its project sites, IP farmers from Sitio Pawagaon and Sitio
Magsaysay were randomly selected and interviewed as part of a profiling procedure by the foundation.

Because of a limited budget, the executive director of Hinereben Foundation plans to concentrate its effort
on Sitio Magsaysay as he believes that the IP farmers in the said site are more financially in need. He
believes that the IP farmers in Sitio Pawagaon, being located near the Poblacion, have higher monthly
household incomes compared to those in Sitio Magsaysay. Is there enough evidence to support the
executive director’s claim? Would you advise him to pursue his plan?

To analyze this problem, let us first define the populations. Let population 1 be the set of IP farmers from
Sitio Pawagaon and population 2 be the set of IP farmers from Sitio Magsaysay. Since the IP farmers were
taken independently from each Sitio, we know that we are dealing with independent samples.

R COMMANDER OUTPUT #1 The variable of interest for each population is the monthly
household income which is measured in the ratio scale.
Sitio = Pawagaon Based on the Shapiro-Wilk normality test (see R
Shapiro-Wilk normality test Commander Output #1), the assumption of normality is
W = 0.96596, p-value = 0.3775 satisfied for each of the two populations (p-values are
-------- relatively large).
Sitio = Magsaysay
Shapiro-Wilk normality test Parameter of interest: µD = µP−µM; the difference between
W = 0.97861, p-value = 0.8690 the mean monthly household incomes of IP farmers from
Sitio Pawagaon and Sitio Magsaysay.

Institute of Statistics, CAS, UPLB [5]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

To obtain a point estimate of µD, we simply get the difference between the mean monthly household
incomes of IP farmers from Sitio Pawagaon and Sitio Magsaysay (see R Commander Output #2).

R COMMANDER OUTPUT #2
Thus,
ˆD = xP − xM
95 percent confidence interval:
1188.810 3165.008 = 4862.242 − 2685.333
sample estimates: = 2176.909
mean in group Pawagaon mean in group Magsaysay
4862.242 2685.333

Aside from a point estimate, a confidence interval estimate about µD may also be constructed. Based on
the results (see R Commander Output #2), we are 95% confident that the difference between the mean
monthly household incomes of IP farmers from Sitio Pawagaon and Sitio Magsaysay lies between
1,188.810 and 3,165.008 pesos.

To test if there is enough evidence to support the director’s claim that the IP farmers from Sitio Magsaysay
are more financially in need than those from Sitio Pawagaon, the appropriate hypotheses are:

Ho: µD (µP−µM) = 0; There is no difference in the mean monthly household incomes between IP farmers
from Sitio Pawagaon and Sitio Magsaysay.

Ha: µD (µP−µM) > 0; The mean monthly household income of IP farmers from Sitio Pawagaon is greater
than that of IP farmers from Sitio Magsaysay.

R COMMANDER OUTPUT #3

Bartlett test of homogeneity of variances

Bartlett's K-squared = 13.788, df = 1, p-value = 0.0002047

We already know that the assumption of normality for each population was satisfied. Since we do not have
information on the population variances, we need to test if the variances are equal. Based on the results of
the Bartlett test of homogeneity of variances (see R Commander Output #3), the population variances are
not equal (p-value is too small). Therefore, the non-pooled t-test or Welch’s test should be used.

R COMMANDER OUTPUT #4

Welch Two Sample t-test

t = 4.431, df = 47.473, p-value = 0.00002755
alternative hypothesis: true difference in means is greater than 0

(Consider R Commander Output #4) Since the p-value is very small, we reject the null hypothesis. Hence,
there is sufficient evidence to say that the mean monthly household income of IP farmers from Sitio
Pawagaon is greater than that of IP farmers from Sitio Magsaysay. Results show that there is sufficient
evidence to support the director’s claim and it is advised to pursue his plan.

THINK ABOUT THIS!

What if the assumption(s) is(are) not satisfied for the original and transformed data?

Institute of Statistics, CAS, UPLB [6]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

Mann-Whitney test (Wilcoxon rank-sum test), a non-parametric test, is also applicable to two independent
samples when it is of interest to know if the two groups have been drawn from the same population. Instead of
using the actual values of the observations, it utilizes the ranks of the data, so it is functional for variables at
least ordinal scale. Among the non-parametric alternative of the parametric t-test, this non-parametric test is the
most practical and useful.

For the results to be valid, the assumptions for this non-parametric test are the following:
1. The variable of interest is at least ordinal in scale.
2. Independent samples must be taken using simple random sampling from two populations.
3. The distributions of the populations must have the same shape.

The Mann-Whitney test can be used to perform a hypothesis test for both population median and population
mean (unless the variable of interest is in ordinal scale, then only median can be used).

EXAMPLE 5.2. Jazz or pop music?

Studies have shown that listening to music alters a person’s dopamine level, a
neurotransmitter associated with pleasure and learning. Since different types of music may
have different effects on the concentration, memory, and learning states of an individual, an
experiment was conducted using different songs from jazz and pop music.
Icons made by ultimatearm from www.flaticon.com

Twenty randomly selected STAT 101 students were asked to solve a maze while listening to music. Ten
students were assigned to listen to jazz music and the other 10 students to pop music. The time it took (in
seconds) for each student to solve the maze was recorded. Determine if there is a significant difference in
the completion time of students who listened to jazz and pop music, on the average.

The given scenario still exhibits the use of independent samples since students who were assigned to
listen to jazz and pop music were taken independently. Let population 1 be the set of STAT 101 students
assigned to listen to jazz music and population 2 be the set of STAT 101 students assigned to listen to pop
music while solving a maze.

R COMMANDER OUTPUT #5
Based on the Shapiro-Wilk normality test (see R
Music = Jazz Commander Output #5), both populations did not
Shapiro-Wilk normality test satisfy the normality assumption (say α = 0.10).
W = 0.82269, p-value = 0.02077 Thus, instead of using the mean difference as the
-------- parameter of interest, we use the median
Music = Pop difference.
Shapiro-Wilk normality test
W = 0.80915, p-value = 0.01872

Parameter of interest: MdD = Md1−Md2; the difference between the median completion time (in seconds)
of STAT 101 students who listened to jazz and pop music

To test if the students who listened to jazz music finished the maze faster than those who are assigned to
pop music, the appropriate hypotheses are:

Ho: MdD = 0; There is no difference in the median completion time of students who listened to jazz and pop
music.

Ha: MdD ≠ 0; There is a difference in the median completion time of students who listened to jazz and pop
music.

Institute of Statistics, CAS, UPLB [7]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

Given that both populations do not follow a normal distribution, we

must use the non-parametric counterpart for the test on two
population means. Since both distributions have relatively the same
shapes (as shown in the boxplots on the left), the Mann-Whitney
test can be used to analyze the problem.

Based on the results (see R Commander Output #6), we reject the

null hypothesis at a given α = 0.10 Therefore, we have sufficient
evidence to say the there is a significant difference in the
completion time of students who listened to jazz and pop music.

R COMMANDER OUTPUT #6

Wilcoxon rank sum test

W = 79, p-value = 0.02881
alternative hypothesis: true location
shift is not equal to 0

TAKE NOTE!

Under the condition of normality, the pooled t-test is more powerful. Alternatively, under the non-normality
condition but with same shape distribution, Mann-Whitney test is more powerful.

USING RELATED (OR PAIRED) SAMPLES

Suppose a simple random sample of size n related or paired samples are drawn, then a characteristic of interest
X is measured for each member of a pair. The data obtained may be presented in the table below.
Pair Number (i) X1 X2 di
1 x11 x21 d1 = x11- x21
2 x12 x22 d2 = x12- x22
⋮ ⋮ ⋮ ⋮
n x1n x2n dn= x1n- x2n
where
x1i is the value of the random variable X observed from the first member of the ith pair;
x2i is the value of the random variable X observed from the second member of the ith pair;
di is the difference for the ith pair, di = x1i − x2i, i = 1, 2, …, n; and
n is the number of pairs of observations.

A similar data representation can be used when the observations are taken using the self-pairing method where
X1 represents the measurement at time 1 while X2 represents the measurement at time 2 taken on the ith unit.

Point Estimation
We now consider the difference (di) as the random variable of interest. A point estimator of µD is d , which is
computed as the mean of the observed sample differences given by
n
 di
i =1
d = .
n

Institute of Statistics, CAS, UPLB [8]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

Interval Estimation

A (1−α) × 100% confidence interval estimator for µD is given by

n
 ( di − d )
2

 sd  i =1
d t  where sd = , the standard deviation of the observed sample differences.
2(
n −1)
 n n

Hypothesis Testing

Similar to the previous case, the null hypothesis for determining if there is a difference between two populations
using related samples is Ho: µD = D0, where D0 is the hypothesized population mean difference, which could
take values from −∞ to +∞ but is commonly set to 0. To test the significance of µD, the test procedure to be
performed is the paired t-test.

For the results to be valid, these are the assumptions that need to be satisfied first:
1. The variable of interest should be measured on at least interval scale.
2. Paired samples must be taken using simple random sampling.
3. The paired-differences (di’s) must be normally distributed or the sample size is large enough for the
Central Limit Theorem to apply.

The test statistic of the paired t-test is given by

d − D0
tc = ,
sd
n

which follows the Student’s t distribution with (n−1) degrees of freedom. For a large sample (n ≥ 25), the test
statistic tc is approximately distributed as standard normal, N (0,1). This implies that the Z-table can be used to
obtain the tabular values should you decide to make a decision by comparing the test statistic and its
corresponding tabular value.

EXAMPLE 5.3 Married Couples

Romantic couples with a large age gap often raise eyebrows. Studies found partners with more
than a ten-year age gap experience social disapproval. It is of interest to find if the mean age
of husbands differs from the mean age of their wives. Ten Filipino married couples are selected
at random and their ages (in years) were obtained.
Icon made by Freepik from www.flaticon.com

The study involved married couples wherein each husband and wife were asked about their age. This
illustrates the use of related (or paired) samples. Thus, let di = xhusband − xwife be the difference between the
ages of the ith couple where xhusband is the age of the husband and xwife is the age of the wife.

R COMMANDER OUTPUT #7 The variable of interest for each population is age, which is
measured on a ratio scale. Based on the results of the
Shapiro-Wilk normality test Shapiro-Wilk normality test (see R Commander Output #7),
W = 0.95629, p-value = 0.7429 data on the difference of the ages of the couples are normally
distributed (p-value is too large).

Let’s consider the parameter of interest: µD; the mean difference of the ages between the couple.

Institute of Statistics, CAS, UPLB [9]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

R COMMANDER OUTPUT #8
Based on the results (see R Commander Output #8), the
95 percent confidence interval: mean difference of the ages between the couple is 3.6 years,
0.04394139 7.15605861 on average. Moreover, a confidence interval estimate can
sample estimates: also be constructed. At a 95% confidence level, the mean
mean of the differences difference of the ages between the couple lies from 0.04 to
3.6 7.16 years.

To determine if the mean age of husbands differ from the mean age of their wives, the appropriate
hypotheses are:

Ho: µD = 0; The mean difference of the ages between the couple is equal to zero.
Ha: µD ≠ 0; The mean difference of the ages between the couple is not equal to zero.

Given that the normality assumption is satisfied, we can use the paired t-test, a parametric test, to verify
the given claim (Consider R Commander Output #9).

R COMMANDER OUTPUT #9
We reject the null hypothesis at a given
Paired t-test α = 0.05. Therefore, we have sufficient
data: Husband and Wife evidence to say that the mean difference in
t = 2.2901, df = 9, p-value = 0.04777 the ages between couples is not equal to zero.
alternative hypothesis: true difference Thus, the data provide evidence that the mean
in means is not equal to 0 age of the husband differs from the mean age
of his wife.

THINK ABOUT THIS!

What if the assumption(s) is(are) not satisfied for the original and transformed data?

In the case in which one of the assumptions of the paired t-test has not been satisfied, the paired Wilcoxon
signed-ranks test should be used. This test considers the magnitude as well as the direction of the differences.

In addition, the following are the requirements that need to be satisfied:

o The variable of interest is at least ordinal.
o Samples must be paired taken using simple random sampling.
o The paired-difference variable (d) must have a symmetric distribution.

Since the mean and median of a symmetric distribution are equal, the paired Wilcoxon signed-ranks test can
be used to perform a hypothesis test for both median difference and mean difference (unless the variable of
interest is in ordinal scale, then only median can be used).

EXAMPLE 5.4. Pre- and post-tests

A carefully designed pre- and post-test can be used as a diagnostic and developmental tool
to assess students’ learning preparedness and improve instructors’ teaching strategy. To
evaluate the knowledge of first-year students on basic statistics concepts, Prof. Remia
recorded the scores obtained from the 20 multiple choice questions by 20 randomly selected
first-year students from the exams administered before and after a series of review sessions.
Icon made by Eucalyp from www.flaticon.com

Institute of Statistics, CAS, UPLB [ 10 ]

Module 5.1 [STAT 101 MODULE HANDOUT]
Inference on Two or More Populations

Prof. Remia wants to measure the amount of learning the students have acquired in their three-week review
sessions. She believes that her students have made great progress since they have been more
participatory in her lecture.

Since both the pre- and post-test scores were recorded for each student, this problem illustrates the use of
related samples. Thus, let di = xafter − xbefore be the difference of the pre- and post-test scores of the ith
student where xafter is the post-test score and xbefore is the pre-test score.

R COMMANDER OUTPUT #10 Based on the results of the Shapiro-Wilk normality test
(see R Commander Output #10, the assumption of
Shapiro-Wilk normality test normality is not satisfied (at a given α = 0.05). Hence, we
W = 0.8866, p-value = 0.0233 can use the median difference as the parameter of interest.

Consider the parameter of interest: MdD, the median difference between the pre- and post-test scores.

To find if Prof. Remia’s students have made great progress, the appropriate hypotheses are:

Ho: MdD = 0; The median difference between the pre- and post-test
scores is equal to zero.
Ha: MdD (Mdafter – Mdbefore) > 0;
The median difference between the pre- and post-test scores is
greater than zero.

Given that the assumption of normality is not satisfied, we should use a

non-parametric test. Since the histogram on the right shows that the data
on the paired-difference of pre- and post-test scores are relatively
symmetric in distribution, the paired Wilcoxon signed-ranks test may
be employed to verify the claim (consider R Commander Output #11).

R COMMANDER OUTPUT #11

Since the p-value is very small, we reject the null
Wilcoxon signed rank test hypothesis. Therefore, we have sufficient evidence to
with continuity correction say that the average difference between the pre- and
V = 190, p-value = 0.00006942 post-test scores is greater than zero. Hence, it can be
alternative hypothesis: true concluded that the review sessions helped improve the
location shift is greater than 0 students’ performance in their statistics subject.

TAKE NOTE!

Both paired t-test with paired Wilcoxon signed-ranks test require simple and random samples. However, for
a normally distributed paired-difference variable, the paired t-test is more powerful.

REFERENCE

WEISS, N. A. (2012) Introductory Statistics. 9th Ed. Pearson Education, Inc.

Institute of Statistics, CAS, UPLB [ 11 ]

AP Biology Premium, 2025: Prep Book with 6 Practice Tests + Comprehensive Review + Online Practice
From Everand
AP Biology Premium, 2025: Prep Book with 6 Practice Tests + Comprehensive Review + Online Practice
Barron's Educational Series
No ratings yet
Sniper Mathematics PP1 QS
100% (1)
Sniper Mathematics PP1 QS
16 pages
Module 1 Statistic PDF
No ratings yet
Module 1 Statistic PDF
7 pages
DL RS 299a
100% (3)
DL RS 299a
9 pages
Oultine 4
No ratings yet
Oultine 4
1 page
STS Reviewer
No ratings yet
STS Reviewer
129 pages
Inbound 8609162511062510069
No ratings yet
Inbound 8609162511062510069
28 pages
M10_Ch10-Notes-Hyp_Test_2_Pop_W19
No ratings yet
M10_Ch10-Notes-Hyp_Test_2_Pop_W19
9 pages
Stat 115 - Chapter 4
No ratings yet
Stat 115 - Chapter 4
62 pages
Quantatitive Analysis and Decision Making: Subject
No ratings yet
Quantatitive Analysis and Decision Making: Subject
9 pages
Hypothesis-Testing-for-Two-Population-Pa
No ratings yet
Hypothesis-Testing-for-Two-Population-Pa
11 pages
sanvi isp practical
No ratings yet
sanvi isp practical
17 pages
STA 204 Lecture Note 2 - Continuation
No ratings yet
STA 204 Lecture Note 2 - Continuation
25 pages
MS5 6
No ratings yet
MS5 6
17 pages
QM SLM - SEM 1-2021 Version - B2 (Units 56) with covers - LO - 11.11.2021
No ratings yet
QM SLM - SEM 1-2021 Version - B2 (Units 56) with covers - LO - 11.11.2021
62 pages
Adstat Final Exam Reviewer2highlighted
No ratings yet
Adstat Final Exam Reviewer2highlighted
29 pages
Draft Proof - Do Not Copy, Post, or Distribute: Estimating The Difference Between The Means of Independent Populations
No ratings yet
Draft Proof - Do Not Copy, Post, or Distribute: Estimating The Difference Between The Means of Independent Populations
37 pages
HS4510,LJ,U6
No ratings yet
HS4510,LJ,U6
5 pages
Week 4 - Statistical hypothesis testing (2)(1)
No ratings yet
Week 4 - Statistical hypothesis testing (2)(1)
22 pages
Research Iii & Iv Quarter 3 Week 6: Ca PS LE T
No ratings yet
Research Iii & Iv Quarter 3 Week 6: Ca PS LE T
11 pages
Chapter 5: Two Samples Tests of Hypothesis
No ratings yet
Chapter 5: Two Samples Tests of Hypothesis
5 pages
Statistical Inferences About Two Populations: Learning Objectives
No ratings yet
Statistical Inferences About Two Populations: Learning Objectives
26 pages
Hypothesis Testing Application 2
No ratings yet
Hypothesis Testing Application 2
32 pages
OLANTIGUE Written Report
No ratings yet
OLANTIGUE Written Report
15 pages
Student t test
No ratings yet
Student t test
12 pages
The Statistical Imagination: Bivariate Relationships: T-Test For Comparing The Means of Two Groups
No ratings yet
The Statistical Imagination: Bivariate Relationships: T-Test For Comparing The Means of Two Groups
31 pages
Two Sample Inference: By: Girma M
No ratings yet
Two Sample Inference: By: Girma M
33 pages
Chapter 5
No ratings yet
Chapter 5
23 pages
Adstat Final Exam Reviewer2
No ratings yet
Adstat Final Exam Reviewer2
29 pages
MD115 Wk05
No ratings yet
MD115 Wk05
86 pages
Data Analysis Lecture
No ratings yet
Data Analysis Lecture
17 pages
Chapter 5. Test Concerning Two Means PDF
No ratings yet
Chapter 5. Test Concerning Two Means PDF
23 pages
Group 8 (Semblante, Lague, Peras, Rama) T-Test: Value
No ratings yet
Group 8 (Semblante, Lague, Peras, Rama) T-Test: Value
11 pages
Inferential Statistics: Shaheena Bashir
No ratings yet
Inferential Statistics: Shaheena Bashir
18 pages
BIOstat T-Test Anova
No ratings yet
BIOstat T-Test Anova
10 pages
Comparing Two Means: T-Distribution To Calculate Confidence Intervals For The Mean Difference and Test
0% (2)
Comparing Two Means: T-Distribution To Calculate Confidence Intervals For The Mean Difference and Test
40 pages
Chapter 13
No ratings yet
Chapter 13
8 pages
Tests of Significance For Small Samples
No ratings yet
Tests of Significance For Small Samples
19 pages
Bus 173 - 3
No ratings yet
Bus 173 - 3
19 pages
Inferential Statistic II
No ratings yet
Inferential Statistic II
61 pages
Local Media1419236475208910846
No ratings yet
Local Media1419236475208910846
36 pages
Testing of Hypothesis - Two Samples
No ratings yet
Testing of Hypothesis - Two Samples
33 pages
Statistics in Research
No ratings yet
Statistics in Research
77 pages
Choosing A Significance Test Objectives
No ratings yet
Choosing A Significance Test Objectives
15 pages
25 25 Mean A 358 Mean B 345 10 14
No ratings yet
25 25 Mean A 358 Mean B 345 10 14
3 pages
Inferential Statistics
No ratings yet
Inferential Statistics
11 pages
Chapter 11
100% (1)
Chapter 11
15 pages
Statistics Summary
No ratings yet
Statistics Summary
26 pages
Chapter 10 Overview: Independent Vs Dependent Samples
No ratings yet
Chapter 10 Overview: Independent Vs Dependent Samples
15 pages
Isds361b Notes
No ratings yet
Isds361b Notes
103 pages
Chapter 3
No ratings yet
Chapter 3
28 pages
Inferential Statistics
No ratings yet
Inferential Statistics
42 pages
1 Hypothesis Testing Rev
No ratings yet
1 Hypothesis Testing Rev
122 pages
AST Practice Q.
No ratings yet
AST Practice Q.
4 pages
Lesson 12 Hypothesis Testing and Interpretation
No ratings yet
Lesson 12 Hypothesis Testing and Interpretation
10 pages
Bus-173 3
No ratings yet
Bus-173 3
19 pages
Biostatistics M1-1
No ratings yet
Biostatistics M1-1
57 pages
Research Methods: Inferential Statistics: Two Group Design
No ratings yet
Research Methods: Inferential Statistics: Two Group Design
36 pages
Science Research Iii: Second Quarter-Module 6 Hypothesis Testing For The Means - Two Sample (Independent Sample)
No ratings yet
Science Research Iii: Second Quarter-Module 6 Hypothesis Testing For The Means - Two Sample (Independent Sample)
9 pages
AQA Psychology A Level – Research Methods: Practice Questions
From Everand
AQA Psychology A Level – Research Methods: Practice Questions
Sheila Thomas
No ratings yet
Research in Psychology
From Everand
Research in Psychology
Connor Whiteley
No ratings yet
Simulasi Harga Pekerjaan Painting: 21,981.28 Price Per-M2 Area (m2)
No ratings yet
Simulasi Harga Pekerjaan Painting: 21,981.28 Price Per-M2 Area (m2)
1 page
TEST 1 G9
No ratings yet
TEST 1 G9
3 pages
Animals rights
No ratings yet
Animals rights
11 pages
4-Day Marmaris To Fethiye
No ratings yet
4-Day Marmaris To Fethiye
11 pages
The Psychology Of Art George Mather instant download
100% (1)
The Psychology Of Art George Mather instant download
42 pages
1.-Germans at Meat 1910
No ratings yet
1.-Germans at Meat 1910
4 pages
qatartribune-20250316-1
No ratings yet
qatartribune-20250316-1
16 pages
Surat Pesanan Apotek
No ratings yet
Surat Pesanan Apotek
1 page
Microprocessors and Microcontrollers
No ratings yet
Microprocessors and Microcontrollers
86 pages
8P
No ratings yet
8P
4 pages
Synthetic Hydrograph Models
No ratings yet
Synthetic Hydrograph Models
17 pages
Research Report: Bvlgari
No ratings yet
Research Report: Bvlgari
49 pages
HJHJBBBB
No ratings yet
HJHJBBBB
6 pages
ADDITOL-XL-481-Launch-Pack-final-10-21
No ratings yet
ADDITOL-XL-481-Launch-Pack-final-10-21
9 pages
ENGR 600 + Mid+term-2024
No ratings yet
ENGR 600 + Mid+term-2024
3 pages
Lesson 04-Chapter 4 Classification PDF
100% (1)
Lesson 04-Chapter 4 Classification PDF
86 pages
Euclid Bams 1183535407
No ratings yet
Euclid Bams 1183535407
4 pages
Toyota Hilux 4x4
No ratings yet
Toyota Hilux 4x4
4 pages
Chapter 2. TIME
No ratings yet
Chapter 2. TIME
24 pages
Thin Walled Pressure Vessels: ASEN 3112 - Structures
No ratings yet
Thin Walled Pressure Vessels: ASEN 3112 - Structures
11 pages
Chem 201935443552
No ratings yet
Chem 201935443552
10 pages
Advanced Computer Architecture CSE 8383
No ratings yet
Advanced Computer Architecture CSE 8383
56 pages
Two Worlds One Family
No ratings yet
Two Worlds One Family
81 pages
Model of Cellular Automata
No ratings yet
Model of Cellular Automata
5 pages
Webinar 1 Pfmea As13100 Requirements Overview Published
No ratings yet
Webinar 1 Pfmea As13100 Requirements Overview Published
44 pages
Overview of Transaction Processing and Enterprise Resource Planning Systems
No ratings yet
Overview of Transaction Processing and Enterprise Resource Planning Systems
34 pages
The ChemSep-COffdfdfCO Casebook - Air Separation Unit
No ratings yet
The ChemSep-COffdfdfCO Casebook - Air Separation Unit
5 pages
Selangor Times 15 June 2012
No ratings yet
Selangor Times 15 June 2012
24 pages
Credit Creation
No ratings yet
Credit Creation
12 pages

STAT 101 Module Handout 5.1

Uploaded by

STAT 101 Module Handout 5.1

Uploaded by

STAT 101: Statistical Methods

MODULE 5.1 1st Semester 2022-2023

Inference on Two or More Populations

TOPIC 5A. RELATED VS. INDEPENDENT SAMPLES

ILLUSTRATION: Differentiating between related and independent samples

Institute of Statistics, CAS, UPLB [1]

ACTIVITY 5.1 Test yourself: Related or independent samples?

Institute of Statistics, CAS, UPLB [2]

TOPIC 5B. COMPARISON OF TWO POPULATION MEANS

ESTIMATION AND TEST OF HYPOTHESIS ON TWO POPULATION MEANS

USING INDEPENDENT SAMPLES

Sample values from Population 1 Sample values from Population 2

ni is the size of the sample drawn from the ith population;

Institute of Statistics, CAS, UPLB [3]

Given two Case 1: Case 2: Case 3:

Test statistic ˆD − D0 ˆD − D0 ˆD − D0

Institute of Statistics, CAS, UPLB [4]

EXAMPLE 5.1. Indigenous People

Institute of Statistics, CAS, UPLB [5]

Bartlett test of homogeneity of variances

Welch Two Sample t-test

THINK ABOUT THIS!

Institute of Statistics, CAS, UPLB [6]

EXAMPLE 5.2. Jazz or pop music?

Institute of Statistics, CAS, UPLB [7]

Given that both populations do not follow a normal distribution, we

Based on the results (see R Commander Output #6), we reject the

Wilcoxon rank sum test

USING RELATED (OR PAIRED) SAMPLES

Institute of Statistics, CAS, UPLB [8]

A (1−α) × 100% confidence interval estimator for µD is given by

The test statistic of the paired t-test is given by

EXAMPLE 5.3 Married Couples

Institute of Statistics, CAS, UPLB [9]

THINK ABOUT THIS!

In addition, the following are the requirements that need to be satisfied:

EXAMPLE 5.4. Pre- and post-tests

Institute of Statistics, CAS, UPLB [ 10 ]

Given that the assumption of normality is not satisfied, we should use a

R COMMANDER OUTPUT #11

WEISS, N. A. (2012) Introductory Statistics. 9th Ed. Pearson Education, Inc.

Institute of Statistics, CAS, UPLB [ 11 ]

You might also like