Consistency of Stratified Random Sampling Estimators in Repetive Sampling
Consistency of Stratified Random Sampling Estimators in Repetive Sampling
ISSN No:-2456-2165
Abstract:- This study focuses on consistency of stratified without any recourse to a well stated and known principle of
random sampling in repeated sampling processes within selection (Lohr,2021; Wu,2022). Kanaki & Kalogiannakis
a population with heterogeneous characteristics. The (2023) posited that process of selection under non-random
data for the study is a real life data on number of sampling usually undermine the estimation and precision
students in schools where the stratification is on the basis due to implementation procedure. On the other hand, the
of ownership (public or private). Proportional allocation sampling methods operating under the principle of known
method was used to determine the number of units and non-zero probability of inclusion have advantage over
(schools) to be chosen from each stratum for a given the non-random sampling methods particularly in the areas
sample size and four (4) independent samples of equal of procedure, estimation and precision assessment (Pawar,
sample sizes were chosen and estimates of mean Verma, Daniel, & Sayyad,2023). Karunarathna, De Alvis,
,variance as well as confidence interval obtained with Gunasena and Jayawardana (2024) posited that researchers
the estimators of stratified random sampling. The have preference for random sampling methods over non
estimates obtained for each sample sizes were subjected random sampling methods in order to explore the
to a test of significance to test the null hypothesis of no advantages of estimation and efficiency. Random sampling
significance difference between the estimates and the can be based on equal probability of selection for units in a
actual value using the t-statistic. The analysis revealed given population as it is the case in simple random sampling
that the estimates obtained for different samples differs (Rahman, Tabash, Salamzadeh, Abduli, & Rahaman, 2022),
but the test of significance revealed that there is no interval selection of units as related to systematic
significant difference in the estimates across the sampling(Iliyasu & Etikan,2021), selection of units from a
independent samples as the P-values are less the level of population where there is intra-group heterogeneity as well
significance 0.05 except for n 30 which could be as inter-group homogeneity as it is applicable in cluster
considered as an outlier. Also, there is no significant sampling(Lohr, 2021) and selection of units from
difference in the estimates of variances for the various homogeneous sub-groups obtained after stratification of a
sample sizes considered for this study with a Pvalue of 0. heterogeneous population as it is in stratified
2344. sampling(Sharma,2023; Shah, 2024).Stratified random
sampling is a method applicable where a given
Keywords:- Resampling, Stratified Sampling, Proportional heterogeneous population is partitioned into a set of non-
Allocation, Estimators, P-value. overlapping homogenous sub-groups called strata (Verma,
Verma & Abhishek, 2024). Enzo(2021) mentioned that the
I. INTRODUCTION population is usually partitioned into sub-groups on the
basis of a stratification variable capable of grouping the
Sampling is obviously the most widely used concepts units in the population in a way that the groups are mutually
in day to day human activities as it forms the basis of exclusive. Pandey (2024) opined that stratified sampling
operations in virtually all areas of endeavour. It has an specifically have enormous areas of applications. Stratified
enormous range of application in very many real life sampling is known to have a good precision when adopted
situations which makes it a concept of concern to in selection of units from heterogeneous populations in the
researchers and stakeholder in the specific areas of need presence of a characteristic which could be used for
with the aim of maximizing benefits in its usage by ensuring stratification (Ahamed, Gupt, & Phukon, 2021). The
appropriateness, efficiency and cost economy. Samples principal concern of samplers and researchers is to deploy
survey methods have wide range of applications in both an appropriate method in order to ensure accuracy of
physical, natural, health science and economy (Olayiwola, estimates obtained through the sampling process. The major
Apantaku, Bisira &Adewara,2013).Sampling could be done concern of stratified sampling is to reduce the variability at a
in a probabilistic context where every units in the population given cost or reduce the cost for a given variance(Lawal,
has a known and non-zero probability of selection(Mweshi, Salami, Obisesan,Yusuff & Owolabi,2018; Adebola &
& Sakyi, 2020; Clark & Steel, 2022,). Non – probabilistic Ajayi,2014).Some of the issues earlier raised by researchers
sampling involves selection of units from the population on stratified sampling include the determination of
stratification boundary. Horgan (2006) considered the issues
x
L
different samples a bound to have unique measurements for
the characteristics of interest. The central idea of this paper X st N N 1
j i 1
ij
The data used for this study is the enrolment data into =( (W j )2 (1 f j )V ( kj ))
j 1
both public and private schools in Ede North local
government, Osun state Nigeria. The population consists of
a total of 74 schools comprising 28 public schools and 46
L nh
(W j )2 (1 f j ) (n j 1)1 ( kj )2 )
private schools in Ede North Local Government, Osun State
Nigeria. The data consists of the number of students in the =(
j 1 i 1
various grade levels in the schools. The summary of the data
is presented in Table 1. Four independent samples of the
(N ) n
1
=N
2
j
2
j ( N j n j ) j2 (2) The proportional allocation method was adopted to
j 1 ensure fairness in the determination of number of units to be
chosen from each stratum for a given sample size ( n ) such
that the stratum sample size depends on the stratum weight (
0.5
Se = [V ( X st )] Wh ).The estimators in 1, 2, 3 ,4 and 5 are used in the
computation to obtain the estimates on the k-samples shown
L
(N ) n
1 in table 2 to 5. A test of significance was performed using
=[N
2
j
2
j ( N j n j ) j2 ]0.5 (3) the F-statistic and Pvalue to test the hypothesis in (6) to
j 1
determine whether the estimated value of the statistic(s) at a
particular sample size (n ) is significantly different.
The lower and upper boundaries of the confidence
interval respectively can be expressed as H 0 : 1 2 3 4 vs H1 : 1 2 3 4
for at least two pairs of k.
L k X L
st Z1 / 2 [( (W j ) (1 f j ) j ]
2 2 0.5 Decision rule: Reject if the P<0.05, otherwise do not
j j 1
(4) reject.
Table 1 shows the distribution of schools with respect III. RESULTS AND DISCUSSIONS
to ownership. It shows that there are 28 and 46 public and
private schools respectively with corresponding weights of Table 2 shows the estimates of mean, variance and
0.3784 and 0.6216 respectively. The distribution of schools confidence interval for the four independently selected
suggests that the two categories do not have the same samples of equal sizes. It was observed that the estimates of
weights as shown in column 3 where the number of private the statistic(s) are reasonably close. The estimates for
schools is almost twice the number of public schools. In various sample sizes were subjected to test of significance
order to take into consideration the variations in the stratum using the t-statistic with overall mean of 85.2466 and the
weights, the proportional allocation method was adopted Pvalue obtained was as presented in table IX. The results
such the stratum sample size is directly proportional to the show that the estimates are not significantly different from
stratum weight and expressed as nh nWh . The various the overall mean of 85.2466 indicating that the deviation of
the estimates from the population mean is not significant
sample sizes leads to different values of nh in the two strata except for n=30 which can be considered as an outlier.
while approximation was observed in cases where the value
is a non-integer.