09 Introduction To Nonparametric Methods 2
09 Introduction To Nonparametric Methods 2
Nonparametric Methods At the end of this lesson, you would be able to: Parametric tests assume underlying statistical distributions in the data.
Therefore, several conditions of validity must be met so that the result of a
Glyzel Grace M. Francisco
STAT1200 – Management Science 1. Decipher when to perform nonparametric tests. parametric test is reliable. For example, Student’s t-test for two
2nd Semester, 2022-2023
independent samples is reliable only if each sample follows a normal
2. Perform Spearman Rank Correlation
distribution and if sample variances are homogeneous.
3. Perform Chi-square Test for Independence
Nonparametric tests do not rely on any distribution. They can thus be
applied even if parametric conditions of validity are not met.
Parametric Test Nonparametric Test Assessing normality using different statistical graphs/plots
Conditions/ Purposes Parametric Test Nonparametric Test
Normal Distribution Non-normal Distribution Conditions/ Purposes • A normal quantile plot (or normal probability plot) is a graph of points (x,y) where
Normal Distribution Non-normal Distribution each x value is from the original set of sample data, and each y value is the
One sample z-test (if 𝜎 is known)
Compare a mean with
standard value
and Wilcoxon test Compare >2 means of corresponding z score that is a quantile value expected from the standard normal
One sample t-test (if 𝜎 is unknown) Multi-factor ANOVA Friedman test
matched data sets distribution.
Two independent samples z-test
Compare two means of
(if 𝜎1 𝑎𝑛𝑑 𝜎2 are known) Find the relationship Procedure for determining whether it is reasonable to assume that sample data are
and Mann-Whitney test Pearson’s correlation Spearman’s correlation
unpaired data sets between two variables from a normally distributed population:
Two independent samples t-test
(If 𝜎1 𝑎𝑛𝑑 𝜎2 is unknown) Predict the values of 1. Histogram: Construct a histogram. Reject normality if the histogram departs
Simple linear or
Compare two means of one variable from Spearman’s correlation dramatically from a bell shape.
Paired-sample t-test Wilcoxon test nonlinear regression
paired data sets another
2. Outliers: Identify outliers. Reject normality if there is more than one outlier
Compare >2 means of Find the relationship Multiple regression Kendall’s coefficient of present. (Just one outlier could be an error or the result of chance variation, but
One-way ANOVA Kruskal-Wallis test
unmatched data sets among several variables (linear/nonlinear) concordance be careful, because even a single outlier can have a dramatic effect on results.)
NONPARAMETRIC METHODS | 7 NONPARAMETRIC METHODS | 8 NONPARAMETRIC METHODS | 9
department of department of department of
statistics statistics Example (Normal) statistics Example (Uniform)
3. Normal quantile plot: If the histogram is basically symmetric and there is at most one The first case shows a histogram of IQ scores that is close to being bell-shaped, so the The second case shows a histogram of data having a uniform distribution. The
outlier, use technology to generate a normal quantile plot. Use the following criteria histogram suggests that the IQ scores are from a normal distribution. The corresponding normal quantile plot suggests that the points are not normally
to determine whether or not the distribution is normal. (These criteria can be used corresponding normal quantile plot shows points that are reasonably close to a distributed because the points show a systematic pattern that is not a straight-
loosely for small samples, but they should be used more strictly for large samples.) straight-line pattern, and the points do not show any other systematic pattern that is line pattern. These sample values are not from a population having a normal
Normal Distribution: not a straight line. It is safe to assume that these IQ scores are from a normally
distribution.
The population distribution is normal if the pattern of the points is reasonably close to a straight line distributed population.
and the points do not show some systematic pattern that is not a straight-line pattern.
department of department of
statistics Example 1 statistics Example 1
4. Computation
First, find the expected values for each cell of the contingency table. 5. Decision
(𝑟𝑜𝑤 𝑡𝑜𝑡𝑎𝑙)(𝑐𝑜𝑙𝑢𝑚𝑛 𝑡𝑜𝑡𝑎𝑙)
Hand Preference (j) 𝐸𝑖𝑗 =
𝑔𝑟𝑎𝑛𝑑 𝑡𝑜𝑡𝑎𝑙
Since 0.7576 > critical value (3.841), we failed to reject 𝐻𝑜 .
Sex (i) Total
Left (1) Right (2)
(120)(36) (120)(264)
𝐸11 = = 14.4 𝐸12 = = 105.6 6. Conclusion
Female (1) 12 108 120 300 300
Male (2) 24 156 180 At 5% level of significance, we can conclude that the sex and the hand
(180)(36) (180)(264)
Total 36 264 300 𝐸21 = = 21.6 𝐸22 = = 158.4 preference are not related.
300 300
𝑘
𝑂𝑖 − 𝐸𝑖 2 12 − 14.4 2 108 − 105.6 2 24 − 21.6 2 156 − 158.4 2
2𝑐 = = + + + = 𝟎. 𝟕𝟓𝟕𝟔 Ho: The sex and hand preference are not related
𝐸𝑖 14.4 105.6 21.6 158.4
𝑖=1 Ha: The sex and hand preference are related