Review and Non Parametric Using SPSS 2023
Review and Non Parametric Using SPSS 2023
Advanced Statistical
Tools Using SPSS
Prepared by:
Armele J. Mangaran
Professor College of Science
Bulacan State University
February 11, 2023
It was either the famous writer Mark Twain or the 19th-century British
Prime Minister Benjamin Disraeli who said: “There are three kinds
of lies: lies, damned lies, and statistics.”
This saying means that statistics can be used for very wrong
purposes and the persuasive power of numbers can assist to
strengthen weak arguments.
Scale Categorical
Continuous
Measurements Discrete: Ordinal: Nominal:
takes any Counts/ integers obvious order no meaningful
value order
The research problem
Problem Methodology
Independent Groups
Triglyceride level at
week 8 (mg/dl) -
-11.371 80.360 13.583 -38.976 16.233 -.837 34 .408
Triglyceride level at
baseline (mg/dl)
Null Hypothesis is:
P-value =
Decision (circle correct answer): Reject Null/ Do
not reject Null
Conclusion:
Ignore the
shaded part of the
Example 2: t-Test Results
output for now!
Levene's Test
95% CI of the
for Equality of T-test results
Difference
Variances
Sig. Mean Std. Error
F Sig. t df Lower Upper
(2-tailed) Difference Difference
Equal variances
2.328 .136 4.539 35 .000 3.648 .804 2.016 5.280
assumed
Equal variances
4.510 32.342 .000 3.648 .809 2.001 5.295
not assumed
P-value =
Decision (circle correct answer): Reject Null/ Do
not reject Null
Conclusion:
Ignore the
shaded part of the
Example 2: Solution
output for now!
Levene's Test
95% CI of the
for Equality of T-test results
Difference
Variances
Sig. Mean Std. Error
F Sig. t df Lower Upper
(2-tailed) Difference Difference
Equal variances
2.328 .136 4.539 35 .000 3.648 .804 2.016 5.280
assumed
Equal variances
4.510 32.342 .000 3.648 .809 2.001 5.295
not assumed
IS evidence of a
P(t< - P(t>4.539
difference in weight loss 4.539) )
between treatment and Is < 0.001 Is < 0.001
placebo
-4.539 4.539
Assumptions in t-Tests
Normality: Plot histograms
One plot of the paired differences for any paired data
Two (One for each group) for independent samples
Don’t have to be perfect, just roughly symmetric
Every sample
taken from a
population, will
Sample A contain different
n=20
Mean = 277 numbers so the
mean varies.
Population Sample B
Mean =? n=50 Which estimate
SD =? Mean = 274 is most reliable?
Levene's Test
95% CI of the
for Equality of T-test results
Difference
Variances
Sig. Mean Std. Error
F Sig. t df Lower Upper
(2-tailed) Difference Difference
Equal variances
2.328 .136 4.539 35 .000 3.648 .804 2.016 5.280
assumed
Exercise: Solution
Discuss what the interpretation is for the confidence interval from
Example 2 highlighted below:
Levene's Test
95% CI of the
for Equality of T-test results
Difference
Variances
Sig. Mean Std. Error
F Sig. t df Lower Upper
(2-tailed) Difference Difference
Equal variances
2.328 .136 4.539 35 .000 3.648 .804 2.016 5.280
assumed
Variables:
Favourite subject (Nominal)
Gender (Binary/ Nominal)
2
n
Oi Ei
2
i 1 Ei
Chi squared Test?
Null: There is NO association between
class and survival
Alternative: There IS an association between
class and survival
contingency
table
3x2
Using SPSS
p- value
p < 0.001
SPSS
Output
Low Cell Counts with the Chi-squared test
Presence of outliers
Statistic used:
r = correlation coefficient
Correlation Coefficient r
Measures strength of a relationship between
two continuous variables
r = 0.9
r = 0.01
r = -0.9
Correlation Interpretation
An interpretation of the size of the coefficient has
been described by Cohen (1992) as:
Relationship Correlation
y a x
Histogram of the
residuals looks
approximately normally
distributed
R2 = 0.152
Does the model result in reliable predictions?
Check the assumptions
Correlation
Interpretation:
There is a significant relationship between a mothers’ pre-pregnancy
weight and the weight of her baby (p = 0.011). Pre-pregnancy
weight has a positive affect on a baby’s weight with an increase of
0.03 lbs for each extra pound a mother weighs.
Interpretation:
There is a significant relationship between a mothers’ pre-pregnancy
weight and the weight of her baby (p = 0.011). Pre-pregnancy
weight has a positive affect on a baby’s weight with an increase of
0.03 lbs for each extra pound a mother weighs.
INDEPENDENT DEPENDENT
(explanatory/ (outcome)
affects
predictor) variable
variable
Scale Categorical
Ordinal Nominal
How many variables are
involved?
Two – interested in the relationship
Comparing
means
Comparing
Paired t-test
measurements
WITHIN the same
subject
Comparing means
Independent
t-test
2
Comparing
BETWEEN groups One way
ANOVA
Comparing
means
Comparing
Paired t-test
measurements
WITHIN the same
subject
Does Margarine X
reduce cholesterol?
Everyone has
cholesterol measured
on 3 occasions
Nonparametric
techniques are usually
Non-parametric based on ranks/ signs
rather than actual data
Non-parametric tests
Dependent
Scale Categorical
Ordinal: Nominal:
Normally
Skewed data Non-parametric Chi-squared
distributed
Non-parametric
Parametric test
Q&A
• "We must be careful not to confuse data with
the abstractions we use to analyze them."
William James.
• "Maturity is the capacity to endure uncertainty."
John Finley.
• "Natural selection is a mechanism for
generating an exceedingly high degree of
improbability."R. A. Fisher.