Define the null hypothesis (no difference between sample and theoretical distribution) and the alternative hypothesis (difference exists).
Define the null hypothesis (no difference between sample and theoretical distribution) and the alternative hypothesis (difference exists).
Statistics: Tests
& Examples
This presentation explores various statistical tests used to analyse
data and draw conclusions. We'll discuss the fundamentals of
each test, its applications, and relevant examples.
BY - GROUP 3
Small Sample Test
A small sample test refers to statistical tests used to analyze data when the sample size is relatively small,
typically less than 30 observations.
One-sample t-test
Tests if the mean of a sample is significantly different from a known
population mean.
Example: Comparing the average height of students in a class to
the national average.
Two-sample t-test
Tests if the means of two independent samples are significantly
different.
Example: Testing if there's a difference in the average performance
of two groups of students on a test.
Paired t-test
Tests if the means of two related samples are significantly different
Suppose a teacher wants to know if a new teaching method has
improved students' test scores. The teacher administers a test
before and after the new teaching method to the same group of 5
students..
Practical Example: Comparing Two Study Methods
Imagine you're a teacher and you want to find out if two different study methods lead to
different exam scores for your students. You divide your class into two groups:
Compare Calculated t-value to Critical t-value:- Suppose our calculated t-value is 2.7.
Significant Result: If our calculated t-value (2.7) is greater than the critical t-value (2.306), we have a significant result. This means the
difference in average scores between the two groups is unlikely to be due to chance.
Not Significant Result: If our calculated t-value was less than 2.306, it would mean that the difference might be due to random chance.
F-test
The F-test compares the variances of two populations. It's used to
determine if the variances are statistically significantly different.
Key
Conditions:
• Categorical Data: The data used in the chi-square test should
be categorical, meaning it falls into distinct categories or groups.
• Independence: Observations in the data should be
independent of each other. This means that one observation
should not influence another.
• Expected Frequencies: The expected frequency in each cell
of the contingency table should be at least 5. This condition is
crucial for the accuracy of the chi-square distribution
approximation.
χ² = Σ [(O - E)² /
Where:
E]
• χ² is the chi-square statistic
• O is the observed frequency
• E is the expected frequency
• Σ is the sum of all categories
• Purpose : To assess whether the observed frequencies differ significantly from the
expected frequencies under a specific hypothesis.
A chi-square (Χ^2) goodness of fit test is a goodness of fit test for a categorical variable
. Goodness of fit is a measure of how well a statistical model fits a set of observations.
• When goodness of fit is high, the values expected based on the model are close to
the observed values.
• When goodness of fit is low, the values expected based on the model are far from
the observed values.
Example :
You’re hired by a dog food company to help them test three new dog food flavors.
You recruit a random sample of 75 dogs and offer each dog a choice between the
three flavors by placing bowls in front of them. You expect that the flavors will be
equally popular among the dogs, with about 25 dogs choosing each flavor.
Once you have your experimental results, you plan to use a chi-square goodness of
fit test to figure out whether the distribution of the dogs’ flavor choices is
significantly different from your expectations.
• Null hypothesis (H0) : The dog population chooses the three flavors in equal
proportions (p1 = p2 = p3).
• Alternative hypothesis (Ha) : The dog population does not choose the three
flavors in equal proportions.
• The Degrees of Freedom (df) : For chi-square goodness of fit tests, the df is the
number of groups minus one.
Since there are three groups (Garlic Blast, Blueberry Delight, and Minty Munch),
there are two degrees of freedom.
For a test of significance at α = 0.05 and df = 2 , the Χ^2 critical value is 5.99 .
• If the Χ^2 value is less than the critical value, then the difference between the
observed and expected distributions is not statistically significant (p > α).
⚬ The data doesn’t allow you to reject the null hypothesis and doesn’t provide
support for the alternative hypothesis.
The Χ^2 value is less than the critical value. Therefore, you should not reject the null
hypothesis that the dog population chooses the three flavors in equal proportions.
There is no significant difference between the observed and expected flavor choice
distribution (p > .05). This suggests that the dog food flavors are equally popular in
the dog population
Independence of Attributes
This test determines if two categorical variables are independent of each other. It assesses whether there is a statistically
significant association between the variables.