Hypothesis Testing

Uploaded by

cryptobaratoe

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Hypothesis Testing

Uploaded by

cryptobaratoe

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Machine Learning

Hypothesis Testing
Lecturer: Professor Hadi Farahani

February, 2024
Content

● Basics of hypothesis testing

● Hypothesis tests: Z-test, T-test, Chi-Square test
● p-Value method
● Type Ⅰ and Type Ⅱ Errors
Basics of hypothesis testing

- Hypothesis testing is a statistical method used to make inferences or conclusions about a

population parameter based on sample data. It involves formulating a null hypothesis (typically
denoted as H₀) and an alternative hypothesis (Hₐ), which are mutually exclusive statements about
the population parameter. The purpose of hypothesis testing is to assess the evidence provided by
the sample data to determine whether there is enough evidence to reject the null hypothesis in
favor of the alternative hypothesis, or if there is not enough evidence to do so.
Basics of hypothesis testing

● Hypothesis: A claim or a premise that we want to test.

● Null hypothesis (H₀): Currently accepted claim. In other words, we could say that H₀is the default
state of belief about the world.
● Alternative hypothesis (Hₐ): Involves the claim to be tested.
● H₀ and Hₐ are mathematically opposites.
Basics of hypothesis testing

● Test statistics: Calculated from sampled data and used to decide.

● Statistically significant: Where do we draw a line to make a decision?
● Level of confidence: This represents the probability that the statistical test will lead to the correct
rejection of the null hypothesis when it is false. It's the probability that the confidence interval will
contain the true population parameter.
● Level of significance: In hypothesis testing, the level of significance is typically set before
conducting the test, and it represents the threshold beyond which you would reject the null
hypothesis. Commonly used levels of significance are 0.05, 0.01, etc., but they can vary depending
on the context and the requirements of the analysis.. It is calculated by the following formula:
α= 1-C
Basics of hypothesis testing

The possible outcome of hypothesis testing:

- Reject Null hypothesis (H₀)

- Fail to reject Null hypothesis (H₀)
Example
It is believed that a candy machine makes chocolate bars that are on average 5g. A worker claims
that the machine after maintenance make no longer 5g bars.

● H₀: μ = 5g
● Hₐ: μ ≠ 5g
Hypothesis tests

There are various hypothesis tests, each appropriate for various goals to calculate our test. This could be
a Z-test, Chi-square, T-test, and so on.

● Z-test: If population means and standard deviations are known and the sample size is greater than
30 then Z-statistic is commonly used.
● T-test: If population standard deviations are unknown and sample size is less than 30 then t-test
statistic is more appropriate.
● Chi-square test: Chi-square test is used for categorical data or for testing independence in
contingency tables
● F-test: F-test is often used in the analysis of variance (ANOVA) to compare variances or test the
equality of means across multiple groups.
Hypothesis tests

● Hypothesis test could be conducted on population mean or on population proportion.

● In hypothesis test for population mean the test statistic is computed by:

● In hypothesis test for population proportion the test statistic is computed by:
Example
A factory has a machine that dispenses 80 mL of fluid in a bottle. An employee believes the average
amount of fluid is not 80 mL. Using 40 samples, he measures the average amount dispensed by the
machine to be 78 mL with a standard deviation of 2.5. (a) State the null and alternative hypothesis. (b) At
a 95% confidence level, is there enough evidence to support the idea that the machine is not working
properly?
Example
a) H₀: μ = 80 mL, Hₐ: μ ≠ 80 mL
b) The first step is to determine the type of test. Is this a one tail test or two tail test? The fact is that
the Hₐ is not equal to 80 and it could be less than 80 or greater than 80. So we need to conduct a
two tail test.

Here we use Z-test because the number of sampled data is more than 30.
Example
- The confidence level (C) is equal to 95% .
- So the significant level (α) would be equal to: α= 1-C= 1- 0.95= 0.05. Based on this we could say
that the value of α for each side is equal to α/2. That means the area of shaded regions would be
equal to 0.025 or 2.5%.
Example
- Now we need to find the z value correspond to 95% confidence level from the table. Which is
equal to 1.96. This value separates the rejection region (shaded area) from the failed to rejection
region (unshaded region).

- Now to make a decision to accept or reject null hypothesis we need to calculate the z score of
sampled data and compare it with the critical z value which is 1.96.
Example
Example
- So the calculated z is equal to - 5.06 which is less than -1.96. This shows that calculated z is in
rejection area and we could reject null hypothesis.
Example
A company manufactures car batteries with an average life span of 2 or more years. An engineer believes
this value to be less. Using 10 samples , he measures the average lifespan to be 1.8 years with a standard
deviation of 0.15. (a) state the null and alternative hypothesis. (b) At a 99% confidence level, is there
enough evidence to discord the null hypothesis?
Example
a) H₀: μ >= 2, Hₐ: μ < 2
b) The first step is to determine the type of test. Is this a one tail test or two tail test? The fact is that
the Hₐ is less than to. So we need to conduct a one tail test.
- Here we use t-test because population mean and standard deviations are unknown and the
number of sampled data is less than 30.
Example
- The confidence level (C) is equal to 99% .
- So the significant level (α) would be equal to: α= 1-C= 1- 0.95= 0.01.
- Now we need to find the t value correspond to the degree of freedom (df) and α value from the
table of student t distribution.
Example
Example
Example
- So the calculated t is equal to - 4.22 which is less than -2.82. This shows that calculated t is in
rejection area and we could reject null hypothesis.
p-Value method

Conducting a hypothesis test typically proceeds in four steps:

- Step 1: Define the Null and Alternative Hypothesis

- Step 2: Construct the Test Statistic

- Step 3: Compute the p-Value

- Step 4: Decide Whether to Reject the Null Hypothesis
p-Value method

p-Value: The p-Value serves as a crucial metric, quantifying the likelihood that an observed difference is
a result of chance. As the p-Value decreases, the statistical significance of the observed difference
intensifies. Ultimately, a very low p-Value prompts the rejection of the null hypothesis.
Example
A factory manufactures cars with a warranty of 5 years on the engine and transmission. An engineer
believes that the engine or transmission will malfunction in less than 5 years. He tested a sample of 40
cars and find the average time to be 4.8 years with the standard deviation of 0.50. (a) State the null and
alternative hypothesis. (b) At 2% significant level, is there enough evidence to support the idea that the
warranty should be revised?
Example
- First we need to calculate z value correspond to sampled data.
Example
● So the p-Value is equal to 0.0057. In p-Value method if p-Value < α the the null hypothesis is
rejected and if p-Value >= α the null hypothesis is accepted.
Example
- Then we need to find the area that correspond to z value.
Type Ι and Type Ⅱ Errors

When conducting hypothesis testing on randomly selected data samples instead of the entire population,
it's essential to acknowledge that our conclusions may not be universally applicable. Two types of errors
can occur:

● Type Ι Error: Incorrectly rejecting the null hypothesis when it is true.

● Type Ⅱ Error: Incorrectly accepting the null hypothesis when it is false.
References
● https://www.youtube.com/watch?v=zJ8e_wAWUzE&t=91s
● https://www.youtube.com/watch?v=8Aw45HN5lnA
● James, G., Witten, D., Hastie, T., Tibshirani, R. and Taylor, J., 2023. An introduction to
statistical learning: With applications in python. Springer Nature.

C 17
No ratings yet
C 17
20 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Statistical Analysis (T-Test)
No ratings yet
Statistical Analysis (T-Test)
61 pages
T - Test
100% (2)
T - Test
32 pages
Eda Research
No ratings yet
Eda Research
11 pages
Lecture 8 Hypothesis Testing
No ratings yet
Lecture 8 Hypothesis Testing
44 pages
Stat
67% (3)
Stat
70 pages
Last Competencies
No ratings yet
Last Competencies
34 pages
Research Methodology 22
No ratings yet
Research Methodology 22
28 pages
L1 QM06 High Yield Notes
No ratings yet
L1 QM06 High Yield Notes
9 pages
What Is A Hypothesis Test?: 1. Specify The Hypotheses
No ratings yet
What Is A Hypothesis Test?: 1. Specify The Hypotheses
5 pages
Week 14_15 Testing Claims About Means and Proportions
No ratings yet
Week 14_15 Testing Claims About Means and Proportions
74 pages
Ch10 and 11 Rev Answers
No ratings yet
Ch10 and 11 Rev Answers
16 pages
Lesson 5
No ratings yet
Lesson 5
5 pages
Hypothesis Testing 1
100% (1)
Hypothesis Testing 1
118 pages
ITM Chapter 6 On Testing of Hypothesis
No ratings yet
ITM Chapter 6 On Testing of Hypothesis
39 pages
Hypothesis Testing For Single Populations - Chapter Nine
No ratings yet
Hypothesis Testing For Single Populations - Chapter Nine
36 pages
PT Module5
No ratings yet
PT Module5
30 pages
QT Session 16 - 22 Hypothesis Testing
No ratings yet
QT Session 16 - 22 Hypothesis Testing
58 pages
Hypothesis-power-analysis
No ratings yet
Hypothesis-power-analysis
37 pages
statssss
No ratings yet
statssss
31 pages
HYPOTHESIS TESTING
No ratings yet
HYPOTHESIS TESTING
56 pages
An Introduction To Statistical Inference
No ratings yet
An Introduction To Statistical Inference
33 pages
Hypothesis Tesing
No ratings yet
Hypothesis Tesing
30 pages
Hipothesis Testing 2019 Dari Ralitsa
No ratings yet
Hipothesis Testing 2019 Dari Ralitsa
48 pages
Unit 8
No ratings yet
Unit 8
27 pages
Engineering Mathematics 2
No ratings yet
Engineering Mathematics 2
29 pages
LESSON-10-STEPS-IN-HYPOTHESIS-TESTING.pptx
No ratings yet
LESSON-10-STEPS-IN-HYPOTHESIS-TESTING.pptx
23 pages
Stat - Hypothesis Testing
No ratings yet
Stat - Hypothesis Testing
34 pages
STAT 1013 Statistics: Week 12
33% (3)
STAT 1013 Statistics: Week 12
48 pages
BS Group 4
No ratings yet
BS Group 4
36 pages
HYPOTHESIS TESTING AND ESTIMATION
No ratings yet
HYPOTHESIS TESTING AND ESTIMATION
7 pages
Hypothesis Testing G
No ratings yet
Hypothesis Testing G
28 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
7 pages
6.2 Hypothesis Testing v1
No ratings yet
6.2 Hypothesis Testing v1
34 pages
Point Estimation of Process Parameters
No ratings yet
Point Estimation of Process Parameters
64 pages
Statistical Hypothesis
No ratings yet
Statistical Hypothesis
13 pages
Module2 DS Ppt
No ratings yet
Module2 DS Ppt
46 pages
Introduction To Hypothesis Testing: Print Round
No ratings yet
Introduction To Hypothesis Testing: Print Round
2 pages
Analytics 2 e
No ratings yet
Analytics 2 e
55 pages
Eda Group5 Hypothesis Testing
No ratings yet
Eda Group5 Hypothesis Testing
32 pages
Testing Technique in Data Science
No ratings yet
Testing Technique in Data Science
65 pages
Theory of decision
No ratings yet
Theory of decision
9 pages
Week 5
No ratings yet
Week 5
26 pages
Hypothesis Test
100% (1)
Hypothesis Test
52 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
16 pages
PSNM - Ch. 3
No ratings yet
PSNM - Ch. 3
32 pages
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
100% (1)
An Introduction To T-Tests: Statistical Test Means Hypothesis Testing
8 pages
Probability and Statistics - Asynch A.1
No ratings yet
Probability and Statistics - Asynch A.1
4 pages
Powerpoint Topik 8
No ratings yet
Powerpoint Topik 8
6 pages
Types_of_Hypothesis_testing
No ratings yet
Types_of_Hypothesis_testing
4 pages
What Is Hypothesis Testing
100% (1)
What Is Hypothesis Testing
32 pages
Ch7- Hypothesis Testing_077cd6dd09901b1a4975fb68e8e9f364
No ratings yet
Ch7- Hypothesis Testing_077cd6dd09901b1a4975fb68e8e9f364
36 pages
Hypothesis Testing For One Population Parameter - Samples
100% (1)
Hypothesis Testing For One Population Parameter - Samples
68 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
60 pages
C22 P09 Chi Square Test
No ratings yet
C22 P09 Chi Square Test
33 pages
Test of Hypothesis
No ratings yet
Test of Hypothesis
8 pages
CH 09
No ratings yet
CH 09
10 pages
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
ESGC Cost Performance Report 2022 PNNL-33283
No ratings yet
ESGC Cost Performance Report 2022 PNNL-33283
174 pages
Isotc 135sc 3 Ultrasonic Testing PDF Free
No ratings yet
Isotc 135sc 3 Ultrasonic Testing PDF Free
2 pages
MathModernWorld Syllabus Spring24 2 1
No ratings yet
MathModernWorld Syllabus Spring24 2 1
4 pages
2023 F5 UT Memo
No ratings yet
2023 F5 UT Memo
2 pages
MS2 Motion Wokrsheet2
No ratings yet
MS2 Motion Wokrsheet2
1 page
Articles Difference Between RCC and Prestressed Concrete
No ratings yet
Articles Difference Between RCC and Prestressed Concrete
6 pages
AMS-2404 - Plating, Electroless Nickel
No ratings yet
AMS-2404 - Plating, Electroless Nickel
10 pages
CV2C Labex07 Xxe1 Matamorosa
No ratings yet
CV2C Labex07 Xxe1 Matamorosa
9 pages
Y1 S&M Practice Paper 2
No ratings yet
Y1 S&M Practice Paper 2
7 pages
MATH QUIZ G9
No ratings yet
MATH QUIZ G9
3 pages
Design Analysis Procedures For Fixed Offshore Platform Jacket Structures-Ijaerdv05i0379025
No ratings yet
Design Analysis Procedures For Fixed Offshore Platform Jacket Structures-Ijaerdv05i0379025
8 pages
Fe-Gen-St-Xxxxx-Weld Repair Procedure
No ratings yet
Fe-Gen-St-Xxxxx-Weld Repair Procedure
3 pages
Answer ............................................... (1) : For Examiner's Use
No ratings yet
Answer ............................................... (1) : For Examiner's Use
34 pages
Detailed Explanations For Trends
No ratings yet
Detailed Explanations For Trends
2 pages
Sheet - 01
No ratings yet
Sheet - 01
25 pages
2022 Summer Model Answer Paper (Msbte Study Resources)
No ratings yet
2022 Summer Model Answer Paper (Msbte Study Resources)
12 pages
Year 5 BI Paper 1 SECTION A
No ratings yet
Year 5 BI Paper 1 SECTION A
6 pages
51 Notification-EC-18.08.22 (EducationandCIC)
No ratings yet
51 Notification-EC-18.08.22 (EducationandCIC)
38 pages
CMT Lesson 4
No ratings yet
CMT Lesson 4
8 pages
Properties: Chapter Two
No ratings yet
Properties: Chapter Two
15 pages
Introduction To TP
No ratings yet
Introduction To TP
47 pages
ME-102 Engineering Graphics
No ratings yet
ME-102 Engineering Graphics
20 pages
Exam Paper Hge and Geo
No ratings yet
Exam Paper Hge and Geo
5 pages
Xiameter Mem-1785 Emulsion
No ratings yet
Xiameter Mem-1785 Emulsion
2 pages
A Survey of Wireless Power Transfer and A Critical Comparison of Inductive and Capacitive Coupling For Small Gap Applications
No ratings yet
A Survey of Wireless Power Transfer and A Critical Comparison of Inductive and Capacitive Coupling For Small Gap Applications
13 pages
Data Sheet: Overload Relays, RMP-111D ANSI Code 32
No ratings yet
Data Sheet: Overload Relays, RMP-111D ANSI Code 32
7 pages
NEPHELOTURBIDOMETRY
No ratings yet
NEPHELOTURBIDOMETRY
19 pages
XII Maths Preboard 2020-21 SP-4
No ratings yet
XII Maths Preboard 2020-21 SP-4
13 pages
Mechanical ENG
No ratings yet
Mechanical ENG
5 pages
Multivariate Analysis
No ratings yet
Multivariate Analysis
25 pages