0% found this document useful (0 votes)

98 views53 pages

Unit 5 Hypothesis Testing-Compressed-1

The document discusses hypothesis testing, which is a statistical method used to make inferences about population parameters based on sample data. It explains that the rationale for hypothesis testing is to assess claims about a population based on empirical evidence rather than intuition. The document provides examples to illustrate key concepts of hypothesis testing such as forming hypotheses, selecting a significance level, calculating test statistics, making decisions, and interpreting results.

Uploaded by

egadydqmdctlfzhnkb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views53 pages

Unit 5 Hypothesis Testing-Compressed-1

Uploaded by

egadydqmdctlfzhnkb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

What is hypothesis testing?

A hypothesis is an educated guess

about something in the world around you.
It should be testable, either by
Experiment or observation.

For example:

A new medicine you think might work.

A way of teaching you think might be better..
What is a Hypothesis Statement?
If you are going to propose a hypothesis, it’s customary to write a statement.
Your statement will look like this:
“If I…(do this to an independent variable)….then (this will happen to the
dependent variable).”
For example:

If I (decrease the amount of water given to herbs) then (the herbs will
increase in size).
If I (give patients counseling in addition to medication) then (their overall
depression scale will decrease).
If I (look in this certain location) then (I am more likely to find new
species).
14
15
16
17
18
19
Rationale for Hypothesis Testing

Hypothesis testing is a statistical method used to make inferences about

population parameters based on sample data. The rationale behind
hypothesis testing is to assess the validity of a claim or hypothesis about
the population. It helps researchers or analysts to draw conclusions and
make decisions based on empirical evidence rather than intuition or
guesswork.
Example: Suppose a pharmaceutical company develops a new drug for
lowering blood pressure. The company wants to determine whether the
new drug is more effective than the current standard treatment. In this
scenario, hypothesis testing can help the company evaluate whether there
is sufficient evidence to support the claim that the new drug is superior in
lowering blood pressure compared to the standard treatment.

20
Direction of Hypothesis Test
The direction of a hypothesis test refers to whether the alternative
hypothesis is one-tailed or two-tailed.
One-Tailed Test: The alternative hypothesis specifies a direction
for the effect (e.g., greater than, less than).
Two-Tailed Test: The alternative hypothesis does not specify a
direction for the effect; it only suggests that there is a difference.
Example: Suppose a manufacturer claims that the average weight
of their product is less than 10 kg. In this case, the alternative
hypothesis would be one-tailed, indicating a direction (less than).
On the other hand, if the manufacturer only claims that there is a
difference in weight, without specifying whether it is greater or less
than 10 kg, then the alternative hypothesis would be two-tailed.21
Errors in Hypothesis Testing

Type I Error: Rejecting the null hypothesis when it is actually true.

Type II Error: Failing to reject the null hypothesis when it is actually
false.

22
Errors in Hypothesis Testing

Type I Error: Rejecting the null hypothesis when it is actually true.

Type II Error: Failing to reject the null hypothesis when it is actually
false.
Example: Consider a medical diagnostic test for a rare disease.
Type I Error: The test indicates that a person has the disease when they
actually do not.
Type II Error: The test indicates that a person does not have the disease
when they actually do.

23
24
Correct decisions :
Type 1 error –
Type 2 error -

25
Formula Review
α = probability of a Type I error = P(Type I error) = probability of
rejecting the null hypothesis when the null hypothesis is true.
β = probability of a Type II error = P(Type II error) = probability of
not rejecting the null hypothesis when the null hypothesis is false.

26
27
General Procedure for Hypothesis Testing
Formulate Hypotheses: State the null hypothesis (H0) and the alternative
hypothesis (H1or Ha).
Select Significance Level: Choose the significance level (α) to specify the
probability of committing a Type I error.
Collect Data: Collect sample data relevant to the hypothesis being tested.
Calculate Test Statistic: Compute a test statistic based on the sample data and
the assumed distribution under the null hypothesis.
Make Decision: Compare the test statistic to the critical value or calculate the p-
value. Reject the null hypothesis if the test statistic falls in the rejection region, or
if the p-value is less than the significance level (αα).
Interpret Results: Draw conclusions based on the decision made in step 5,
considering the context of the problem and the consequences of potential errors.

28
Z Test statistics is a statistical procedure used to test an alternative
hypothesis against the null hypothesis. It is any statistical
hypothesis used to determine whether two samples means are
different when variances are known and the sample is large.
Z Test determines if there is a significant difference between sample
and population means.
Z Test normally used for dealing with problems relating to large
samples.
When the sample size is more than 30 units than in that case the z
test must be performed. Mathematically z test formula is
represented as,

29
Here,
x̄ = Mean of Sample
μ = Mean of Population
σ = Standard Deviation of
Population
n = Number of
Observation

30
31
32
33
34
Example 1: A teacher claims that the mean score of students in his class
is greater than 82 with a standard deviation of 20. If a sample of 81
students was selected with a mean score of 90 then check if there is
enough evidence to support this claim at a 0.05 significance level.

Solution: As the sample size is 81 and population standard deviation is

known, this is an example of a right-tailed one-sample z test.
H0: μ=82,H1: μ>82

From the z table the critical value at α= 1.645

35
36
Example 2: An online medicine shop claims that the mean delivery time for medicines is
less than 120 minutes with a standard deviation of 30 minutes. Is there enough evidence
to support this claim at a 0.05 significance level if 49 orders were examined with a mean
of 100 minutes?
Solution: As the sample size is 49 and population standard deviation is known, this is an
example of a left-tailed one-sample z test.
H0: μ=120,H1: μ<120
From the z table the critical value at α
= -1.645. A negative sign is used as this is a left tailed test.

37
38
39
We reject H0 because 2.38 > 1.645. We have statistically
significant evidence at a =0.05, to show that the mean weight
in men in 2006 is more than 191 pounds.

40
Example:
The National Center for Health Statistics (NCHS) published a report in
2005 entitled Health, United States, containing extensive information on
major trends in the health of Americans. Data are provided for the US
population as a whole and for specific ages, sexes and races. The NCHS
report indicated that in 2002 Americans paid an average of $3,302 per
year on health care and prescription drugs. An investigator hypothesizes
that in 2005 expenditures have decreased primarily due to the availability
of generic drugs. To test the hypothesis, a sample of 100 Americans are
selected and their expenditures on health care and prescription drugs in
2005 are measured. The sample data are summarized as follows: n=100,
x̄=$3,190 and s=$890. Is there statistical evidence of a reduction in
expenditures on health care and prescription drugs in 2005? Is the sample
mean of $3,190 evidence of a true reduction in the mean or is it within
chance fluctuation? We will run the test using the five-step approach.
41
Step 1. Set up hypotheses and determine level of significance
H0: μ = 3,302 H1: μ < 3,302 α =0.05
The research hypothesis is that expenditures have decreased, and
therefore a lower-tailed test is used.
Step 2. Select the appropriate test statistic.
Because the sample size is large (n> 30)

42
43
Example:
The NCHS reported that the mean total cholesterol level in 2002 for all adults
was 203. Total cholesterol levels in participants who attended the seventh
examination of the Offspring in the Framingham Heart Study are summarized as
follows: n=3,310, x̄ =200.3, and s=36.8. Is there statistical evidence of a
difference in mean cholesterol levels in the Framingham Offspring?
Here we want to assess whether the sample mean of 200.3 in the Framingham
sample is statistically significantly different from 203 (i.e., beyond what we would
expect by chance). We will run the test using the five-step approach.
Step 1. Set up hypotheses and determine level of significance
H0: μ= 203 H1: μ≠ 203 α=0.05
The research hypothesis is that cholesterol levels are different in the
Framingham Offspring, and therefore a two-tailed test is used.
44
45
An insurance company sells health insurance and motor insurance policies. Customers pay premiums for
these policies. The CEO of the insurance company wonders if premiums paid by either of the insurance
segments (health insurance and motor insurance) are more variable than another. He finds the following
data for premiums paid:

Afghan United Bank Statement S
100% (2)
Afghan United Bank Statement S
2 pages
Causal AI 1st Edition by Robert Osazuwa Ness download
100% (1)
Causal AI 1st Edition by Robert Osazuwa Ness download
85 pages
Lesson Presentation Ordering Numbers To 1000
No ratings yet
Lesson Presentation Ordering Numbers To 1000
22 pages
Federated Learning For Healthcare Informatics
100% (1)
Federated Learning For Healthcare Informatics
19 pages
Final Bank Exam
100% (13)
Final Bank Exam
14 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
78 pages
Naive - Bayes - Ipynb - Colab
No ratings yet
Naive - Bayes - Ipynb - Colab
3 pages
Atlas Copco Roto - Z Sds
No ratings yet
Atlas Copco Roto - Z Sds
14 pages
Infant Mortality in Brazil a Survival Analysis Using Machine Learning Models7
No ratings yet
Infant Mortality in Brazil a Survival Analysis Using Machine Learning Models7
47 pages
Year 7 Revision Checklist
50% (2)
Year 7 Revision Checklist
6 pages
Biomathematics JYT
100% (1)
Biomathematics JYT
195 pages
Drug Dosage Control System Using Reinforcement Learning
No ratings yet
Drug Dosage Control System Using Reinforcement Learning
8 pages
Psych1x03 Quiz
No ratings yet
Psych1x03 Quiz
20 pages
Maximum Likelihood Estimation
No ratings yet
Maximum Likelihood Estimation
22 pages
Correlation and Regression - The Simple Case
100% (2)
Correlation and Regression - The Simple Case
106 pages
RAG with math
No ratings yet
RAG with math
7 pages
Formulatinghypotheses 110911135920 Phpapp02
No ratings yet
Formulatinghypotheses 110911135920 Phpapp02
53 pages
Regression Analysis
No ratings yet
Regression Analysis
14 pages
Statistical Methods For Bioinformatics Lecture 5
No ratings yet
Statistical Methods For Bioinformatics Lecture 5
48 pages
ArabicOCR - Amazing OCR Library For Arabic PDF Documents - by Shekhar Khandelwal - Medium
No ratings yet
ArabicOCR - Amazing OCR Library For Arabic PDF Documents - by Shekhar Khandelwal - Medium
16 pages
UNIT IV - Inferential Statistics
No ratings yet
UNIT IV - Inferential Statistics
180 pages
Statistical Methods For Bioinformatics Lecture 4
No ratings yet
Statistical Methods For Bioinformatics Lecture 4
29 pages
Structure Factor: Textbook's Convention
No ratings yet
Structure Factor: Textbook's Convention
17 pages
Causal Inference and Stable Learning: Peng Cui Tong Zhang
No ratings yet
Causal Inference and Stable Learning: Peng Cui Tong Zhang
95 pages
GTU ICT Syllabus Reference Book
No ratings yet
GTU ICT Syllabus Reference Book
3 pages
06 - Natural Experiment (Part 1) PDF
No ratings yet
06 - Natural Experiment (Part 1) PDF
89 pages
Mathematical Economics
No ratings yet
Mathematical Economics
38 pages
Modeling With Differential Equations: A Lecture in ENGIANA
No ratings yet
Modeling With Differential Equations: A Lecture in ENGIANA
115 pages
Econometrics - MCQ Flashcards - Quizlet
No ratings yet
Econometrics - MCQ Flashcards - Quizlet
19 pages
07 - Natural Experiment (Part 2) PDF
No ratings yet
07 - Natural Experiment (Part 2) PDF
90 pages
Peter Dueben: Royal Society University Research Fellow & ECMWF's Coordinator For Machine Learning and AI Activities
100% (1)
Peter Dueben: Royal Society University Research Fellow & ECMWF's Coordinator For Machine Learning and AI Activities
33 pages
Chapter 11 Hilton - Solutions
60% (5)
Chapter 11 Hilton - Solutions
11 pages
Mathematical Modeling
100% (1)
Mathematical Modeling
24 pages
Deep Learning
No ratings yet
Deep Learning
49 pages
TCS Sell Sheet
No ratings yet
TCS Sell Sheet
2 pages
When Should You Adjust Standard Errors For Clustering?: Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge
No ratings yet
When Should You Adjust Standard Errors For Clustering?: Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge
33 pages
Chi Square and McNemar Test
No ratings yet
Chi Square and McNemar Test
45 pages
A Deep Learning Approach For Automated Diagnosis and Multi-Class Classification of Alzheimer's Disease Stages Using Resting-State fMRI and Residual Neural Networks
No ratings yet
A Deep Learning Approach For Automated Diagnosis and Multi-Class Classification of Alzheimer's Disease Stages Using Resting-State fMRI and Residual Neural Networks
16 pages
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
100% (1)
Paper 1-Bidirectional LSTM With Attention Mechanism and Convolutional Layer
51 pages
04 Notes 6250 f13
0% (1)
04 Notes 6250 f13
16 pages
Time Series Lecture Notes
No ratings yet
Time Series Lecture Notes
97 pages
Doug Bates Mixed Models
No ratings yet
Doug Bates Mixed Models
75 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Econometric Project - Permanent Income Hypothesis
No ratings yet
Econometric Project - Permanent Income Hypothesis
9 pages
GR 10 History (English) Term 1 Controlled Test 1 Question Paper 2
No ratings yet
GR 10 History (English) Term 1 Controlled Test 1 Question Paper 2
4 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
System S3.18 - Epoxy Zinc Phosphate Primer, 200 Microns
No ratings yet
System S3.18 - Epoxy Zinc Phosphate Primer, 200 Microns
3 pages
Download (Original PDF) Categorical Data Analysis 3rd Edition by Alan Agresti ebook All Chapters PDF
100% (8)
Download (Original PDF) Categorical Data Analysis 3rd Edition by Alan Agresti ebook All Chapters PDF
46 pages
Funny in Farsi Comprehension Check Questions STUDENT
No ratings yet
Funny in Farsi Comprehension Check Questions STUDENT
5 pages
Dickey-Fuller Unit Root Test
No ratings yet
Dickey-Fuller Unit Root Test
13 pages
Hypotheses Testing
No ratings yet
Hypotheses Testing
5 pages
Understanding Risk Management and Hedging in Oil Trading: A Practitioner's Guide to Managing Risk 1st Edition Chris Heilpern download
No ratings yet
Understanding Risk Management and Hedging in Oil Trading: A Practitioner's Guide to Managing Risk 1st Edition Chris Heilpern download
54 pages
Spinal Nerves
No ratings yet
Spinal Nerves
53 pages
PDF Hands-on Time Series Analysis With Python: From Basics To Bleeding Edge Techniques B. V. Vishwas download
100% (1)
PDF Hands-on Time Series Analysis With Python: From Basics To Bleeding Edge Techniques B. V. Vishwas download
62 pages
Deciphering Cryptographic Messages, Containing Detailed Discussions On Statistics. (1) It
No ratings yet
Deciphering Cryptographic Messages, Containing Detailed Discussions On Statistics. (1) It
4 pages
A A Regression
No ratings yet
A A Regression
28 pages
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
No ratings yet
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
49 pages
Lecture1 IntroToMathModelling PDF
No ratings yet
Lecture1 IntroToMathModelling PDF
39 pages
CCST9017 (2023 24) L8 Cryptography
No ratings yet
CCST9017 (2023 24) L8 Cryptography
47 pages
Captura de Tela 2025-01-09 à(s) 15.22.14
No ratings yet
Captura de Tela 2025-01-09 à(s) 15.22.14
1 page
CSCE106 Lab Session 3
No ratings yet
CSCE106 Lab Session 3
1 page
Discrete and Continuous Simulation
No ratings yet
Discrete and Continuous Simulation
15 pages
Estimation and Hypothesis
100% (1)
Estimation and Hypothesis
32 pages
WS22418 SD
No ratings yet
WS22418 SD
7 pages
Statistical Models
No ratings yet
Statistical Models
35 pages
David - sm14 - Inppt01 - GE
No ratings yet
David - sm14 - Inppt01 - GE
41 pages
Prophet R
No ratings yet
Prophet R
18 pages
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
No ratings yet
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
10 pages
Besr Reviewer
No ratings yet
Besr Reviewer
41 pages
GAMS Getting Started
No ratings yet
GAMS Getting Started
31 pages
Guidelines On Clinical Management of Endometrial Hyperplasia
No ratings yet
Guidelines On Clinical Management of Endometrial Hyperplasia
14 pages
Orthodontic Elastic Wear Fact Sheet Diagrams
No ratings yet
Orthodontic Elastic Wear Fact Sheet Diagrams
4 pages
Al-Saadi - Demystifying Ontology and Epistemology in Research
No ratings yet
Al-Saadi - Demystifying Ontology and Epistemology in Research
11 pages
Beyond Prediction Using Big Data For Policy Problems
No ratings yet
Beyond Prediction Using Big Data For Policy Problems
4 pages
crowley2009 - вагинизм
No ratings yet
crowley2009 - вагинизм
6 pages
Student Academic Performance Prediction Under Various Machine Learning Classification Algorithms
No ratings yet
Student Academic Performance Prediction Under Various Machine Learning Classification Algorithms
19 pages
04
No ratings yet
04
7 pages
Mathematical Treatise On Linear Algebra
No ratings yet
Mathematical Treatise On Linear Algebra
7 pages
Darvas Box
100% (1)
Darvas Box
18 pages
Quadrimalleolar Fractures of The Ankle: Think 360°-A Step-By-Step Guide On Evaluation and Fixation
No ratings yet
Quadrimalleolar Fractures of The Ankle: Think 360°-A Step-By-Step Guide On Evaluation and Fixation
3 pages
150 5340 26a
No ratings yet
150 5340 26a
102 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Persian Ay - 5
No ratings yet
Persian Ay - 5
2 pages
Carding Shops Vs DNM Slides
No ratings yet
Carding Shops Vs DNM Slides
10 pages
Tales Family Handbook 2021-22
No ratings yet
Tales Family Handbook 2021-22
40 pages
TF Idf Algorithm
No ratings yet
TF Idf Algorithm
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Building Data-Driven Applications with LlamaIndex: A practical guide to retrieval-augmented generation (RAG) to enhance LLM applications
From Everand
Building Data-Driven Applications with LlamaIndex: A practical guide to retrieval-augmented generation (RAG) to enhance LLM applications
Andrei Gheorghiu
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

Unit 5 Hypothesis Testing-Compressed-1

Uploaded by

Unit 5 Hypothesis Testing-Compressed-1

Uploaded by

What is hypothesis testing?

A hypothesis is an educated guess

A new medicine you think might work.

Hypothesis testing is a statistical method used to make inferences about

Type I Error: Rejecting the null hypothesis when it is actually true.

Type I Error: Rejecting the null hypothesis when it is actually true.

Solution: As the sample size is 81 and population standard deviation is

From the z table the critical value at α= 1.645

You might also like