Lecture 18 - Statistics and Data Analysis I 2

The document summarizes the steps of hypothesis testing: stating hypotheses, setting criteria for decision making, collecting data, and making a decision. It outlines three models for hypothesis testing - one-sided tests for less than or greater than, and a two-sided test for not equal. An example of flipping a coin 10000 times is provided to demonstrate the process.

Uploaded by

guy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Lecture 18 - Statistics and Data Analysis I 2

Uploaded by

guy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Statistics and Data Analysis I – IDC – 2017

Avner Halevy

Lecture 18 – Hypothesis Testing

Let’s summarize the steps used in the hypothesis test from the last lecture:
Step 1: State the hypotheses. The hypotheses always involve some unknown population parameter.
For us, this is the population proportion p of people who like chocolate.
H0 : p = 0.8
H1 : p < 0.8
Step 2: Set the criteria for a decision. This means that we have to divide the set of all possible
sample statistic (for us this is P̂ ) values into two kinds: values that are consistent with H0 and values
that are not. As we know, the latter set is known as the rejection region. If we ultimately have a
value in this region, representing extremely unlikely outcomes, we reject H0 . Before we can compute
the boundary of the critical region, we need to decide on a significance level, denoted by α. Unless
otherwise stated, we shall always assume α = 0.05, which means – we shall soon see – the rejection
region will consist of all those values that fall in the most extreme 5%.
Under the null hypothesis we know

p(1 − p) 0.8(0.2)
= N 0.8, 0.042

P̂ ∼ N p, = N 0.8,
n 100

To find the critical value separating rejection values from acceptance values, we compute:
X0.05 − 0.8
−1.65 = Z0.05 =
0.04
This leads to X0.05 = 0.734. Thus, any value to the left of 0.734 would be considered extreme and
lead us to reject H0 .
Step 3: Collect data and compute the value of the test statistic. We collected a random
sample of n = 100 people and determined that 77 of them like chocolate. Our statistic is the sample
proportion:
77
P̂ = = 0.77
100

Step 4: Make a decision. Since 0.734 < 0.77 (the value of our statistic does not fall in the rejection
region), we decide not to reject H0 . At the 0.05 significance level, we have insufficient evidence to
reject H0 .
It is common to also compute the p-value:

0.77 − 0.8
p-value = P (P̂ < 0.77) = P Z < = P (Z < −0.75) = 0.2266
0.04

1
Comparing the p-value to the significance level α, we would reject if the p-value were smaller. Since
0.05 < 0.2266, we again decide not to reject. Using the p-value instead of the rejection region to make
a decision always leads to the same decision, but provides more information about how extreme (or
not) the value observed is.
Given a parameter p and a fixed value p0 , there are three models for hypothesis testing: the first two
are called one-sided and the third one is called two-sided (see the figure below).

Model I
This is the one we have already seen, where the rejection region is located on the left and the hypotheses
have the following form:
H0 : p = p0
H1 : p < p0
In this model, the p-value is the probability of observing a value of the statistic that is lower than the
observed value.
Model II
In this model, the alternative hypothesis states that the value of the parameter is higher than pre-
viously believed. The rejection region is therefore located on the right and the hypotheses have the
following form:
H0 : p = p0
H1 : p > p0

2
In this model, the p-value is the probability of observing a value of the statistic that is higher than
the observed value.
Model III
In this model, the alternative hypothesis states that the value of the parameter is simply different from
what was previously believed, without suggesting a particular direction for the change. The rejection
region is thus divided into two regions, one on the left and one on the right, each having an area of
α/2, and the hypotheses have the following form:
H0 : p = p0
H1 : p 6= p0
In this model, the p-value is the probability of observing a value of the statistic that is more extreme
(in either direction) than the observed value. Thus, if the value observed was higher than p0 , the
p-value is twice the area to the right of it, and if the value observed was lower than p0 , the p-value is
twice the area to the left of it.
Example: we flip a coin 10,000 times and count 5167 H’s. Is this a fair coin? Conduct the test using
α = 0.05.
Step 1:
H0 : p = 0.5
H1 : p 6= 0.5
Step 2:
Under the null hypothesis we know

p(1 − p) 0.5(0.5)
= N 0.5, 0.0052

P̂ ∼ N p, = N 0.5,
n 10000

To find the critical values, we compute:

X0.975 − 0.5
1.96 = Z0.975 =
0.005
This leads to X0.975 = 0.5098. Thus, any value to the right of 0.5098 or to the left of 0.4902 would be
considered extreme and lead us to reject H0 .
Step 3:
5167
P̂ = = 0.5167
10000
Step 4:
Since 0.5098 < 0.5167, the value of our statistic falls in the rejection region, so we reject H0 . At the
0.05 significance level, we have sufficient evidence to reject H0 in favor of the alternative: the coin is
unfair.

3
We compute the p-value:

0.5167 − 0.5
p-value = 2P (P̂ > 0.5167) = 2P Z> = 2P (Z > 3.34) = 2(0.0004) = 0.0008
0.005

Since this p-value is smaller than α = 0.05, it would once again lead us to reject H0 . Furthermore,
since the p-value is much smaller than α, we see that the observed value is quite extreme and thus
highly statistically significant.
We note that the procedure we have described under Model III is equivalent to the following procedure
which uses the confidence interval we constructed in lecture 16. Given the significance level α, we
construct a 1 − α confidence interval for the parameter p and reject H0 if the value of p under H0 does
not belong to the interval. In our example, we have:

 s s 
P̂ − Z1− α P̂ (1 − P̂ ) P̂ (1 − P̂ ) 
, P̂ + Z1− α2
2 n n
r r !
0.5167(1 − 0.5167) 0.5167(1 − 0.5167)
= 0.5167 − 1.96 , 0.5167 + 1.96
10000 10000
= (0.5069, 0.5265)
Since p = 0.5 does not belong to the interval, we would reject H0 , as before.

FATA Sustainable Development Plan
100% (1)
FATA Sustainable Development Plan
183 pages
ch9
No ratings yet
ch9
36 pages
Topic 6
No ratings yet
Topic 6
81 pages
Tests For A Population Mean The Critical Region and The P-Value
No ratings yet
Tests For A Population Mean The Critical Region and The P-Value
8 pages
9_Chapter5_Testformean_part2
No ratings yet
9_Chapter5_Testformean_part2
51 pages
Lecture 7 With No Solutions2
No ratings yet
Lecture 7 With No Solutions2
42 pages
Hypothesis Testting - One Sample Test
No ratings yet
Hypothesis Testting - One Sample Test
62 pages
Mat 326 Chapter 10 Fall 2024
No ratings yet
Mat 326 Chapter 10 Fall 2024
10 pages
Inferential Statistics 1
No ratings yet
Inferential Statistics 1
32 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
17 pages
Chapter 9 - Large-Sample Tests of Hypotheses
No ratings yet
Chapter 9 - Large-Sample Tests of Hypotheses
43 pages
Gsbiju MA202 3 3
No ratings yet
Gsbiju MA202 3 3
7 pages
Lecture 9
No ratings yet
Lecture 9
43 pages
MIT18_05S14_class18_slides
No ratings yet
MIT18_05S14_class18_slides
27 pages
Statistics 1
No ratings yet
Statistics 1
34 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
58 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
45 pages
Lecture 4 PDF
No ratings yet
Lecture 4 PDF
9 pages
Probability and Statistics Notes
No ratings yet
Probability and Statistics Notes
10 pages
One Sample Procedures
No ratings yet
One Sample Procedures
5 pages
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
No ratings yet
Statistical Hypothesis Testing Yp G: Null Hypothesis Null Hypothesis
34 pages
Hypothesis Test
No ratings yet
Hypothesis Test
49 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
22 pages
Walpole Chapter 10
No ratings yet
Walpole Chapter 10
13 pages
Hypothesis Testing 2
No ratings yet
Hypothesis Testing 2
14 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
80 pages
Testing of Hypothesis_Note
No ratings yet
Testing of Hypothesis_Note
6 pages
Chapter 6
No ratings yet
Chapter 6
18 pages
MTH4106 Introduction To Statistics: Notes 6 Spring 2013
No ratings yet
MTH4106 Introduction To Statistics: Notes 6 Spring 2013
7 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
3 pages
02 - Statistical Inference
No ratings yet
02 - Statistical Inference
10 pages
Z Test Proportions
No ratings yet
Z Test Proportions
9 pages
Unit 4 - STATISTICAL HYPOTHESES
No ratings yet
Unit 4 - STATISTICAL HYPOTHESES
20 pages
Handout11 Heuristics Hypothesis Testing
No ratings yet
Handout11 Heuristics Hypothesis Testing
2 pages
Lecture7 Hypothesistest PDF
No ratings yet
Lecture7 Hypothesistest PDF
22 pages
Chapter 9 Worksheet
No ratings yet
Chapter 9 Worksheet
18 pages
15 Testing of Hypothesis
No ratings yet
15 Testing of Hypothesis
54 pages
Chapter 5-BUSINESS STATISTICS
No ratings yet
Chapter 5-BUSINESS STATISTICS
15 pages
Test of Hypothesis by Zakir Sir
No ratings yet
Test of Hypothesis by Zakir Sir
34 pages
Introduction To Power Analysis
No ratings yet
Introduction To Power Analysis
9 pages
9 Testing of Hypothesis
No ratings yet
9 Testing of Hypothesis
12 pages
10.01. Testing of Hypotheses Printable
No ratings yet
10.01. Testing of Hypotheses Printable
22 pages
Cha 3
No ratings yet
Cha 3
13 pages
MIT18 05S14 Class17 Slides
No ratings yet
MIT18 05S14 Class17 Slides
25 pages
Hypothesis Testing For One Population Parameter - Samples
100% (1)
Hypothesis Testing For One Population Parameter - Samples
68 pages
Sample Tests
No ratings yet
Sample Tests
66 pages
Cases:: Case # Case I Case II Case III
No ratings yet
Cases:: Case # Case I Case II Case III
7 pages
Testing of Hypotheses
No ratings yet
Testing of Hypotheses
19 pages
Statistical Hypothesis Testing
No ratings yet
Statistical Hypothesis Testing
21 pages
Hypotheses Testing
No ratings yet
Hypotheses Testing
19 pages
An Introduction To Statistical Inference
No ratings yet
An Introduction To Statistical Inference
33 pages
2.introduction To Hypothesis Testing
No ratings yet
2.introduction To Hypothesis Testing
43 pages
module2_ds
No ratings yet
module2_ds
28 pages
Learning Unit 8
No ratings yet
Learning Unit 8
20 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
49 pages
Chapter 7 XSTKE
No ratings yet
Chapter 7 XSTKE
26 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
64 pages
Final Stat Fiche Révision PDF
No ratings yet
Final Stat Fiche Révision PDF
9 pages
CH 8
No ratings yet
CH 8
9 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Neet - LT - 1a - CT-03 - Results (17.10.24) - 1
No ratings yet
Neet - LT - 1a - CT-03 - Results (17.10.24) - 1
2 pages
Logarithmic Periodic Vertical Polarization Half-Power Beam Width
No ratings yet
Logarithmic Periodic Vertical Polarization Half-Power Beam Width
2 pages
Online Marites
No ratings yet
Online Marites
13 pages
Group No 10 Evidence
No ratings yet
Group No 10 Evidence
8 pages
Friday Features Year 2013
No ratings yet
Friday Features Year 2013
115 pages
Pt. B.D. Sharma, University of Health Sciences, Rohtak.: Note
No ratings yet
Pt. B.D. Sharma, University of Health Sciences, Rohtak.: Note
1 page
Tollison Public Choice and Legislation
No ratings yet
Tollison Public Choice and Legislation
34 pages
The Pullback Trading Strategy - How To Trade Pullbacks
No ratings yet
The Pullback Trading Strategy - How To Trade Pullbacks
12 pages
Paul and Patricia Churchland
No ratings yet
Paul and Patricia Churchland
1 page
CHM101 - Chapter 1
No ratings yet
CHM101 - Chapter 1
26 pages
Oilfield Services Companies in India.
No ratings yet
Oilfield Services Companies in India.
8 pages
Ganaba,-WPS Office
No ratings yet
Ganaba,-WPS Office
2 pages
List of Important Days and Dates in Year 2023
No ratings yet
List of Important Days and Dates in Year 2023
11 pages
Answering The Call of The Inner Child: by Jacqueline Wright, Ed.D
No ratings yet
Answering The Call of The Inner Child: by Jacqueline Wright, Ed.D
2 pages
QUESTIONNAIRE On Service Delivery in Local Authorities
100% (3)
QUESTIONNAIRE On Service Delivery in Local Authorities
4 pages
Children's Apperception Test (C.A.T.) : Jan Faust and Sara Ehrich
No ratings yet
Children's Apperception Test (C.A.T.) : Jan Faust and Sara Ehrich
2 pages
Geomorphology of Moon
No ratings yet
Geomorphology of Moon
42 pages
ME685 Homework3
No ratings yet
ME685 Homework3
16 pages
Determination of Densities
No ratings yet
Determination of Densities
4 pages
Crack the case how to conquer your case interviews 2005 Edition David Ohrvallinstant download
100% (1)
Crack the case how to conquer your case interviews 2005 Edition David Ohrvallinstant download
48 pages
3BPROFED10 - Pansoy & Raymundo - Module2 - Lesson3
No ratings yet
3BPROFED10 - Pansoy & Raymundo - Module2 - Lesson3
33 pages
Class VI(Chapter 3 -Formulas and Function in Excel 2016)
No ratings yet
Class VI(Chapter 3 -Formulas and Function in Excel 2016)
2 pages
Chapter 6 - Chemical Bonds-Vsepr
No ratings yet
Chapter 6 - Chemical Bonds-Vsepr
22 pages
BG 1001 Unit 3
No ratings yet
BG 1001 Unit 3
18 pages
Sample Thesis Related To Education
100% (3)
Sample Thesis Related To Education
8 pages
15184
No ratings yet
15184
55 pages
Origin of Life
No ratings yet
Origin of Life
33 pages
Phoenix Contact 2761509 en
No ratings yet
Phoenix Contact 2761509 en
13 pages
Dredging Permit
No ratings yet
Dredging Permit
6 pages

Lecture 18 - Statistics and Data Analysis I 2

Uploaded by

Lecture 18 - Statistics and Data Analysis I 2

Uploaded by

Statistics and Data Analysis I – IDC – 2017

Lecture 18 – Hypothesis Testing

To find the critical values, we compute:

You might also like