0% found this document useful (0 votes)
55 views15 pages

Chi-Square: Reporters: Angelo John Roque Jomar Condicion Mark Paul Sabino Gerard Dela Pena Hallen Valerozo

The document discusses Chi-Square tests and their applications. Chi-Square tests can be used to test for independence between two variables, test if observed frequencies differ from expected frequencies, and test if two sample proportions are significantly different. Examples are provided to demonstrate how to set up contingency tables, calculate expected values, compute the Chi-Square statistic, determine degrees of freedom, and compare the computed value to critical values to determine if results are significant.

Uploaded by

Angelo Roque
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
55 views15 pages

Chi-Square: Reporters: Angelo John Roque Jomar Condicion Mark Paul Sabino Gerard Dela Pena Hallen Valerozo

The document discusses Chi-Square tests and their applications. Chi-Square tests can be used to test for independence between two variables, test if observed frequencies differ from expected frequencies, and test if two sample proportions are significantly different. Examples are provided to demonstrate how to set up contingency tables, calculate expected values, compute the Chi-Square statistic, determine degrees of freedom, and compare the computed value to critical values to determine if results are significant.

Uploaded by

Angelo Roque
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Chi-Square

Reporters: Angelo John Roque


Jomar Condicion
Mark Paul Sabino
Gerard Dela Pena
Hallen Valerozo
Introduction

The Chi-Square test is used when treating ordinal data in the form of frequencies
or proportion.
  2
Formula: 2
𝑘
( 𝑂𝑖 − 𝐸 ˙𝑖)
𝑋 =∑
𝑖=1 𝐸𝑖
Where: Oi = observed frequencies
Ei = expected frequencies
Contingency Table

This table shows a cross-tabulation of the classes of observations with the


frequencies for each class shown. A one-way classification table is one where
only one row of observations is given. In a two-way classification table, there are
r number of rows and k number of columns of observed frequencies.
Test of Independence

In this test, two variables are involved. One variable will be tested for
independence to the other variable. A two-way classification table will be used
to tabulate the observed frequencies.
Example:

Survey Results

Group Strongly Approve Disapprov Strongly Total


Approve e Disapprov
e
I 12 5 12 11 40
II 5 14 14 11 44
III 5 6 12 11 34
IV 10 8 13 13 44
V 15 20 5 5 45
Total 47 53 56 51 207
Test 
 the significance of the difference between the observed frequencies and the expected
frequencies at 1% of significance.

Solution:
1. Ho: the responses of the five groups do not differ significantly.
2. a=1%
3. Use Chi-Square test.
4. Solve first for the expected values using the formula
For  example,

= sum of the observed values in the first column times the sum of the observed values in the first
row divided by the grand total.
=9.082126
The values if the expected frequencies are as follows:

=9.082126 =10.24155 =10.82126 =9.855072


=9.990338 =11.2657 =11.90338 =10.84058
=7.719807 =8.705314 =9.198068 =8.376812
=9.990338 =11.2657 =11.90338 =10.84058
=10.21739 =11.52174 =12.17391 =11.08696
Substitute
   these values to the formula for
  2
2
𝑘
( 𝑂𝑖 − 𝐸 ˙𝑖)
𝑋 =∑
𝑖=1 𝐸𝑖
=0.937444457+2.492755836+0.958229929+0.000009344
  
+2.238669407+2.682586757+0.663642427+0.84071911
+0.946660792+6.238718512+0.128397985+0.369191363
+0.85352956+0.101028063+4.227481942+0.133013754
+0.00234407+0.821447978+0.430151775+3.341861253
=28.40798465

5. df=(c-1)(r-1)=(4-1)(5-1)=(3)(4)=12
6. Tabular value=26.217
Goodness of Fit

A group of university deans rated 100 professors into three categories: excellent,
satisfactory and needs improvement. If teaching effectiveness is normally
distributed in the population of college professors, determine if the distribution
of ratings by the deans differ significantly from the expected ratings.

Classification of Professors

Excellent Satisfactory Needs Total


Improvement
Observed 10 85 5 100
Expected 1.587 58.021 0.7935
The  middle
 region represents 68.26% of the total population and the outer regions represent 31.74%
of the total population, half of which (15.87%) is found on the left side and the other half is found
on the right side.
2. Multiply the observed frequencies by the percentage values.
=(15.87%)(10)=1.587
=(68.26%)(85)=58.021
  2
=(15.87%)(5)=0.7935
2
𝑘
( 𝑂𝑖 − 𝐸 ˙𝑖)
𝑋 =∑
=44.59897227+12.54487929+22.29948614
=79.4433377
𝑖=1 𝐸𝑖
Solution:
1. Ho: The actual ratings are not significantly different from the expected ratings.
2. Set the level of significance at 5%.
3. Use the Chi-Square test.
4. Computed value = 79.4433377
5. df= 3-1=2
6. Tabular value=5.991
7. Since the computed value is greater than the tabular value, Ho will be rejected.
8. Therefore, there is a significant difference between the observed and the expected values.
Chi-Square in Testing the Significance of
the Difference Between Proportions
The  Chi-Square
 is more convenient to use when testing the significance of the
difference between two proportions as compared to the z-test.
The formula is:
=
Where N=total number of cases
A,B,C, and D are the observed frequencies.
Below is a sample table showing the positions of the frequencies A,B,C, and D.

A B

C D
Example:
A sample survey of a presidential candidate in the Philippines shows that 120 of 200 male voters
dislike candidate X and 175 of 250 female voters dislike the same candidate.
Determine whether the difference between the two sample proportions, 120/200 and 175/250 is
significant or not at 1% level of significance.
Gender Voters who Dislike the Voters Who Like the Total
Candidate Candidate
Male 120(A) 80(B) 200(A+B)
Female 175(C) 75(D) 250(C+D)
Total 295(A+C) 155(B+D) 450(N)

 
Computing for , we have:
=
=
=4.92

The degree of freedom is (2-1)(2-1)=1. The tabular value of is 3.841 at 5% level of significance.
Therefore, there is a significant difference between the opinions of male and female voters toward the
candidate and there are more voters who dislike the candidate than those who like him.

You might also like