Probablity and Statistics
Probablity and Statistics
Dy Hg (uy ~ ws) # Dy
| ores (a =
< Do)
G-H)-D
Test tative:
Ta
Vilna * i)
Rejection ion: T >t Reetion eon IT
tor <1
nee
(una + (m- Da |
| and the distribution of Tis based on m; + my ~ 2
tions from which the samples are selected both have
Assumptions: 1. The popu
approximately normal relative frequency distbutions,
| 2, The variances ofthe two populations are equal. i.e, 07 = a
3. The mom samples are selected inan independent manner trom i
the two populations
Warning: When the assomprion of normal populations is violated, the test may lead
to-ertoncos inferences. In his ease, use the nonparametric Wilcoxon test described
in Seoton 133.CC <$$L
360 | chapters Tests of Hypotheses,
(ETRE | 0st pone ars odie wc of ao pe ofa or ectic—wl racemes
Sey oe oe nc ean aa ten cd se
‘ap oes at aay rave no woe ns
fico aon vty he Soe tet at sat
Enea i matt en ot nine oan
Brame oro pune eco caionee pace See
} ‘an tet meen wesc ace sere aneg EES
Hit} thet aa over con aes = ceva see ee
i|}| ceca paste net ow apse aes
|
{) vauan | TABLE 8.4 Data on Plant investment/Quad, Example 8.14
Hate 205087 aS ae
| O82 91.84 720 66 6467 165.60 036
Sotaion | Let eset the mean invesimenga fr al plans witht an et
Hi) ss epesen! the men nese Tr al lane oh
1}
1s uilies. Then, we watt
to condct the test
Hs (oa — Hs) = 0 (hess = pa)
| He: (= m2 #0 (ieamy > ror my < py)
‘Summary satsics forthe two samples were proiced using SPS
SPSS printouts shown in Figure 818 Nove that
| and) = 49.05,
To obtain the tet statistics, we fst cake
The resting
F143) = 6243
»_ (= Ns 2
| p= GD +
| = UN = (6243)? + (16 1NK49.05)2
WF I6~2
2505131
] “13 = 5002.05
i ‘Then, if we can assume that the distributions of the investmentquad dat forthe €WO
| Plant types are both approximately normal with equal variances, the test statistic 8
24-3174 _ 14.09
) ema
68
i Nas at hist static andthe comeing pac he estat ot sted
! cute SPSS prinout in Figure 8.15. Since the wot pve or thes Yash
! case pave = S00 eteecisa = 5, there nulfcen endeoccroster ly
That is, we cannot conclnde (a a= 05) thatthe mean investnenetquad levels
\ for those plants with lets and asus ae differen
Recall ftom Section 7.5 that valid small-sample inferences about (yy — 13) a8
sill be made when the assumption of equal variances is viola, We cence 1
‘cetion by Eiving the modifications required to obzin approximate small sample eS
‘about (iy ~ a2) when of # oF forthe two cass described in Section 75! m1 = #2
and my # mp5.7 Testin the Difference Between Two Population Means: Independent Samples 361
OURE 815,
WSS prntat or Example 8
‘Modifications to Small-Sample Tests About (1. — yx.)
when 4 7 «Independent Samples
Tes sais
| Vat Ve
Del ype Ym nice
Test set
Degrees ofrecdom: y= (lm + my
ae a= |
[Notes The value of» will generally not be a integer. Round down to the ne
able 7 of Appendix B)
integer to use the Ta
“0plied Exercises
80 se,
8 Bote itary. Refer the Jour of Agr:
wa iota, and Evironment elences (oe 2005)
{Psi o the Inox of Bic Itenty (IBD, Exercise
736.291). Recall hat the TB messes he bialogical
the BI x ses oe in Oni er asin esr
an Hocking, seeped inthe ex abe, Cond et
of bytes t= 10) 0 cmpate the ean BT ace362 chapters. Tests of Hypotheses
ih te infrence derived fom he confidence nr
erGaen—_SamoSio wean Stndard beasion
Muskingum 53 23s 10
Hocking 51 340 6
832 Pertrmance of R&D. Does competion beeen spe
‘ate research and development (RAD) tame nthe US
Deyrtment of Defense working. indepen on the
same projet improve performance? To amuer tis ques
thom, perfermance rage were assigned 0 cach of 58
rmulisouce compte) ond 3 sole sauce RAD sow
truss IEEE Trrsoctons on Engineering Management
Feb 1990), With reapet equal orp and pec
the competitive conc had u mean pecormance rating
of 7.62, whereas the sole-sce contacts al neat
ofa
Set the al ae atetive hype fr determin
ing whether ube mean geal perfomance rating of
‘cptive RED cont exec themes for sok
Find he eecon region forthe test using «= 0,
© The pale for he est ws report he etwcen 2
and.08, Wha sth appropriate conc
‘SAS Output for Sanple Statistics
Exercise 8.34
MINITAB Output for Descriptive Statistics: HEATRATE
Exercise 8.33
Variable ENGINE MN Mean StDev Minimun Maxstin
HEATRATE Advanced 219764999108 11508
Aeroderiv 712332 2682 a7ie deze
fe wre si Expl hy te et wi age gasrunmIE
1538 Gooing meta er gs trtines. Refer othe Jounal
Engineering for Gas Taines and Power Gan, 205)
ly of gus tubes apmeate wih hires ik
fogging. Berne 4.1 (p.352) The escuela
0 urine ino thre eagle rafonal acre
san arodetvatt. Samay satin on eat rae (>
Jules per koma pr our foreach othe the spf
fm urins inthe sap are shown inthe MINITAB
Pritt at topo be page.
2 Ie the saint evidence ofa ierence beteen the
mean Ret aes of tdtncal augmented sce
tl arervtve mem gos urine? Tet see
w= 5
nthe sauce evidence diferenebetmeentte
mean heat res of anced apmenied as ies
om arodtvatve agent urine! Test sh
a= as
& vouace
834 Process votage readings. Rees 1 the Hans Coe
‘iw nisery of Fora comparison ofthe mean pees
‘lage readings tt lection, Excise 7.0 (p22)
The dia or 30 profiion eat Both he ld 2
ator ae sven he VOLTAGE file The SAS pnb!
fhe nis is epdee helo Fe anit
vale fo the es to compare the mean process YDS
‘wales, Wha do ou conchae? Does yor ans a=
wih Excie 740?
90% Conf idence Interval for the Difference between Two Neand
Lover Limit{7 Testing the Diference Between Two Population Means: Independent Samples | 363
35 wasteter reauront sth. In Kcoloical Enincering
{Feb 204), the pei of Hosting aquatic plate eae
hry mare wastes as investigated. For neyo
tie sds, 16 wrested wastewster samples were madon'y
Sed into wo groups—a conto algal was cle in
hal de samples and water hyacinth was cared io the
Cer lf Theat of ire nthe noua ot pe.
ploras was measured i eash water sample summary of
he esas iver inthe accompanying table. Cod
Yet dtermine i there 3 diferece in mean ates of
Insreise af ttl phonphors forthe two aguas plate
ew = 05,
‘ontor Algal Water Ryach
Hib Wier Samples ‘
Samp Men 36 026
Sed Deviation 08 06
$38 conpitesmedated communkation sty. Compu
eit communica (CME) i om af tection
nde compar reo!
say n people incrating va CME to people meeting
Sce4otce FTFL una of Computer eit! Ci
franca, Ae 2004 Pars wee M8 uaa
Xe sides, of wich a were andor anigct he
MC guy and half signed tote FTF pop. Each
up was ven tak hat eed communica ith
th smcp meer Tae nthe CMC weup corm
Se wig the “shat” moe of instances wf
‘a: hse in the ETF grup et in conference ron
The ale of interes, elie! ttimay sre,
enue on a in scl) fr sah paris
ech of te diesem mesing seo Sansnry ai
8 forthe fi meting veo ae piven ee The
Scacies ypotcnzd a, flr te fi meeting, the
Man lal ninacy soe fr pact the
{EME pep wou ewer han th ean eon
Ime wear puripns nthe FTF eo, Tex Be
‘sachets sing =
cee eee
femal A sty wane
ce FF
Nene oftaricpons 43K
Serve evs yas
Mes eae
17 fetes tnd xr mine Sue
Thole (Oct. 1993) repre on way of insecticides
ft on dormant ecards in the San Joaquin Valk.
Gitonis mbin sr samples were ceed and sulyzd
Sy tom creat ite daring the mst itm Pid
‘spraying. The tion and oon ves (ng) in the
Sie Samples ae recorded in the tle swell a the
xenon ation. Compa the mean oxen acs of
e8y and learcloudy conditions tte chad ing
testofhypeheis Use = 05.
Sjorcrano
Sale cancion Thon une ato
sais Fe 2103 20
Fe te 69 2a
Fe m2? 205
‘2 Fe BT ita 53
2 og 2S
0 Cer Ul a8 ae
mew 99 un
Dore oats x0
3 Fe eo 2
B cows mes 2
Bore 49 2
2 Cer M3 tas xs
corto incrsr tad Soma ents sr
88 stacted menu asa. One Heature of wserfcnly
‘Sompuicrinerface ia sched menu diay Each ine 2
‘men item is selctad, a submen is cplayed partly
lover he pret eno, thus ceaing sre of “Maced
menus. The Special Iiret Group on Compas lama
Imeraction Bullen aly 195) sported om «sty oe
termine he effects ofthe presence raven of sacked
‘meme sirctre on sear time Twent-t sujects Were
‘anny pce ino on of two groups and each wes
tsk to search mora-driven sofia package for pas
ticular tem. In the experimental group fn = 11) the
steked meme formet was Usd: the cool grup
(= 10) ony the caret mena was dphged
8. The researcher’ intl hypothesis is thatthe mean
time regied to find urzeitem does tot fr for
the two menu dispys, Describe te satel method
appropiate for estig ths hype
‘What assmpion ae neque for arenes derived
from the analyst be valid?
The mcan seach ines fr the tno groups were 11.02
second and 1.7 seconds respectively Ts this enough
{nfo 0 Gnd! the est? Expat
The observe significance level fr the test, part a
‘exces 0 Inept this est
1839 Perceptions of automaton problems. Acconting toa pop-
ae model of managerial hive, he eumen tt
aos: on ove vaati
dl
i:
2
a
s
:
t364 chapters Tests of Hypotheses
sutomation na manufac fr aaees mangers?
Perens of problem of abfomion. Tp investiga is
proposition. escarchors Cnc Users (Monta
Surveyed anager a fms with high evel of anome
tin anda fee ith Tow level of anomatin (EE
Transactions on Englcerng Management, Aig, 190)
Ech manager was asked giv his or her pecepon
the problems of autoraion athe fm. Responses were
meatted ona Soin sale: Noproblem, 5 Moor
problem. Sumeary state or he wo prope ofan
"ges povided in th tale, were sed oe the pate
sisof modern inthe mean perseption of atomaton
rblems between tatagers of hI tomate ad es
ome amacrine.
Sanpleske Mean Sandra Devan
ow Level wosae 782
High eset 43280 na
Conds the test fr he eaters, assuming ht he
peteepionvanances forthe two group of anaes
‘eel. Use = OL
Con the test for he escort
the pepo variances ifr for manages at
leteland highs ins
8.8 Testing the Difference Between Two Population Means: Matched Pairs
nay be possible wo acquire moce information onthe difference between two popu
tion means by using data collected in matched pur inscad of independent samples
Consider, for example, an experiment o investigate the effectiveness of cloud seeding
in the artifical prodvction of rainfall. Two farming areas with similar past meteor
Topical rconds were selected forthe experiment. One i ceded regulary heater
‘elt unseeded. The monthly precipitation at the Fas will be recorded fer 6 random
selec months. The resulting data, matched on months an be use ots pole
‘ss about the diffrence betwcen the mean monthly precipitation in the seeded ae
‘unseded areas. The appropriate procedres ae suniarize inthe bones,
Large-Sample Test of Hypothesis About (w; — ys): Matched Pairs
One Tailed Test
Hos (1s ~ a) = Dy
He (ia ~ ta) > Dy [oH (uy ~
Two-Taied Test
He (sty ~ ua) = Pv
a) < Dol He: (a ~ wa) # Do
Test uatiie z= 4— Dy ,, 1= Dy
cdVn” Vu
‘where dans reeset the mean and tna deviation othe sample of fren.
Rejection retion: Z> 2, [or2<
Rejection region: (Z| > saa
[ote: Do is our symbol forthe particular numerical value specified for(i — a)
in Ho. In many applications, we want to hypothesize that there is no difleren
betveen the population means: in sch cases, Dy = 0]
‘Small-Sample Test of Hypothesis About (ay ~ wi): Matched Pals
One Tailed Test Two-Taled Test
Hy Gu — mo Hy (uy ~ wa) = Do
He (n= 02) > Dy lorie (wv ~ ta) < Dg) te (un) # Dd
Dy‘88 Testing the Difference Between Two Population Means: Matched Pais | 365
Rejection region: > Rejection region: || >
(oT > ~1]
where the T-disribution is based on (x ~ 1) degrees of fecedom,
Note: Dy is ou syrbo forthe puticular numerical value specified for (1s ~ jg)
in te null hypothesis. In many practical applications, we want to hypothesize that
there is no diference between the populition means: in uch eases, Dy = 0)
Assumptions: 1. The eelative frequency distribution of the population of differ:
ences is approximately normal
2. The pire ffeences are randomly selected from the population
of aiterences
‘Warning: When the assumpeion of normality is grossly volte, the #test may lead
{w erroneous inferences. In tis case, use the nonparamtic Wilcoxon test described
sn Section 15
EEETIEERII | corer noc scans egeemen corse ory ciation tthe tw am cs 0
Y crovoseen
souton
thedataghennTabe as preva sate eusere toda the the ean emmy pecan
at ne sede fam wes exes re canespndr mon forte usesed lem ses fat ng
‘TABLE 8.5 Monthly Precipitation Data (In inches) for Example 8.15
‘Fait eee ested ae
Seeded ca ERS aD
anele e IAs igi ate
4 ig ate a ory eas EEO
‘Let uy andj represent the mean monthly precipitation values Fr the seeded and un
seeded farm areas, respectively. Since We want to be abl 1 detect ty > ey, we wil
ondoct the on tailed et
He: (w) ~ 5) = 0
He (a —
Assuming the differences in monthly precipitation values for the two areas ae fom
‘an approximately normal distribution, the test statistic wll have a distribution based
fon (— 1) = (6 ~ 1) = $ degrees of freedom, We wil reject the null hypothesis if
Tr
oy = 2015 (see Figure 8.16)
To conduct the test by hand, we must first calculate the difference din monthly
precipcation atthe two fam ares foreach month, These differences (where the ob
Servations forth unseeded farm areas subtracted rum the observation forthe seeded
area within each pair) are shown inthe las row of Table #5, Next, we would calculate
the mean dand standard deviations, for this sample of n = 6 differences to obtain the
Rasher than perform these calculations we will ely on the ouput from a compute,
‘The MINITAB printout for the analysis is shown in Figure 8.17. The test statistic
shaded in Figore 8.17, 7 ~ 3.01
‘Since this vale of the test static exceeds the eitical value ¢q5 = 2.015, theres
sufficient evidence at a = .05) to indicate that the mean monthly precipitation at the
seeded fam area exceeds the mean forthe unseeded farm ares,366 Chapter Tests of Hypotheses
FIGURE 8.16
Rejeton pon for
t trition wt
uy ~ 2015 | meee
bed va of ts sti
FIGURE 8.17
MINITAB prinout or Example 8
‘The same conclasion can be reached by examining the p-value ofthe test. The
‘e-taied p-value, shated onthe MINITAB printout, is 015, Since this vale is ss
than the eowem level (08), we reject Ho Infact, we wll reject Ha for any lat
than p-valve = 015.
In the experiment of Example 8.15, wy did we colette data in matched sis
father than use independent random samples of months, with some arsgned to oly
the seeded area and others to only the unsoded area? The answer i that we expected
some months to have mor rai than others, To cancel out his variation fom mth ©
‘month, the experiment was designe so tht precipitation at bath farm areas would DE
feconded during the same months, Then both farm areas would be subjected t0 te
same weather pater in a given month, By comparing precipitation within each
‘month, we were able o obtain more information onthe difference in mean moctlS
Precipitation than we could have obtained by independent random sampling.
Applied Exercises
8.40 computermadated communication study. Refer w the
Jour of Computer Meduted Conmunicaion (Age
2004) say te compre relational intimacy in peopl
teacting wa computermeditd commun (CMC)
to poople meeting fase Tae (FTF), Fxevise 8.36
(p. 503 Recall hat elation itimacy scone was meas
ured (om 37 poit scale foreach parca le each af
three diferem mowing sessions The rscrcher aso
hypothesized thatthe tean lations tia soe or
ftp nth CMC gry wl gia ios
beeen he fn nd hd macings tte oe
‘tween the fst an thind mewn wil not significa
tangs parc anne FT pop
4 Fore CMe pup compar, gee i an
Leman yates fee :
8 Themen ad th compron prt 2
‘sie test. Explain why te data sould be aaled
pepenlentin'88 Testing the oitference Between Two Population Mears: Matched Pars 367
6 Forthe CMC group comprison he rpaed est
fic was 13.04 nth prase O03 pe
these sls. Is the rescarche’hypoens suppor
4 Forte FTF group comparisn, gv he nll nae:
atv hypothe o interest
8. Forthe FTF group comparison th ep tes! sai
tiewas = 39 with p-value 70. Inert tene te.
vols Ihe researcher’ hyphens sppora?
141 pt tocar wanton of genes Symec
{yatohcterum that can grow and survive ina wide nage
of condoms. is used y sient wo moc! DNA beta
nthe one of Bacteriology ly 200), setts
thss and respiration ad iavesigted the seni of
the testo light. ach gene sampe was grown to mice
foveal phn in a growth incbatr in ll gh” The
Has were extinguished and growth assed er
24 hous inthe dank (ull atk’) The lights were then
{ume back en for 0 mines ramet ight) flowed
‘mediately ym additions) 90 mine in the dark
amen! dark). Stained growth mcaeerens in
och lphldark coniion were obtained for 103 gems
The complete dataset i sad in the GENEDARK fie
a forthe fis 10 genes are shown inthe acorpayig
ws, a ype of
vena
‘web Faloee rug woe
S67 —ommse L489 —28809
SURI ows L097 ~068728
SSRI osu — ores 0319
Sums wma 130601 =L.618
‘SLRO33S — —0.20620 171404 73029
SUR? -osia77 244156 ~033174
SUI oma Lassas 05802
SuRIn9 Ossi» aia oases
SI oss sane ~a1664
SUIRS 65866 i904 024820
a0 GT." Genae wae rai arin
FCO J Bota YoU N22
4 Treat the data fo the fin 10 genes adam supe
‘alles rom ue popultion of 108 pens and est the
!opotesis of no diference between the mean stoR-
tie growth of gene in flack ean ant
es in the rnsion ight cnditnn Use a= 01
sea sais software package t compute the mean
lferene n standart growth of he 108 genes in
‘he falar coniion andthe tovie-lght cod
on: Dd te test parca, det his erence?
®
© Repeat pa and fora comparison ofthe mean stan
ania grosthof gone in th ful-dack sono sd
ene nthe sien dak ono,
6 Repeat pats an fra comparison ofthe mean tan
died rom of genes in the taint ight con
ton an gone im th ansen dark sono,
2 Tasting etcronie cicus. Refer to the IEICE Trans
tans on Information & Systems Osh 2008)
tno metho of testing cleconi eivute Enerve 7.6
(9-297) Each I ircite was teed wing the stn
ted coding method, and he compression ato recedes
The dia ae repediced in the accompanying te Ie
theory the Hunan coding meth wl yd a smaller
new conpresiem ato
a. Testhe theory ining a = 05,
Does your concise, pall a, agr6 wit the inforene
erived fem the 95% confidence intra found in
Everie 76
Bewcuns
CreutStardird athod furan Coding Method
1 0 7
3 a 6
4 3 3
5 so si
6 96 8
7 » 82
’ 1 45
10 9s 79
u 9
183 conc pavement response to temperature Civil en
neers at West Virgins Unitesty have developed « 3D
‘model to pod he response of jomed concrete pave
ment 0 temperature vatatins. The Inert Jour
ral of Pavement Encincering, Spt 2008 To valde the
tod, mesel predictions were compu to fel mete
‘ements coke) conse ses variables taken ata hem
onstucied highs) One variable measured was lb op
transverse stain (change in length pr an length ot
uit tine) ata distance of | meter from te ogi
Joint The Shout changes (820 4, 41:20 in lab
top wansverse sai for 6 days ae listen the next ble,
[sth «sigan ference between the mean daily
transverse rain changes rr eld easements end the
3D model? Testusing @ = 08A
368 chapter § Tests of iypatheses
By siaasrRaN
- ‘Change nTanoenesran
charge in Fed
Dey Temperate) atastroment 20 od
Ont ~63 38 “2
Dec. 12 @ 2
Dels 33 as 2
Feb Isa 2
Moc? 17 0
May 26 “0
2) weveur
eaitne scheduing with rbot. Rescuers at Pande
Univesity compre aman reali sheng ipo
owing envroneat a atm opproic ht ils
compiz rts and sensing devices EEE Tras
sons. Ml. 193). The experiment coms of igh sim
lad scheduling pobleam Each ask was performed by
Ivan schedule! an hy the tomate! Ste, Pee
‘muce wis mse hy theta edt he
amber fps produced welhes by odit qty
The reaing ou rats ae shown i the accom
ning able Analye the data wing nto pene
Tack man Shear Aorta wethog
t as 1804
A 1463 ass
3 144 ass
4 ris 26a
s 2m0 2103
i ana 236
7 aes 20
Source: Yi, Y= Lag, Td Meno, HRD heding in
Siglo sin maze Merck Resch Labs conc an
experiment wo evaluate the eet of new dit ting he
Single Tsvim maze. Nineteen mpeg dar atte
apr ad allocated doags of 125 mlgrams of th
‘tu. One ale and one female pop wer anomly shee:
‘dom cach revuig liter to perm the swim mre
Esch ra pp i placed in water atone od the mare
ad allowed wo swim und it scsesilyeseapes at he
pote cd te pup io cp tar cea
do if ped te epg co
‘anand peat aneupioceape, Teepe
i ep i re newt ceapes a aes
play exh tap. The mer fis aid
chpupt pin ice etl eer ope
iether sic cneme n= ID tade
feeace tren he mea smb sins re
saan fra pu?
& mars "i
SSS, ea
sees as
ee sa
oe eee)
SENET 2 SET
aa a |g Te
ees
Sours: Reade, Thus & Merck Resch La BL
846 1200 conamnationTeacondenzp xin TDD)
Sea lgly toxic wince found in india mass A
study as conc to determine the amount of TCDD
Dyes in he tsscs of bale inhaiting the Rocky
Branch Crk in eur Aran an aen known © Be
‘omaminaed by TED (Chomorphere, Fa, 1980. Te
‘eel of TCDD Gn prs per ion was mesa i 5
ral apc sues of for fm bal gs th to
TCDD inthe se 0 TCDD inthe fg mule fhe
ves seared for each, The relive rion of conti
oxo times, the ler andthe ora, ee ive fort
fhe ou fg inthe sscompanying able Accs
{he researches, The dats et sugges thatthe ne]
sive eve of TEDD inthe varies female fg ie
‘hat the mean Heel in the lier ofthe fogs” Tet 8
shies uing a = 08
SH euurnocs
@ c °
2 46
M242
Sere Ketch WA Hinen Es Roe
“wou dsruten of 2338:7eDD ngs aed
42478-1COD canara! re” Chae, V1.
No.2 Fe 1986138 Rind wi yer open
1986 ergamn rest
i89 Testing.a Population Proportion | 369
a7 Modlin anspor of gases. I AUCH Journal Jan 28), a was acute with tbe new mol. The results ae
hei enginers published x aew method for modeling shown inthe table below. The chemical engineers com
subicomponenttaspor of gases Twelve gas mate led that heres “an excellent agreement hetwcen oa
coming of tn, rz, ad elim Wee pepe row calculation and experiments" Do you ope? Your
feo ts nt emperatres, The viet answer shoud ince iscuson of patel ver ta
och ni was eased spore sical infcane
Gwscosmy
scanty weasrorens Vstosny Measinerens
1 200 27% 7 2886 2910
: aan am | 9 a0 sm
‘ 30. 3.050 2 3a0 76
89 Testing a Population Proportion
In Section 8.2, we gave several examples ofa steistical est of hypothesis fra pop
lation proportion p. When the sample size is lage, the sample proportion of sucterces
is approsimately normal andthe ge
real formulas for condocting a large-sample 2
les (given in Section 8.2) canbe applied
Te procedure for testing a hypothesis abouts poplation proportion p based ona
large sample from the target population is deseribed in dhe box. (Recall that p repre
sents the probability of sucess in a binomial experiment.) For the procedure to be
vali, the sample size must be sufficiently large to guarantee approximate normality of
the sampling distribution of te sample proportion, p. AS with confidence interval. a
seneral rule of thumb for determining whether i “sufiiently lage” is that bath ip
ant nj are greater than or equal to 4
Large Sample Test of Hypothesis About @ Population Proportion
One-aited Test Tiro-Tiled Test
Me: P= ro He. P= mo |
Me p> po [oll p< po) Hep Po |
Teststaise: 2 = 2 — Po By
Vinal |
where gy = | — py |
Rejection region: Z > % Rejection region: 2) > san
(ord <—za]
“Assumption: The sample size mis slficienlly large so thatthe approximation is
Walid. As a rule of thumb, the condition of "suficinty large” will be satisfied
when np = 4nd nj 2 4,370 Chapters Tests of Hypotheses
ESRI convey serous ree of wether sien the construct of hima bd Cres he
‘wcrc seas coeae rotiems we Weenie ane cert png ses pe
disuse ge const. On me era tee crprtns dante se ops
ne exer andra that 3% oF al vest sl ergs mopesion show cops
ance wih ome cree crag tests Caen ateam et eng andstclnasy ce
eto 6 ancy sect weg Ses bls a tana to ey howe eld
‘ermane thece etre, n= 05 at su prpeton oh weiner sie! hehe BS
‘Yat snaw"Bro prance ss tha $5, ue qos Wesel caper?
|
station
The parameter of interests a population propoton,p. We want to tes
He p= 9s
Ne p< 95
{|| where pis the ve proportion ofall weathering steel highway’ bridges that show
\| “good” performance,
‘A significance level a = 05, he nll hypothesis will be rejected if
= 25 ta
that A wil be eect if
Z<-1615 (ee Figue 8.18)
‘The sample proportion of bridges tha show “goad performance is
Thus, the test tats asthe value
dm ___ 9-295
Voonin V95)(05)/00
‘The ml hypothesis canbe eee (atx
ith the reestion region Thee is sient ev
iil Propotion of weathering tec high
Zz
-178
05) snc the computed value of fills
dove to support the byposhese thatthe
hay vides that show “ood” performance i Ess
than 98. (Note that bt) = 6090) = $$ and nj = 6010) = 6 encwed 4, Ts
‘early lange enough io guarantee the valiity ofthe hypotess est]
the sample size
|
|
1
tt ROURES.18 4)
Although smallsample procedures are availa
le for testing hypotheses about *
PopulApplied Exercises
{14 unsuited compitr use tn. Comper Se is
tte (CSI survey of computer crime oa sample of 183
repre hat unui uses of busines aes cae
fm inside the company. (Computer Securify
Trends Ne. 7, Sping 2001) Let p represen the te
orton of untied uses of business websites in 20M
Same fom inside the company,
4. Cacao a pint extinae frp
2. Setupihe nul and aemativehypobes oes wh
Cale he ost istic forthe tent, pb
4 Find he ction region forthe test fa = 8.
Use the sat of prs ead to make te appoprie
{Fin the ale ofthe testa confi thatthe conc
sion ised onthe vane grees with the onlaion
9 Tone chemical cients. Reter to the Process Saf
Progress (Sept. 2004) stody of sh enetgeney eopene
!3sem for inde invlving tose chic Tawa,
ves 3.1 (p68). ha smpl of 250 toni chemical i
sides loged since the syste was implemented 5 0¢
aret in a school labortry. Supine you ant 10
Sodus a et of hype to detrei the ve Fe.
ettags fone chic nce nT hat oe a
schon! aba is es then 10%
4 Seep he nl and atertivehypotess or thts
1 Gre tbe jection esi era = UL
© Compete vale of thet statistic
4 Give the appropri conclusion fr the est
440 sar insects wth ow oxygen. A grup a Avalian em
‘ecingcalwoxicolgitnvesigaed the impact of exp
ft low oxygen o the molto mets, owl of
Arcs, Bilonleal. and Exvinmmental Sti
S91. 200.) Thousands of dal ce weevils were paced
'ochamber filed with seat rain ate shar 8
peo nitrogen gos for days, Inch mre seed
4 dead or alive 24 hours afer exposure. The res
31.386 dead weevils and 35 woes fond live, Previous
ates have shown 999° morality rte nad ce wee
vis xpos carbon dni fer # das the marty
{3 fo dul ice wevis expe to mirgen higher han
Teasing @ = 10,
55 Freon in paper ecng process. Reser a the Un
‘enity of Roches sted he econ that oc the
Berfeeding process af «phecopir Geum of igh
ering for Industry, May 1983), The expen
Voted monitoring the diplcement of india sheets
OF Paper na stack fl thr the sop fn ses x
{GP the op one moved more tan 2% of he tal soe
tance he fed was coniered sucess. na Sack
100s of pope. the feeding process was ceil 9¢
The success rato he feeder i denied 0 Be.
‘to detcrmine wher the tre sce rif three
eres 90, sea ~ 10
'89 Testinga Population Proportion | 374
852 PhO’ in engheerng The National Sine Fount, ina
survey of 2237 engincriag arate sets who ena
ther PRD degrees, oad that 07 were US. cen th ra
Font (1.490 ofthe PRD degrcs were war fri
onl (Scene Sep 24, 199), Conducts ett deere
tier he time opin of ening PAD sears
smd faci anal exces § Use w 0
853 Cul phone use by oriers Refer tothe US. Deparment ot
Transponation Gly 2001) stay fhe level of cll heme
se by chives while dey are in he at fdivng a motor
pmseget vehicle, Bree 752 (p90), Recall that fora
Fandom sample of 1.165 divers elected cross the con.
ty, 35 were sing ther al phone,
Conduct text (at = 08) 10 determine if, he te
shiver cel phone wert ifs fom 02
Does your concn pata pee wth he inference
UH derived from the 98% contdence inna orp
xetne 752
ty oF har So. Mesos (March 195) seper the
‘esas of say of lina sl volt, Daa were oained
from te Apoto 16 misicn wo hers, ring whi 2
2-emeue was extracted he snes the nse,
Monoainraie pans of lina el wet seated et an
amined fo cating with dan las
fran mas the clasife scte or nected. OF teres,
Sth “soa nde tht the propo f gras tha se
oa. Accu to sal evolution thom. the cet ink
silenced Sarthe top ofthe coe cpl inte deed
the core nd al eo he eto the cre Use te
summary aan te asonparying abet et cach pat
the wre pan theory Use = 05 reach et
‘ecsten (sth
Top wide Bottom
(280m gatem — (stsum
owber of Grains 84 ” a
Sampled
Number Coad o »
“Ses A. and MOG DS, “Lamar wl evn perc al
Alo 1core DOS Mess, Vl 0. No > Mr 338,
855 umanindiced disturbances of le. Asp fe eva
tuto fran environmental impact semen of «propio
Iecectc design on the Sine River Bish Colum
te esac candice primi insists of te
cfc of hamanndhceddittencs onthe bavi oF
"heed! mnt goa peplaion (Enronmenta Man
‘agement, Ma. 1983). Got response 0 explo set
tis, noi coso-Ayng Heep, fl-wing ata
nen bipedal moves oad Bat from pola
‘ling atts, ware record orn = 804 gts. The
teachers observe tht 255 eats played seve igh
rps tolocal ck or pl cove Test the hypothesis hat
‘rer 30% ofthe resent mown gas wil show a severe
ponte to bunatrindaced Ssttbances Use a = DSyy | i ss: 7
| 372| chapter Tests of hypotheses
8.10 Testing the Difference Between Two Population Proportions
The method for prfoming a argesaple tet of hypothesis aout (p1 — phe
| rence between two binomial proportions, is outlined in the accompanying bon.
‘When ing thrall hyped hat ( ~ p) eas sore spiel rea
$33, Dy_we make disiction between the ete Dy = O and the cise Dy 0 Fat
the social case Dy ~ O.¢, when we are testing He (p, ps) = Dens uae
Ho: Py = phe best esate of py ~ po ~ pis ound by dividing te el meas
‘of successes in the combined samples By the total number of ubservations in the twe &
sanples, Thi iif she nonber of sucess in sample | and
suezesse in sample 3, then
the numberof
In this case, the best estimate of the
i tndard deviation of te sampling distribution
of Gi ~ fa) is found by substituting for bath py and p,
For all cases in which D # 0 [for example, when testing Ho: (D1 ~ Ps)
‘we se and jin the formula fro, py, However in most priced sittegons
will want to test fora difference betteen poportions—that is, we will want te
| Holm p
i Large-Sample Test of Hypothesis About (p,
One Tailed Test
ii} Hs (Pi ~ pa) = Dp
Me. (Bi ~p) > Dy (ory
~ P2): Independent Samples
Two-Talled Test
Hs (D1 ~ pa) = Do
7D)
Rejection region: Z >
faren
i When Dy #0,
where dj = 1 — fy and js = 1 ~ pp
Wes Dos 0
id)
‘het he total umber of sccesses in the combined sample is (9,
owes
+ wand
| Assumption: The sani
fied if my = 4, mi,
Pe sien and are slfcinty lage. This will be sat
= fand mis & ds my = 458.10 Testing the offence Between Two Population Proportions | 373
he sample sizes mand must be sufficiently lage to ensure that the spl
dstributions of and fz. and hence ofthe difference (> — P). re approximately
normal. The rle of thumb used o determine if te sample sizes are suffice large’
isthe sume ws that given in Section 7.8, namely. thatthe quantities mPa mh
and pare al greater than or equal to 4. (Nove 1 the sample sizes ae not satiety
large, pan pean be compared wing a technique to be discussed in Chapter 9)
(EEDEREAII | verey sere rave neen terse campaians encoun pol to save ener by capt o
Solution
‘wk Sone does have treated an newer carpocng 2) Obata cram Rhy Hae es
{5 "tarool one, on cars wth oo rae pasar can ae Ces nl elute ee
fecveness of sl Dat persimmon cy mentees 200 radoM eco cars or
‘move passengers carpool ders) nthe “betore" ana after” sample, repectvoly Do the cats.
TABLE 8.6 Results of Carpooling Study, Example 8.17
‘Before Caron ‘ter Ca-Poo!
Les estattshes Lanes Established
Sample Sze = 2000 m= 1500
CanPoot Riders», = 652 += 576
If we define p and pas the tru proportion of curs with cat pol riders befone: and
| after establishing carpool lanes, respectively, the clement of our test are
Hy (1 ~ pa) = 0
eed 20
(Th tex is one-tiled since wea interested only in determining whether the propor-
tion of ears with carpool riders as increased, ie, whether ps ps.)
Test static: 2 = "P=?
Rejection region: a = 05
Z< ey megs = 14S (see Figure 8.19)
‘We now calelate the sample proportions of ears with ear pool riders:
62 576
= eb =
= 000 ~ iso0 ~
“The test statistic is
i-f)-0, iri,
where374 Chapter. Tests of Hypotheses
Ths,
= 058
aa ee Loe
VESD(6N 5500 * 7500)
Since Z ~ ~3:56 als inthe rejection rego, ther js sufcient evidence at we = 05
| te somelade tha the proportion of all ars with ear pool riers hes increased tere
| ablshine earpoot lanes, We could place a contidence interval on (py — ps) if we
‘wor interested in estimating the extent ofthe increase
FIGURE 8.19
Example 817
Applied Exercises
5 woe
36 Groundwater comination in Wels. Ref wo the En
onmental Science & Technology a 2008) sty of
methyl redugl ether MITRE) ‘contamination in
New Hampshire wells, Exercise 749 (p30, Recall
that 223 wee were clnifed aconing tp well cat
(public or private) and deacubl level of MTBE (hte
imi or detec. The MINITAH pitt blow ives the
number of Wels inthe spe witha dette lel of
BITBE for both the 120 puis wells nd the 108 pine
Conduct two tied et of hypothesis to copa he
trve proportion of public wells witha decal
(MINITAB Output for
Exercise 8.56
a |
ef MTBE tothe ie proponion of rivate well witha
detectable Jv of MIME: Use 0,
2 nFxccise 70, you cece the wo pope ih
498% cotdence ine Explain why the ie
Serv from he two-tailed et, pat, wil ae ih
the free dries rom the coin na
{857 Coverage of tt mecancs. The Jum of Profesional
‘ues Engicering Education an Pracice Ap 298)
"epoton theres 208 suey of cores eed
a undergraduate enginecringprgruns. OF the 90 £8
erin rotons that pug in the 2005 sae 8
‘sere Mid mechanics. na arvey taken 20 yen ci
‘Tabulated statistics: MTBE-Detect, WellClass
Rowe: MTBE-Detect colums: YeliCiase
Private Paplie att
so ass
Devese 22 48 70
an 102320 233,
Ceit content: coune{8.10 Testing the Difference Between Two Population Proportions | 375
(EaginceringEacaton, Ape 1986), 66 the 10 under
fradateensincring rostams covered Tid mechan.
Candi es deter wheter the ction of unr
rsdmteenzincering programs covering Mid mechanics
increased tom 1986 (02005, Use a= 01
{9 mace oan gas structures. Refer tothe Ol Ger
Joaral Yan 3 3008) sty FADD i and ga str
inthe Galf of Mexic, Exercise 3.179. 79, The accor
ing able beaks dwn these strats by type cal
Son, well protectin, rfid platform) and sts (ate
mete) Assume the 3400 nuctre ar aepresenttve
‘ple ofall land gs strates workwike
_—__Giiscn Well Protector Fived Pati Totals
fitwe 503s 1a 7s
ee $98 tT 40125
iia 0142 ian 3400
© Comite sts (ata ~ 10) wo determine if the propo
foe of eason sates hit ae inv een he
‘oponion ef wel pts site that ite
Const atest (ate = 10) to determi if the po
Pron of cisonsotres that ae inactive ences
fe proprion of Fixed platform scares tht ae
6. Conic tt (at — 10) determine i the prope
om of wel protector sutures hata nce ile
‘fom the roporon of fined plufomn siete that
$59 tng inscs wn lon cnygen Rel to the Loar of
Aue, Mog, ond Exner Sees
(See 200 sy ofthe monty of noe woe x
ed wow ongen Exec 950. 11) Reval at
3386 of 31421 ce wees were fund deal er
poi niogen ga for 4 days In a ex
‘ne. 23S16of 23676 ce wevh wee od del ae
tsgore wo nrgen ga for 83 aye Cont tt
ype wo compare the moray er of a nce
Seo exp mogen ae 0 ego Us
‘thoes pica diferente ~ 10) he mak
ign?
4 terri rates of tree gs. Slemiss ave inked ease
‘topic decine im ene numberof fogs ibsbig the
‘ett to uraviolt radiation fm Earth’ tated o7o0e
Laer Temps Pribune, Ma. 1, 1994), The Pai eo,
however, isnot elie tbe in decline Bees pro
ces enzyme th appears to prec exes Some
theo radon. Researcher at Oregon Stat Univer-
ty compared the hatching exes of vo grape of Pie
8 frog epes. One group of eggs wan shilled ith
rave oeking an shades, whee the second gop
was nt The number ofa stccestilly habe in each
‘rp is provide nthe tae Compare the aching aes
‘tthe too groups of Pai ee fing exes with a et of
Iypatess, sea = 01
‘Sun Shaded Eggs _ unsneded Ease
Test Ne 7
umber Hatched M
Bot Engeerng vs tchralagy agrees. nadine
dina! tacelor of engincering (BE) decent un
ersten worldwide afer a bachelor of echtsogy
(Bech degree for suaets who wish werk san eu
exting technic. There perception th BTech st
dents are mo a6 “academically rong” as BE students
“hiss was dressed inthe Fternational Journal
Cominung Engineering Education and Lfiong Learning
(1 13,2003) The reseatches compass Ban Bech
sudnis at an Atman univenty on a varet of
fcaiomieeaed outcomes, The following ble shes
the peceuages of BE and BTech side's who Sith
‘es from so waitonaly rigorous couse, cegiesing
smatherisice and engines graghis/CAD.
Tech Students
‘ngneorng Mememotcs BE Students
Number Ele sr 117
Percentsge Wahdraxn 788 w9%
Engneeing rapicsCAD BE Sucerts BToch Stents
Nbr kre m Aa
ws m6
iene Pale Sad Bi, "Compe dine potiemens of
‘ticrng ed icky sch ot Dein Uns, Aaa
lng Leming N13 Nor -5, 2008 abe a
4 Isha slfcom evidence of aitferenceheineen the
Percemage of BE student and percentage of BTexh
Stents who witha ttm engineering matheratics?
Testusng #05,
bs Isthere suiciat evidence of adfeence Between the
percentage of BE stuns and percetage of BTech
iden who withdrs fm enginecene graphic
CAD Tet ing o = 5
8.2 Active varus passive solar heating. Howe solar bing
system con be estegorized ino two group, pss sla
eating ystems anda solar heating tenn ps
sive solr heating sym the haus el or energy
Collec, wher an active Solar esting system, cla.
trate mechani equipment is used cover the Sus
rays imo heat Consider the diference hetween the
operon af pase solr nd activ soaring systens1: tt _ ii .—_
376 chapters. Tests of Hypotheses
sed 30 ce etree foes a ace a
dence ofa iference hetmeen the proprions of pairs 200 Gallons of Last Wear “6
and sctive solaheated bes tha eued les han 200 eoor_ oe
8.11 Testing a Population Variance
Recall rom Section 79 tha the pivotal stats for estimating « population variance
(0° doesnot posses anand noma (Z) distribution. Therefor, we anne apply the
‘Procedure outlined in Section 83 when testing hypatheses about o>
‘When the sample is selected from a normal population however the pivotal st
tstc possesses a chi-square % distibuion andthe test can he condted ss ound
in he box. Note thatthe assumption of normality is equiedrepardess of whether tbe
sample sizes arg or smal
Test of Hypothesis About a Population Variance a?
Me = 05 My a = a8
Ne o> of Ne a 40h
VY > x (ory < yh a) 2 < yiwr0ry? > Xe
here and 7-1 ae values of that locate an area of the igh anda he
et respectively, of chi square dsrituion based on (n — 1) degree of feed
(ote: a9 is our symbol for the panicular numerical valu specified for 0 in he
rll bypbesis,)
Assumption: The population from which the random sample is selected ha an
approximate noma dssibuion
Ean | toss bum 7.46. 30 arcn be vain ra off at cn SER
usu ec pi Ba he nad Govan stan oases
| Segetesiam cme sexier eaten ese
Solution | Since the nll and alteative hypotheses must be stated in tem of? (rather than
1g will waot wo test the null hypothesis that o? = OF apuint the stemaive thal
@ <1. Therefore, the elements ofthe test ae
Hy @= 01 (isso =.1)
He 0 < 01 (ie.a <1)
SD ruvwrs | TABLE 8.7 Fl Weights of cans
796790798 aoa oi =
796150198 an 79179 aanMean
Stanoard Evror
S\Mesan
Ficure 8.21
EXCELPHSTAT2 pinot for Eepe 8.18
alle:
8.11 Testing @ Population Variance | 377
FIGURE 8.20 -
Example 818
Assumprion: The population of fil amounts i approximately noel
Test state
‘Rejection region: The smaller the value of s* we observe, the stronger the evidence in
favor of Hy Thus, we reject Mg for “Small values” of the test satis
tic. With « = 0S and 9 df, the 2 value for rejection is found in
Table 8 of Appendix B and pictured in Figure 8.20. We will reject,
| Hoi x? < 3.32311. (Remember that the area given in Table 9 of
| Appendix B is the are to the right of the numerical yulue in the
table. Tus, to determine the lower-tal value that has = 05 1 its
deft, we use the 1s column in Table 8,)
‘To compute the test statisti, we need wo find the sample standard deviation, s
[Namercal descriptive statistics fer tho sample data ate provided inthe EXCEL printout
a A Bi
Tchi-Square Test of Variance
BZ
z Data
7809 4 Nall typatieais or 04
no1saie 5 Level of Significance (0.05)
1 sanpie ie “0
EL 7 [Sample SundardDevnion 63
oe gy Tmerrediatecalcusiens
o47e71 a
rear a ]oegees of Freecom @
aa Higjuat eo Te
Alon Square Sttstc 1.6841
2
| Lowa Tall Test Renu
Tower Critical Value 332511514
ayp-Valve 3521
Reject the null pothesis
ens
ones in Keats
i378 Chapter® Tests of ypotneses
shown in Figue 821. The value fs (shaded onthe printout) ss ~ 043, Substining
$= 043.1 = 10, and = 01 ito the frm forthe test statistic, we bain
2 _ (10— 160437
x om
[Note that this test statistic andthe comesponding p-value ofthe test 0043) are bah
‘given (shaded) atthe botom ofthe EXCEL print, Fie 8.21
= 1.664
Conetusion: Since the tet staisic, x2 = 1.65, is less than 3.32511 (or, snot
4 = 03 > p-value ~ 004), the supervisor ean contac (at = 5) tha the vt
‘ance ofthe population of all amounts of fils ess than 1 (@ ~ 1) this procedure
is repeatedly use, it will incorrectly reject Hy only 5% ofthe time. Thus, the quality
‘contol supervisors confident inthe decison tht the cannery is operaing within he
deste limit of vrai
Applied Exercises
“ronoice
1868 Albedo of ce meionds. Refer othe Nationa Sow and
Ice DataCenter (NSIDC) collection of dato he be
of melons, Exersse 770 409) The vibe beso
‘alae fora sample of Sb ice meld lene i he
Canadian Acti are ss inte PONDICE ie,
4. Condbet a west (a a = 10) t0 determin if the ve
‘ariange of the sible albedo values of all Canaan
Asc onde yom 0235. (hie To 308
2 Discus the practical sinfciace ofthe tes inp
(Win Use the 0% contence interval you foand fe
8.64 Deepa aig. Refer othe Journal for Engincering for
‘cdsry May 1993) stl of Sep oe ig ae il
hip congestion, Exercie 8.23 p 383), Tet to determine
Uwhchr the tme standard deviation of dil chip enh
Airs fom 75 om. Rec hat form ~ 50 dt chap,
= 502
868 Becca sgn! eon: Reering lst tivity of he
‘rai is imports in inka problems aswel an ocr
plysolgial research. Ty improve the signa trmase
{ato (SNR in the electra activity. ts necenary tore
‘Peay stimulate subjects and average the rexpemer =
4 proved tha ses Ua single espns ar hemo
neous A say was comdcted 10 fst fhe homogencan
Signal thor UEEE Brsinering i Madiine andBeogy
Magne Mat. 190) The nl hypetess ta the var.
ance ofthe SNR reais of ut equals the “expec
leseunier the homogeneous seal theory Fo th sa,
the “expected” level was tse tobe SH Mf the SNR.
variance exceeds this evel, the researcher will code
‘hat he signal ae nontomoencous
© Sec up the mull and stertive bypass fr the
researchers
0. SNRs recorded for sample of normal chien
‘anged fom, 10.30. Use thi information to ban
1m estimate of the sample standard deviation, in
16
Asime tht the dsb of SNRS is norma nd
"ant Fhe SNR inthe poplin wl al iin
64 eke, from w— 2e toy + 27, Noe tat he
range fhe nerval equals 4)
© Use te eine of sin pat fo cont the est ft
aTetuinga 10
Water decors: The mon common mtd of ie
Aetng war for pouble we fee rsa elon.
Preammoniaton ie thai of ammonia othe wit
rot to aplying fe chlorine) is reeved amir
‘eto as an altemiewesment. Ino ty, 44 Wl
‘pecinens tad with prearmoniticn were fan’ ©
Fave mean Maen cay of 1.8 anda tan >
tin of 16 American Wier Work Jour Jn 1988-1
‘here suticicnt evidenc to neta tre vain of te
event bidity in water specimens feted by 2
FPesmmoniston meth exceeds 0016? (Te vale £016
epee th known effet tril yan of malt
‘ecimea weed wit fe in, Test sig a = 0
fiber cement caine. cmpuny procs 2 fn
deiner cenentin ace minom asta
ity como inpecor fined a sng wt
‘ates of th amount of bbe comet dpe
‘bec toe thn3.I'enthe opr ech
‘eed of asin. Sie inspection fe cnt
bros resus hat the oping machins be
Sovr, ad snows ay ag pref
be company than are oes
sper ae cai rand saple ofo 10e
fo teing Aft messing the weight to one
‘be mectorcompues he fllonng says
Y= MSSoues 5 = s¥ ounce
8 Dees the sample evidence inde tat te dseasi
mca i mee of sme? Tet
ance evel a — 05,
'. What assumption inecesary forte hype testo
portato beta?
a418.12 Testing the Ratio of Two Population Variances 379
Pe ntcractsamcfinpomniatectecaaaee, | TOA
pics, are exremelyDivaows comin va ase
aed te envicnmet. The Exvioumensl Proscon —_ :
“i gieipeetond ieee tags Suppose the ERA requis an inramctthayics PCB
ee tee radish 4 varaoe of ls ha. Does the ew
tenon he same ish spe The ia are ec here
fps perl
8.12 Testing the Ratio of Two Population Variances
As in the one-sample case, the pivetal statistic for comparing two population vari
ances, and, as u noanormal sampling distribution. Recall from Section 7.10
thatthe ratio ofthe sample variances s/s} possesses, under certain condition, an
F distribution
__ The elemants of the hypothesis test forthe ratio of two population variances,
«ij ae given in the box.
‘Test of Hypothesis for the Ratio of Two Population Variances
Indepencent Samples
One-Taed Test Two Tailed Test,
Hs H 1
Hy Sn
[seme
Test stat: Test satis
Larger sample variance
~ Smaller sample variance
whens? >
when sd >
Rejection region Rejection region:
F> Pe F> Fy
Wwinee Fad Fa ae values that locate area and a2, respectively, inthe upper
tal of the F distabation with », = numerator degrees of feedom (ie. the for
the sample variance inthe numerator) and v3 = denominaror degrees of feeder
(ie, the f forthe sample variance inthe denominaton.
Assumptions: 1. Both ofthe populations fom which the samples ae electe have
relative fequency dstiutions that are apprimately nora.
2, The random samples are slected in an independent manner from
the to populationsSSE? —CC~é~ésrS ll
380 chapter & Tests of Hypotheses
Example 8.19 How doses of eee axe 7) nas have bean ehown to ae sipnicany he Ona se
{ue of cos Atoughitis known mutagen a suspected carregen E10 ae ute a,
‘nsteritnaosoealsuppies asi as conducted omits eect of 0 on hostage
sormeimoted wih te stetzan posse Throne suet wae tendon) selects a
Sipe 0 orw of wo Usk. Tee sects wer aid tho task of perng ane acaone
Sterier un led wth EO eas 1) The remseeg 8 sds wee sodbnes ean of ease
Steheatonpekage conaing eT (ask 2) ater els were pred Tenens eeee
tho aunt of ET (nga reer in tre atebeam cf och sect & stmny of ese
{ul gprs in Te. Do the cata rove seer ence onde aero tea
bility of ET even sites asned foe wo ast estas te
Solaton | Let
HHH} TABLE 8.8 Summary Data | o} ~ Population vasiance of ETO levee in ubjecs as
ill {for Example 8.19)
woe? | For his esto yield valid resus, we mast assume that bath smpes of ETO ies
i Sample sie 13.18 | come rom normal ppeaion andthe samples inne
Mean $00 so | Thebypotheses of meres are hen
Standard
Deviaton 30193 |
ed task 1
| 3 = Population variance of ETO levels in subjects assigned task 2
ki Al (oteod
The mature ofthe Fables
ven in Appendix B affects the form of the test statistic To
{i form thet
ection region fora two-tailed F test, we want to make certain tha the
{ ‘Value fora, since we double the probability ha the F
ait always placing the larger sample variance in the numerate, That
j ‘wo-tailed by pling the larger varane inthe numeatoe rather
is, we make the test
than establishing
jection regions in both tal
Thus for our example, we h
i ° ve a numerator sf with df = mj — 1 = 12 and 2
17, Therefore, the text statistic wil be
Smaller sample variance ~
Larger sample variance _
and we wil rejet 0)
fr c= 10 when the calculted value of F exceeds
| the abated value
}
Fos =
*
‘We an now calle the vale ft test statistic and complete the analysis
#_ G10?
i pa t_ G10? _ 961
4 (193)
tS Ne Soars this to the rejection region shown in Figure $22, we sce tat
| F = 2.88 falls interjection region. Therefore, the dat vide secon evs
on variances der.‘8.12 Testing the Ratio of Two Population Vanances 384
FIGURE 8.22 ae
Rejection pon for Example 819
Faas
Test for Equal Variances
Test stacistic = 2,58, p-valve = 0.073
FIGURE 8.23
MINITAD prinout for Fasepo £19,
[oe You can also use the p-value of the et to make the appropriate conclsion. The
| peal fortis wove Fests shown add} onthe MINITAB printout Figure 823,
Since pvae = 73% ss than ~ 10, here alien evens oj
What would you have concluded in Example 8.19 if the value of calculated
from the samples had no fallen in the rejection region? Would you conclude thatthe
null iypothesis of equal variances is true? No, because then you risk the possibility of
2 Type Il eror fling to reject Mo if i Wve) without knowing the value of. the
probability of fling to reject Ho: of = oii Fact ts false, Since we will not con
Sider the calculation off for specific akernatives, when the F statistic does no fll in
the rejection region, we simply conclude that insufficient sample evidence exists toe
fie the nl ypesis hat
Example &19illusrates he technique for calculating the test statistic and ejection
region fora two-tailed test to avoid the problem of locating an F value in the lower tai
tf the F distribution, Ina one-tiled est this is much ese to accomplish since we an
‘control how Wwe specify the ratio ofthe population variances in Hy and Hy, That is, we
am always make a oneal estan wppersufed est Foc example if We want est
tahether of is rear than then we write the aemative hypothesis as
|
|
|382 chapters Tests of Hypotheses
Al
and the approprite test stati ip F
ish. Conversely if we want to test whether
17s fess than (Le, whether is greater than), we write
a ,
ts > oi)
andthe comesponding test static is F = 3h
pplied Exercises
69 Ares o forest acess oth. Refer th ert
Weorsrorame
¥
ara of Fors Engine ly 1935) ty te
es finest ces ods Exe 1769 33)
Recalltht hansen sue dion illite) as
‘messed for independent rn spe of 32
sobyrade acess ads and 40 pest sabre asexe ma
The wats are eed in ie ascerparingTie
stay scot o compa tema oxen bs
2 & Ciena ecrd unr (80 ‘eater condone
{ogg a cleo. Te das ave nthe ORCHARD
‘i Tet th sumption of egal variants reed fo
cmp of meansto fe valid User = 8
1872 Accuracy of wet samples. Wet sampler re stand
2 LS ves wet esi the chee compton =
Pace Sige tion. The seaey of the we depsion realing
= weve ay depend onthe nuke of pls sates
Mineral Peat inte Fel. xpenncne in The Neier cellos et
Number of Roads a « epson mewurements using anywhere frm one cet
Mesias Ogee ee Mei wet samples (Amarpher: Esra. Nl. 3
1990) Far each sampler (or sampler combination,
Standard Deion 339 a ‘wee cole every 24 hos er an ete Seat, 18
Compare the surface detection variances ofthe two
avement types ith tweed et af hypothe
thing o = 08
mn Exevse 776, you wed 3 98% conden intra
1 compare the soe deflection varkances. Dem
state thatthe nfrences derived fromthe tent and on
fidence inerval ate eat Wil thi always ee
as? Epi
70 Cooling method for Bus turbine. Rest the Journal of
Ennering for Gas Turtines and Power (an, M9)
Sty of gs turbies augmented with hgh renee
fogging. Ents 8.3 (p42) Hest ate data ji
Pe kilowatt er hou) fr each of tee types of goer
ines advance, acrdervatve, tation ste saved fn
‘he GASTURBINE fil In orto compare the mean
hea rats of eno tes of g turbines you smd at
the Bea rate Yarances wer al.
readogs wee collected er sample or sample emis
thm, When oe wt sampler was we the ston er
Mon of the Iyxogen reins (eased spel
‘ate othe verge eda fom al eight seple) =
4, When he wetsanples were wed he salar
‘inion ofthe tydogenreadng ncasrel a perc
‘iin othe average eaing from ll eight simple) =
26% Conduct to compare he vation in oe
‘alas fr the two sung schemes (Leone wet
ler sera hee we spas) Tex ing = 08
{873 Cooking tron of-beams. An experiment nas ere
(0 sty the fc of mints flanges on the sia
‘spac of enferced eonerte Ehea (Jor of th
American Concrete Insite; Janes 1986) Sever
ferent type of beams wer ud in he experiment cc
‘ype having ieen ange with The beans wt
ed uader ‘combined torsion aad bending lf
(sacking) One variable of iret is the cracking Ka
Ione the top of the Mange of the be. Crk
fori moments for eight bess ith Zoem sab 0
And cight beams with Om sah wichita flow
Conducts text (at a= 05) for equality of eat rae
‘ariance for rational and aerodervative apm
10 urbines, Use the rest © make a satenet abt
the vali ofthe inference derived in Exerc
2 Conduct a test (at = 5) fr exit af ea rae
‘ariances for aivanced and aerderivavesugmeted
tubes Use the resi 0 make a satemett abot
the vay of the inference derive in Exerc 33
oy oncnaro
'E7% secretes used m orchards. Refer to Exerine 437
(0.49) Real haan Ensim Scece & Techy
SG voeams
Ria Pa Se
Sieh Wii: 600,720, 1020, 13.20, 1140, 13.60,920, 1120
100-0
Sth Wid 680,920,580, 13.29, 11.29, 1490, 1020, 11.50
Is thr evidence af itfeene inthe vation i BE
‘gactns terion mens ofthe totes of Team”
Usea= 10.
Whit asumpcions ar rogue fer he est to be vali?8.13 alternative Testing Procedures: Bootstrapping and Bayesian Methods (Optional) | 383
{74 peste water poten Rete the pene ute organ
study dssmcd in Estee 671 fp. ta). Beteet
cules coleced at wo sewage Metaet sis in 35 7
ge irene ss in w= 53 Ha 42
thre ado othe i sewage ft. Sn he Sire Fein MT "Aa of
‘Bhsicied dh! the TOC levels fund at Foxcote villhave
srt ataton ta the levels Bedlon.Does he sane
pubes? Tae = 05
Hleinforaton supp his
‘Theoretical Exercises
Bi Rapcec we ween siete] =f eime mse} Gd tee F depends on vj pumerte an oy denominator
Stow that he eect eon given by sand depends > numer and», denomin-
3 erat
use Fora 876 seth rests of Execse 875 to shom hat,
mew F depends ony = (ay ~ 1)d0 and 99 Lager sp vaiance
1) af, iseqivlenc othe jection eon given by ee =e
shee F depends om aumeruior d= (Sample size for
bere depends en»; numerator fan denominator
aor (ie
\ Stes variance * )
we depen mea an rin
Hint Use he fat rt mid) at Hise d en
itn A qe
Aa Then we the fit tat PF > Fyn) = a
8.13 Alternative Testing Procedures: Bootstrapping and
Bayesian Methods (Optional)
In optional Section 7.14, we inteduced two alternative methods for finding con
ence intervals: the bootstrapping metho apd a Bayesian method. These procedures
fan also be sed 0 conduct asatistical test of hypothesis. In certain sampling stun
tions the conclusions dawn from one or Both ofthese methous may be mote valid
than those produced using the classical tests of Sections 84-812, especially when the
data do not adhere the underlying assumptions,
Bootstrap Hypothesis Tests
Recall thatthe bootstrap is Monte Carl method that involves resampling —that is,
taking repeated samples of size n (with replacement) from the orginal sample data
fet. The bootstrap testing procedure uses resampling to find an approximation forthe
‘Observed significance level (p-value) of the test. The tes required to obtain the boot
sirap p-value estimate for atest on x population mean are listed inthe box.
annucis Kugsaal
ii
\ee
384 chapters Tests of Hypotheses
Ho
| Bootstrap p-Value for Testing a Population Mean, Ho:
Tels 93
> Fepreset a random sample of size froma population wih
sean £01) =p
Stop 1 Calcul the val of het sats forthe spl: fogs = (5 ~ uo) (wVA
| where isthe sample mean andthe sample standard dation
Step 2 Select, where jis te numberof times you will resample, (Usually jist
very large mmber sa, j= 1,000 or j= 3,000,)
is take each sample y value, subract the sample mean, then
step will generate sample
scan inf)
oy Th
Mp his
with a mean equal to the hypothesized
Step 4 Randomly sample, with replacement, m values of X from the transfomed
sample data st i. 80
| 99S Reve sep atoms
Step 6 Fer eah bowsap sample, compute the tes static, = CE
where and
strap sample
poylg VA)
ore the mean and standard deviation, respectively, of boot
‘Step 7 Find the bootstrep estimate! p-valve— dy (in: Use the posterior distin, g 5 He < bp int Use the posterior disbuon,
foind in Exec 794 (p38, ly). found in Exercise 795 (9 225)
Test sttisic 937
Theo-tiled statistical tet s
Type Lemur 338
“Type I eror
Uppersiled est 346
‘deena
None 348
‘Normal population 350
‘Normal popelaion 376Tm EES ZZ
390 Chapter 8 Tess of typotneses
| ‘Summary of Hypothesis Tests: Two-Sample Case
Parameter oy Hypothesis Test Statik
(md (am) =) =H
Independent dt we saat
between and
then Dy = 09,
whore Tis bane on
P= +m— 2aand
(w= DAH
Doe d= Bydin p= aD
det sane where “
a then
=e Fords = 0
Awe wae nee
seu
diterence
etree py amt
Psthen Dy = 0)
- 4 Fortzet > ok Fe
Focitg
"ona
Samiesio _Assumotons
23m 230 Nowe "
Fitern, < 300¢ Bath pops 399
‘mvs 30 orth normal vith eq
which # 02.88
the moicatons
sen the bon
op. 36)
Allng(ltmy > 30, Population of 364
ttenthestandid erences,
myaninglage Independent
tough so tbat samples
mina 4
Ally adng Independent
ocmalpoulsionsLANGUAGE LAB.
%
z aa
A eta
4 theta mht
a naught
a Siem-squed naught
Chapter Summary Notes
+ Blements of a test of hypothesis: m
"ejection region, p-value, and conclusion
* Twotypes of errors in a hypothesis test: ype I error (reject Hy when pis ru), Type H error (a
fae,
* Pohablies of enors: = P(Type Terror) =
fake
Supplementary Exercises 394
Descreton
Nall types
Probability of Type Tere
Prob of Type I ene
Hyped eof popsltion pm in Hy
Hypxesiad ah
Population man in
Hypoxia vole of popsation diference in Hy
Hypodiein vale of population tae in
hypothesis, alternative hypothesis, test statistic, significance level (a,
pt Hy when Hg
Reject Hol Hy tue), B= P(Type M error) = P(Accept lM
* The forms ofthe altesativ hypothesis: Howertiled test () upper test (>), two-tailed test ()
* Observed significance level (p-values he sles alc of tha can be wc cjet heal hypothe
* Dechion rue for rejecting My 1)! tats rejection regen of 2) ovale < a
* Power ofthe test = 1 — fi = P(Reject Hy |My fe)
* Key words for denying asthe parameter of interest: mean, averaxe
+ Key
re rmo means using independent samples
* Key wordy
ears using matched pairs
* Key words for identifying p a the parameter a intrest
* Key words/phrases for identifying py
fompare ovo proportions using independent saps.
3s for identifying aya the parameter of interest mean or average of
¥ wordsphraes for identifying xy ~ ply the parameter of interest diference between means or averages. com
sired differences, compare
proportion, percemage, rate
‘Pa the parameter of interest difference berween proportions o percentages
* ey words for identiving os the parameter of interest variance, spread, variation
* Kes wordvphrases for identifying er/er os the parameter of interes.
‘populations using independent samples
Swoplementary Exercises
"4 Morgolon dese ams. The Journal of Biogeography
{Bs 203 pnd wy of a Mags en
a Asi). Botanists pce sce has ative sites in he
Dry Steppe region and sx sites in the Gobi Desert and b-
feral he uber ofan species atte 1 ec se
rence between variances, compare variation
These data are sted in the ext table Is thee evidence
to cmchie tha diference exists Bawsen the average
amber of ant species fou st ses 0 the two Fxions
‘of Mongols? Draw the appropite cochasion using
¥
i
2392 chopter Tests of Hypotheses
D comnts
Ste ‘Resin Nambr of Spades
1 Dry Seppe a
3 Dry Sterne 2
4 Dry Spe 7
5 DeySteppe
* Gobi Des 8
7 Gti Dee 5
° Goi Dee 4
10 Gobi Desert s
u Gobi Desert 4
{8.85 morgotan desert ants conte), Refer to he Jounal of
Biogeagraphy (Dee. 2008) sty ‘ol atin Monga
(Cental Asi). Eerie $84, ete you compel the
‘mean numberof ana eo dese ites Sine th sample
sires were small the vrtanss of the populate
{wo tes amit he eau 6 od fer te inference be
vali
Setup fy and for deermling wheter the vriaces
sre the same,
Us th din the
tn forthe test
€. Give the scestion region forthe est if = 05
Fil he apron pale oft ten
OBIANTS hl finde ests.
© Make the appeopit conchae in he won of the
1 What coon are eure for te tes resus to be
vali
‘886 suening majors in colege. Whos female undergratntes
sich from cence, mathematics and eninerng (SME)
"aor dicpins hat ae at ska, a te
‘eason ileea fom the oftheir male soups?
This questo was investigated in Scene Edeanen ely
1995). sample of 335 junusenir undergrads
172 females and 163 inlet tw lane esearch iver
ties were Menifed as “vwitches” that they eta
Asean SME nj fora non-SME mje Each etdene
ised one oe more its that combed the suicing
2 OF he 172 females inthe sample. 74 ised ak oan
of ierest ia SME (e, “tumed off by sins) a a
major factor, compare to 72 ofthe 168 miles Com
dct a et (ata = 10) to determine whether he po
Ppton of Female witches who ive “ack of cee
‘in SMI" as a major reson for switching differ fom
the cotespning proportion of mules
Thiny-tee of the 172 females inthe sample ins
"at tey were ascouagd or lo confience Bec
of ow ges in SME dung their carly yeas com
pare 6 44 oF 16 makes. Cons 804 snide
inal forthe diference between the propor o
feral and mal sicher who fst onde
low gas in SME, Inert the esl
'257 comparing word processors Suppose you want odes
mine whether usr of dita proessrs have pefomnet
Detveen word process AaB if ases hoe no
ce frie of be wo word pressor ie fie to
‘ysems ae ec then the probably phat 2 wet
‘ef sytem Ai p = 3. Let Fe the number of
{sample of 10 who preter sytem Ay and suppose yo
ant ots fy 9 = Sapo Hp 28. One pie
tet procedure tore fit = Vor Y= 8
2. Finda forthe.
1 Find itp =. Whaisthe power fhe te?
Ral itp ~ 8. What isthe power ofthe tent?
8.8 Mensng paper tress The quay con depat
set fa puper company mess the gts ames
te of reflectance) of fins paper oma pero tas
hough the dy. Two intents hata anal
‘meat paper specimens ae mjc! oro, at
0 be asd so that te mean readings far cnt
oper specimen are the same for both insrments, Sa
[oy ar concerned al th precision eh tn
Stumennares tht nse 2 is ess pecs a
Inu To check thie five mesure
sine paper sample are ae om ho ramet The
la are shown in the bl the at roi uit
{sides to ndcte ht insane presi a
instrament 1? Test ning = 08,
D swcurwess
insur) rstamert2
» %
20 a0
8 2
0 28
889 Schodutns te dabiny tests. The texting deparme of
4 te ad aber company schedules ick ad pose
rs or dab es Carey, ties ae bse
doe weekly Mleilegroesors (machines thi Ct
hance cite cor pasenger tie) wing the srs
!mocesing time (SPT) approach Under SPT the ie i
the stot proesing fief chal ft. Com Po
‘searcher bave developed « new scheduling rales!
they believe will ede the average ow tine (se,
serge compleion time af ts and led 104 ed
inthe average urtines of sched tes. To comps
‘he wo scheduling res ties were andoly sce)
__
es scheduled using SP,
fie provided a theta
sul size Oo softs
= arenes
ow
wr 1s28 8SI280 S26 48209
Is torsuliciem cvdece at = 1S to conchae that
the average fo ime sks unde he proposed sched
ling ul than der the SPT epncach™
[there stint videnee at = 05 wo omclde at
the proposed scheduling rae wil lea oa redo
he wera
190 tos capacity of Foams. Refer to the einfred cone
Team cracking experiment describe in Exerie 873
(p82) The expednoral rcs were Compare he
Metical resals obsned wing he fale sare
red of predicting ultimate lad capaci. The atl
‘1 cortical lat tron moment for sx Thsuns
thse sab with te given inthe table, Conduct
sto determine whether the experi ean wkimite
‘erin moment der rm he theoretical es wate
9 mone —
ee a a se se
BowinenalRewh 470 520 S40 $40 430 80
Mevetsal Result 463 465 S440 S40 S62 363
See ie ere ec
19% atone water problem that cut with cei
pet mining sere y-pednd oe m
‘Tadioactive and these pr ioe metimes get into our
fesh wate spy Te EPA his se eli or
Cen ini ne sna chatty ps
ting water Parca the ms vel oa
way
cca tation 3 ios per eo wae,
Aandi of 24 war sce fom 4's
{es py profuced he mpl sani j= 48 pe
Seas per cs 87 peur ie
4D these data provide scent eidene tn indlcate
thatthe moan evel of alton sf (blow the ma
um eve ey tho EPAY? Test wing = 01
Why stould you want wo ne sal eof forthe
testi prt a?
Calculate the value of Yor te test if ay = 45 pi
‘ois pr her of wate.
o£ {ate stp he pe fr te
12 Overeating ina spa. Saying 00 log in a $s pool can
esl in overheating, which in the case ofa preset
Supplementary Exercsos | 393
woman, may cause fetal malformation But how Jong is
too long? Based on thie werk in ths area, severl
searcher hype that 75% of women, mes in
spa with water tompeatre of 40°C, wil become unc
Feetabiy ot when heir ear cna (cor) emer reaces
PC. Arrest aubjective dicot is suggested a 2
posible seguir a
Apparently contacted by an Austulian sly of 24
(46%) were uncomfortably hot when their core tempera
ture eae 40°C. Test the Bipot ht the te po
enage of hesthy,neapregniat women who become
uncmnfertably fot when their ore trperatre teaches
a0°C iow than 75%, Var =
4 vor
898 Contamination of sh. Refer Wo the US. Amy Comps of
Engineer: ay of corterid! fishin the Tenner
2 Use arc number table abe 1
generne a random sample of n=
DDT concenttom in fh fo te BD
1 The Road ad Drog Administration (FDA) sets the
Appendix Bye
lit fr DDT coment im indivi ih at Spar ot
len pm). Dos te sume of par a provi sa
‘et evidence o conc tht the sverige DD con
Ten of individu! fish inhabiting the Tennessee River
and is cock bates exces 5 pp
Sigifane level of a ~ 01
Sipps he fst of hypoth, ptt mas bse on &
ramlone sample of only n = 8 inh, Wha ae te dia
anges ending this sls est
1. Reed per using only the stormation on the DDT
seis of sample of fish andy selected from
the 40 observations of tw, Compre he resis of
1298 Bal bearng specieauons I the mamuacta of machin
eit esetal tole path conform pec
tins nthe pst, haters ofthe hall easing pred
bya certain mamaactrec ad varanse f 4136. cat
esi the manufacturers lesseapeive poae
thon methid. Te yatnce of the meters of 10 a
domly simpled bearings produced bythe ne press as
2 Doe ape ic ne ne
‘fe more variable thus tone produced ty the old proces?
Teste = 05
1195 Atmasphericln bance A high conceattin of positive
jm in a oom can induce fig, sem and respiratory
lems inthe roms osapants, However sec as
‘heroom's amosphere (ough a negative ion genera,
tn combination wit constant vention restores the a
tal balance of fons th is conducive to human beak
(One experimen! war cdc as follows, One andre394 chapters Tests of hypotheses
employees of «lage Factory were radon elected and
divided imo two proupeo 3 each. Both paps were tld
‘hat they woul be working ian espe wth a on
‘ulancs contol thrgh negative on Eentar: How
ver, anknown to th
nly in th experinenal grays woe aes
Aithe en ofthe das the numberof pox epg
migraine, nausea fatigues, o some ed ys
sal discomfort was corded foreach group. The Reis
are summed inthe ube
ontroiGreup
fon gens
Experimental Group
on genes ot
‘Namibern Sample my = 50
i Patel Discomort 3 a
4. Perform atest oF hype wo detenine wheter he
Proportion of empioyect in he expeimenal gry
wvho experience some type of physical contort
the en ofthe dey i szniicim es than the sone
sponding proportion a te ent gop. Use
1. Compt the phe fo this test
'376 mores of WaSDs. Refer 1 te Scene (No 1988)
‘uy of inbreeding in tpi swarm founding waa,
Exercise 798 p.329) Asample of 197 waapr opera,
frozen, and subjected series gente eyed
Sadar deviation of «= 84, Recall hati the way bay
to tendency to ined, the tre mean itveeing cect
‘Teste hypxhess ht the te mean indice
deny ori specs of wap ences 0 Use = 08,
Bvaron
2. Compare he infeens, pat 8, 10 the inne ob
une in Esersse 798 sing confidence intra Db
thelnfrences age’ Exp
897 Strnats of uty poses The meus snd sandal de
fens shown in the table summarize Infrmaton on he
sents modules frp at rund in pup
‘qa inc) far 80 types of wooden pcs wed ye
lity indy Do the data provide suicem evden
indie a diference in the variance of the sot
‘ween ols made fron coil Cols ian cet
Poe! Test wing = 02,
= = sage
Sample sample Sandee
Soeces Sze Mean Devatin
ComlDougcre 118 ano ket
Souter Pine a7 got
898 Coa tar apo pressure. The le below proves isn
‘he theoreti (cakulaa) an expesimentlyales of 8
‘gor pressts for dbenthiphene hee
"atc Compound sins to hoe found in cal a He
tert made fo vapor pres sa good model of
aly, the te mean deren between the expe
sn called valtes of Yapor pres fr BER Et
ee Wl ual.
4 Do the dst prove sutcent evidence wo into
‘ht the mean difleence dies frm 0? Test ie
a= 0s
1, Caleta interpret the pale forthe est
wore |
Temperatue(°C) __exprtmentai—_Calcuated
10060 an 21%
10136 34 sr
1040 as 330
1044 08 90
10870 4 a
0, sa. ses
nae ss ss
ust 2 0
rn Dts N36, 1S, 21-128 Crp 5 Aso
Sine: Eda DR an Pras 1M." psa fa pa
anor Presse
Temperature rc) Experimental calcul
669 69 ws
93 a3 ss
io 350 2
2361 tor 101
12490 107 108
ian 126 s
13024 ta 8
Bias ss Ls
Seated panda of Cael
Cenc Soe Repent wit eran
or
iStatistics in aevion | 395
yt Bates pans icc i i aac wc or
tae igh ofa Bae ee os pel ocims Tm ome sees tee
erie ee menos enti noee Seeccnatmiecmnl mer eainace
be ole To mci probing to ey es ‘evel Atma edo oad mers
ce pend yt unalone: ed oes pagua woh, rv niet ones (ec =
ie el estado chek to decent a Sas te be cram maa of tg goss
(Rane epenee prety Le proche Le ok tng ed acer eee ess So
sci egh te Sippel ane oa acy ttt thn te concpning sear et
fit 100 iter d's = Otc Does Sein
‘plein nde ta tt mace pecs fase
ba ih ama np ot eo nsf ie tacos Cone ess
prance ono oneal Tes = ae nee
t000 Hess nace, Rescue iat FAS Day : =
Recah Unit and he Depart of Area ee
Engen te Uninesyo Plane Seve nats ae3
desien ere forthe comsraction of shade structs Sear: Relea 0.1 “Boe aio
‘hat they believe will help alleviate eat ste for dairy foraging ind its impact on seed deities
lrmeserof pregnancy were divided ino 80 groupe AS ep. R092 Copp
the remaining 15 cows wee deed
cored the 10-day ilk il (in pounds of
de. Researchers
8.102 Sola traction stay Rese a
i compan le eeu eee Ee eat erat lo
lee to indica a itference between the mean milk Tho data, repraduced inthe table, were wed to Compare
ells of cows sven acces jo shade and cows de the mean, day-long, leash solr rion levels (in
sae? User = 10 (assume the andar deviations of
Ucar = 10 : BTUng fst two midwest sites
shade No state eee
Simple sce 16 is 2 es
‘Sted chnsty strewn stay. A field expesimeo was
‘adit 1 ascertain the impact of dose runes eee ce -
(sete) onthe dest amd dition of 80 in a cc
the sol (oles, Des. 197), Sine ome desert mes ay 2 in6
'e mown ou sees in sre caches, the tly March? 33 ae
spect designed to Jetennine wher thse op igs -
ches eventually palace mote sei 1698
#0. tha an ajacet contol ares. Foy small aes
cael by mens were laced and covered ith
Dt capes to prevent mde rom esi he caches.
STATISTICS IN ACTION
Comparing Methods for Dissolving Drug Tablets—issolution Method Equivalence Testing
Inthe pharmaceutical industry, quality engineers are responsibe for maintaining the quality of drug prod
cts produced inthe manufacturing process. The key to qual isan assessment of product characteristics
‘trough repeated measurements oft varable of interest When the vriadle iste concentration ofa par
ticular constituent in a miture, the process Is called an assay For this Statistics in Acton, we focus on a
‘hemical assay to determine how fast @ Sold-dosage pharmaceutical product (eg, an aspirin tablet or| 396 Chapter # Tess of ypotheses
tient, quality inspectors requirea test that accurately measures visoluton,
'n "Dissolution Method Equivalence” (Chapter 4, Staistcal Case Studies: A Collaboration betwom
‘Academe and Industry, ASA-SIAM Seres on Statistics and Apoled Probabilty, 1998), statisticians Russet
| Reeve and Francis Giesbrecht explored the dissolution characteristics of a new immediate release Ong
| Product manufactured by a wel-know pharmaceutical company. An immediate-elease products dexgres
\ \} to dissolve and enter the bloodstream as fast as possibie To test for dsolution ofthe sold-dosage ug
the Company uses an apparatus with si vessels or tubes, each tube containing a dssoWing solution, Out
{ables or capsules re cropped inthe tubes. Then, at predetermined times, a small amount ofthe solos
's witharavm fom each tube and analyzed using high-gerformance liquid chromatography (HPLC) Te
HPLC device quantifies How much ofthe drug sin the solution; this value is expressed as percent of abel
strength (5)
i i]
| Cepsule) ssohes. ce varaton nthe soliton ofthe dg can have harm ie eects one
‘duce results equivalent to those
verfles that any two sites using the dissolution test produce equivalent assay resets
‘Dissolution test data for an analgesic in Tabet form conducted at two manufacturing sites (New lesey
and Puerto Rico) are listed in Table SIAB.1. (These data are saved in te DISSOLVE fle) Note that MLS
les were obtained at fur different pins in teeter 20 minutos, after 4D minute sites eo
i ‘ter 120 minutesfor each of the Sx vessels Based on the sample data, do the to sites prod,
| lent assay results?
| An intially appealing approach to answer
ng tis question sto conduct atest of hypothesis onthe ot
ference between the mean SL measurements tthe two ses Let ay represent the ppuion ean AS
tor tests conducted at the New Jersey site and let represent the population mean see for tects cot,
| srerag atte Puerto ico ste. the testresuts ath tw sits ar equivalent then ag The nl
alternative nypotheses can be stated:
| He (1 ~ 42) = 0 (ie, dssolition equivalence)
) Hi (us ~ m2) #0 (ie, nonequivatence)
] ZoammpHly the analysis the statisticians suggested conducting ths test at each of the four time pets
St Bac time point, withthe resits shown in Figure SUB.
TrePalues forte two talled test gle onthe prntoun fo 20,4060, and Lateetrires cre
{me are 1528, 0395, 3499, and 4956, respectialy. wa select type arteritis heen
h ‘oralectha (p-vakve > 05) fr tree ofthe four tie points: only when css oe tok so roes
& ossowve
TABLE SIA8.1 Dissolution Test Data (Percent Label Strength)
Ste i ee
ee a ; ; °
|| © Fe " e s - »
a © 96 » a 95 9% ”
ii) tan ean ce Pade
| o 6 “ a re °
9s *” 98 os ~~ 2
= Ng ole
SSR REE hap ns ae AS es pe eae Gan a a
iy. 198 (Chipeta
iStatistics in Acton | 397
Bonple Statistics
Group N Neon Std. Dev. Std. Error
Neo Jersey ‘Ss777 Tagen
Poort fice & 85 4.3000 tire
Null bypothesie: — flean 1 = Mean 2 = 0
Aiternat ive Mean 1 > Mean 2 ==
57
Ter a3 Octsar
802 Conf idence Interval for the Difference between Two Means
Lover Linit Upper Limit
Tike
oor) N Neon Std. Dov. Std. Error
Neverssy 6 vas a.atsa Taaze
Puorte Rico 8 eaiener atte ivase
Hypothesis Test
Nutt hypothesis: — Mean 1 = Mean 2 = 0
Alternative! Hoan 1 = Han 2 >= 0
It Variances fret statiatic oF Prot
Equal
ot Equal Par 9.27 oloata
30% Contidence Interval for the Difference between Two Neans
Lover Limit Unger Limit
tas sa
FIGURE SIA8.1a |
|
or
New Jersey 6 96.8999 2.3381 0.9545
Porto Rico 8 Seleeser 5.9440 14088
Wypothesis Test
MoI hypothesis: ean 1 = °
Aitarnat ve’ ean
Semple Stat istics
Group eon Std. Error
Nev Jersey 6 aaveseay
Porte Rico § 9g S558
Wypothesis Tost
Mal hypothesis
ttt bop fea 1 - Neen 2 = 0
Mean 1 > Rean 2 == 9
If Variances fre
ereerg
4360
FIGURE SIAB.1 (Continued)Statistes n con | 399°
TABLE SIAB.2 Determining Dissolution
Equivalence
ean 4S Ossouon Equivalence Oca
mre =18% nd 15%
is tere suficient evidence to conclude thatthe mean XLS values forthe two sites die. nother words,
‘one might ressonably conclude from the hypothesis tests thatthe two sites produce equivalent results at
Aissoving thmes of 20,60, and 120 minutes, but do not produce equivalent results at a csso\ng time of
40 minutes
Tete are several caveats to ths hypotiesis-testing approach, a the statisticians warned in thelr
chapter. First the idea of equivalence in te testis established by “accepting Ho” Recal that & measure of
Felabilty for the conclusion ‘accept Hs" is 8 = P(Type terror) ~ P( Accept Ha His false), For tis apo,
Cation, Bis the probabity of saying wz wen, fact, the means oe Since the sampling dstrbuvon
Ot 41 ~ is unknown when the ateative condition, + sz, Wue, the exact value of pf unknown
Second, the notion of “practical significance is ignored in the hypothesis test Thats, although the popula
tion means may be statistically diferent ato = 05, the true difference may be small and not considered 8
Imeaninglul difference in practice. Finally, the test may have the unforunate effect of penalizing a testing
site wth small smaller than average) ELS variance. You can see this by examining te formula forthe test
Statistic in Section 8.7. when the diference in sample meansis divided by a small standard error which wil
ely occur fone site as a smal variance}, the resulting value wl De large (and likely to be signicant)
TD overcome these problems, parmaceutical companies have developed alternative approaches to
the equivalence problem, One method, suggested bythe statstcans in their chapter requires that you frst
fn @ 90% confidence interval for 1 — uo Ifthe confidence interval forthe diflrence between mean SL
ues les within equivalence limits established by the company, then accept the assays ofthe two sites as
being equivalent. the compary in this application uses the equivalence limits in Table SIAB.2. Note that the
"mits depend on the magnitude ofthe mean SLs.
Using the equivalence limits of Table SIAB.2, we wll accept the assays ofthe two sites as being equiv
lent ifthe 90% confidence interval for yy ~ jg: (a) les between ~15 and 75 when the mean ®LS Is less
‘han 9, o (0 ies between ~7 and 7 when the mean LS is greater than or equal ¥ 90, Note that this ep
roach is equivalent to testing the following hypatheses or those assays with mean < 90%)
He (uy ~ ua) < ~180F (us ~ ma) > 18. (Le. nonequivalence)
Hai —15 < (uy ~ wa) < 15- (12, dissolution equivalence)
For ths reason, this methodology is referred to asthe two one-sided t test (TOST).
To apply TOST tothe data of Table SUAB.1, we find the 90% confidence intervals for yy ~ ya. These con-
fidence intervals, as weil as the mean SLs values, re also Sneded inthe SAS printout Figure SIAB 1. The
Confidence intervals for each of the four time points are all within thelr respective equivalence limits Ge.
between — I and 15 for times 20 and 40 minutes, and between ~7 and 7 for tie points 60 and 120 min
tes) consequently the data support dissolution assay equivalence between the two sites for all four dis.
soliton times.
TOST is now considered the standard method for bioequivalence testing of pharmaceutical products
{and is becoming widely accepted in process engineering. chemistry, and environmental science. An excel
lent tutorial on TOST is given in “Beyond the Test: Statistical Equivalence Testing.” Analytical Chemisty
Lune 1, 2005). There, the authors provide insight into TOST sample size determination and on how to
hocse the allimportant equivalence limits.
a fern
Se AM
maa TeCHAPTER
Categorical Data Analysis
OBJECTIVE
To show how to analyze count data obtained by
the classification of experimental observations from |
a multinomial experiment
CONTENTS |
9.1. Categorical Data and Multinomial Probabilities
9.2 _ Estimating Category Probabilities in a One-Way Table
9.3 Testing Category Probabilities in a One-Way Table
9.4 inferences About Category Probabilities in a Two-Way
(Contingency) Table
| %5 Contingency Tabies with Fixed Marginal Totals
9.6 Exact Tests for Independence in a Contingency Table
Analysis (Optional)
STATISTICS IN ACTION
‘The Public's Perception of Engineers and Engineeringble
Nay
Ne
92 Estmatng Category Probabities in 8 OnesWay Table | 404
9.1 Categorical Data and Multinomial Probabilities
In Chapters 7 and 8, we discus
nale population. Recall tha
how ro make inferences about a propetion from a
Population propetion pi the probability of "ste
«e35"in binomial experient—an experiment that resis n one of two possible ot
comes on any one trial In this chapter we ae interested in making inferences about
the unknown probabilities (or proportions) fom a multinomial experiment with k
possible outcomes. Tha is, we want to make inferences about pyspa-- py, where p
1s the probability ofthe th outcome and py + ps + (See Section 47
for a detailed disession of multinomial experiments.)
Toillsrate, conser motor fan blade company tha manufactures impellers on
‘one of ive praduction lines, A, B,C, D, or E. Assume that the lines proce nnpellons
ue rate an volume. Ina sample of » = 103 impellers found to be detective
manufactured online A, 27 on line B, 31 on line C.19.0n lite D, and U1 on
line (see Table 9.1) For this mulinomial experimen, there ae five outcomes, or
ach defective can be classified, one corresponding to cach of
TABLE 9.1 Classification of
= 103 Defective impellers
According to Production Line
reuse
ractcal question to be answered inthe sti is whether the proportions of
defective impellers der among the five production lines, Do the data provide ev
dence 1 contradict the null hypothesis Hp = = ps, whore pis the pro
portion of defectives manufactured on the ah production line? If the data in Table 91
‘ontradict this hypothesis, the manufacturer would want to Know why the at of pro
Auction of defectives is greater on some production lines than others and would ake
‘ounermeasune to redace the production of defectives,
This chapter s concemed withthe analysis of eat
that represent the counts for each category of multinomial experiment. In Sections
9.2 and 9.3, we will arm how to make inferences about the category probabilities For
data classified according 10 a single qualitative (or categorical) variable. In Sections
8.4 and 9.5, we consider inferences about the category probabilities For data clawed
according to two qualitative variables. The sltistiewsed for most ofthese inferences
is one that posesss, approximately, the familiar eh-square distribution. although
the proof of he sdaquacy ofthis approximation is beyond the scope ofthis tex some
specs ofthe theory can be deduced from what we have learned in earlier chapters,
orca data specifically, data
°2 Estimating category Probabilities in a One-Way Table
‘Consider a multinomial experiment with outcomes that correspond 10 cateyories of a
Single qualitative variable, The data (ie, category coun) for such an experiment
‘would appeae similar to that of Table 9.2 where nis... «mu represent the category
TSLE9.2 One-way Table wns an m = my m + * + me. Such a abe is often called 2 ome-way table
fategory Counts sine only one qalave variable swe form the categories, or outcomes
eters Toestinte category robbs na one-way table, conse hat ¢ maim
——_ Seam experiment can always be reduced to a binomial experiment by isolating oe cat
al thers. Since we know that in a bina
ns epory St, category i and then combining
experiment with number of successes, ¥.) = ¥/n isa good estimator ofthe binomial
parameter pit follows that
aa
Sen gone ofthe probly soci wih carn ina mune exper
iment. solv th wl poses the same roe as ean, ea whens
ee, lb apex oma ctr (by he conta tit thee wh
Fi)=(iy lee ne
| aa
| vo
‘Consequcrly, a lrge-sample confidence interval fr , may be constructed ss shova
| in the box
|
A Large-Sample (1 ~ a) 100% Confidence Interval fr pin a One-Way Table
Table $ of Appendix ae
We
edimate the difference between a pir of category probabilities, 7
= stories and ( # ), using (iy ~ f,). This linear funtion of and fj wl Be
approximately normaly distributed with
BA pie
and
PB) = Vi) + 1) ~ 2CovtG.5)
Since the covariange of two category counts, say and my (i +), given by
Counc) = ~ may
(the proofs eta an exercise atthe end ofthis section, it flows thatthe covariance
Derwoen the conesponding eximatrs, and
) (n,m; |
Cov (i d)) = AUG, ~ ple - A {
1 1
2 Corte m) = (apy)
‘Therefore
| Vai ~ B) = WA) + VG) ~ 200A
PAL=p) Rm) any
and lrg sample (1 ~ 100% confidence interval for () —
the bax
is as indicated
A Large-Sample (1 — «)100% Confidence interval for (p, — p) ina One-Way
| eases
. [oi =F) * Bap + 2A
w aa OE RT
vi‘9.2 Estimating Category robabiities in @ One-Way Table 403
CSET wre nna 54 ts rel icon 9 cc nes
incon be tra ocucton A Noe et Psat prenTton cimpelae pode
roduc fea mat ae eleva, athe operon a dace Mle ey
ue pretkcon ins
Seluion | From Table 81,we haven, = 1S anda = 103, Thera, 295% confidence interval
forp
ia 5 -m 1s
fs wher fy = = 1S as
= a6 + 199, [CONES
eS ey
| 0°46 4 068. Tere. ournterval estimate for pi fom.078 to 214
‘That i, we are 95% confide tht the true proportion ofall defective impellers
that are produced on ine A falls between (78 and. 214,
CET | Pio srr 91 ts 2 cdc inal ~,t oen bencen
Proportions eet mses arouse to craducton ines ae esoecnry
Soiton | From Tale 9.1, we have m= 27 an 7/103 = 262, Then a 95%
confidence intra for (pp)
_ +R A) i
(i -b) dey oe
(CraaKasay = (a
= (146 209) + 196) “HODES *
=.116 4 121
‘Therefore, our interval estimate of (7 ~ pa). he difference inthe proportions of the
defective impli atibutble to production lines Aan Bis ~ 237 OOF Sloe
thisimterval icles 0, theres inset evidence at the 98 confidence vel is
conclae that the to proportions df
Applied Exercises
28 ew asincuon study ceo on dental pases with
emooroandbulr (jam) jit dysficton (IMD) was
‘alia in Genera Denitry an, 200) Aram
fame f 60 patens was selected for an experiment
Letowit of TMD. Pir to warmest, the pte ied
8 survey on two nonfunctional jw fists
(eth rinding) and ec clenching that have en
‘inked wo TMD. OF to 6 patents, 3 ated o brie,
[Hadiined oth clenching, 3 adie o both abi,
2416 claimed they bad ether babit.
* Desctibe the qualiative varahle of interes in the
‘tay. Give te level (categorie) snc wih the
anal
AG
sri one-way tal for the sample data
& Find and inert $3 confide aera forthe
propeton of deta patents wo admit wo bth
Find and interpret 95% confidence interval fr the
Ailes between te tr proprio of dental ps
‘ents who admit to hot habits and the tra prope
tion of deta pens who elim they have oer
hab
92 Turns a ierseetors. A alc study found that of 972
automobiles entering aus ntseton ding he pete
fom at 107 387 tre Te “21 ered gh nd
234 drove sight tough he intersection
3. Comsrct sene-vay tbl forte say
Find» 955 confdence inva rth tse proportion
‘of suomotiles tha ve suai ough the merc
‘ion dig this peri
© Find a 95% confidence interval fe the difference be-
‘ween the proportions of eons htt leftard
turn righ especvely, ding his pes. Inept the
interval.404 chapter 9 categorical bata Anais.
99. cADtectnolay Each month Mechanical Eusering wr
ey is readers with an cline “Questo af the Month
The uy 204 se reported on the resent the ques
Yin, "Do you fel you know enough sb te ae on
pteraded design (CAD) techacogies wo do Jour jon?
‘The ress 44% anwwered “yen” 1% ante "hs at
mot wowed abot,” 38% anvwered "na cr
eros me.” and 9% answered“ don eed to kaww CAD
inmy job" Assume 1,00 reades respond to the lng
se
Find and ites 98% confidence interval for the
Proportion of ears who fe ty know endogh abt
EAD todo hiro
1 ind and interpt 95% confidence itr for he
sieence beeen the proper of ads Mh a
ho answered no, ad i concerns ne
94 Dial slr ae image process. Dig inal and
age processing (DSIP hk wide varity of epic,
inching entertainment (eco on demand lansing,
sccuitysrveilance, military taret cot, wiles
‘ommuniatios, and inteligent transportation sym
CConseucaty, dee iss rapidly prowing need ft eng
‘ers tied in DSI. The International Jeural of ee
trical Engincerng Eduction (Ape 31) repre oa
‘vastion ofthe experimen DSIP ondopradate te
QVM). sample of 80 student repunded to heat
ment “I hebeve at his esac experience i vey vale
able t my profess fare” The resis 47 set
agreed 3 students were seul, ad 0 students sagen
‘ih he tere,
4. Estimate the popoton of WMU stants wh ace
"hate DSIP research experience i alae thot
Professional ftir. Use 99% confide ima
1b Entimate the difference betncen the propervons of
‘WMU studems who agree and who ae mal abut
the stsemeat Use 3 9% confaene imerea
2 Ponce
9S Chumetersties of ice motwonds. Refer to the Nail
Show and Ke Dut Comer (NSIDC) sllston of dats on
S01 ee metpons io the Cioadan Arte, Example 21
p16) The tare sanedin he PONDICE file One var
ae of ire 0 eovionil engiesrs tying the
Imeionds isthe pe ofc observe fa ach po. Real
that type ic hase Ft year ees mlgear eo
‘SAS Output for Exercise 9.5
lant ice The SAS summary ile othe ce ype el
$ mekponds sori te ton th pope
12, Use a 90% eofidene itera to exinat the propa
thon of mein inthe Casadlan Arctic ht ae
Use 090% confidence iteral to estimate the die
ene etwcen th roportion of rtd nthe Can
in Arctic tat ave fst yea ee and the prop
ha bave mates ice
94 Onontaton cus experiment. Refer tothe Han Fatt
‘se 198)sndy of eolortrphness os nbd sito:
slow, Exiie 7.9 (p. 301). Ninety college dee
lining cm the tacks inthe dr mere Genented whet
Prsitoned ons ring platform unde lowly tating
dk ta bloke thei fel of vision, The subfcs wee
shed to sy “sop” when they fl as if they wer he
side wo. The potion of the brighines pte the ds
‘in ean o each tue’ body elation was ed
record Sujet lei oly thre sk ges pt
tems as subjective yer clues: (1) tiger sie
(2h darter side up and) trig and are ie ald
miter sid ofthe subjecshead. Te fesncy Sms
forthe experiment are given in the accompany ble
‘Consus 495 confidence tral forthe diferent
‘wee he proportion of sbjects who sect brs sie
'upand the poerion who select darker ide yp et
‘hes Inert the esa
soovewe
= isk orentaion or
rr se Up_ Darker Ste Up rgnt and Darkside AS
ss is
97 Gastoentsrts outros. vatetoene ponbcteil 3
‘ocneitsoutueak ecu in Corde sa ro
Tonestandng her deieoey and malfunction of *
aps teatment plat, A sty wa conde © dete
‘ine whether the incense of gastointesia diet
ing the epidemic was vested 0 water consuming
(American Wier Works Jounal, tn, 196) A telepote
sary of houscholdsyieked the accompanying iB
‘io om dy consumption of ounce pace of wate
8 stool of 40 resents who exhited gto
‘ympioms during the epidemic
The FREQ Procedure
leervPe Frequency Percent
First-year
Londrast. ye
natttyear 2 ales93 Testing Category Probables na One-Way Table | 405
Socm ‘ental Plerids.Arandom sample of 10 agricul nh
= ee mes cers was imerviewed and asked which of the strategies
Daly Consummation of he or she believes would yield maximum productivity
8-ounce asses of Water A summary of hei reaposs is shown nh ble
© 1-2 34 Sermow Tort oy mmoare
sitsympuo| 6 Bp om 4 = Fe tee
om 2 Frequency Ut so»
saan Wer Ws arc NL TB Nn
{Be Copan © 1986, American Wats Anociaon
@ Finds
aleogacers who rsermend sttety C
a for the diternce =
ween the tm proprions uf agricul engineers
es who dink 1-2 glass of :
who recommend sess End 8.
8. Use 199% confidence interval to estima the pect
ae of gates
ho ink 1-2 and glasses of water per dy.
°% isting crepend. Because of cmsic mifll paterm Theoretical Exercise
{2 low water olding capacities of soils in Fluids, 9.9. Fe the muinonial ebability distribution show tat,
Spree imgation is required for pring most
Stops. & research eam his develope five aleroatve
°3 Testing Category Probabilities in a One-Way Table
‘Suppose we want to test hypothesis abou the category probabilities forthe defective
impeller study using the data given in Table 9. Specifically, we might want to teat
the (aul) hypothesis that the proportions of defectives atribuable to the five produc
tion lines are equal, Lew Hospi = P2 =~" = ps = 2. against the altematve by
pothesis that at east woof the probabilities ure unequal. Intuitively, we would shooxe
a test statistic hased onthe deviations ofthe observed category counts.
‘ns rom thee expected values, or expected eategory counts
E(n) = ni = (103)(
Lange deviations between the observed and expected cutegory counts would provide
evidence t indicate thatthe hypothesized category probabilities are incorect
The statistic used to test hypotheses about the eatery probabilities of a category
mokinomial experiment, one hased on the weighted sum of squared deviations be
tween observed and expected cell counts, is
Substituting apy for Ein) and expanding the numerator, it can be shown (proot
‘omitted that: |
| acre 8
\
[oegeuanghae
the statistic x will possess
(prof omited) approximately a chi-square’ sampling
distbution.® The val of 42 wi
ib larger thn expected he deviations [n; ~ E(,))
‘arg lage. Therefore the ejeton region forthe testis a* > 2, where isthe val
2° tha loses anaes ain the uper tai of te chi-sjare distribution (ose Figur 9.0,
The numberof degrees of freedom forthe approximating chi-square dstbution
always equal less {degre of freedom forever linearly independent rests
placed onthe category counts. For example, we always have atleast one Tne
tition on the category counts because their um mst egal the sample nee
A Test of a Hypothesis About multinomial Probab
Hi
ies: One Way Tale
ates = P20 k= Pa Where Py Pay. Pag Tpmesel HE
!hypshesied ales ofthe multinomial robbie
Me: Atleast on ofthe mukiomial probable doesnot guts ypbind
value
where £(m) = pos the expe
ed numberof outcomes of type 4 assuming H i+
true, The total sample size isn
Rejection resion: x? > 2, where x2 has (k ~ 1) at
Assumption: Fo the chisquate approximation oe vali E(,)
| Leon: Sire cisare approximation to be
S forall
very Probables assume specified Vales el
in only single near resticton om the clegry county mac mr ao fe
1 = 1. No category probabilities need to be estimated besause this value are opt
| Sed in Ho, The test procedure is described inthe preceding box, We will vst is
simple application ofthe chi-square test in Exae 94
‘ease splot expected coun cnbe eth Mo ts jected
‘per Cth 08 iene owner orcasCE | ea:
Solon
{9.3 Testing Category Probabilities in & One-Way 7
407
he cats poten Table 91. Teste hypothe ta the proprio fl deface im
We wantiotest Ho: py = pa = ++ = ps = 2 agunst the altermative hypothesis
Atleast to ofthe cotepory probabilities are unequal. We ave uieady cucusted
rt 8 (= 1.2...,5)
TABLE 9.3 Observed and Expected Category
Counts for the Data of Table 9.1
Ome 8
‘The observed andthe expected category count (in pare
Substitting the observed ad expected val
for x we oan
eves) are shown in Table 9.3.
sof the category counts into the formula
206)?
206
= hum
(1 = 206)?
206
336
The rejection region for the west is 42> ys. where xs ie hased on k — 1
= 14 degrees of freedom, This value, found in Table § of Appendix B, i
ats = 948773, Since the observed valve of 2° excoeds this value. ther is suRerent
{evidence (at a= 05) to reject Hy, It appears that atleast one production line we
spoasibe fora higher proportion of defective impellers than the ther lines, Wore
The test fora one-way table can be conducted with slatstical software, The SPSS
| primout ofthe anaysis is shown in Figure 9.2 Sine the p-value, 010 (highlighted)
Isless than @ = 0S, we eject Hy)
une
Dasenean | Bpeceon [Rena
x me] 2s] 86
8 Z| (eaten vce 6a
fe a1] 208| 104
D a] ms) 46
: u| as] “98
Teal 103 | |
Tost statistics
c TRE
THOTT] Ts
a 4
ee
3 0 ces 8) have expected trequencies less than
5, The minimum expected cel frequency is 20.8
FIGURE 9.2
SPSS analy of drain Tale 9.3
ie eoT
airr
408 chapter9 categerical Data Analysis
Applied Exercises
910 Scaming mtemet messages: Ine: Tchalogy (M18,
1997) reported thers of on Equitas Consumer
Privacy Survey in which 328 meme srs nated et
level of spreement withthe following stem "The
government eed 1 be able to scan Inert mxagee
and vser communications to proven fad and ate
rime.” The rumber f wet it ach sponse cap
summarized
se to dteine If the opinion of
Cos the et
bess yo woul
arc ney ae
egos
pata using U5.
© In the content of th robe. wha i Type Lenoe?
A Type emer
sumptions ust old in ert eur the
ality ofthe est prt?
B scan
“agree ‘agree Oisaaree Okage”
Storey somewhat somewhat Soa
en) 2 »
Ponce
P51 Characters of joe moezend. Retr the sy of ke
Imeyo inthe Canadian Art, Enecne 9p.) The
SAS summary il forthe ce Ope the Sn
‘wpe atte bow ofthe age Spent
engines hypethei tit 19% of Caan Acie net
pm have freer ie, 405 hae lanl and 986
ave miyearice. Tenth engine thaey ing er
942 Menagement stm atures. Refer to he P
Bsr.
anagenen syaem
Cause cxtewory
and US. Chemical Safty ond
umber of insets
Engineering & Design a
Process Pactces 2
“Management & Orersighe 2
Training &€ Communication 0
Tott s
‘SAS Output for Exercise 9.11
Cadtest” 136
nie 20
Hara nvestigsion Boa sady
caused by manngcrocat stem
flues, Exere 24
(0.1 The accompanying ale gives breakdown et
soot causes of sample of 49 incidents. A thee
ant diereces inthe pecenige
case ctgorien? Tung 0S,
‘Spaces hotspots. Refer tthe Nae (Sep. 1993) ny
animal and plant species “hotguts”in ret Ba
Everise 275 (pI) A hase if defied as kn
ae areata ispecies hay popu
by the species of mere Simi olspt i He
‘pare ara that specs por. The fllwog tae ge
the numb of butety heaps and numberof te
olds ina ample 0288 IO-km squire ess Inthe.
oy $% ofthe eas shuld be utery hoapts, 32
sould be baer cosets, withthe remaining
(29%) new Tx he hry wing = 1
J worseors
Bawey op
ona
Bary Colypas 7
eat reas a3
TOTAL 88
914 Ts at iersctons Refer. the wfc stay, Exece
92 ay,
4. Dothe data disagree withthe hy phos tht the afc
5 sully divided among the tee iets? Tt
‘nga = 08
4 Do the dat provide sutcent evidence to indica at
moe than enti fal atoms elerng
Yeection tun ef? Tet ng a= 05
ostoentents cutreak. Refer wo the American Wie
orks Joural Uae 1986) sy of a gsroees
Esetie 9.7 (p40) Condit esto Stine whe
the incidence of gastrin sexe during th CF
‘demic seated to water cnsumpsion, Use =
96 Onertaion cue experiment Rete the Human Fst
Pee 198) uy of ristao ls. Exrche 68.10)on
Iv ac square sampling
Bmuss
Conic et compare the proper
Whether th dee proportion if «
op lana. Refer othe survey of asculra en
94 Inferences About Category Probables ina Two-Way (Contingency) Table 409
1m Optional Exercise 6.118
aed that 2 s+ anda ona ands
n. Deavie the two cleo for bien
9 ~Y) Ther fore
fires, Exercise 94 pi) te date se sf esperimemasm) = Yan
Theoretical Exercise manda ty a
938 A gna pet of the at tat 7” press apron wtih | degre of som, Show cea hat
94 Inferences About Category Probabilities in a Two-Way (Contingency) Table
‘ne-vay) classification ofthe data Fo example, the categories fr the defective im.
peller data of Example 9.3 correspond the “Values” assumed bythe qualitative vt.
able, prduction line. Often, we may want to lasiy data aoonding to two disctions
Te methods presented in Seotion 9.2 and 9.3 are app
of elassifcation—tha is secon 10 0 guaitative va
a classification wsualy ito determine whether thet
ies. The abjestiveof sch
ddenendent
‘To llurate, consider questionnaire that was mulled to 4 sample of 180 house.
holds within 2 weeks ater a nuclear mishap oecired in 1979 on Thuee Mile Island
near Harrisburg, Peonsslvania, One question cone
full evacuation: “Should there have been full evacuation ofthe immediate area?”
Residents were classitid according to the distance (in miles) ofthe camtmunty in
ned residents atitades toward a
reside ftom Three Mile Island ann their opinion on a fall evacuation,
which the
A summary of the responses forthe 150 households randomly selected is show in
the two-way table shown in Table 94. This table i called a contingeney table it
presenis multinomial count data clasiied on two scales, or dimensions. of cansifce
Mile Island and responses to te fll evscuation
tion, namely, distance fom Thee
question
Fach coll of Table 94, located in a specific row and column, represents one of
the £ = (2)(3) = 6 categories of a twordircetonal classification of the m ~ 150
observations. The symbols representing the cell coats fo the experiment in Table 9
are shown in Table 95; the comesponding cell, row, and column probabilities ate
TABLE 9.4 Contingency Table for Three Mile island Survey
ve 72 z Tomas
fall i is 5 3
acuation No 20 » 4s x
‘TOTALS 8410 chapter 9 categercal Data Anas.
TABLE 9.50 Observed Counts for Contingency Table
‘asanco anne eine es
6 748 2. Toms
rit Ye a ms ms me
action No Ss m ry ns
TABLE 9 5b Probabilities for contingency Table
Distance ftom Three le tslna, res
eae Tos
alt Yo Pu Pa Pas m
Pvacuation No Pa ba P
TOTALS me
Shown in Table 9.5. Thus reeset the numberof residents wh lve wii 6
ails ofthe acident and suport ful evcuation, and py represents Un corey
ing cll probably. The ow tetas (designated atm, and ng) cel colomn coos er
lenated at, a, and na) ate shown in Table Sa, The corresponding row an
column probability tals re shown in Table 9b. The probably ls Rete somt |
sn columns ae called marginal probailies, For example the sarin po
is the pecabilty tht a esident favored ful evacuation, ate ve ea oral
“ste probably that a espndent lives 1-6 miles rom The he feed, Tes
Pie = Pu + Pa + ps
and
P(fovor fll evacuation)
P= Pui + Px = Plive L-6miles fom Three Mie Island)
Yon cons thatthe expernem we have desrited is x mutinomil expres
sia otal of 150 nls and (2)(3) ~ Seateporice See te Ink eordaes cee
‘kam chose, be tls ae considered independent end he potataions we veel
a remaining constant from wal al
Tey Stes fhe study iso determine whether the two classifetions
Sziistam Tee Mile stand an opinion on ll evaciaton se tapeedens rat
te knew the dstace fs Tice Mile nd dics dat areca rove oct
about the resden’s opinion vaca
ation? Th a probabilistic cence, we KoO¥
Sham 2) tha independence of events A and tnplice Pla C18) = Pea) PC)
Snr, ate condageney tbl aly if the to cain atons oo issco
2 erebity that an tem is clasified in any panicle ell eb he po
co he comesponding marina probes Th, unr ins nen of ose
Pendence, in Table 9.55, we mat he
P= Pe
1 PAT PERE Bae pips
tim fonh Therefor, he oul hypothesis thatthe detions of clsifcton
Independent is quan othe hypehsisht every ell pobase eal B!
wile{24 Inforences About Category Probabilities ina Two-Way (Contingency Table 414
Dreduct of is respective tow and column marginal probabilities. If the data disagree
With the expected cell counts computed fom these probuiliis, there i evidence wo
indicat hat the two directions of classification are dependent.
i we were wo calculate the expected cel counts for our example, you would im.
mediately perecivea dificult. The marginal probabilities are unknown and must be
extmated, The best estimate of te ith 0 minal probability call pis
Row itt
timate of Ue th marginal coluran probability is
Colum total
Therefore, tbe estimatéd expected cll count fr the cll inthe ih ow and th column
‘ofthe contingency table s
(5)(2)-
ag
‘The general form of an Xe contingency tale (one contain
columns) is shown in Table 96, When nis
rows and ¢
“555
will possess approximately a chi-square distribution, The rejection regio for he test
willbe 2 > 472 (See Figure 9.
‘Todetermine the number of degrees of freedom forthe approximating chi-square
Aisribution, note that = re, From this we must subtaet degree of freedom be
cause the sum of al cell counts must equal n. We also subtract (r ~ 1) ease we
Inust estimate the (r— 1) row marginal probabilities, (The last row probability will
then be detemined because the sum ofthe row probabilities must equal.) Similarly i
TABLE 9.6 General x c Contingency Table
2 6 Rw
Tals
tom meme
ow:
Column Totals ” nen412 chapter9 categorical Date Anais.
FiGuRE 9.3,
Rejeeon region for the hi-squte
te for dependence
Ae
eile
‘we must subtract (€ ~ 1) because we most estimate (c ~ 1) column marginal po
abilities. Therefore he degres of freedom fr chi-square wil be
a
(Scinet Tinany independent
e-W-(-)-(-1)
wrr-e 4
(= e~ 0)
‘The chi-square tess summarized inthe box; its ws sisted in Example 94
General Form of a Contingency Table Analysis: A Test for Independence
‘The two classifications are independcat
1g. The two classification are dependent
es lng — Atay?
AR Bm
Test statisti
where
Bing) Me = total for tow ¢
ij = al fr column
Rejection region: yx? > x2, where x2 has (r — 1)(e — 1) at,
Assumptions: 1. The n observed counts ae a random sample from the population
| of interest. We may then consider this wo be a multinomial expet
| ‘memt with r X possible outcomes,
2. Forthe 7 approxi
expects
ET | 0: ctr ade 9 oon
eter rit resents orion on a evcunton of
‘le anc oecencsonrow er es te eet Ne fom Goer an
Soluton | The first stop in the analysis of « contingency table is to calafate the estimated
expected cell counts. For example,
tion oe valid, we require that the estima’
be greater than or equal to $n all ells94 inferences About Category Provabites ina Two-Way (Contingency) Table | 413
ln
(84)(78)
150
4368
The cel counts (op number in el) andthe corresponding estimated expected values
(ettom number in cell are shown in the SAS printout ofthe contingency table analy
sis Figure 94 .
oe this study, the 2? wes.
taste i computed a follows;
2 lu = BoP tm ~ Ena)? = Hoy
e- + be ST
| Elms) Evins)
3, ong
Aa Em)
Table of EUAC by DISTANCE
evnc —ISTAMCE,
Froquency|
Expected’ [1-6 [7-12 [Ovent2_| Total
cy 20 13 5 oy
21.2 | 19.08 | 43.66
res 18 1s 33 66
were | 14.36 | 94.32
Total 30 ™ an)
Statistics for Table of EUAC by DISTANCE
Seatiatic oF Velue Prob
Ch) Square 2 8.2558 Oars
Etter iheod at io Chi-quare 3 «Oakes Gere
Fontelagenexe! Chi-Squore | 0.2687 0.8110
Pri toeftieient otoa2i
Contingency Coefficient coat
Cranes oroaat
FIGURE 9.4
‘SAS contingency ale ali for
Sonple Size = 150
Example 94414 | chapter 9 categorical Date Anaysis
Substinsing the dats of Figure 94 into this expression, we obtain
(181679
672
Note tht this value, x = 2688, i shaded atthe bottom of the SAS prix,
Figure 9.4. The rejection region for the test is x2 > yg = 599147, where xp 6
based on (¢ ~ 1)(e ~ 1) = (1)(2) = 2 depres of freedom. Since the compl
‘value of 1°26, falls helow this ctl valve We fil to reject Hy here sins
nt evidence to conclude tha the two diestions of data clasificatton are depend
1 appears tat opinion on fll evacuation is independent of eistanve from Thrce Mik
Islan,
We cam ave
shaded in Figur
athe
same conclusion by observing thatthe p-vav for the est
4, exceeds = 5
Suppose we conclude thatthe two directions of elssifcation in a comtingensy
table are dependent Practically
esntages of observations
‘depends om the level ofthe other
sean that the proportion
three dstaee groups, Tod
confidence interval for the diferences, (p,
peaking, this implies thatthe distribution of he por
falling in the categories foe one ofthe qualitative variables
arabe. Inthe 2 3 table of Example 9, ths
of residents tht favored fll evacuitiondifered forthe
ccrmins the magnitude ofthe differences, we coud ons
= Pah (Pa = Pes) al (pa ~ Bs)
sing the method of Section 710,
Applied Exercises
9419 Suuty of ereine development In Teco (x. 2008,
_2solgiss published thi ecrch on he formato a et
‘lines (ured moun ls) nthe eal Appalachian
‘mountains. comparison was mide oo nppes che,
frock tht hive moved oer 4 ge ert dstane)
‘nen Petsyvaia and he tern Maran Rock sam
les atthe mourn rn of bo aos wee called
an the foto intersection ates (FIA) preserved ihn
lage mineral rains was messin for each Te socom,
yng able shows the namber of rock saplec ine or
ferent FIA measurement capris atte two leans
‘The seoloxns teil whether the nibaion of FA
trends wee the same forthe Panmayhania Naga
Maryland ape sing a chinquae et of ndepemoce,
LY onocune
Reverie wand
Topo nanpe
one 20 6
ra we v 0
150.179 0 "
‘es i. eal Apaches, N23,
{Give the nal and aerate hypotess rth test
1 The searchers opr thts ems? 157,
Do you are’
Find te ection eg forthe sing a = 8.
4 Make the epropate concen in tne mes 0 86
rote
9:2 Hh Sect sty Acct tse eprint
ural of he Neral Cancer Inia (Ape 1916
int fs igh in ber may Help ete sist
sue Te ce oy die 30a
fs fa gps Meas Al as mere et
4g tat cr tent ue er ook tae fd
sit ffs tiber or 5 wos However he veh
{atm Sher vate om group go td 8
feline ete saner ort wha ts
deemine for exh goa. The ua a sana
‘he accompunyingconingeay be
5 race
age | > ee
eros
Wah Fay ‘High Fat’ Low Fat) Low Fot!
NOFiber Fiber NoFiber Fiber TOTALS
Cancer Yes 27 ~» 9 4 a
Tumors No 3 10 n 16 =
roms Sieber
& Docs te sampling speat io satis the ssumptons
for s multinomial experimen? Explain,
1 Cate the expe cel counts forthe comings
tableAi
9.4 inferences About Category Probebilis in aTWoay (Contingency) Table 448
Cleat the 2? ati
4. Ts hee even to inde that dit and presence!
abuence f cancer a independent Test using = 8
Compa the percentage of rts 00 high fa her
faufiber dict with canoer using a 95% confidence ine.
al inerpet the esl,
921 nagoing gran motns tn on experiment desea inthe
ural of Agricaliral, Biological and Errored
Sis (Dee: 200, bins cor wer socked with var
fous parsits Ce. grain et) in le wine ta eaty
summer (Jan), hee bowl shaped taps were pede
‘8 pan face im oder to cape the ots Al ice
taps vere hited wih ase pherorone ge Bower
ap wed an unmarked sky aubesve, ome as mired
witha Murescent ed powcr amd one was maed witha
Suorescet ue powder The aps wete set om a Wednes
hy andthe catch callie he flowing Thurdsy and
Fay. The uble shows the muber o mots cared
exh ap om cach day. Cos es (at a ~ 10) 4 de
mine if the percentages of maths cought by the tee
"ap depen on yo the week,
Gworstear
‘Achesie—Wo Mark ReciMark Blue Mark
136 a 0
Sore Wo EP, wal SAakig vapor welch ke
22 Bat soutaon risk order eval ti station
arom phe raps att in atl
lows. At a eandom pst in te tithe itor 5
fromen nd data on situation goes are immediatly
colecs The sination ster contin ntl mot,
Pesmance (e.g umber of kill) s mead. A sty
Speedin Haman Factors (Met. 1998) investigated
‘mete: emperaiy sopping the smaion resus in any
‘ngs in plo pertonance, Ti ete designed 0 tht
‘ome simulations wee stopped to oles stain are
es data wile ethers were not sapped Each tal ws
then clasifie according tothe mute of Kills made by
te plo The dat foe TA rss ae summiod in the
Sontngeny table below Conde! a coniageny table
ims and fly interpret the es.
Gomeus
‘Number OKs
a oe
Pei SS
NoSwps 24 3688
A a
923 Grounawatercontainston i ws. Refer 19 the En
‘nmerial Since & Technology Jan 38) stay
meh ere NRE) contrition in publi
rivet Nev Hampsie wells, Exes 210 19), Recall
tha date on wel cas pu rp egies ether
or usconsaida), nd dese level of MIB low
limit or dee wore collected for «sane of 233 wel
‘These aa are sed in the NTE il (Da othe Fi 10
seicted wel a shows inthe acompanyng able)
mre
WellClass _aauler Detect MTBE Status
Private Badrock Below Lie
Prive Badnck Below Limit
Public ——Uneumsaliited Detect.
Pub Unconoidated Below Lim
Publis Uncemoliduted Below Lint
Public Uncemoliuted Below Limit
Public ——Uncemolited Dees
Public ——Uncomsolidted Below Limit
Public Unconoidated Below Limit
Public Bedrock Deiet
Public Back Detect
‘erst eet oceutense ad elie acces puis an
thet New Hamp
4, Use the dts inthe MBE feo rate seningeney
befor wel clan ad detectable MTB stats
b, Conducta test astern if dette MITE stats
eponds on wel class. Tet wing = 08.
© Use th dat ithe MBE file creates cotingency
able fr ager an detectable ME sc
6. Conduct test determine if detectable MTBE sus
pens on auier Test using a= 0S,
92
Fight response of geese. Oflhire vil dling estan
Alaskan estuary bas let ines it tien
luge heticopters—i th ares. The U.S Fi and Wile
Service commisioned 8 stuly 1 invetgate the impact
thes hebcopters have onthe lacks of Pace brat geese
that inhabit estar nal ene migrating (Sal
Case Stier: A Collaboration between Academe and
Instr, 1996) Two lage blicopters wee flown pet
lly over the estuary ot ferent ies an aera!
tances trom the flock. The Might espe ofthe geese
{eorded ion” oe “igh ade neo mes),
and later! cstance (handed of mics) for each of 464
Ielicoperovelshs were revere and ae saved in the
PACGEESE fie. The dts forthe fist 10 orig ate
shown inthe nex table)416 chapter 9 categorical Data Analysis
LD pacctese
(rt 1Ooberations shown)
Overnert —Atude ota a
i os 499 a
2 st ica
3 ost 338 ce
‘ oie 2k Low
s re
‘ a1 ah
? 305 als
8 610 338 OH
9 a
10 iy 6 oat
8. The reseurtes categorie ale as fas: ks
thon 300 ters. 300-500 mses ad 0 oe me
seers, Summarize the dat inthe PACES ey
‘reaing a contingency table orate category and
3s 0 determine if ight response of the
re depentsonaltade othe heer Tew eit
The escarchas categorie lal stance slows
tes thin 1.000 meter, 100-2040 matey, 2000
31000 meters, and 3.00 oF move meen. Suminatze
"he data inthe PACGEESE fl by cca ome
BEOCY ube for lateral distance catepry and Mie
Conduct ates © determine if fight respunse ofthe
ese depend on later distance of kecope ne
Mock Testusing a= 01
The curat Federal Aviation Autocity (FAA) min
‘mum aie stndard fr ying over te etn
21000 fet (appronimately 610 mes. Based wy the
‘ous, pats a, hat changes tothe FAA ropa
2 you recommend in onder mii he
Pace ban geese?
fst to
{ seeouns
925 Subarcue pant study. The ts of ced beating pans
Inlgenous 10 sobantc Finland! way suid in Arey
Amnrete. and Alpine Reeurch (My 20M). Pots wom
‘ategvizedaccrking 1 peat ab, ho eh
bundance of seedings (no seclings, tate sedge, og
‘shunt seeding) generate owen vegas
rodction,vegeutiverepnaton poe, veptaie te
Produc infec or vegtative reproduction eft,
sec wg eas (0
S.5.10,1.0-80, and 280
gras, and aspre morpho (i ec ap
is, wngs, fleshy frat, or smnuhoks. The da ra
sample of 7 plants are sane nthe SEEDLING fe
8A comingacy table fr pln type aa ceing sae
snes, produced ty MINITAB, follows (Noe NS =
to scalings. SA = scilngssbundane, and SR
‘tedings we) Suppne you want to perform ci
uate tet of independence to deine. neh
seeing abundance deperih on plat type, Fd he
expect cell coum forthe comtingeny tale. Ae he
‘seuss aired othe es sats?
Reform the conigeny table ty combining th
NS and SR cages ein sburdance. Find he
expected cell ours Yor ths new contingency
Ale the aston egied forthe et sai
© Reform he comingency ae ofp bby cumbi
ine the dart si and rans caters plat
Find the eset cel ous forth contingent
Are the assumptions rue forte tt sified?
a
Carry out te chi-square tes fr ndepemicnce oh
otingeney tale, pate. sing «=. Wha
onl
IMINITAB Output for Exercise 9.25
Tabulated statistics: Abundance, Type
Columns: Type
us a 1 as
3A 5 “ug &
se 5 230 17
an 3 yo 7
uate cust umes. Fugive dust plans seer
‘ac eqipmeat can hazardous oman eal 1a
dourel of Agricul, Biological, and Eminnrsl
Setences (Mat 200), environmen gies developed
eel for dust puicle concentrations i pues Pe
ced by a ractr operating in 8 whet Fel. The Hae
‘vee song it pall eitensth paths in he fl
Aen sensing israment witha laser fea, pas
the eds ofthe el, measured the porate mater he
thst every 5 seco. Unformaeya ew of he TE
‘ces Were cerned. hgh than he signal ee
ofthe nse. This say cece when the
‘ea sortase rm the isomer ner bea, ThE
ext table stows the mumber of ceasred eum
foreach ofthese atone
‘+ Calelate and compre the sple proportion of c=
sored nesurements fir the i ct Hes.
Do he data provide sficient cvidence onda
the proportion of cesond measurements fle: ft
‘he sx waco lines? Tes sing a =
95ngency Tales with Fixed Marginal Totals 417
et swocrecrs
te 927 Satvnre defects. The PROMISE Software Enginsing
Bocce Reni. sed ye Sh fin Tec
a suche with ds st fr baking pede sofa
ir ae Stoel (See Sts in Aron, Chapter 3) Das oh
! : spas isen ved be SWDEFECTS Se
2 1456 Recall hat each module was analyzed for detects snd
a elasied as “oe” if contained fective cee and
B i ake if One alg fo edicting wc ot
8 mie a detects ope” (ented
5 aa ENG), whore 4 molule with atlas IS blow gro
aan te mn ‘han te mtd rete sdk the ped ENG
Tors san r sue yer odie, on Acomapoey ole
Speier ey forthe two vale acta defective sts ad peice
apenas VG is shown nthe acompring SPSS prima Inter:
Mac 201 or dail provided by saa Bt Helen ‘modules? Explain . “
‘SPSS Output for Exercise 9.27
DEFECT” PRED_EVG Crosstabulation
coum
[eee as
a 3] a
tue a 2] a
Total aa] | ae
chi-square Tests
T Age So] Badse [baat
vave | «| “Goin _| Graces) | (es
Perm ON SRE | Te 7 2
Contnuy coreenore| 306 1 00
Leineod rato 28 1 a0
Fane's Seat ret a7 | a57
NotVaie Cases «|
‘Computed cniyfor a D2 table
©. | cells (26.0%) have expected count less than 5, The minimum ewpectea court is
98
+8 Contingency Tables with Fixed Marginal Totals
Inthe analysis of contingency table data, one or more ofthe categories may contain
tn insufficient number of observations. To illustrate, we will consider the study
(deserted in Setion 94) ofthe relationship between a resident's opinion of fll evac
sation of the area surounding a nuclear accident sn the distance the resient ives from
‘Thre Mile Islnd. Ifthe random sample contains only small number of residents at
ny
ty
cannasin fueiaa
iBT NINA418 chapter9 catogorical ose anaiysis
TABLE 9.7 Distance-Evacuation Contingency Table with Column
‘otal Fixed
aa Distarce Fan The Wie Wend ries
1% 7 oe Tos
Fett te Ee eae 6
Fracuaton No 385 208
TOTALS. to 10 00
live a certain distance awe; this may cause the expected ell counts for tha istanse
to be small—perhaps less than the equired 5, To guard agaist this possibility, exper
{imenters often fix citer the row or columa totals. For our example, we would fit te
column totals by randomly and independently sampling a fixed numberof residents in
each distance group. Ths would increase the likeli
cell counts would be of adequate size,
For example, suppose we obtain the evacuation opinion of random sample of (0
residents in each dance prow. The rest
ils might appeae as shown in Table 07. Noe
the diferece between this sampling procedure and the one describe in Sesto Sc
Where we assumed tht a singe rundom sample of = 150 residents wae sceced fom
‘among th population ofall people residing near Thtce Mile Ink In his section, we
ave randomly and independemly selected te samples, 100 residents Irom each ds
tance. Therefore, the data of Table 9.7 result from thge
With k = 2 ells (apport ord nx suppor fll evacosy
Aistances, 1+ ils, 7-12 mils,
A chi-square
nod! that the estimated expected
uitinomial experiment exh
ion) corresponding othe hee
and 13 or more mils trom Three Mile Isl
est fo detet dependence between row and column classification,
‘when citer the column othe 1 tots are fied. is conducted in exactly te sate
way asthe test of Sction 94 tan be shown prof omited tha! the sais will
Fosse a Sampling distribution that is approximately chi-square diebuton with
(r © (6 ~ 1) degrees of freedom. The test procedure is summarized in the box. AX
‘pplication ofthe txt the comparison of two or mur Binomial proportion fl
trated in Example 9,
A Test for Independence with Row* Totals Fi
| General Form of contingency Table Analysis:
I row tous ae fixed
‘Hg: The row proportions in each e
ll donot depend on the row; dat ste ist
bw
ons of observations in the column categories are the sme for cach
‘1: The row proportions in some (oF all) ofthe ells depend on the rows that
|e
sey ~ Bog
Para
Test statistics
where
ug) =
Rejection region: 32> x2, where xis (e~ 1)(e — 1) at
=e ta btn ie pete fe contig
i i er 4.x amy dol, i nny oo
dp95 Contingency Tables with Fixed Marginal Totals 449
Assumptions: 4. random sample is selected from each population for which the
row totals are fined.
2, The samples are independently selected
EEE | 1 conpre nepreparins of deface Inporsprodued by tee prodcton Ines, a aut eon
ra engneerranaom samples roars om each ne The rumours eis te tace
H marer.en | ines were tun woe 2,17, a7, respect, bo te da prove asf endencee oles
dezences inte proprtons cf decor pts proceed y te he pret Ines? oer
Solution | The da were entered as a contingency able in MINITAB, withthe resulting pintout
shown in Figue 9.5. The objective ofthis experiment is to compare tree bona
proportions of defectives. pp, and ps, based en thre independent binnmialexper-
ments, each ontaning 50 observations
The nll hypothess i that the proportions of defectives for she three production
lines ae identical
My m= m=m
ost the alternative hypothesis
1g Atleast wo ofthe proportions, pp and cfr
[Note tha the null hypothesis we have specified implies thatthe numbers of detec
tives and nodefetives ae independent of te production line. There, we et Ho: py
Pa Ps using the chi-square est fora contingency table analysis
‘Tabulated statistics: STATUS, LINE
Rover STATUS Columnes LINE
prrect 1217 736
mmm i
fe 398 408 $05 1363
aun 500 500 500 1500
‘$00 S00 500 1800
cell concent
Pearson'Chivsquare ="4.269) DF = 2, Fovaiue = 0.116
Likelihood Ratio Chi-square = 4.999, DF = 2, P-Value = 0.111,
FIGURE 9.5
[MINTTAB coatingeny thle analy for Example 95A et ed ie a
420 cChapter9 categorical Data Analysis
The estimated expected cell counts are computed usin the formula
Therefore
im) =
| ana
Etna)
(36)(500)
(26)(s00)
7300
1500
These long with he remaining estimated expected cll counts ae show (ighlihed)
‘on the MINITAB printout, Figure 9.5
The computed value of x (also shown on the printout) is
ere
PRS Bap
02 2F 7-19 (asa — aa0y?
a @ =
= ass
The lesion son ote? > yh whee yy 58017 is ted v0
( Nfe= = (NQ)
oes not exceed 2s (a
2 degrees of freedom. Since the computed valve of
nd, since the p-value shown on the primo, 118, exceeds
«© 05) there is insufficient evidence to indicate differen in the proportions of
efestive impellers produced by th three production lines Nove that we donot aePt
Heth is, we donot concioe that py
bout the posi
Applied Exercises
928 Performance of soter joke inspectors. Westinghowse
Eletic Company has experimented with ferent nes
of evaluating he porormance of slder join nacre
‘One aoroach involves comparing an individ! ape
‘ors lsifistons wit thse of the group of expe he
comprise Wesinghoe's Work Stns Commitee In
‘ne experiment 183 solder snoestions were eal ty
‘he commitee and 111 were clasifed as aeepale: As
inspector evaluated the Same 1S connections and hn
fed 124 as accepuble OF the tes rejected bye me
specter, he commie aged wth 19,
8 Comirct contingency table that summzes the
‘lasifiations of the commits el he inspector
Based on visual examination of the table you con
struc part dees it appa hat hee lao
Ship betwecn the inspects clsiteations and the
that if iferences exis, the
{oom each prodvction line
>—because we would be concerned
‘of making Type I eri, fling to detet differences inthe r=
Dertions of defectives ifn fit, differences exist, The text concise simply
Were (00 small to detect using samples of 500 impels
sommitce's? Explain. (A bar graph ofthe pero
‘sje hy commites an inspector il a 901
© Conduct chisqute test of independence for Bes?
ita. Use a= 05. Ctl interpre he eas of
Ur test inthe context ofthe proble
929 Anew cena bonding agent. When boing ith ot
lente ms macs a dy fed. Anew boding 3B
sive alld Smartbond) has heen developed to cise
the acest ofa dy Hel. However there is coe
Ut the new bonding abesve may not sch othe wo
as wll the curent standard, composite ase
{Trends n loraterials& Arid Organs, fn. 200.)
‘Tests were conc 2 sample of 10 exacted ttt
‘ded withthe new adhesive anda sample of 10 ex
‘cath bonded withthe compute adhete. The Aes
aeRenan dex (ARN, which messes the es be
sive of bonded
oth on a scle of 1 5, was deter
mined for cach of the 20 honed Leth ater 1 hour of
shying. (New: An ARE score of | implies al adhesive
rami on the teh, an acne ofS means none ofthe
adesive resins en the oth bk ofthe mn
terof bonded tein inthe fe ARI categories ash fo
thee
Browns
~ AaesveRemnantndex Score
iL 2 9 s «
2) ero
je Bea
4 Expain why
urging,
1 Conuct an analysis to dete ifthe dntibaton of
ART scores ler forthe two pe of boing ae
© Are the astumptions ofthe tt sated? Ino, how
es his impact the aly of he inferene dvired
Fre the est?
comigncy able is one with fe
{0 Manganese i tne arts crust, Te ssurce and esetal
etl, manganese, has been fd aba inno
es on the deep sella To vestige the elticanip
esweon the magnetic age of Er’ crust onthe cca
Noo andthe pana of nding manganese nls in
tht lcution. cust specimens were sled fiom seven
Iagnstic ape cans and the peentge of species
emiing manganese cles was send foreach, The
state shoe in the aocrpsaying able Is here sl
sit cridence wo nate tt te probity of ding
Manganese nodules in the deep-sea Fan's crs is de
Pendent om the magnetic age of the erst? Tet sing
SG mancanese
= Es
ve Specmons. Manganese Nodes
Miocene resent 389) 38
lass M0 v9
Bie 214 6a
— 4 24
eeCitaceom aa aut
Evo Midi Crewseous 1.120 a2
# no
Ieee H.W. “Tin, hn, anne in fess
"AL" Ancrcn Stet Sep-Ost 197
95 Contingency Tables with Fixed Marghnal Totals | 424
9.31 Doubled crest Seldane-D,pocod by Marion
Merrell Dow Ine san oven coumer dug designed to
relive steering aa congestion ad other smpoM
allege sini. General avers sects of Seldane-D
‘vee investigated ins double-blind, somtolled sty of
‘ver 0 piers sateing om llc hs sample
of 374 patiems were given Sele D, whereas second
Sample of 195 potets were given placebo tno di)
‘The numberof potentsrpeting ima in each of the
‘v0 groups are gives in dh table. Tet to determine
whee the proportion of painstaking Sekane-D sho
expense isomers Ina the comesponing pre
bon or patients recsving the placeho a= 10.
2 sewoaneo
care
Tasca ”
Nobeemia 7 1st
Tons 5 193
9.32 Bvauanng ngn.evel programs: High-level programing
languages (eg, C"7, Fate Algo and Pascal ae ee
ical 0 1 computes’ opcating sytem In one sty a
searcher developed smeasarement ester for eating
‘so bih-evel programming languages, Algol and Pascal
‘The eprid rel include a isbn of he relative
frequency af cceurene ofthe flerene pes of sae
mens wed in typical Algol and Pasa programs of ap
oxime the sme size. The reported percentages ete
sed to tabulate the
nation gen it he able
womever
= al Pascal
Tpeef oF 2s 204s
Stoners FOR 96380
0 Bs ae
Asigoment —$92)——4768,
Other ror es
rons toi. 9470
“Swe: Aap tom De Prskor "Oaths dcop f=
Assuming fied magia forthe to programming
Tmguages conduct atest 10 detemine whether the pe
etiages of the citeren ype of programming ste
mens ier fer the to languages. Tes sing a ~ 05,
be. Coosuust a 98% confsence irl forthe iflerence
Into percentages of aeigment sates sedi the
to langues,
¥
f
SSRI WOT422 chapter9 Categorical Dat
ta Analysis
9.6 Exact Tests for Independence in a Contingency Table Analysis (Optional)
‘The procedure fr testing independence ina contingency table in Sections 9.4 and 95
is an “approximate” est due to the fact thatthe y° tet statistic has an approwmae