Statpro Long Test 2 Notes
Statpro Long Test 2 Notes
His
running tines anenor ally distr i byted with o 40s
2: -0.44
→-0.44 =J20.4-A
→120.o -= -4.4
u 120.6+ 4-4 125 sec.
EMPIRICAL RULE
- Very uce ful rule of thumb for normally distributtd sets of data values.
1approxC8 ot eobservathions
lie in [M-o, utoJ.
2) approx 457 of the observationslit in [a-20, Mt20.
3 pp rox 94.7%, of the obce rva tion ie in u-3o. M + 3o].
L 45
q9-7°%
APINGbiSTRIBUTIONS
WHAT ARE SAMPLE DISTRIBITIONS?
sample
cample sample
4
If he samplestatiìcic s thesamplemean,thenthedistributionis the
sampling distr ibution of the cample means (%X, X,x4,Xç.X,).
SAMPLING DISTRIBVTION MEAN R VARIANCE
X.
7631
the population.
ne 12.S
S IS. S
IS, 10 |2-5
IS, IS
20
0,S 12-S
(0, 10 10 20, 10
,IS 20,1S
I0, 0 0.0
fi
reat a probability distribution & nd the
ean & standard error of the sample uan.
| Probabiity
0.002X Io(0.1878)+ ILs (0100) +is(01)
2 0.1200 + s(0-10) 20(0.0L2)2s
3 0.I8S Or
4 0. 2s00
-, 12.r
IS 3 0-185
2 0.2SD +Is'(0.250o)+iro.1895)
0.0628 + ns(o.ro) +0(0.0bs)-ns
= Is-o24
or
- for eithercace
Ox
AON
INFERENTIAL STATISTICS
point estimation
y compuRSfor a singleapproximatevalue fom a rando sanple to cpesent
an unknown populaion maJur.
•mean? X=, Xi
v• arianu;s' ,(K;-*)
X
* probablity:p: n whr x no. ofsuccesses
- interval estmahon
L constructs an interval of possible values from a vandom smple to
esima an unknown pavameter of interest.
L theinervaliscalledthe intrvalestinatk. 3 s el
L the ale /formu la used to get it is called th interval estimatr.
fifi
fi
L Whn a degr ot conhdenceis attached to an inteval estinate, the
ioterva is called a conhiene interval estimate.
denoted by a.
deviation of PG00.
Find te pointestimationof mony spent per
isi tor.
X=P3400
Constrnct a 9s% conhdene inerval for it-
6 unknown
d.= I- 0.45- 0.08, df = 44-l= 4g
Based
onanother
table -
= 2.004 . |12.2
→ (3100 -172- 2, 310oo+ 112.2)
- (3s27.8, 2872.2)
* o iS the s. d. of a populahion & s is the s.d. of sam ple si.
e= Zo-os, D. (0.39)
SYpposedly -196 but juct
9e+ th posiie
→ (0-0b-0.D4IS, 0-b + D. 0415)
- (0.618s, 0.7015)
SAMPLE SIZE DETERMINATION
San ple sie dteminahion for the mean
n: (Z«s , het e= emr ofmargin
ex. How large a cample shod you tae to be a
<- -0.9S-0.0r
Z.as/2 -1-9b
Rovnd up for --→)n :(1.46 ) 62.3)0
San ple sie
- sample sie detemination for theprrportidn
n: Zenp-)
ex. How large a sample sie should you take to
be 1% confdent hat G6, of 4eens have
d |-.4 0.0S
Zoos/a : -1.46
n= (-196) 240l
A(0.02 =24D|
- sample sie determinahon witth knounpopulathonsize & sampk itardavd deiahon
n: a/z)
d: |- 095 =0.0
Zo.o/2 -|96
n: (C1.96)(0.r) 42
0-l0+ (-1947(or)
2000
HYFOTHEGIS ESTING
TEST OF HYPDTHESIS
- Statisticalhypothesis
L→ Null hypothesis (Ho)
One-tailed test
3.sts a nul hy
pothesisaguinst a diecional alternahve
bypotesis,
Ha:0>,, .
,Two-tailedtest
• tess a hull hypothesis against a ondirutional alternaive
hypothess.
Ho* 4 10
|Contradicts Ho
- Tupc I eror
L an eror of rjecting the null hypothecis that is true.
(Reject Hol Ho is true)
- Type I ewor
y an evor of taing to njet (AKA aceping) a null hypothesis that is
false.
L (Fait to viect Hol Ho is false)
ex. Ho: The detendant is ot guilty
Ha: The detendant is guilhy
guilty-
innoent.
- uness population data is available, you can't determine thetuth or. falsity of the
null & alturnative hy potheses. Howerr, the probabilihes ot committing errors can be
evaluaed.
4 level of signifcane
• measurts the visk of rejecting a tvue null hypothesis
• P(TypeI ror): o
L level of con hdee
• probability of mnaking th corret decision of accephing a true
null hypothesis.
-« &B are inverely related. Both can be eduud by increasing tthe sanple sie.
TEST HYPOTHESTS ABOT PARAMETEKS OF ONG Po PULATTON
Ha Decisionae rject H, if
Z > Za
ex. A manutachirer of sports equipmunt developed
Z3s/so : -2.83
UsingHa RL, --- -→ -2a/2 .o1/2 =
-2o.01 2.5S
Sine - 2.93<-2.s 5
Conclusion? at de°%, thue is enough
t<-t, n4or t ta
t>tajn
tl-ta,n1
x. It is rcommuhded that cild 43yrs dd have
a 1070kcal of enrgy intake. A semple of 78
1070kca.
- 4est hypothsis about the population proporhon
of Sigi cance.
Hqi p<to n G0
po 10 p:300 d=0.oS
Z = 4.08
Vio(ovoo
Using Hai p<p. ZaZo.os - I-G4
Sina -4.08 - I-4S
Concsion at d sl, thurt is enough
emdence to clain thatoverscas
FilipinosS compnse of less than
Vo tourists.
fi
TESTING HYPOTHESTS ABOUTPARANETERSDF Tub POPULATLONS
•Z (3,-*)-d
2-an or 2> Zn
|Kik do 2 -Z4
L teeting hypothusisabout h ditference ot twopopulahonmeans A-hz, &
t<-t(o,en,-2)
or tt,(hn,)
do
-t< do
t>ta,(n,na2)
t<-ta.cn,tne)
K. A vandom sample of 100wOmen trom nual urban
artas cach have these staishcs regarding the age
21 4-S
rural
urban 23 S.I
Does ts provide evidene that ural womnmany
earlier han urban women?Usesl%level of sigkan.
women.
* Another way to chck a claim is through p-value. The nul hypothusis is rejeced it
the p value is less than the si gni can level (use an onine p-value calu).
p-value 0.001830
Sine o.oo18 36( 0.0S
Concusion; at af % thue isenonghevidene
to support the c laim that rural
VOmn.
*example for two populations Mi -as
0
with Ehown
o.
oors are assignd to est cach way. Does hu
data incate wax is mort effecthve than
3 0.33
2 2.4 0.30
La Zo.os -l-olS
Sin
Concdusion ; at os% tht is not nough
ewidnce to sufport te claim that
wax 1 is mor efhiáent than
Wax 2.
fl
fi
cAse 2: rlaed or paired samples
Lyindepen dent samples taken from a normal populahon with sie n;
Student 4 2 3
s 18
day 1 19052237 |(831241 22l 428 (G02
p-value= 0-i44832
Sine o.(44992 o.
Conciusion at a 10% thue is notenaugh
eviden to support the claim
that migher dietary intae is
observed on day 1.
SFAG+4)
p: n, + n2 I
pooledestimattot po
fi
Ha Decisionute ject Ho if
2<-Za2 or Z>Z2
Z > Za
PPe <do
hypertensivt 35 q0 128
normal 38 112
total 13 02 25
Ho P, -Pz 0 Ha pi-fe t0 d > 0.DS
iORRELATONCORHVENTG 1810FPORHO)
CORRELATLON COEFFICLENT (P)
made of "o (together) & relation.
- correlahon is positive it values inurea se togethr.