REVIEW OF BASIC STATISTICS
REVIEW OF BASIC STATISTICS
Statistics
Singular sense: the science that deals with
collection, organization, presentation,
analysis and interpretation of data
Plural sense: a set of numerical information, a
processed data. Some examples are:
Population statistics, Statistics on Births,
Statistics on Enrollment.
Two phases of statistics:
Descriptive statistics
Inferential statistics
Measures of Location
Measures of Dispersion
Measure of Skewness
Measure of Kurtosis
Measures of Location
1. Minimum
2. Maximum
3. Measures of Central Tendency
- Mean, Median, Mode
4. Quantiles
- Percentiles, Deciles, Quartiles
MCTs
Arithmetic Mean
∑ Xi
µ= i
=1
N
Median
X N +1 , if N is odd
2
Md = X N + X N
2 +1
2
, if N is even
2
Mode
Mo is the most popular value in the data.
Diameter 8 10 12 16 20 21
Frequency 5 8 8 4 3 2
MCTs:
Mean = 12.73
Median = 0.5(12+12)=12
j
location of P j = ×N
100
X j + X j
100 N N +1
100
, if location of P j is a whole number
2
Pj =
data value in the next higher whole number position,
if location of P is not a whole number
j
For Deciles and Quartiles
P42 = X13 = 10
D8 = P80 = (X24+X25)/2 = (16+16)/2 = 16
Q2 = P50 = (X15 + X16)/2 = (12 + 12)/2 = 12
Measures of Dispersion
1. Range ( R )
2. Variance (σ2)
Range = 21 – 8 = 13
Variance = 17.1289
CV = 32.5029
Measure of Skewness
3 ( µ − Md )
SK =
σ
Possible values of SK:
SK = 0 symmetric distribution
SK < 0 negatively skewed (skewed to the left)
SK > 0 positively skewed (skewed to the right)
What is SYMMETRY?
µ = Md = Mo µ = Md = Mo
Likewise, a symmetric distribution has SK=0 since its
mean is equal to its median and its mode.
MEASURE OF SKEWNESS
SK > 0
positively skewed
Mo < Md < µ
SK < 0
negatively skewed
µ < Md < Mo
Measure of Kurtosis
∑ (X i − µ)
4
=K i=1
−3
Nσ 4
Possible values of K:
K = 0 mesokurtic (bell-shaped) distribution
K < 0 platykurtic (flat with thick tails) distribution
K > 0 leptokurtic (peaked with thin tails)
distribution
MEASURE OF KURTOSIS
K=0
mesokurtic
K<0
K>0
platykurtic
leptokurtic
Example:
SK = 0.5291
K = 4.0174
Module 3. Probability
Random Experiment
Sample Space
Event
Probability
Example:
A Priori Approach
(Theoretical Approach)
A Posteriori Approach
(Empirical Approach)
Subjective Approach
Illustration: A Priori Approach
Addition Principle
Suppose an operation can be done in n1
ways, a second operation in n2 ways and so
on to a kth operation that can be done in nk
ways. If these operations cannot be done at
the same time or simultaneously, then there
is a total of n1 + n2 + … + nk ways to
perform these tasks
Example:
n!
n P r=
(n − r )!
n P n= n !
Examples:
n! k
n1! n 2! n 3! ... n k !
where: n = ∑
i
ni
=1
= n1 + n 2 + ... + n k
n n!
n C r=
r = (n − r )! r !
n n!
=
n1 n2 ... n r n1! n 2! ... n r !
N(S) = 4 x 4 = 16
2. In how many ways can eight problems be
assigned to four groups if no two groups will
be assigned the same problem?
N(S) = 21 x 3 x 2 x 3 x 5 = 1890
Event Relations and Probability
A. Complement of an Event (E C)
P(E C) = 1 – P(E).
B. Conditional Probability
P (E 1 ∩ E 2 )
P (E 1 | E 2 ) =
P (E 2 )
C. Independent Events
P(E1 | E2) = P(E1) or P(E2 | E1) = P(E2)
D. Mutually Exclusive Events
P(E1 ∩ E2) = 0.
Theorems on Probability
Addition Theorem
P(E1 ∪ E2) = P(E1) + P(E2) – P(E1 ∩ E2)
If E1 and E2 are mutually exclusive then
P(E1 ∪ E2) = P(E1) + P(E2)
Multiplication Theorem
P(E1 ∩ E2) = P(E1 | E2) * P(E2)
If E1 and E2 are independent
then P(E1 ∩ E2) = P(E1) * P(E2)
Examples:
I nferences and
Generalizations
Two Broad Classification of
Sampling
1. Non-probability sampling
– No objective procedure in sample
selection.
– Probabilities of selection of units are not
known.
– Sampling errors cannot be computed.
– No inferences about the
universe/population can be made.
Some Non-probability Sampling
Methods
Convenience sampling
Accidental sampling
Quota sampling
Purposive sampling
2. Probability sampling
– Uses an objective procedure in sampling
selection (randomization procedure)
– Probabilities of selection of units are
known.
– Sampling errors can be computed.
– Valid inferences about the population can
be made.
Some Probability Sampling
Methods
Two Types:
SRS without replacement (SRSWOR)
– NCn number of possible samples
– Probability of selecting a sample: 1/NCn
SRS with replacement (SRSWR)
– Nn number of possible samples
– Probability of selecting a sample: 1/Nn
Randomization Procedures
Lottery method
Use of the table of random numbers
Random number generator (computer
programs that generate pseudo-random
numbers, e.g. calculator)
Stratified Sampling
L
N = ∑
i
=1
Ni = N 1 + N 2 + ... + N L
L
n = ∑
i
=1
ni = n 1 + n 2 + ... + n L
n1 n5
N1
N5 N4 n4
n2 N2
N3
n3
Systematic Sampling:
1. non-probability sampling
2. stratified
Estimation:
∑X i
Point Estimator: Sample Mean X= i =1
Interval Estimator:
σ
Known σ, X Z α 2
n
s
Unknown σ, X tα 2 ( n −1)
n
Population Variance σ2
( )
n 2
∑ Xi − x
Point Estimator: s 2 = i =1
n −1
Population Proportion P
a
Point Estimator: Sample Proportion p=
n
pq
p zα 2 , q = 1 − p
n
STRS Point Estimation:
Population Mean µ
L Ni
∑ wX
i =1
i
where
i wi =
N
Population Variance σ2
L
∑ ii
w s
i =1
2
Population Proportion P
L
∑w p
i =1
i i
Example:
1. Analysis of Variance
2. Regression Analysis
3. T-test on one population mean
4. Binomial test
5. Binomial test
THANK YOU!