Statistics Formula Sheet
Statistics Formula Sheet
C
R
Stay updated with the latest FREE lectures: https://www.youtube.com/@ekagrataca
A
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 1
Chapter 13 – Statistical Description
A R
of Data U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 2
1. Frequency Density of a Class Interval =
Class Frequency
Class Length
A R
2. Relative Frequency of a Class Interval =
Class Frequency
Total Frequency
U M
T K
3. Percentage Frequency of a Class Interval =
Class Frequency
Total Frequency
100
A N
IS H
A N
C CA NISHANT KUMAR 3
Chapter 14 – Measures of Central
A R
Tendency and Dispersion U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 4
Topic 1 – Arithmetic Mean
n
x A R
1. Mean of Individual Series x = i =1
i
U M
K
n
2. Mean of Discrete/Continuous Series x =
fx
NT f
A
(In continuous series, the mid-point of the class interval is taken to be x)
3. If two variables x and y are related as y = a + bx , and x is known, then y = a + bx .
H
IS
n x +n x
4. Combined Mean x = 1 1 2 2
n1 + n2
A N
C CA NISHANT KUMAR 5
Topic 2 – Geometric Mean
1. GM of Individual Series = nth root of the product of n numbers
A R
(
2. GM of Discrete/Continuous Series = x1 x2 ... xn
f1 f2 fn
)
1
N
U M
T K
(In continuous series, the mid-point of the class interval is taken to be x)
3. If z = xy, then GM of z = GM of x × GM of y
x
N
4. If z = , then GM of z =
y
A
GM of x
GM of y
H 1
5. For a set of r observations, log G = log x
IS r
A N
C CA NISHANT KUMAR 6
Topic 3 – Harmonic Mean
n
A R
1. HM of individual series HM =
1
x
U M
T K
2. HM of Discrete/Continuous Series HM =
N
f
x
A N
H
(In continuous series, the mid-point of the class interval is taken to be x)
n +n
N IS
3. Combined HM = 1 2
n1 n2
+
H1 H 2
C A CA NISHANT KUMAR 7
4. The harmonic mean of 1, ½, 1/3, …, 1/n is given by:
2
n +1
A R
5. Average Speed =
2xy
x+ y
U M
T K
6. Relationship between AM, GM, HM for constant observations: AM = GM = HM
7. Relationship between AM, GM, HM for distinct observations: AM > GM > HM
N
8. For positive observations, GM 2 = AM HM
A
IS H
A N
C CA NISHANT KUMAR 8
Topic 4 – Median
A
n +1R
U M
1. For odd number of items, Rank of Median of Individual Series:
2
th
K n
2. For even number of items, Median is given by the average of term and
T
2
N
th
n
+ 1 term.
2
HA N +1
IS
3. Rank for Discrete Series:
2
N
4. Rank for Continuous Series:
A
N
2
C CA NISHANT KUMAR 9
Rank − c
5. Median of Continuous Series: l +
f
i
A R
yme = a + bxme .
U M
6. If two variables x and y are related as y = a + bx , and xme is known, then
T K
A N
IS H
A N
C CA NISHANT KUMAR 10
Topic 5 – Quartile
n +1
A R
1. Rank of Q1 of Individual Series:
N +1
4
U M
T K
2. Rank of Q1 of Discrete Series:
4
N
A N
3. Rank of Q1 of Continuous Series:
4
4. Rank of Q2 & Q3: 2 × Rank of Q1, or 3 × Rank of Q1
IS H
5. Quartile of Continuous Series: l +
Rank − c
f
i
A N
C CA NISHANT KUMAR 11
Topic 6 – Decile
n +1
A R
1. Rank of D1 of Individual Series:
N +1
10
U M
T K
2. Rank of D1 of Discrete Series:
10
N
A N
3. Rank of D1 of Continuous Series:
10
4. Rank of D2 & D3: 2 × Rank of D1, or 3 × Rank of D1
IS H
5. Decile of Continuous Series: l +
Rank − c
f
i
A N
C CA NISHANT KUMAR 12
Topic 7 – Percentile
n +1
A R
1. Rank of P1 of Individual Series:
100
N +1
U M
T K
2. Rank of P1 of Discrete Series:
100
N
A N
3. Rank of P1 of Continuous Series:
100
4. Rank of P2 & P3: 2 × Rank of P1, or 3 × Rank of P1
IS H
5. Percentile of Continuous Series: l +
Rank − c
f
i
A N
C CA NISHANT KUMAR 13
Topic 8 – Mode
f1 − f 0
A R
1. Mode of Continuous Series = l +
2 f1 − f 0 − f 2
i
U M
2. If two variables x and y are related as y = a + bx , and xmo is known, then
ymo = a + bxmo .
T K
N
3. Relationship between Mean, Median, Mode
A
a. For Symmetric Data: Mean = Median = Mode
b. For Skew Symmetric Data: Mode = 3Median – 2Mean
H
IS
c. For Positively Skewed Data: Mean > Median > Mode
d. For Negatively Skewed Data: Mean < Median < Mode
A N
C CA NISHANT KUMAR 14
Topic 9 – Range
1. Range of Individual Series = Largest Observation – Smallest Observation
A R
𝐿𝑎𝑟𝑔𝑒𝑠𝑡 𝑂𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛−𝑆𝑚𝑎𝑙𝑙𝑒𝑠𝑡 𝑂𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
2. Coefficient of Range = 𝐿𝑎𝑟𝑔𝑒𝑠𝑡 𝑂𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛+𝑆𝑚𝑎𝑙𝑙𝑒𝑠𝑡 𝑂𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
U M
Boundary
T K
3. Range of Continuous Series = Upper Most Class Boundary – Lower Most Class
N
𝑈𝑀𝐶𝐵 −𝐿𝑀𝐶𝐵
4. Coefficient of Range = 𝑈𝑀𝐶𝐵 +𝐿𝑀𝐶𝐵
A
5. If two variables x and y are related as y = a + bx , and Rx is known, then Ry = b Rx .
H
N IS
C A CA NISHANT KUMAR 15
Topic 10 – Mean Deviation
x− A
A R
1. Mean Deviation of Individual Series =
n
f x− A
U M
T K
2. Mean Deviation of Discrete/Continuous Series =
𝑀𝑒𝑎𝑛 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝐴𝑏𝑜𝑢𝑡 𝐴
N
A N
3. Coefficient of Mean Deviation 𝐴
× 100
4. If two variables x and y are related as y = a + bx , and MDx is known, then
MDy = b MDx .
IS H
A N
C CA NISHANT KUMAR 16
Topic 11 – Standard Deviation
( x − x ) A R
M
2
K
2. Standard Deviation of Individual Series =
n
x −(x)
2
2 U
NT n
f (x − x)
2
HA
3. Standard Deviation of Discrete/Continuous Series =
N
IS fx − ( x )
2
2
4. Standard Deviation of Discrete/Continuous Series =
A N 𝑆𝐷
5. Coefficient of Variation = 𝐴𝑀 × 100
N
C CA NISHANT KUMAR 17
6. Variance = Square of Standard Deviation
n2 − 1
A R
M
7. Standard Deviation of first n natural numbers =
12
K
8. Standard Deviation of only two numbers a and b =
a −b
2 U
SDy = b SDx .
NT
9. If two variables x and y are related as y = a + bx , and SDx is known, then
10.
HA
Combined Standard Deviation =
n1s12 + n2 s2 2 + n1d12 + n2 d 2 2
IS
n1 + n2
A N
Here, d1 = x1 − x ; d2 = x2 − x
C CA NISHANT KUMAR 18
Topic 12 – Quartile Deviation
Q − Q1
A R
1. Quartile Deviation = 3
2
Q − Q1
U M
T K
2. Coefficient of Quartile Deviation = 3
Q3 + Q1
100
A N
IS H
A N
C CA NISHANT KUMAR 19
Chapter 15 – Probability
A R
1. Probability =
No. of Favourable Cases/Events/Outcomes
Total No. of Cases/Events/Outcomes
U M
T K
2. When an experiment with total number of events a is repeated b number of times,
the total number of outcomes is given by a b .
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝐹𝑎𝑣𝑜𝑢𝑟𝑎𝑏𝑙𝑒 𝑂𝑢𝑡𝑐𝑜𝑚𝑒𝑠
IS H
5. If two events A and B are
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝐹𝑎𝑣𝑜𝑢𝑟𝑎𝑏𝑙𝑒 𝑂𝑢𝑡𝑐𝑜𝑚𝑒𝑠
P ( A B ) = P ( A) + P ( B ) − P ( A B )
not mutually exclusive, then,
A N
C CA NISHANT KUMAR 20
6. Two events A and B are mutually exclusive, if A B = Therefore, P ( A B ) = 0,
or P ( A B ) = P ( A ) + P ( B ) .
A R
7. If three events A, B, and C are not mutually
U M exclusive, then
P ( A B C ) = P ( A) + P ( B ) + P (C ) − P ( A B ) − P ( A C ) − P ( B C ) + P ( A B C )
8. Three events A,
T K
B, and
P ( A B C ) = P ( A) + P ( B ) + P (C ).
C are mutually exclusive, if
A N
9. Probability that only event A occurs: P ( A − B ) = P ( A ) − P ( A B )
Probability that only event B occurs: P ( B − A ) = P ( B ) − P ( A B )
10.
11.
IS H
P ( A B ) = P ( A) P ( B )
12.
C CA NISHANT KUMAR 21
P( A / B) =
P( A B)
P( B)
A R
13.
M
Probability of event B given that event A has already occurred is given by
P(B/A):
U
P ( B / A) =
P ( B A)
P ( A)
T K
14. P ( A '/ B ) =
A N
P ( A ' B ) P ( B ) − P ( A B )
P( B)
=
P( B)
15.
IS
P ( A / B ') = H
P ( A B ') P ( A) − P ( A B )
P ( B ')
=
1− P( B)
A N
C CA NISHANT KUMAR 22
16. P ( A '/ B ') =
P ( A ' B ') P ( A B ) ' 1 − P ( A B )
P ( B ')
=
1− P( B)
=
1− P( B)
A R
17. Probability that only event A or only event B occurs:
P ( A) + P ( B ) − 2 P ( A B )
U M
18.
T K
Expected value ( ) of a random variable ( x ) is given by: = E ( x ) = pi xi
( ) is given by: E ( x ) = p ( x )
N
2 2 2
19. Expected value of x i i
20. Variance
HA( )
2
of a
V ( x ) = 2 = E ( x − ) = E ( x2 ) − 2
random variable ( x) is given by:
IS
2
21.
A N If a and b are two constants related with two random variables x and y as
y = a + bx, then the mean, i.e., the expected value of y is given by: y = a + b x .
C CA NISHANT KUMAR 23
22.
R
If a and b are two constants related with two random variables x and y as
A
y = a + bx, then the standard deviation of y is given by: y = b x .
23.
M
Expectation of sum of two random variables is the sum of their expectations,
U
i.e., E ( x + y ) = E ( x ) + E ( y ) , for any two random variables x and y.
24.
K
Expectation of the product of two random variables is the product of the
T
expectation of the two random variables, provided the two variables are
N
independent, i.e., E ( x y ) = E ( x ) E ( y ) .
A
IS H
A N
C CA NISHANT KUMAR 24
Chapter 16 – Theoretical
A R
Distributions U M
T
Topic 1 – Binomial DistributionsK
N
1. P ( x ) = nCx p x q n− x , for x = 0, 1, 2, 3, …, n
A
IS H
2. The mean of the binomial distribution is given by = np .
3. A binominal distribution is symmetrical when p = q.
N
4. Mode of a Binomial Distribution is given by largest integer contained in
0 = ( n + 1) p , OR 0 = ( n + 1) p and ( n + 1) p − 1
C A CA NISHANT KUMAR 25
5. The variance of the binomial distribution is given by 2 = npq .
6. If p = q = 0.5, variance is the maximum, and is given by .
n
A R
U M
4
7. Standard Deviation of a binomial distribution is given by = npq .
T K
8. Let x and y be two independent binomial distributions where x has the parameters
n1 and p, and y has the parameters n2 and p. Then ( x + y ) will be a binomial
N
distribution with parameters ( n1 + n2 ) and p.
A
IS H
A N
C CA NISHANT KUMAR 26
Topic 2 – Poisson Distribution
e− m m x
A R
1. P ( x ) =
x!
, for x = 0, 1, 2, 3, …, n
U M
2. The mean of Poisson distribution is given by m, i.e., = m = np.
T K
3. The variance of Poisson distribution is given by 2 = m = np.
4. The standard deviation of Poisson distribution is given by = m = np .
A N
5. Mode of a Poisson Distribution is given by largest integer contained in m, OR m and
m–1
IS H
6. Let x and y be two independent poisson distributions where x has the parameter m1 ,
and y has the parameter m2 . Then ( x + y ) will be a poisson distribution with
N
parameter ( m1 + m2 ) .
A
C CA NISHANT KUMAR 27
Topic 3 – Normal Distribution
−( x − )2
A R
M
1 2 2
1. P ( x ) = f ( x ) =
, for –∞ < x < ∞
U
.e
2
It is given by z =
x−
. K
2. If μ = 0, and σ = 1, the variable is known as the standard normal variate (or variable).
T
A N
3. Relationship between MD, SD, and QD → 4SD = 5MD = 6QD
4. Mean Deviation = 0.8σ.
IS H
5. Quartile Deviation = 0.675σ.
6. Q1 = − 0.675
A N
7. Q3 = + 0.675
8. Median − Q1 = Q3 − Median.
C CA NISHANT KUMAR 28
9. Points of inflexion are given by x = − and x = +
10. If there are two Independent Normal Distributions x ~ N 1 , 12 (
A R) and
M
y ~ N ( 2 , 2 2 ) , then z = x + y follows normal distribution with mean ( 1 + 2 ) and
U
SD = + 2 respectively.
1
2 2
T K
A N
IS H
A N
C CA NISHANT KUMAR 29
Chapter 17 – Correlation and
A R
Regression U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 30
Topic 1 – Karl Pearson’s Product Moment Correlation
Coefficient A R
1. r = rxy =
Cov ( x, y ) Cov ( x, y )
=
U M
Sx S y
a. Cov ( x, y ) =
T K
x y
( x − x )( y − y ) = xy − x . y
A
( x − x )N 2
n
x
n
H
2
b. S x = = − x2
IS
n n
( y − y ) y
N
2 2
c. S y = = − y2
C A n n
CA NISHANT KUMAR 31
2. r =
n xy − x y
n x 2 − ( x )
2
n y 2 − ( y )
2
A R
U M
3. Let there be two variables x and y. Let the correlation coefficient between them be
T K
rxy . Now, if they are changed to another set of variables, say, u and v, then,
ruv = rxy , if b and d have the same sign, or
Here,
A N
ruv = −rxy , if b and d have opposite signs.
b=
IS H
−Coefficient of u
Coefficient of x
, and d =
−Coefficient of v
Coefficient of y
A N
4. rxy =
bd
b d
.ruv
C CA NISHANT KUMAR 32
Topic 2 – Spearman’s Rank Correlation Coefficient
6 d 2
A R
1. r = 1 −
R
n ( n 2 − 1)
U M
T K
6 d +
2 ( tj −tj )
12
3
A N
2. When ranks repeat, then rR = 1 −
n ( n 2 − 1)
IS H
A N
C CA NISHANT KUMAR 33
Topic 3 – Coefficient of Concurrent Deviations
( 2c − m ) A R
1. If ( 2c − m ) 0 , then rc =
m
U M
K
2. If ( 2c − m ) 0 , then rc = − −
T
( 2c − m )
m
A N
IS H
A N
C CA NISHANT KUMAR 34
Topic 4 – Regression
1. if y depends on x, then the regression line of y on x is given by:
A R
a. either y = a + bx , where,
n xy − ( x )( y )
U M Cov ( x, y )
i. b = byx =
T K
n x − ( x )
2 2
, or, byx = r y , or, byx =
x ( x )
2
, and
N
ii. a = a yx = y − ( x byx )
A
H
byx is known as the regression coefficient.
b. or, ( y − y ) = byx ( x − x )
N IS
2. if x depends on y, then the regression line of x on y is given by:
a. either x = a + by , where,
C A CA NISHANT KUMAR 35
i. b = bxy =
n xy − ( x )( y )
n y 2 − ( y )
2
, or, bxy = r
x
y
, or, bxy =
Cov ( x, y )
( )
2
A R
, and
T
or, ( x − x ) = bxy ( y − y )K
bxy is known as the regression coefficient.
3. bvu =
Scale of v
A
byx N
H
Scale of u
IS
4. r = byx bxy
A N
C CA NISHANT KUMAR 36
Topic 5 – Probable Error and Standard Error
1. Probable Error (P.E.) is given by
A R
a. P.E. = 0.674
1− r2
, or
U M
b. P.E. = 0.6745
N
T K
1− r2
, or
N
N
c. P.E. = 0.675
HA
1− r2
N
IS
2. Limits of the correlation coefficient of the population is given by p = r P.E.
1− r2
A N
3. Standard Error (S.E.) is given by S .E. =
N
C CA NISHANT KUMAR 37
2
4. PE = SE
3
A R
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 38
Topic 6 – Coefficient of Determination and Non-
Determination A R
U M
1. Coefficient of Determination (Also known as “Percentage of Variation Accounted
( )
for”) r 2 =
T K
Explained Variance
Total Variance
A N
2. Coefficient of Non-Determination (Also known as “Percentage of Variation
Unaccounted for”) = 1 − r 2
IS H
A N
C CA NISHANT KUMAR 39
Chapter 18 – Index Numbers
A R
Topic 1 – Price Relative
U M
1. Price Relative =
Pn
P0
100.
T K
A N
2. Simple Aggregative Price Index =
P 100
P
n
IS H 0
P1
P 100
A N
3. Simple Average of Price Relatives = 0
N
C CA NISHANT KUMAR 40
4. 𝑊𝑒𝑖𝑔ℎ𝑡𝑒𝑑 𝐴𝑣𝑒𝑟𝑎𝑔𝑒 𝐺𝑒𝑛𝑒𝑟𝑎𝑙 𝐼𝑛𝑑𝑒𝑥 =
𝑆𝑢𝑚 𝑜𝑓 𝑃𝑟𝑜𝑑𝑢𝑐𝑡𝑠
𝑆𝑢𝑚 𝑜𝑓 𝑊𝑒𝑖𝑔ℎ𝑡𝑠
A R
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 41
Topic 2 – Weighted Aggregative Index
PnQ0
A R
1. Laspeyres Index =
0 0
P Q
100
U M
2. Passche's Index =
PnQn
0 n
P Q
T K
100
A N
3. Marshall-Edgeworth Index =
Pn ( Q0 + Qn )
0 0 n
P ( Q + Q )
100
4. Bowley’s Index =
IS H
Laspeyres' + Paasche's
2
A N
5. Fisher's Index =
PnQ0 PnQn 100
P0Q0 P0Qn
C CA NISHANT KUMAR 42
A R
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 43
Topic 3 – Weighted Average of Price Relatives
Pn
A R
P ( P0Q0 )
1. Weighted Average = 0
P0Q0
100
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 44
Topic 4 – The Chain Index Numbers
A R
𝐿𝑖𝑛𝑘 𝑅𝑒𝑙𝑎𝑡𝑖𝑣𝑒 𝑜𝑓 𝑡ℎ𝑒 𝐶𝑢𝑟𝑟𝑒𝑛𝑡 𝑌𝑒𝑎𝑟 × 𝐶ℎ𝑎𝑖𝑛 𝐼𝑛𝑑𝑒𝑥 𝑜𝑓 𝑡ℎ𝑒 𝑃𝑟𝑒𝑣𝑖𝑜𝑢𝑠 𝑌𝑒𝑎𝑟
M
1. 𝐶ℎ𝑎𝑖𝑛 𝐼𝑛𝑑𝑒𝑥 = 100
K U
NT
HA
N IS
C A CA NISHANT KUMAR 45
Topic 5 – Quantity Index Numbers
Qn
A R
1. Simple Aggregate of Quantities =
0
Q
100
Qn U M
T K
2. Simple Average of Quantity Relatives = 0 100
Q
3. Laspeyre’s Index =
ANQP
100
n
H
n 0
Q P
IS
0 0
QP
N
6. Paasche’s Index = 100
n n
Q P
A
0 n
C CA NISHANT KUMAR 46
7. Fisher’s Index =
Q P Q P
n 0
Q P Q P
n n
100
A R
M
0 0 0 n
Qn
K
8. Base-year weighted average of quantity relatives = 0 U
Q ( P0Q0 )
P0Q0
100
NT
HA
N IS
C A CA NISHANT KUMAR 47
Topic 6 – Value Index Numbers
Vn PnQn
A R
Value Index = =
0 P0Q0
V
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 48
Topic 7 – Deflating Time Series Using Index Numbers
𝐶𝑢𝑟𝑟𝑒𝑛𝑡 𝑉𝑎𝑙𝑢𝑒
A R
M
1. 𝐷𝑒𝑓𝑙𝑎𝑡𝑒𝑑 𝑉𝑎𝑙𝑢𝑒 = 𝑃𝑟𝑖𝑐𝑒 𝐼𝑛𝑑𝑒𝑥 𝑜𝑓 𝑡ℎ𝑒 𝐶𝑢𝑟𝑟𝑒𝑛𝑡 𝑌𝑒𝑎𝑟
2. 𝐶𝑢𝑟𝑟𝑒𝑛𝑡 𝑉𝑎𝑙𝑢𝑒 =
3. 𝑅𝑒𝑎𝑙 𝑊𝑎𝑔𝑒𝑠 =
𝐵𝑎𝑠𝑒 𝑃𝑟𝑖𝑐𝑒 (𝑃0 )
K
𝐶𝑢𝑟𝑟𝑒𝑛𝑡 𝑃𝑟𝑖𝑐𝑒 (𝑃𝑛 )
𝐴𝑐𝑡𝑢𝑎𝑙 𝑊𝑎𝑔𝑒𝑠
× 100
U
T
𝐶𝑜𝑠𝑡 𝑜𝑓 𝐿𝑖𝑣𝑖𝑛𝑔 𝐼𝑛𝑑𝑒𝑥
N
HA
N IS
C A CA NISHANT KUMAR 49
Topic 8 – Shifting and Splicing of Index Numbers
𝑂𝑟𝑖𝑔𝑖𝑛𝑎𝑙 𝑃𝑟𝑖𝑐𝑒 𝐼𝑛𝑑𝑒𝑥
A R
M
1. 𝑆ℎ𝑖𝑓𝑡𝑒𝑑 𝑃𝑟𝑖𝑐𝑒 𝐼𝑛𝑑𝑒𝑥 = × 100
𝑃𝑟𝑖𝑐𝑒 𝐼𝑛𝑑𝑒𝑥 𝑜𝑓 𝑡ℎ𝑒 𝑌𝑒𝑎𝑟 𝑜𝑛 𝑤ℎ𝑖𝑐ℎ 𝑖𝑡 ℎ𝑎𝑠 𝑡𝑜 𝑏𝑒 𝑠ℎ𝑖𝑓𝑡𝑒𝑑
K U
NT
HA
N IS
C A CA NISHANT KUMAR 50
Topic 9 – Test of Adequacy
1. Time Reversal Test P01 P10 = 1
A R
2. Factor Reversal Test P01 Q01 = V01
3. Circular Test P01 P12 P20 = 1
U M
T K
A N
IS H
A N
C CA NISHANT KUMAR 51