Statistics Formula
Statistics Formula
This web page presents statistics formulas described in the Stat Trek tutorials. Each formula links to a
web page that explains how to use the formula.
Parameters
Population mean = = ( Xi ) / N
Population variance = 2 = ( Xi - )2 / N
Standardized score = Z = (X - ) /
Statistics
Unless otherwise noted, these formulas assume simple random sampling.
Sample mean = x = ( xi ) / n
Sample variance = s2 = ( xi - x )2 / ( n - 1 )
Pooled sample standard deviation = sp = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22 ] / (n1 + n2 - 2) ]
Correlation
Standard error of regression slope = s b1 = sqrt [ (yi - i)2 / (n - 2) ] / sqrt [ (xi - x)2 ]
Counting
Probability
Random Variables
In the following formulas, X and Y are random variables, and a and b are constants.
Chi-square statistic = 2 = [ ( n - 1 ) * s2 ] / 2
Variance of the difference between independent random variables = Var(X - Y) = Var(X) + Var(Y)
Sampling Distributions
Standard deviation of difference of sample means = d = sqrt[ (12 / n1) + (22 / n2) ]
Standard deviation of difference of sample proportions = d = sqrt{ [P1(1 - P1) / n1] + [P2(1 - P2) /
n2 ] }
Standard Error
Standard error of difference of sample means = SE d = sd = sqrt[ (s12 / n1) + (s22 / n2) ]
Pooled sample standard error = s pooled = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22 ] / (n1 + n2 - 2) ]
Standard error of difference of sample proportions = s d = sqrt{ [p1(1 - p1) / n1] + [p2(1 - p2) / n2] }
Cr-1 * Pr * (1 - P)x - r
x-1
N-k
Cn-x ] / [ NCn ]
Linear Transformations
For the following formulas, assume that Y is a linear transformation of the random variable X, defined by
the equation: Y = aX + b.
Standardized score = z = (x - x) / x.
t-score = t = (x - x) / [ s/sqrt(n) ].
Estimation
Hypothesis Testing
Degrees of Freedom
The correct formula for degrees of freedom (DF) depends on the situation (the nature of the test
statistic, the number of samples, underlying assumptions, etc.).
One-sample t-test: DF = n - 1
Two-sample t-test: DF = (s12/n1 + s22/n2)2 / { [ (s12 / n1)2 / (n1 - 1) ] + [ (s22 / n2)2 / (n2 - 1) ] }
Sample Size
Below, the first two formulas find the smallest sample sizes required to achieve a fixed margin of error,
using simple random sampling. The third formula assigns sample to strata, based on a proportionate
design. The fourth formula, Neyman allocation, uses stratified sampling to minimize variance, given a
fixed sample size. And the last formula, optimum allocation, uses stratified sampling to minimize
variance, given a fixed budget.