0% found this document useful (0 votes)
19 views

Formula Sheet Final Exam

Uploaded by

anushad.free
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Formula Sheet Final Exam

Uploaded by

anushad.free
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Stat-S 301 Formula Sheet

Excel Commands
=AVERAGE(number1, [number2], ...)
Returns the average (arithmetic mean) of its arguments, which can be numbers or names, arrays, or
references that contain numbers
=BINOM.DIST(number s, trials, probability s, cumulative)
Returns the individual term binomial distribution probability
=BINOM.DIST.RANGE(trials, probability s, number s, [number s2])
Returns the probability of a trial result using a binomial distribution
=CHISQ.DIST(x,deg freedom,cumulative)
Returns the chi-squared distribution
=CHISQ.DIST.RT(x,deg freedom)
Returns the right-tailed probability of the chi-squared distribution
=CHISQ.INV(probability,deg freedom)
Returns the inverse of the left-tailed probability of the chi-squared distribution
=CHISQ.INV.RT(probability,deg freedom)
Returns the inverse of the right-tailed probability of the chi-squared distribution
=CHISQ.TEST(actual range,expected range)
Returns the test for independence
=COMBIN(number, number chosen)
Returns the number of combinations for a given number of items
=CORREL(array1, array2)
Returns the correlation coefficient between two data sets
=COUNTIF(range, criteria)
Counts the number of cells within a range that meet the given condition
=COUNTIFS(criteria range1, criteria1, [criteria range2, criteria2]. . . )
Applies criteria to cells across multiple ranges and counts the number of times all criteria are met.
=COVARIANCE.S(array1, array2)
Returns sample covariance, the average of the products of deviations for each data point pair in two data
sets
=FREQUENCY(data array, bins array)
Calculates how often values occur within a range of values and then returns a vertical array of numbers
having one more element than bins array
=INTERCEPT(data y, data x)
Calculates the point at which a line will intersect the y-axis by using a best-fit regression line plotted
through the known x-values and the y-values
=MAX(number1, [number2], ...)
Returns the largest value in a set of values. Ignores logical values and text

1
=MEDIAN(number1, [number2], ...)
Returns the median, or the number in the middle of the set of given numbers
=MIN(number1, [number2], ...)
Returns the smallest value in a set of values. Ignores logical values and text
=MODE.SNGL(number1, [number2], ...)
Returns the frequently occurring, or repetitive, value in an array or range of data
=NORM.DIST(x, mean, standard dev, cumulative)
Returns the normal distribution for the specified mean and standard deviation
=NORM.INV(probability, mean, standard dev)
Returns the inverse of the normal cumulative distribution for the specified mean and standard deviation
=NORM.S.DIST(z, cumulative)
Returns the standard normal distribution (has a mean of zero and a standard deviation of one)
=NORM.S.INV(probability)
Returns the inverse of the standard normal cumulative distribution (has a mean of zero and a standard
deviation of one)
=PERCENTILE.INC(array,k)
Returns the k-th percentile of values in a range, where k is in the range 0..1, inclusive.
=PERCENTRANK.INC(array,x,[significance])
Returns the rank of a value in a data set as a percentage (0..1, inclusive) of the data set.
=PERMUT(number, number chosen)
Returns the number of permutations for a given number of objects that can be selected from number
objects.
=QUARTILE.INC(array, quart)
Returns the quartile of the data set, based on percentile values from 0..1, inclusive.
=SLOPE(data y, data x)
Returns the slope of the linear regression line through the given data points
=SQRT(number)
Returns the square root of a number
=STANDARDIZE(x, mean, standard dev)
Returns a normalized value from a distribution characterized by a mean and standard deviation
=STDEV.S(number1, [number2], ...)
Estimates standard deviation based on a sample (ignores logical values and text in the sample)
=SUM(number1, [number2], ...)
Adds all the numbers in a range of cells
=SUMPRODUCT(array1, [array2], [array3], ...)
Returns the sum of the products of corresponding ranges or arrays
=T.DIST(x,deg freedom, cumulative)
Returns the Student’s left-tailed t-distribution
=T.DIST.2T(x,deg freedom)
Returns the two-tailed Student’s t-distribution.
=T.DIST.RT(x,deg freedom)
Returns the right-tailed Student’s t-distribution.

2
=T.INV(probability,deg freedom)
Returns the left-tailed inverse of the Student’s t-distribution.
=T.INV.2T(probability,deg freedom)
Returns the two-tailed inverse of the Student’s t-distribution.
=T.TEST(array1,array2,tails,type)
Returns the probability associated with a Student’s t-Test
=VAR.S(number1, [number2], ...)
Estimates variance based on a sample (ignores logical values and text in the sample)

Mathematical Formulas
This is just a list of formulas. You have to choose which one is appropriate for any given question.

Description Formula Description Formula


Pn Pn
xi w i xi
Average x̄ = i=1 Weighted x̄ = Pi=1
n
n i=1 wi
Average

Pn
2 i=1(xi − x̄)2
Sample Variance s = Sample Standard s= s2
n−1
Deviation

s σ x − x̄ x−µ
Coefficient of or Z-Score or
x̄ µ s σ
Variation

# outcomes in A
Equally Likely P (A) = Law of Total 1 = P (A) + P (A′ )
Total # outcomes
Outcomes Probability

P (A ∩ B)
Unions P (A ∪ B) = P (A) + P (B) Conditional P (A|B) =
P (B)
−P (A ∩ B) Probability

Independence P (A|B) = P (A) Independence P (A ∩ B) = P (A)P (B)

Expected Value of n
Variance of a n
a Probability Probability
X X
2
E(X) = µ = xi P (xi ) σ = (xi − µ)2 P (xi )
Distribution i=1
Distribution i=1

n!
Combinations n Cx = Binomial P (x, n) = n Cx px (1 − p)n−x
x!(n − x)!
Probabilities

Binomial Mean µ = np Binomial Variance σ 2 = npq

σ x̄ − µ
Standard Error S.E.(x̄) = σx̄ = √ Central Limit Z= √ under conditions
n σ/ n
of the Mean Theorem

3
Description Formula Description Formula

x p̄ − p0
Proportion p̄ = Test Statistic Zp = r
n p0 (1 − p0 )
r r n
p̄(1 − p̄) p̄(1 − p̄)
Lower Confidence p̄ − zα/2 Upper Confidence p̄ + zα/2
n n
Limit Limit
Pn
i=1 xi x̄ − µ0
Mean x̄ = Test Statistic t= √ d.f. = n − 1
n s/ n
s s
Lower Confidence x̄ − tα/2,n−1 √ Upper Confidence x̄ + tα/2,n−1 √
n n
Limit Limit

Two Dependent Dependent √


Samples Test d¯ − µd0 Samples S.E.(x̄1 − x̄2 ) = sd / n
t= √ d.f. = n − 1
Statistic sd / n Standard Error

x̄1 − x̄2 − (µ1 − µ2 )H0 (n1 − 1)s21 + (n2 − 1)s22


Independent t= Pooled Variance s2p =
S.E.(x̄1 − x̄2 ) n1 + n2 − 2
Samples Test
r
1 1
Pooled Variance S.E.(x̄1 − x̄2 ) = sp + Pooled Variance d.f. = n1 + n2 − 2
n1 n2
Standard Error Degrees of Free-
dom
s  2 2
Unequal Variance s21 s2 Unequal Variance s1 s22
S.E.(x̄1 − x̄2 ) = + 2 +
Standard Error n1 n2 Degrees of Free- n1 n2
d.f. =  2 2  2 2
dom s1 s2
n1 n2
+
n1 − 1 n2 − 1

Lower Confidence x̄1 − x̄2 − tα/2,df S.E.(x̄1 − x̄2 ) Upper Confidence x̄1 − x̄2 + tα/2,df S.E.(x̄1 − x̄2 )
Limit Limit

4
Description Formula Description Formula

(Row Total)(Column Total) X (Observed − Expected)2


Expected Counts Chi Square
Table Total Statistic, χ2 Expected

Degrees of (Number of Rows -1)×


Freedom (Number of Columns -1)

Regression yi = b0 + b1 xi + ϵi Prediction ŷ = b0 + b1 x
Xn
Residual ei = yi − ŷi Sums of Squares SSR = (ŷi − ȳ)2
due to Regression i=1

n
X n
X
2
Sums of Squares SSE = (yi − ŷi ) Sums of Squares SST = (yi − ȳ)2
due to Error i=1 Total i=1

SSR
Sums of Squares SST = SSR + SSE Coefficient of R2 = r 2 =
SST
Formula Determination
r
SSE
Standard Error se = Confidence Inter- CI : b1 ± tα/2 sb1
n−2
val for the Slope

You might also like