STAT2225 Module 6. Intro To Inferential Statistics
STAT2225 Module 6. Intro To Inferential Statistics
Inferential Statistics
Jomel R. Alanzalon
STAT 2225: Elementary Statistics & Probability
2nd Semester, A.Y. 2023-2024
2. Interval Estimation
• we are producing interval or range of values that is likely to
contain the true value of the parameter.
Point Estimator
• the rule or formula that describes the calculation of a single
value estimate
Point Estimate
• the calculated single value used to estimate
ഥ = 𝟐𝟓. 𝟖
𝑿
Point Estimator Point Estimate
Interval Estimator
• the rule or formula that describes the calculation of interval
of values
Interval Estimate
• the calculated interval or range of values used to estimate
• also known as confidence interval (CI)
1
Introduction to Inferential Statistics |
0
DEPARTMENT of
STATISTICS Interval Estimation
Important Concepts in Interval Estimation
Confidence Interval (CI)
• range/interval of values used to estimate the true value of a
population parameter
• It gives us a much better sense of how good an estimate is
Confidence Level (𝟏 − 𝜶)
• probability or percentage that the confidence interval actually does
contain the population parameter
• also known as degree of confidence or confidence coefficient
• 𝛼 is the significance level
1
Introduction to Inferential Statistics |
1
DEPARTMENT of
STATISTICS Interval Estimation
Commonly used confidence level and its level of significance
Confidence Level Significance Level
𝟏−𝜶 𝜶
90% 10%
95% 5%
99% 1%
1
Introduction to Inferential Statistics |
2
DEPARTMENT of
STATISTICS Interval Estimation
Important Concepts in Interval Estimation
Critical Value
• number on the borderline separating sample statistics that are likely
to occur from those that are unlikely to occur
• In interval estimation the critical value can be find using
- Z-distribution (positive 𝑍𝛼 ) if 𝝈 is known
2
1
Introduction to Inferential Statistics |
4
DEPARTMENT of
STATISTICS
𝜶
Student t-distribution
- t-table
- used when 𝝈 is unknown
or not given
Example:
𝒅𝒇
𝒕𝟎.𝟎𝟓, 𝟏𝟖 = 𝟏.
𝟏. 𝟕𝟑𝟒
𝟕𝟑𝟒
1
Introduction to Inferential Statistics |
5
DEPARTMENT of
STATISTICS Interval Estimation
Important Concepts in Interval Estimation
Margin of Error (E)
• maximum error of the estimate; maximum likely difference between the
point estimate of a parameter and the actual value of the parameter
• computed as the product of critical value and standard error
1
Introduction to Inferential Statistics |
6
DEPARTMENT of
STATISTICS Interval Estimation for Pop’n Mean 𝝁
Confidence Interval for the Population Mean 𝝁
ഥ−𝑬<𝝁<𝑿
𝑿 ഥ+𝑬
lower limit of the interval upper limit of the interval
Requirements/Assumptions
a. The sample is a random sample.
b. The population is normally distributed or the sample size is large
(𝑛 > 30).
1
Introduction to Inferential Statistics |
7
DEPARTMENT of
STATISTICS Interval Estimation for Pop’n Mean 𝝁
Confidence Interval for the Population Mean 𝝁
Steps in Constructing CI for the Population Mean 𝝁
1. Verify if the requirements are met.
2. Evaluate the margin of error (𝐸).
a. Case 1: When 𝜎 is known
𝝈
𝑬 = 𝒁𝜶 ⋅
𝟐 𝒏
b. Case 2: When 𝜎 is unknown
𝒔
𝑬 = 𝒕𝜶, 𝒏−𝟏 ⋅
𝟐 𝒏
ഥ−𝑬<𝝁<𝑿
3. Solve for the limits: 𝑿 ഥ+𝑬
1
Introduction to Inferential Statistics |
8
DEPARTMENT of
STATISTICS Interval Estimation for Pop’n Mean 𝝁
Confidence Interval for the Population Mean 𝝁
4. Round-off the confidence interval limits.
• When the given is the original set of data, round the limits to one
more decimal place than used in data.
• When the given is only the summary statistics (mean and standard
deviation), round the limits to the same number of decimal places
used for the sample mean.
5. Interpret the confidence interval.
“We are (𝟏 − 𝜶)% confident that the interval from 𝑿 ഥ − 𝑬 to 𝑿 ഥ + 𝑬 actually
does contain the true value of the statistic.”
1
Introduction to Inferential Statistics |
9
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
Example 1.
A random sample of 100 fish caught at Taal Lake has a mean
length of 35.5 cm. Assuming that it is known that the lengths of
the population of fish in Taal Lake follows a normal distribution
with a population standard deviation of 5 cm, construct a 95%
confidence interval for the mean length of all fish in Taal Lake.
Solution:
Given: 𝑛 = 100, 𝑋ത = 35.5, 𝜎=5
1 − 𝛼 = 0.95 → 𝛼 = 0.05
2
Introduction to Inferential Statistics |
0
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
We are 95% confident that the interval from 34.5 cm to 36.5 cm actually
does contain the true value of the mean length of all fish in Taal lake.
2
Introduction to Inferential Statistics |
2
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
Example 2.
A random sample of 100 fish caught at Taal Lake has a mean
length of 35.5 cm. Assuming that it is known that the lengths of
the population of fish in Taal Lake follows a normal distribution
with a population standard deviation of 5 cm, construct a 99%
confidence interval for the mean length of all fish in Taal Lake.
Solution:
Given: 𝑛 = 100, 𝑋ത = 35.5, 𝜎=5
1 − 𝛼 = 0.99 → 𝛼 = 0.01
2
Introduction to Inferential Statistics |
3
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
Note: As the confidence level increases, the confidence interval goes wider.
2
Introduction to Inferential Statistics |
5
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
Example 3.
A study found that the body temperatures of healthy adults are
normally distributed. A random sample of 15 adults have a
sample mean of 98.20 degrees Fahrenheit and the sample
standard deviation was 0.62 degrees Fahrenheit. At 95% level of
confidence, construct a CI for the population mean of all body
temperatures of healthy adults.
Solution:
Given: 𝑛 = 15, 𝑋ത = 98.20, 𝑠 = 0.62
1 − 𝛼 = 0.95 → 𝛼 = 0.05
2
Introduction to Inferential Statistics |
6
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
𝑠
Since 𝜎 is unknown, we will use Case 𝐸 = 𝑡𝛼 , 𝑛−1 ⋅
2 𝑛
2 to calculate the margin of error. 0.62
= 𝑡0.05 ⋅
, 15−1 15
2
0.62
= 𝑡0.025, 14 ⋅
15
2
Introduction to Inferential Statistics |
7
DEPARTMENT of
STATISTICS
Standard t-distribution
- t-table
- used when 𝝈 is unknown
or not given
Critical Value
𝒕𝟎.𝟎𝟐𝟓, 𝟏𝟒 = 𝟐.
𝟏. 𝟏𝟒𝟓
𝟕𝟑𝟒
2
Introduction to Inferential Statistics |
8
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
𝑠
Since 𝜎 is unknown, we will use Case 𝐸 = 𝑡𝛼 , 𝑛−1 ⋅
2 𝑛
2 to calculate the margin of error. 0.62
= 𝑡0.05 ⋅
, 15−1 15
2
0.62
= 𝑡0.025, 14 ⋅
15
0.62
= 2.145 ⋅
15
𝐸 = 0.3434
2
Introduction to Inferential Statistics |
9
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
We are 95% confident that the interval from 97.86⁰F to 98.54⁰F actually
does contain the true value of the mean body temperature of healthy
adults.
3
Introduction to Inferential Statistics |
0
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
Example 4.
The football coach randomly selected nine players and timed how long
each player took to perform a certain drill. Assume that the data are
normally distributed. Times (in minutes) were as follows:
3
Introduction to Inferential Statistics |
1
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
ഥ , thus
The point estimator of the population mean 𝜇 is the sample mean 𝑿
3
Introduction to Inferential Statistics |
2
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
3
Introduction to Inferential Statistics |
3
DEPARTMENT of
STATISTICS
Standard t-distribution
- t-table
- used when 𝝈 is unknown
or not given
Critical Value
𝒕𝟎.𝟎𝟓, 𝟖 = 𝟏.𝟏.𝟕𝟑𝟒
𝟖𝟔𝟎
3
Introduction to Inferential Statistics |
4
Interval Estimation for Pop’n Mean 𝝁
DEPARTMENT of
STATISTICS
We are 90% confident that the interval from 7.28 minutes to 17.24
minutes actually does contain the true value of the mean length of time
a certain drill can be performed.
3
Introduction to Inferential Statistics |
6
DEPARTMENT of
STATISTICS Estimation of the Population Proportion
Approximating the Population Proportion 𝒑
• Point Estimate: Sample Proportion 𝒑
ෝ
𝒙
ෝ=
𝒑
𝒏
where: 𝑥 = number of sample units that possess the characteristics
of interest
𝑛 = sample size
• Moreover, 𝒒 ෝ is the proportion that do not possess the characteristics of
interest, where 𝒒ෝ=𝟏−𝒑 ෝ.
ෝ𝒒
𝒑 ෝ
• The variance of population proportion 𝑝 is
𝒏
3
Introduction to Inferential Statistics |
7
DEPARTMENT of
STATISTICS Estimation of the Population Proportion
Example 5. A survey was conducted from a certain hospital. The results
shows that out of 1,500 randomly selected operations, 1,432 are successful.
Find the point estimate for the proportion of successful operations in the
hospital.
Solution:
The characteristic of interest here is the successful operations, hence
𝑥 1432
𝑝Ƹ = = = 𝟎. 𝟗𝟓 or 𝟗𝟓%
𝑛 1500
In addition, we can also compute for 𝑞, ො which is the sample proportion of
unsuccessful operations that is
𝑞ො = 1 − 𝑝Ƹ = 1 − 0.95 = 0.05 or 5%
3
Introduction to Inferential Statistics |
8
DEPARTMENT of
STATISTICS Interval Estimation for Pop’n Proportion 𝒑
Confidence Interval for the Population Proportion 𝒑
ෝ−𝑬<𝒑<𝒑
𝒑 ෝ+𝑬
ෝ𝒒
𝒑 ෝ
𝑬 = 𝒁𝜶
𝟐 𝒏
2. The confidence interval can only be used if we can assume the sample
proportions follow a normal distribution.
𝒏ෝ
𝒑 ≥ 𝟓 and 𝒏ෝ
𝒒≥𝟓
We are 90% confident that the interval from 21.15% to 24.85% actually
does contain the true proportion of students who paid for their
education in student loans.
We are 95% confident that the interval from 80.79% to 87.21% actually
does contain the true proportion of adult residents who have cellphones.
4
Introduction to Inferential Statistics |
5
DEPARTMENT of
STATISTICS
Interval Estimation for 𝝈𝟐 and 𝝈
CI for the Population Variance and Standard Deviation
Formula for the Confidence Interval for a Variance:
𝒏 − 𝟏 𝒔𝟐 𝒏 − 𝟏 𝒔𝟐
< 𝝈𝟐 <
𝝌𝟐𝜶 𝝌𝟐 𝜶
𝟐 𝟏− 𝟐
𝒏 − 𝟏 𝒔𝟐 𝒏 − 𝟏 𝒔𝟐
<𝝈<
𝝌𝟐𝜶 𝝌𝟐 𝜶
𝟏−
𝟐 𝟐
Example:
Answer: 24.996
2
The critical value for 𝜒1− 𝛼 with df=19 is
2
2
𝜒1− 𝛼 = 𝜒 2 0.05 2
= 𝜒0.975 = 8.9066
2 1−
2
𝑛−1 𝑠2 𝑛−1 𝑠2
2
𝜒𝛼
<𝜎< 𝜒2 𝛼
2 1− 2
𝟏. 𝟐𝟏 < 𝝈 < 𝟐. 𝟑𝟒
Therefore, the 95% confidence interval for the
population standard deviation for the nicotine
content is between 1.21 mg to 2.34 mg.
Solution:
Given: 𝛼 = 1 − 0.99 = 0.01 → Z0.01Τ2 = 𝑍0.005 = 2.575, σ = 4.33, E = 2
𝑍𝛼ൗ ∙𝜎 2 2.575 4.33 2
2
𝑛= = = 31.08 ≈ 𝟑𝟐
𝐸 2
Therefore, to be 99% confident that the estimate is within 2 feet of the true
mean depth, the scientist needs a sample of at least 32 measurements.
Note: If there is any fraction or decimal portion in the answer, use the next
whole number for sample size 𝑛.
Introduction to Inferential Statistics | 56
DEPARTMENT of
STATISTICS
Sample Size Determination (Proportion)
Example 10. A researcher wishes to estimate, with 95% confidence, the
proportion of people who own a home computer. A previous study shows
that 40% of those interviewed had a computer at home. The researcher
wishes to be accurate within 2% of the true proportion. Find the minimum
sample size necessary.
Solution: Given: 𝛼 = 0.05 → 𝑍0.05Τ2 = 1.96, 𝑝Ƹ = 0.40 , 𝑞ො = 0.60 and
𝐸 = 2% = 0.02
𝑍𝛼ൗ 2 1.96
2
2
𝑛 = 𝑝Ƹ 𝑞ො = (0.40)(0.60) = 2304.96 ≈ 𝟐𝟑𝟎𝟓
𝐸 0.02
Therefore, to be 95% confident that the estimate is within 2% of the true
proportion, the researcher must interview 2305 people.