EXAM (1)
EXAM (1)
Problem 1:
Data Set:
Sorted
data: 5395,5538,5561,5612,5710,5816,6045,6106,6265,6320,6341,6632,6908,7220,7361,743953
95,5538,5561,5612,5710,5816,6045,6106,6265,6320,6341,6632,6908,7220,7361,7439
Step 2: Calculate the Five-Number Summary
• Minimum: 5395
• Maximum: 7439
Median (Q2): The median is the middle value of the dataset. Since there are 16
data points (an even number), the median is the average of the 8th and 9th
values:
!"#!$!%!&
Median = =6185.5
%
First Quartile (Q1): The median of the lower half of the data (first 8 values):
&!"%$&'"#
Q1= =5661
%
Third Quartile (Q3): The median of the upper half of the data (last 8 values):
!(%#$&'"#
Q3= =6330.5
%
So the five-number summary is:
• Minimum = 5395
• Q1 = 5661
• Median (Q2) = 6185.5
• Q3 = 6330.5
• Maximum = 7439
• Mean:
The mean is the sum of all the values divided by the number of data points:
Mean=
5395+5538+5561+5612+5710+5816+6045+6106+6265+6320+6341+6632+6908+7220+
7361+7439/16= 5946.94
• Mode:
Since no values repeat in the dataset, there is no mode.
• Range:
The range is the difference between the maximum and minimum values:
Range=7439−5395=2044
• Variance:
Variance is the average of the squared differences from the mean:
Variance=1/16∑!"
#$! ( × 𝑖 − 𝜇)^2
• Where xi are the data points and μ=5946.94. After calculating the squared differences and
averaging, we get:
Variance≈478501.12
• Standard Deviation:
The standard deviation is the square root of the variance:
Standard Deviation≈478501.12≈691.96
• Skewness:
The distribution is slightly skewed to the right (positively skewed) because the mean
(5946.94) is greater than the median (6185.5), indicating that there are a few higher
values that pull the mean to the right.
• Reasoning:
In a right-skewed distribution, the tail on the right side is longer or fatter than the left
side. In this case, the values towards the higher end (such as 7361, 7439) contribute to the
right skew.
To construct a 99% confidence interval for the population mean number of days the car model
sits on the dealership’s lot, we'll use the following information:
The formula for a confidence interval for the population mean when the sample size is less than
30 and we use the t-distribution is:
• xˉ=9.75
• t=2.724 (for 99% confidence and df = 35)
• s=2.39
• n=36
)
Standard Error = = 2.39/√6=0.3983
√+
Step 3: Calculate the Margin of Error
So:
Lower limit = 9.75−1.085=8.665
Final Answer:
The 99% confidence interval for the population mean number of days the car model sits on the
dealership’s lot is:
(8.665,10.835)
This means we are 99% confident that the true population mean for the number of days the car
model sits on the dealership’s lot falls between 8.665 and 10.835 days.
Given Information:
At α=0.10 the significance level is split between the two tails of the normal distribution
(α/2=0.05 in each tail). Using a Z-table or standard normal critical values:
𝐱¯#𝛍
Z =
𝛔/√𝐧
"(,'-&."&,#&#
Z=
%/'&/√&##
"(,'-&."2,#&# .%&&
Z= = ≈-1,98
"%/,!( "%/,!(
Step 4: Decision Rule
• Reject H0 if the test statistic falls outside the critical region (Z<−1.645 or Z>1.645).
• In this case, Z=−1.98, which is less than −1.645
At the α=0.10 significance level, there is sufficient evidence to reject the claim that the mean
annual cost of raising a child by married-couple families in the U.S. is $14,050. Instead, the
sample suggests that the actual mean cost differs from $14,050.
Problem 5:
Given Information:
• The data for vertical jump heights before and after training is as follows:
• Null Hypothesis (H0): The mean difference in jump heights is zero (μD=0).
There is no significant improvement in vertical jump heights.
• Alternative Hypothesis (Ha): The mean difference in jump heights is greater than zero
(μD>0).
The training shoes improve vertical jump heights.
tcritical=1.415
𝐃¯
t=
𝐬𝐃/√𝒏
Where:
∑4 %$($#$#.%$%$&$(
Dˉ= = =1.75
+ /
Step 3.2: Calculate sDsD (Standard Deviation of Differences)
∑(47.8¯)^%
sD=%
+."
= [0.25,1.25,−1.75,−0.75,−3.75,0.25,3.25,1.25]
(Di−Dˉ)^2=[0.0625,1.5625,3.0625,0.5625,14.0625,0.0625,10.5625,1.5625]
∑(Di − D¯)^2=31.4375
Calculate sD:
(".2('&
sD=√ ≈ √4,491 ≈ 2.12
/."
Step 3.3: Calculate the Test Statistic
8¯ ".'& ",'&
t= = = ≈ 2,34
𝐬𝐃/√𝒏 %,"%√/ #,'-2%
Step 4: Decision
Compare the test statistic t=2.34 with the critical value t critical=1.415:
Step 5: Interpretation
At the α=0.10significance level, there is enough evidence to support the claim that the training
shoes significantly increase athletes' vertical jump heights.
Problem:6
The data for hours studied (x) and test scores (y) is:
1. Plot hours studied (x) on the x-axis and test scores (y) on the y-axis.
2. Mark the data points (e.g., (0,40),(2,51),..).
3. Look for a trend.
) ∑ *+,∑ * ∑ +
R=
-() ∑ *^/,(∑ *)^/() ∑ + , ,(∑ +)^/)
9 97 81 9409 873
Step 2.2: Calculate r
!"($%&$)(()!)(*$*)
r=
+(!"(,%))(()!)! )(!"($-!-))((*$*)! )
$)$,
r=
√%%"%,""-
=0.968
Hypotheses:
∑ 0#5 ∑ /
• b=
-
67(8498)#(:6)(;8;) 8:8<
M= = ≈7.0
67(<4:)#(:6)^4 98=
;8;#(;.7)(:6)
B=
67
=39.0
Regression Equation:
y=7.0x+39.0
7. Chi-Square Test for Independence
Given Data:
Total
Gender Cup Cone Sundae Sandwich Other Contribu
tion
1069 × 978
𝟏𝟎𝟔𝟗×𝟔𝟖𝟖 𝟏𝟎𝟔𝟗×𝟑𝟒𝟎 𝟏𝟎𝟔𝟗×𝟖𝟖 𝟏𝟎𝟔𝟗×𝟏𝟎𝟑
𝟐𝟏𝟗𝟕 = = = =
Male =
𝟐𝟏𝟗𝟕 𝟐𝟏𝟗𝟕 𝟐𝟏𝟗𝟕 𝟐𝟏𝟗𝟕 1069
334.63 165.33 42.83 50.07
475.86
χ2= ∑ (Oij−Eij)2/Eij
Where Oijare the observed frequencies, and Eij are the expected
frequencies. Calculate for each cell:
Total
Gender Cup Cone Sundae Sandwich Other Contributio
n
Degrees of Freedom:
df=(Rows−1)(Columns−1)=(2−1)(5−1)=4
Critical Value:
Interpretation:
Given Data:
(1) Hypotheses
Mean Variance
Group Values n
(xˉ) (s2)
1. Between-Group Variance:
SSWithin=∑(ni − 1)s^2i
After calculations:
• SSBetween=0.0361
•
• SSWithin=0.0527
Where:
After calculations:
(3) Decision