Team Assignment 2
Team Assignment 2
Hypothesis Testing
Team Members:
Asif Khan Mohammed – 2230254
Amer Uddin Afsar – 2208331
Sainty – 2210903
Krishna Moe Thu - 2130179
Parthasarthy PV – 2216761
The mean listing price of a home from the Surrey and Guilford area in 2022 is 1313551.067
1. b) Generate appropriate numerical summaries for the listing price of a home, the
sample standard deviation, etc.
Answer:
1. Mean = 1313551.067
Mean shows the average listing price of a home from the Surrey and Guilford area in 2022.
2. Median 1348500
The median shows that half of the listing prices are below 1348500, and half are above
1348500. The median is often used to measure central tendency when the data is skewed or
has outliers, as it is less affected by extreme values than the mean.
3. Mode = 1450000
In this case, we can see that the value 1450000 appears three times, which is more than any
other value. Therefore, the mode of the listing prices is 1450000.
4. Standard Deviation = 164832.1338
This indicates that the data is relatively spread out from the mean, around 1313551.067. The
standard deviation can help give an idea of the variability in the price listing and how
representative the mean is of the dataset.
5. Sample Variance = 27169632347
For the dataset of listing prices, the sample variance is approximately 27169632347. This
means that the average squared difference between each price listing and the mean is quite
large, indicating that the data is spread out.
6. Skewness = -0.40315359
This shows that the dataset of price listing is slightly skewed to the left, with a long tail on the
left-hand side of the distribution. This means that there are some lower values in the price
listing that are pulling the mean and the distribution towards the left.
7. Range = 450000
The range of a dataset of price listing is the difference between the largest and smallest values
of price listing of homes. In this case, the largest value is 1499900, and the smallest value is
1049900, and the difference between them is 450000.
1. c) Use these summaries to develop a 95% confidence interval to estimate the mean listing
price of a house in your selected city. That is, you will be using the sampling data you collected
to estimate the population parameter (listing price). Make sure to show both your calculations
and a final statement of its meaning.
Answer:
σ
σx
n
Where: σ = Population standard deviation = 164832.13
n = Sample size = 30
By substituting the above values in formulae, we get the standard error of the mean
σ x = 30094.09264
If α = 0.05, then zα/2 = z0.025 = 1.96 is the value that encloses 95% of the area under the
normal distribution and leaves 2.5% in each tail.
The Margin of Error (ME) is the amount added and subtracted to the point estimate to form
the confidence interval.
Conclusion:
Based on our sample mean of 1313551.067, we are 95% confident that the mean listing price
of a house in your selected city is between 1252001.736 and 1375100.397
1.d) The mean selling price of a house in the sample area is said to be $1,400,00. To determine if
this is true in your sample data, conduct a hypothesis test using a critical value approach using
α = 0.05 level of significance. Make sure to state the null and alternative hypotheses. Write a
concluding statement about your results
Answer:
H1: μ ≠ 1,400,000
Given, α = 0.05
µH0= The mean of the sampling distribution, which is assumed to be true for the null
hypothesis
Calculating the appropriate test statistic from the above formulae, we get Zx̄ = -2.87
Step 5: Comparing the z-test statistic (zx) with the critical z-score (zα).
Conclusion:
According to the sample of 30 price listings of houses, there is not enough evidence
to support the claim that the mean selling price of a house in the sample area is said to be
$1,400,00.
Q2. Grace is the manager at a local restaurant and has implemented changes to her operation to
reduce the average wait time for a table between 6 pm and 8 pm on Saturday night to less than
20 minutes. A random sample of 45 wait times on Saturday night had a mean of 18.3 minutes.
Grace knows from past experience that the standard deviation of wait times is 5 minutes.
Conduct a hypothesis test using p-value and critical value approach to determine if Grace’s
changes have reduced average wait time to less than 20 minutes using α = 0.01
Answer:
Given, α = 0.01
Given:
µH0= The mean of the sampling distribution, which is assumed to be true for the null
hypothesis
Calculating the appropriate test statistic from the above formulae, we get Zx̄ = -2.28.
Step 5: Comparing the z-test statistic (zx) with the critical z-score (zα).
Zx̄ = 2.28 is not greater than Zα = 2.33. So, do not Reject H0.
Reject H0
x scale
20
Zx̄ = 2.28 zα= 2.33 z scale
Step 6: Stating the conclusion.
Conclusion:
According to the sample of 45 wait times on Saturday night, there is not enough
evidence to support the claim that Grace’s has reduced the average wait time to less than 20
minutes.
p- Value Approach:
Given, α = 0.01
=NORM.S.DIST(z, cumulative)
Input the test statistic value for Zx̄ and set cumulative = TRUE to get the area to the left of the
test statistic:
Because our p-value is the area to the right of the test statistic, we subtract 0.9886 from 1.0.
Therefore, p-value > α-value, i.e.,0.011303844 > 0.01. So, we do not reject H0.
Reject H0
0.9886 p-value=0.0113
20 Min
z x =2.28 x scale
¯x̄ =18.3
z scale
Conclusion:
According to the sample of 45 wait times on Saturday night, there is not enough
evidence to support the claim that Grace’s has reduced the average wait time to less than 20
minutes.
Q3. Business organizations are relying more and more on social media to search for job
candidates. According to one survey, 70% of employers use social media sites to screen job
candidates during the hiring process. A random sample of 160 firms found that 103 of them use
social media sites to screen candidates during the hiring process. Test the hypothesis that less
than 70% of organizations use social media to screen job candidates during the hiring process
with α = 0.01.
a. Use the critical value approach to test this hypothesis
b. Use the p-value approach to test this hypothesis
Answer:
a. Critical value approach:
Step 1: Identify the null and alternative hypotheses
H0: p ≥ 0.7
α = 0.01
Given:
The number of organizations that use social media sites to hire candidates (x) = 103
By substituting the given values, we get the sample proportion (p̂ ) = 0.643
p̂ = Sample Proportion.
pH0 = The population proportion, which is assumed to be true in the null hypothesis.
n = Sample size
By substituting the above values, we get the z-test statistic for the proportion (z p) = -1.56
Step 5: Compare the z-test statistic (zp) with the critical z-score (zα).
The z-test statistic (zp) is less than the critical z-score (zα).
α = 0.01
p̂ - scale 0.7
Zp = 1.56 Zα = 2.33
z- scale
Step 6: Stating the conclusion.
Conclusion:
According to the sample of 160 firms, at the 0.01 level of significance, there is not
enough evidence to support the claim that less than 70% of organizations use social media to
screen job candidates during the hiring process.
b. p-value approach:
α = 0.01
The number of organizations that use social media sites to hire candidates (x) = 103
By substituting the given values, we get the sample proportion (p̂ ) = 0.643
p̂ = Sample Proportion.
pH0 = The population proportion, which is assumed to be true in the null hypothesis.
n = Sample size
By substituting the above values, we get the z-test statistic for the proportion (z p) = -1.56
=NORM.S.DIST(z, cumulative)Input the test statistic value for Zp and set cumulative = TRUE
to get the area to the left of the test statistic:
Because our p-value is the area to the right of the test statistic, we subtract 0.94062 from 1.0.
p̂ - scale 0.7
Zp = 1.56 Zα=2.33
z- scale p = 0.643
p-value = 0.059
Step 6: Stating the conclusion.
Conclusion:
According to the sample of 160 firms, at the 0.01 level of significance, there
is not enough evidence to support the claim that less than 70% of organizations use social
media to screen job candidates during the hiring process.