Advanced Statistics Project - Business Report
Advanced Statistics Project - Business Report
1
Table of Contents
1.1 What is the probability that a randomly chosen player would suffer an injury?
1.3 What is the probability that a randomly chosen player plays in a striker position and has a foot
injury?
1.4 What is the probability that a randomly chosen injured player is a striker?
1.5 What is the probability that a randomly chosen injured player is either a forward or an attacking
midfielder?
2.1 What are the probabilities of a fire, a mechanical failure, and a human error respectively?
2.3 Suppose there has been a radiation leak in the reactor for which the definite cause is not known.
What is the probability that it has been caused by:
A Fire.
A Mechanical Failure.
A Human Error.
3.1 What proportion of the gunny bags have a breaking strength less than 3.17 kg per sq cm?
3.2 What proportion of the gunny bags have a breaking strength at least 3.6 kg per sq cm.?
3.3 What proportion of the gunny bags have a breaking strength between 5 and 5.5 kg per sq cm.?
3.4 What proportion of the gunny bags have a breaking strength NOT between 3 and 7.5 kg per sq
cm.?
4.1 What is the probability that a randomly chosen student gets a grade below 85 on this exam?
4.2 What is the probability that a randomly selected student scores between 65 and 87?
4.3 What should be the passing cut-off so that 75% of the students clear the exam?
5.1 Earlier experience of Zingaro with this particular client is favorable as the stone surface was
found to be of adequate hardness. However, Zingaro has reason to believe now that the unpolished
stones may not be suitable for printing. Do you think Zingaro is justified in thinking so?
5.2 Is the mean hardness of the polished and unpolished stones the same?
6. Aquarius health club, one of the largest and most popular cross-fit gyms in the country has been
advertising a rigorous program for body conditioning. The program is considered successful if the
candidate is able to do more than 5 push-ups, as compared to when he/she enrolled in the program.
Using the sample data provided can you conclude whether the program is successful? (Consider the
level of Significance as 5%)
Note that this is a problem of the paired-t-test. Since the claim is that the training will make a
difference of more than 5, the null and alternative hypotheses must be formed accordingly.
7.1)Test whether there is any difference among the dentists on the implant hardness. State the null
and alternative hypotheses. Note that both types of alloys cannot be considered together. You
must state the null and alternative hypotheses separately for the two types of alloys.?
2
7.2)Before the hypotheses may be tested, state the required assumptions. Are the assumptions
fulfilled? Comment separately on both alloy types.?
7.3)Irrespective of your conclusion in 2, we will continue with the testing procedure. What do you
conclude regarding whether implant hardness depends on dentists? Clearly state your
conclusion. If the null hypothesis is rejected, is it possible to identify which pairs of dentists
differ?
7.4)Now test whether there is any difference among the methods on the hardness of dental implant,
separately for the two types of alloys. What are your conclusions? If the null hypothesis is
rejected, is it possible to identify which pairs of methods differ?
7.5)Now test whether there is any difference among the temperature levels on the hardness of
dental implant, separately for the two types of alloys. What are your conclusions? If the null
hypothesis is rejected, is it possible to identify which levels of temperatures differ?
7.6)Consider the interaction effect of dentist and method and comment on the interaction plot,
separately for the two types of alloys?
7.7)Now consider the effect of both factors, dentist, and method, separately on each alloy. What do
you conclude? Is it possible to identify which dentists are different, which methods are different,
and which interaction levels are different?
Problem 1:
A physiotherapist with a male football team is interested in studying the relationship between
foot injuries and the positions at which the players play from the data collected
3
Striker Forward Attacking Midfielder Winger Total
Total 77 94 35 29 235
1.1 What is the probability that a randomly chosen player would suffer an injury?
Ans: P(Randomly Chosen Player would suffer an injury) = Total Player Injured/Total Player
P(Randomly Chosen Player would suffer an injury) = 145/235 = 0.617
Probability that a randomly chosen player would suffer an injury is 61.7%
1.3 What is the probability that a randomly chosen player plays in a striker position and has a foot
injury?
Ans: P(A randomly chosen player plays in a striker position and has a foot injury) =
P(Striker ∩ Foot Injury) = 45/235 = 0.191
Probability that a randomly chosen player plays in a striker position and has a foot injury
Is 19.1%.
1.4 What is the probability that a randomly chosen injured player is a striker?
1.5 What is the probability that a randomly chosen injured player is either a forward or an
attacking midfielder?
Ans: P(A randomly chosen injured player is either a forward or an attacking midfielder) =
= P (56/145 + 24/145) = P(80/145) = 0.5517
4
The probability that a randomly chosen injured player is either a forward or an attacking
midfielder is 55.1%
Problem 2:
An independent research organization is trying to estimate the probability that an accident at a
nuclear power plant will result in radiation leakage. The types of accidents possible at the plant
are, fire hazards, mechanical failure, or human error. The research organization also knows that
two or more types of accidents cannot occur simultaneously.
According to the studies carried out by the organization, the probability of a radiation leak in case
of a fire is 20%, the probability of a radiation leak in case of a mechanical 50%, and the probability
of a radiation leak in case of a human error is 10%. The studies also showed the following;
The probability of a radiation leak occurring simultaneously with a mechanical failure is 0.15%.
The probability of a radiation leak occurring simultaneously with a human error is 0.12%.
2.1 What are the probabilities of a fire, a mechanical failure, and a human error respectively?
2.3 Suppose there has been a radiation leak in the reactor for which the definite cause is not
known. What is the probability that it has been caused by:
A Fire.
A Mechanical Failure.
A Human Error.
Ans: Using Bayes Theorem We can calculate,
5
Problem 3:
The breaking strength of gunny bags used for packaging cement is normally distributed with a
mean of 5 kg per sq. centimeter and a standard deviation of 1.5 kg per sq. centimeter. The quality
team of the cement company wants to know the following about the packaging material to better
understand wastage or pilferage within the supply chain; Answer the questions below based on
the given information; (Provide an appropriate visual representation of your answers, without
which marks will be deducted)
3.1 What proportion of the gunny bags have a breaking strength less than 3.17 kg per sq cm?
Ans: The proportion of the gunny bags have a breaking strength less than 3.17 kg per sq cm
is : 0.11123243744783456
3.2 What proportion of the gunny bags have a breaking strength at least 3.6 kg per sq cm.?
Ans: The proportion of the gunny bags have a breaking strength at least 3.6 kg per sq cm
is: 0.8246760551477705
6
3.3 What proportion of the gunny bags have a breaking strength between 5 and 5.5 kg per sq cm.?
Ans: The proportion of the gunny bags have a breaking strength between 5 and 5.5 kg per sq cm is
0.13055865981823633
3.4 What proportion of the gunny bags have a breaking strength isNOT between 3 and 7.5 kg per
sq cm.?
Ans: The proportion of the gunny bags have a breaking strength isNOT between 3 and 7.5 kg per sq
cm is 0.13900157199868257
7
Problem 4:
Grades of the final examination in a training course are found to be normally distributed, with a
mean of 77 and a standard deviation of 8.5. Based on the given information answer the questions
below.
4.1 What is the probability that a randomly chosen student gets a grade below 85 on this exam?
Ans: The Probability that a randomly chosen student gets a grade below 85 on this exam is
0.8266927837484748
4.2 What is the probability that a randomly selected student scores between 65 and 87?
Ans: The Probability that a randomly selected student scores between 65 and 87 is
0.8012869336779058
4.3 What should be the passing cut-off so that 75% of the students clear the exam?
Ans: The passing cut-off so that 75% of the students clear the exam is 82.7331628766667
Problem 5:
Zingaro stone printing is a company that specializes in printing images or patterns on polished or
unpolished stones. However, for the optimum level of printing of the image the stone surface has
to have a Brinell's hardness index of at least 150. Recently, Zingaro has received a batch of
polished and unpolished stones from its clients. Use the data provided to answer the following
(assuming a 5% significance level);
8
5.1 Earlier experience of Zingaro with this particular client is favorable as the stone surface was
found to be of adequate hardness. However, Zingaro has reason to believe now that the
unpolished stones may not be suitable for printing. Do you think Zingaro is justified in thinking so?
5.2 Is the mean hardness of the polished and unpolished stones the same?
Ans: Null Hypothesis(H0): Mean Hardness of “Unpolished Stones” and “Polished Stones” are same.
Alternative Hypothesis(H1): Mean Hardness of “Unpolished Stones” and “Polished Stones” are not
same.
Level of Significance : α is 0.05
As per the T-Test, We found the P value i.e. p Value is 0.0014655150194628353 and it is less than
Level of Significance.
Hence, We can reject Null Hypothesis. And conclude that Mean Hardness of “Unpolished Stones”
and “Polished Stones” are not same.
Problem 6:
Aquarius health club, one of the largest and most popular cross-fit gyms in the country has been
advertising a rigorous program for body conditioning. The program is considered successful if the
candidate is able to do more than 5 push-ups, as compared to when he/she enrolled in the
program. Using the sample data provided can you conclude whether the program is successful?
(Consider the level of Significance as 5%)
Note that this is a problem of the paired-t-test. Since the claim is that the training will make a
difference of more than 5, the null and alternative hypotheses must be formed accordingly.
Problem 7:
Dental implant data: The hardness of metal implant in dental cavities depends on multiple factors,
such as the method of implant, the temperature at which the metal is treated, the alloy used as
well as on the dentists who may favour one method above another and may work better in
his/her favourite method. The response is the variable of interest.
9
1. Test whether there is any difference among the dentists on the implant hardness. State
the null and alternative hypotheses. Note that both types of alloys cannot be considered
together. You must state the null and alternative hypotheses separately for the two types
of alloys.?
Ans: We have separated Alloy1 and Alloy2 Dataset with name of DF1 and DF2 for better
understanding.
H0 for DF1- Dentists on the implant hardness is same for Alloy 1
H0 for DF2- Dentists on the implant hardness is same for Alloy 2
H1 for DF1- Any two Dentists the implant hardness is different For Alloy 1
H1 for DF2- Any two Dentists the implant hardness is different For Alloy 2
As per the ANOVA test, H0 for Both Alloys are accepted , p-value in both the cases are greater
than significance 0.05.
2. Before the hypotheses may be tested, state the required assumptions. Are the
assumptions fulfilled? Comment separately on both alloy types.?
3. Irrespective of your conclusion in 2, we will continue with the testing procedure. What do
you conclude regarding whether implant hardness depends on dentists? Clearly state your
conclusion. If the null hypothesis is rejected, is it possible to identify which pairs of
dentists differ?
We see P Value is greater than alpha(0.05). hence, we fail to reject Null Hypothesis.
Alloy1:
Alloy2:
Now, We see that the corresponding P-value is greater than alpha (0.05). Thus, We fail to reject
NULL Hypothesis.
Thus, for Both Alloy1 and Alloy2, the Mean Hardness of Alloy1 and Alloy2 is same across all Dentists.
10
4. Now test whether there is any difference among the methods on the hardness of dental
implant, separately for the two types of alloys. What are your conclusions? If the null
hypothesis is rejected, is it possible to identify which pairs of methods differ?
Ans:
11