0% found this document useful (0 votes)
2K views

Advanced Statistics Project - Business Report

Uploaded by

Ketan Sawalkar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2K views

Advanced Statistics Project - Business Report

Uploaded by

Ketan Sawalkar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 11

Advanced Statistics Project - Business Report

By: Ketan Sawalkar


PGP-Data Science and Business Analytics
(PGPDSBA.O.OCT22.B)

1
Table of Contents
1.1 What is the probability that a randomly chosen player would suffer an injury?

1.2 What is the probability that a player is a forward or a winger?

1.3 What is the probability that a randomly chosen player plays in a striker position and has a foot
injury?

1.4 What is the probability that a randomly chosen injured player is a striker?

1.5 What is the probability that a randomly chosen injured player is either a forward or an attacking
midfielder? 

2.1 What are the probabilities of a fire, a mechanical failure, and a human error respectively?

2.2 What is the probability of a radiation leak?

2.3 Suppose there has been a radiation leak in the reactor for which the definite cause is not known.
What is the probability that it has been caused by:

 A Fire.
 A Mechanical Failure.
 A Human Error.

3.1 What proportion of the gunny bags have a breaking strength less than 3.17 kg per sq cm?

3.2 What proportion of the gunny bags have a breaking strength at least 3.6 kg per sq cm.?

3.3 What proportion of the gunny bags have a breaking strength between 5 and 5.5 kg per sq cm.?

3.4 What proportion of the gunny bags have a breaking strength NOT between 3 and 7.5 kg per sq
cm.?

4.1 What is the probability that a randomly chosen student gets a grade below 85 on this exam?

4.2 What is the probability that a randomly selected student scores between 65 and 87?

4.3 What should be the passing cut-off so that 75% of the students clear the exam?

5.1 Earlier experience of Zingaro with this particular client is favorable as the stone surface was
found to be of adequate hardness. However, Zingaro has reason to believe now that the unpolished
stones may not be suitable for printing. Do you think Zingaro is justified in thinking so?

5.2 Is the mean hardness of the polished and unpolished stones the same?

6. Aquarius health club, one of the largest and most popular cross-fit gyms in the country has been
advertising a rigorous program for body conditioning. The program is considered successful if the
candidate is able to do more than 5 push-ups, as compared to when he/she enrolled in the program.
Using the sample data provided can you conclude whether the program is successful? (Consider the
level of Significance as 5%)

Note that this is a problem of the paired-t-test. Since the claim is that the training will make a
difference of more than 5, the null and alternative hypotheses must be formed accordingly.

7.1)Test whether there is any difference among the dentists on the implant hardness. State the null
and alternative hypotheses. Note that both types of alloys cannot be considered together. You
must state the null and alternative hypotheses separately for the two types of alloys.?

2
7.2)Before the hypotheses may be tested, state the required assumptions. Are the assumptions
fulfilled? Comment separately on both alloy types.? 
7.3)Irrespective of your conclusion in 2, we will continue with the testing procedure. What do you
conclude regarding whether implant hardness depends on dentists? Clearly state your
conclusion. If the null hypothesis is rejected, is it possible to identify which pairs of dentists
differ?
7.4)Now test whether there is any difference among the methods on the hardness of dental implant,
separately for the two types of alloys. What are your conclusions? If the null hypothesis is
rejected, is it possible to identify which pairs of methods differ?
7.5)Now test whether there is any difference among the temperature levels on the hardness of
dental implant, separately for the two types of alloys. What are your conclusions? If the null
hypothesis is rejected, is it possible to identify which levels of temperatures differ?
7.6)Consider the interaction effect of dentist and method and comment on the interaction plot,
separately for the two types of alloys?
7.7)Now consider the effect of both factors, dentist, and method, separately on each alloy. What do
you conclude? Is it possible to identify which dentists are different, which methods are different,
and which interaction levels are different?

Problem 1:
A physiotherapist with a male football team is interested in studying the relationship between
foot injuries and the positions at which the players play from the data collected 

3
  Striker Forward Attacking Midfielder Winger Total

Players Injured 45 56 24 20 145

Players Not Injured 32 38 11 9 90

Total 77 94 35 29 235

1.1 What is the probability that a randomly chosen player would suffer an injury?

Ans: P(Randomly Chosen Player would suffer an injury) = Total Player Injured/Total Player
P(Randomly Chosen Player would suffer an injury) = 145/235 = 0.617
Probability that a randomly chosen player would suffer an injury is 61.7%

1.2 What is the probability that a player is a forward or a winger?

Ans: P(Player is a forward or a winger) = P(Forward)+P(Winger)


P(Player is a forward or a winger) = (94/235 + 29/235) = 123/235 = 0.523
Probability that a player is a forward or a winger is 52.3%

1.3 What is the probability that a randomly chosen player plays in a striker position and has a foot
injury?

Ans: P(A randomly chosen player plays in a striker position and has a foot injury) =
P(Striker ∩ Foot Injury) = 45/235 = 0.191
Probability that a randomly chosen player plays in a striker position and has a foot injury
Is 19.1%.

1.4 What is the probability that a randomly chosen injured player is a striker?

Ans: P(A randomly chosen injured player is a striker) = P (Striker | Injured)


= P(Striker ∩ Foot Injury)/P(Injured)
= (45/235) / (145/235)
= (45/1450) = 0.310
The probability that a randomly chosen injured player is a striker is 31.0%

1.5 What is the probability that a randomly chosen injured player is either a forward or an
attacking midfielder? 

Ans: P(A randomly chosen injured player is either a forward or an attacking midfielder) =
= P (56/145 + 24/145) = P(80/145) = 0.5517

4
The probability that a randomly chosen injured player is either a forward or an attacking
midfielder is 55.1%

Problem 2:
An independent research organization is trying to estimate the probability that an accident at a
nuclear power plant will result in radiation leakage. The types of accidents possible at the plant
are, fire hazards, mechanical failure, or human error. The research organization also knows that
two or more types of accidents cannot occur simultaneously.

According to the studies carried out by the organization, the probability of a radiation leak in case
of a fire is 20%, the probability of a radiation leak in case of a mechanical 50%, and the probability
of a radiation leak in case of a human error is 10%. The studies also showed the following;

The probability of a radiation leak occurring simultaneously with a fire is 0.1%.

The probability of a radiation leak occurring simultaneously with a mechanical failure is 0.15%.

The probability of a radiation leak occurring simultaneously with a human error is 0.12%.

On the basis of the information available, answer the questions below:

2.1 What are the probabilities of a fire, a mechanical failure, and a human error respectively?

Ans : Probability of Fire is 0.005


Probability of Mechanical Failure is 0.003
Probability of Human Error is 0.012

2.2 What is the probability of a radiation leak?

Ans: Probability of Radiation Leak which is as follows:


P(R)=P(R∩F) + P(R∩M) + P(R∩H) = 0.001+0.0015+0.0012 = 0.0037

2.3 Suppose there has been a radiation leak in the reactor for which the definite cause is not
known. What is the probability that it has been caused by:
A Fire.
A Mechanical Failure.
A Human Error.
Ans: Using Bayes Theorem We can calculate,

- Probability of Radiation Leak due to Fire: P(F|R): P(R∩F)/P(R) = 0.001/0.0037 = 0.2702


- Probability of Radiation Leak due to Mechanical Failure: P(M|R): P(R∩M)/P(R) = 0.0015/0.0037
=0.4054
- Probability of Radiation Leak due to Human Error: P(H|R): P(R∩H)/P(R) = 0.0012/0.0037 = 0.3243

5
Problem 3:
The breaking strength of gunny bags used for packaging cement is normally distributed with a
mean of 5 kg per sq. centimeter and a standard deviation of 1.5 kg per sq. centimeter. The quality
team of the cement company wants to know the following about the packaging material to better
understand wastage or pilferage within the supply chain; Answer the questions below based on
the given information; (Provide an appropriate visual representation of your answers, without
which marks will be deducted)

3.1 What proportion of the gunny bags have a breaking strength less than 3.17 kg per sq cm?

Ans: The proportion of the gunny bags have a breaking strength less than 3.17 kg per sq cm
is : 0.11123243744783456

Visual Representation of Q.3.1

3.2 What proportion of the gunny bags have a breaking strength at least 3.6 kg per sq cm.?

Ans: The proportion of the gunny bags have a breaking strength at least 3.6 kg per sq cm
is: 0.8246760551477705

Visual Representation of Q.3.2

6
3.3 What proportion of the gunny bags have a breaking strength between 5 and 5.5 kg per sq cm.?

Ans: The proportion of the gunny bags have a breaking strength between 5 and 5.5 kg per sq cm is
0.13055865981823633

Visual Representation of Q.3.3

3.4 What proportion of the gunny bags have a breaking strength isNOT between 3 and 7.5 kg per
sq cm.?

Ans: The proportion of the gunny bags have a breaking strength isNOT between 3 and 7.5 kg per sq
cm is 0.13900157199868257

Visual Representation of Q.3.4

7
Problem 4:
Grades of the final examination in a training course are found to be normally distributed, with a
mean of 77 and a standard deviation of 8.5. Based on the given information answer the questions
below. 

4.1 What is the probability that a randomly chosen student gets a grade below 85 on this exam?

Ans: The Probability that a randomly chosen student gets a grade below 85 on this exam is
0.8266927837484748

4.2 What is the probability that a randomly selected student scores between 65 and 87?

Ans: The Probability that a randomly selected student scores between 65 and 87 is
0.8012869336779058

4.3 What should be the passing cut-off so that 75% of the students clear the exam?

Ans: The passing cut-off so that 75% of the students clear the exam is 82.7331628766667

Problem 5:
Zingaro stone printing is a company that specializes in printing images or patterns on polished or
unpolished stones. However, for the optimum level of printing of the image the stone surface has
to have a Brinell's hardness index of at least 150. Recently, Zingaro has received a batch of
polished and unpolished stones from its clients. Use the data provided to answer the following
(assuming a 5% significance level);

8
 5.1 Earlier experience of Zingaro with this particular client is favorable as the stone surface was
found to be of adequate hardness. However, Zingaro has reason to believe now that the
unpolished stones may not be suitable for printing. Do you think Zingaro is justified in thinking so?

Ans: Null Hypothesis (H0): μ >= 150


Alternative Hypothesis (H1): μ < 150
Level of Significance : α is 0.05
As per the T Test, We found the P value i.e. p value is 8.342573994839285e-05 and it is less than 5%
level of significance.
Hence, Null Hypothesis should be rejected. And Alternative Hypothesis should be accepted.
So, Zingaro is justified in thinking that unpolished stones may not be suitable for printing.

5.2 Is the mean hardness of the polished and unpolished stones the same?

Ans: Null Hypothesis(H0): Mean Hardness of “Unpolished Stones” and “Polished Stones” are same.
Alternative Hypothesis(H1): Mean Hardness of “Unpolished Stones” and “Polished Stones” are not
same.
Level of Significance : α is 0.05
As per the T-Test, We found the P value i.e. p Value is 0.0014655150194628353 and it is less than
Level of Significance.
Hence, We can reject Null Hypothesis. And conclude that Mean Hardness of “Unpolished Stones”
and “Polished Stones” are not same.

Problem 6:
Aquarius health club, one of the largest and most popular cross-fit gyms in the country has been
advertising a rigorous program for body conditioning. The program is considered successful if the
candidate is able to do more than 5 push-ups, as compared to when he/she enrolled in the
program. Using the sample data provided can you conclude whether the program is successful?
(Consider the level of Significance as 5%)

Note that this is a problem of the paired-t-test. Since the claim is that the training will make a
difference of more than 5, the null and alternative hypotheses must be formed accordingly.

Ans: Consider Mean(μ) is Candidate of Aquarius Club can do 5 pushups


Assume,
Null Hypothesis(H0) is Candidates of Club can do more than 5 pushups.
Alternative Hypothesis(H1) is Candidate of Club fail to do more than 5 pushups.
tstat -19.323
p-value for one-tail: 1.1460209626255983e-35
Paired two-sample t-test p-value= 1.1460209626255983e-35
We have enough evidence to reject the null hypothesis in favour of alternative hypothesis.
Hence, We conclude that the claim of Aquarius club training program is unsuccessful.

Problem 7:
Dental implant data: The hardness of metal implant in dental cavities depends on multiple factors,
such as the method of implant, the temperature at which the metal is treated, the alloy used as
well as on the dentists who may favour one method above another and may work better in
his/her favourite method. The response is the variable of interest.

9
1. Test whether there is any difference among the dentists on the implant hardness. State
the null and alternative hypotheses. Note that both types of alloys cannot be considered
together. You must state the null and alternative hypotheses separately for the two types
of alloys.?
Ans: We have separated Alloy1 and Alloy2 Dataset with name of DF1 and DF2 for better
understanding.
H0 for DF1- Dentists on the implant hardness is same for Alloy 1
H0 for DF2- Dentists on the implant hardness is same for Alloy 2
H1 for DF1- Any two Dentists the implant hardness is different For Alloy 1
H1 for DF2- Any two Dentists the implant hardness is different For Alloy 2
As per the ANOVA test, H0 for Both Alloys are accepted , p-value in both the cases are greater
than significance 0.05.

2. Before the hypotheses may be tested, state the required assumptions. Are the
assumptions fulfilled? Comment separately on both alloy types.? 

3. Irrespective of your conclusion in 2, we will continue with the testing procedure. What do
you conclude regarding whether implant hardness depends on dentists? Clearly state your
conclusion. If the null hypothesis is rejected, is it possible to identify which pairs of
dentists differ?

Ans: If we perform One Way ANOVA for the response Variables.

We see P Value is greater than alpha(0.05). hence, we fail to reject Null Hypothesis.

Thus the Mean Hardness is same across all dentists.

Now Lets perform for Alloy1 and Alloy2

Alloy1:

Alloy2:

Now, We see that the corresponding P-value is greater than alpha (0.05). Thus, We fail to reject
NULL Hypothesis.
Thus, for Both Alloy1 and Alloy2, the Mean Hardness of Alloy1 and Alloy2 is same across all Dentists.

10
4. Now test whether there is any difference among the methods on the hardness of dental
implant, separately for the two types of alloys. What are your conclusions? If the null
hypothesis is rejected, is it possible to identify which pairs of methods differ?
Ans:

As P-Value are lesser than alpha(0.05), We reject the Null Hypothesis.

Pair of Method 1 and 3, & Pair of Method 2 and 3 are Differ.


5. Now test whether there is any difference among the temperature levels on the hardness
of dental implant, separately for the two types of alloys. What are your conclusions? If the
null hypothesis is rejected, is it possible to identify which levels of temperatures differ?
6. Consider the interaction effect of dentist and method and comment on the interaction
plot, separately for the two types of alloys?
7. Now consider the effect of both factors, dentist, and method, separately on each alloy.
What do you conclude? Is it possible to identify which dentists are different, which
methods are different, and which interaction levels are different?

11

You might also like