Statistics and Probability
Statistics and Probability
Q1) Calculate probability from the given dataset for the below cases
Data_set: Cars.csv
Calculate the probability of MPG of Cars for the below cases.
MPG <- Cars$MPG
a. P(MPG>38)
[Answer] : 0.327
b. P(MPG<40)
[Answer] : 0.290
c. P (20<MPG<50)
[Answer] : 0.038
[Answer] :
Adipose Tissue Normally distributed
Waist Normally distributed
Q4) Calculate the t scores of 95% confidence interval, 96% confidence interval, 99% confidence
interval for sample size of 25
Confidence Interval T-Score
95% 2.063899
96% 2.171545
99% 2.79694
Q5) A Government company claims that an average light bulb lasts 270 days. A researcher
randomly selects 18 bulbs for testing. The sampled bulbs last an average of 260 days,
with a standard deviation of 90 days. If the CEO's claim were true, what is the probability
that 18 randomly selected bulbs would have an average life of no more than 260 days
Hint:
rcode → pt(tscore,df)
df → degrees of freedom
To find:
Solution:
t = - 0.471
For probability calculations, the number of degrees of freedom is n - 1, so here you need the t-
distribution with 17 degrees of freedom.
The probability that t < - 0.471 with 17 degrees of freedom assuming the population mean is
true, the t-value is less than the t-value obtained With 17 degrees of freedom and a t score of -
0.471, the probability of the bulbs lasting less than 260 days on average of 0.3218 assuming the
mean life of the bulbs is 300 days.
Q6) The time required for servicing transmissions is normally distributed with = 45
minutes and = 8 minutes. The service manager plans to have work begin on the
transmission of a customer’s car 10 minutes after the car is dropped off and the
customer is told that the car will be ready within 1 hour from drop-off. What is the
probability that the service manager cannot meet his commitment?
A. 0.3875
B. 0.2676
C. 0.5
Thus the question can be answered by using the normal table to find
Pr(X<=50) = Pr(Z<=(50-45)/8.0) = Pr(Z<=0.625) = 73.4%
Probability that the Service Manager will not meet his demand will be 100-73.4 = 26.6% =
0.2676
Q7) The current age (in years) of 400 clerical employees at an insurance claims
processing center is normally distributed with mean = 38 and Standard deviation
=6. For each statement below, please specify True/False. If false, briefly explain why.
A. More employees at the processing center are older than 44 than between
38 and 44.
B. A training program for employees under the age of 30 at the center would
be expected to attract about 36 employees.
[Answer] :
We have a normal distribution with mean = 38 and std= 6. Let X be the number of
employees. So according to question
a)Probabilty of employees greater than age of 44= Pr(X>44)
Pr(X > 44) = 1 - Pr(X ≤ 44).
Z = (X - mean)/ std = (X - 38)/6
Thus the question can be answered by using the normal table to find
Pr(X ≤ 44) = Pr(Z ≤ (44 - 38)/6) = Pr(Z ≤ 1)=84.1345%
Probabilty that the employee will be greater than age of 44 = 100-84.1345=15.86%
So the probability of number of employees between 38-44 years of age = Pr(X<44)-
0.5=84.1345-0.5= 34.1345%
Therefore the statement that “More employees at the processing center are older than 44
than between 38 and 44” is TRUE.
Q8) If X1 ~ N(μ, σ2) and X2 ~ N(μ, σ2) are iid normal random variables, then what is the
difference between 2 X1 and X1 + X2? Discuss both their distributions and parameters.
[Answer]:
As we know that if X ∼ N(µ1, σ1^2 ), and Y ∼ N(µ2, σ2^2 ) are two independent random
variables then X + Y ∼ N(µ1 + µ2, σ1^2 + σ2^2 ) , and X − Y ∼ N(µ1 − µ2, σ1^2 + σ2^2 ) .
Similarly if Z = aX + bY , where X and Y are as defined above, i.e Z is linear combination of X and
Y , then Z ∼ N(aµ1 + bµ2, a^2σ1^2 + b^2σ2^2 ).
Therefore in the question
2X1~ N(2 u,4 σ^2) and
X1+X2 ~ N(µ + µ, σ^2 + σ^2 ) ~ N(2 u, 2σ^2 )
2X1-(X1+X2) = N( 4µ,6 σ^2)
Q9) Let X ~ N(100, 20^2) its (100, 20 square).Find two values, a and b, symmetric about
the mean, such that the probability of the random variable taking a value between
them is 0.99.
A. 90.5, 105.9
B. 80.2, 119.8
C. 22, 78
D. 48.5, 151.5
E. 90.1, 109.9
Answer.
Since we need to find out the values of a and b, which are symmetric about the mean, such
that the probability of random variable taking a value between them is 0.99, we have to work
out in reverse order.
The Probability of getting value between a and b should be 0.99.
So the Probability of going wrong, or the Probability outside the a and b area is 0.01 (ie. 1-
0.99).
The Probability towards left from a = -0.005 (ie. 0.01/2).
The Probability towards right from b = +0.005 (ie. 0.01/2).
So since we have the probabilities of a and b, we need to calculate X, the random variable at a
and b which has got these probabilities.
By finding the Standard Normal Variable Z (Z Value), we can calculate the X values.
Z=(X- μ) / σ
For Probability 0.005 the Z Value is -2.57 (from Z Table).
Z*σ+μ=X
Z(-0.005)*20+100 = -(-2.57)*20+100 = 151.4
Q10) Consider a company that has two different divisions. The annual profits from the
two divisions are independent and have distributions Profit1 ~ N(5, 3^2) and Profit2 ~
N(7, 4^2) respectively. Both the profits are in $ Million. Answer the following questions
about the total profit of the company in Rupees. Assume that $1 = Rs. 45
A. Specify a Rupee range (centered on the mean) such that it contains
95% probability for the annual profit of the company.
B. Specify the 5th percentile of profit (in Rupees) for the company
C. Which of the two divisions has a larger probability of making a loss in a
given year?
[Answer] :
Suppose a certain type of small data processing firm is so specialized that some have difficulty
making a profit in their first year of operation. The probability density function that
characterizes the proportion YY that makes a profit is given by
f(y)={ky2(1−y)3,0,0≤y≤1, elsewhere f(y)={ky2(1−y)3,0≤y≤1,0, elsewhere
(a) What is the value of kk that renders the above a valid density function? (b) Find the
probability that at most 50%50% of the firms make a profit in the first year. (c) Find the
probability that at least 80%80% of the firms make a profit in the first year.