0% found this document useful (0 votes)
14 views6 pages

Mba Data Analytics

Uploaded by

setalurikushal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views6 pages

Mba Data Analytics

Uploaded by

setalurikushal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Model Question Paper

Programme : Online MBA Semester : I


Course Title : Data Analytics Course Code : OLMBA504
Faculty : Class Nbr :
Time : 2.5 hours Max. Marks : 100

PART – A

Answer All the Questions (10 X 1 Marks = 10 Marks)

Table or critical ‘Z’ value at 0.05 level of significance is 1.96


Table or critical ‘t’ value at 0.05 level of significance is 2.08
Table or critical chi-square value at 0.05 level of significance and 6 degrees of freedom is 1.635
Table or critical chi-square value at 0.05 level of significance and 2 degrees of freedom is 0.103
Table or critical ‘F’ value at 0.05 level of significance is 1.96

Q.No. Question Description Marks


1 Why is it usually better to calculate a mode from grouped, rather than 1
ungrouped data?

a) The ungrouped data tend to be bimodal


b) The mode for the grouped data will be the same, regardless of the
skewness of the distribution
c) extreme values have less effect on grouped data
d) the chance of an unrepresentative value being chosen as the mode is
reduced
2 Which of the following is the first step in calculating the median of a 1
data set?

a)Average the middle two values of the data set


b) Array the data
c) Determine the relative weights of the data values in terms of
importance
d) None of these
3 Paired sample t test is a test of significance difference between 1

a) Assumed mean and sample mean b) two dependent sample means


c) two paired sample means d) b and c but not a
4 Which of the following statements is NOT correct? 1
a)Some data sets do not have means b) Calculation of a mean is
affected by extreme data values c) all these statements are true

5 The measure of dispersion most often used by statisticians is the 1


standard deviation
a) It considers all the values in the distribution for its estimation
b) It is a standard formula which accounts for the error
c) It considers the sample size (n) for its estimation
d) None of these
6 Samples sizes in analysis of variance need not be equal 1
a) No b) Yes c) It depends on the number of samples

7 A chi-square value can never be negative because 1


a) Difference between expected and observed frequencies are squared
b) a negative value would mean that the observed frequencies were zero
c) the absolute value of the differences is computed
d) None of these
e) a and b but not c

8 Independent sample z test may be used to test whether the means of 1


_____________ populations can be considered equal

a) Exactly 2 b) More than 2


b) c) Not means of populations but samples d) none of these

9 The sample probability of an occurrence of an event is called the 1


a) Bayesian probability b) Joint probability c) Marginal
probability d) Conditional probability
10 Correlation coefficient explains 1
a) The strength and sample size sample
b) The strength and direction of the relationship between two
variables
c) The relationship between two samples

Answer all the questions (15 x 2 marks = 30 marks)


1. If the number of independent variables = 5 2
r² value is 0.40.
If r² has to improve to 0.70, what should be done

a) Find out the variable that brings down the r² and drop them from the
model
b) Improve the sample size and examine if it produces sufficient change
in the r²
c) Include more variables and examine if there is any improvement in r²
d) All of these
2. What is the difference between r² and r 2

a) r² is the amount of variance explained by the independent variables


on the dependent variable whereas r explains the strength and the
direction of association between the two variables.
b) r is the amount of variance explained by the independent variables
on the dependent variable whereas r² explains the strength and the
direction of association between the two variables
c) r² captures significant difference between the two samples and r the
difference between the two sample means
d) None of these

3. If r is closer to 1 and positive, it indicates that the relationship between 2


the two variables
a) tends to be stronger and varies in the same direction
b) tends to be moderate
c) tends to be stronger
d) none of these
4 Every parametric test has an alternate non-parametric test. For 2
example, the non-parametric Mann-Whitney U test is the alternate for
the parametric test for ____(i)________ and Kruskal Wallis it the
alternate for parametric test for ____(ii)_________
a. i) 2-independent sample z or t test and ii) One-Way ANOVA
b. i) Chi-square test and ii) one-way ANOVA
c. i) paired sample t test and ii) one way ANOVA
d. None of these
5 Assume that a χ² test is to be performed on a contingency table with 2
four rows and four columns. How many degrees of freedom should be
used?
a) 16 b) 8 c) 9 d) 6
6 Suppose you are comparing five groups exposed to different methods of 2
treatment and have taken a sample of size 10 from each group. You
have calculated 𝑥̅ for each sample. How could you now calculate the
grand mean?
a) Multiply each sample mean by 1/5 and add these values. Then
divide this sum by 50
b) Add the 5 sample means and divide by 50
c) Add the 5 sample means and divide by 5
d) Add the 5 sample means
e) None of these
7 Suppose you are told that there is a direct relationship between the price 2
of artichokes and the amount of rain that fell during the growing season.
It can be calculated that:
a) Prices tend to be high when rainfall is high
b) prices tend to be low when rainfall is high
c) a large amount of rain causes prices to rise
d) a and c
8 The value of r² for a particular situation is 0.49. what is the coefficient 2
of correlation?
a) 0.49 b) 0.7 d) 0.07 e) cannot be determined
from the information given
9 If sample 1 has 13 elements with s1= 17 and sample 2 has 9 elements 2
with s2 = 22, then sp² =
a) 19 b) 361 c) 367 d) 19.5
10 In the Kruskal-Wallis test of k samples, the appropriate number of 2
degrees of freedom is
a) K b) k-1 c) nk-1 d) n –k
11 If the number of rows and columns in a contingency table in chi-square 2
test is 4 and 4 respectively, what is the degrees of freedom
a) 4 b) 16 c) 9 d) 12
12 If the sum of the square of the difference between the corresponding 2
values in two variables (d²) is 9, what is rank correlation. The sample
size is 10.
a) 60 b) 0.6 c) 0.4 d) 40
13 T-distribution is a sub-set or variant of normal distribution which 2
attributes to
a) Sample size less than 30 b) variance is less than 30
b) median is less than 30 d) none of these
14 A casting company is selecting a group of extras for a movie. The ages 2
of the first 20 men to be interviewed are:

50 56 55 49 52 57 56 57 56 59
54
55 61 60 51 59 62 52 54 49

The director of the movie wants men whose ages are fairly tightly
grouped around 55 years. Being a statistics bluff of sorts, the director
suggests that a standard deviation of 3 years would be acceptable. Does
this group of extras qualify?

a) Yes, because the standard deviation of the sample is within the


acceptable range
b) No, because the standard deviation of the sample is more than
acceptable range
c) It cannot be done because the data are insufficient
d) None of these
15 If we find that the amount spent on movies correlates 0.70 with family 2
income, that seems like a fairly strong correlation (0.70 is closer to 1.00
than it is to zero). But when you square 0.7, you see that is accounts for
only 0.7 * 0.7 = 0.49 or 49% of the variation (r²) in the amount of
money family spend on movies. If you designed a marketing strategy to
appeal only to families with high incomes, you’d miss a lot of potential
customers. How do you describe this situation
a) Try to find what else is influencing family movie decisions
b) Amount spent on movies cannot be explained
c) 49% fairly explains the variation in the amount spent on movies
and so we can use this output to make marketing decisions
d) None of these
Part C
(Marks 6 * 10 = 60 marks)
1 It is advertised that the average braking distance for a small car 10
travelling at 65 kms per hour equals 120 feet. A transportation
researcher wants to determine if the statement made in the
advertisement is false. She randomly test drives 36 small cars at 65 kms
per hour and records the braking distance. The sample average braking
distance is computed as 114 feet. Assume that the population standard
deviation is 22 feet. Using one sample test, determine if the claim made
by the advertising company is true at 0.05 level of significance.
2 Marketers know that tastes differ in various regions of the country. In 10
the rental car business, an industry expert has given the opinion that
there are strong regional preferences for size and car and quotes the
following data in support of that view. Does the data support the
expert’s opinion at 0.05 level of significance?

Region of country
Preferred Northeast Southeast Northwest Southwest
car type
Full-size 105 120 105 70
Intermediate 120 100 130 150
All other 25 30 15 30
3 Fuel costs are important to profitability in the airline business. A small
regional carrier has been operating three types of aircraft and has
collected the following cost data from its 14 planes, expressed as fuel
cost (in cents) per available seat mile. At 0.05 level of significance,
can we conclude that there is no true difference between plane types in
fuel costs? Use a non-parametric test.

Type A 7.3 8.3 7.6 6.8 8


Type B 5.6 7.6 7.2
Type C 7.9 9.5 8.7 8.3 9.4 8.4

4 10
The typical household income and typical home price for a sample of 18
cities are given below in thousands of Rupees

City Income Home price

1 74 114
2 82 126
3 71 130
4 62 92
5 79 135
6 66 116
7 82 161
8 85 145
9 75 145
10 89 162
Determine whether there is a relationship between the income of the
typical household and typical home price. If the typical household
income is Rs. 95,000 for a city, estimate the typical home price.
5 The number of cheques cashed each day at the five branches of a bank 10
during the past month had the following frequency distribution.

Class Frequency
0-200 10
200-400 13
400-600 17
600-800 42
800-1000 18

The director of operations for the bank, knows that a standard deviation
in cheque cashing of more than 200 cheques per day creates staffing and
organizational problems at the branches because of the uneven
workload. Should the director worry about staffing about next month
6 10
A supervisor in an assembly line at a production company, recently
subscribed to an easy-listening music service at its factory, hoping that
this would relax the workers and lead to greater productivity. The
supervisor is skeptical about this hypothesis and fears the music will be
distracting, leading to lower productivity. She sampled weekly
production for the same six workers before the music was installed and
after it was installed. His data are given below and determine whether
the production has changed at all? Use paired sample test and test if
there is any difference in the productivity

Employee 1 2 3 4 5 6

Week without music 219 205 226 198 209 216

Week with music 235 186 240 203 221 205

7 A consumer research organization routinely selects several car models 10


each year and evaluates their fuel efficiency. In this year’s study of two
similar subcompact models from two different automakers, the average
gas mileage for 12 cars of brand A was 27.2 kms per gallon and the
standard deviation was 3.8 mpg. The nine brand B cars that were tested
averaged 32.4 mpg and the standard deviation was 3.8 mpg. At 0.01
level of significance, should it conclude that brand A cars have lower
average gas mileage than do brand B cars?

You might also like