0% found this document useful (0 votes)
33 views11 pages

Statistics Pyq for Qualifiers (2)

The document contains previous year questions for the IIT Madras BS in Data Science and Applications program, focusing on statistics. It includes various statistical concepts such as descriptive and inferential statistics, measures of central tendency, and data representation techniques. Additionally, it provides answers to the questions, along with links to join groups for notes and resources.

Uploaded by

amishra33879
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
33 views11 pages

Statistics Pyq for Qualifiers (2)

The document contains previous year questions for the IIT Madras BS in Data Science and Applications program, focusing on statistics. It includes various statistical concepts such as descriptive and inferential statistics, measures of central tendency, and data representation techniques. Additionally, it provides answers to the questions, along with links to join groups for notes and resources.

Uploaded by

amishra33879
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

[Type here]

PREVIOUS YEAR QUESTIONS.


IIT MADRAS – BS in DATA SCIENCE and APPLICATIONS

STATISTICS
OCTOBER TERM ( 27TH OCTOBER 2024)

1. Based on the data collected from an organization, an analyst made a statement that
the average salary of an employee in 50,000 rupees in different organizations in the
city. The given statement of the analyst is based on which kind of statistical analysis ?
A. Descriptive statistics
B. Inferential statistics
2. What is the sample standard deviation of salary (in thousand rupees) ? (Enter the
answer correct to 2 decimal accuracy)

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

3. What is the mode of the dataset ?


A. Strawberry
B. 40
C. Chocolate
D. 70
4. What percentage of the total votes is represented by Butterscotch and Strawberry ice
creams combined ?
A. 31.81%
B. 20%
C. 22.72%
D. 50%

5. What is the median of the data set represented by the stem-and-leaf plot?
6. Calcualte the range of the data set.

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

7. What is the value of y ( frequency of Thai cuisine) ?


8. What is the value of x (relative frequency of Mexican cuisine)? ( Write correct upto 2
decimal places).

9. Find the sample covariance between X and Y for the dataset given in TABLE 2.
10.Find the sample correlation coefficient( r ) between X and Y for the dataset given in
Table 2. (Write correct upto 3 digits after the decimal)

11. Which of the following is/are true ?


A. A sample is the subset of a population.
B. Numerical variables can have all the properties of ordinal and nominal scales of
measurements.
C. Descriptive measures like Mean, Median, and Mode all of them can be used for
summarize the categorical variable.
D. The correlation coefficient measures the strength of the linear association between
two numerical variables.
12. If a categorical variable is measured on an ordinal scale, which of the following
statistical measures is(are) appropriate?
A. Mean
B. Median
C. Mode

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

D. Variance
13. Given bar chart represent the T-Shirt sizes worn by the members of a sports club.
Which of the following option(s) is (are) the best way to represent the data ?

A.

B.

C.

D.
JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

14. In an exam, student’s scores have an interquartile range (IQR) of 20. The teacher
decides to first add 5 marks to each student’s score and then multiply each adjusted
score by 2. What will be the interquartile range now ?

ANSWERS
1. Inferential statistics
2. 12.22 to 12.28
3. Chocolate
4. 31.81%
5. 28
6. 34
7. 15
8. 0.15 to 0.17
9. 48
10. 0.921 to 0.925
11. A sample is the subset of a population. Numerical variables can have all the
properties of ordinal and nominal scales of measurements. The correlation
coefficient measures the strength of the linear association between two numerical
variables.
12. Median and Mode
13. OPTION C and D
14. 40

MAY TERM (7th JULY 2024)

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

1. Which of the following option(s) is/are true ?


A. Median of the data will be either “Mountain dew” or “Mirinda”.
B. The data is bimodal.
C. Mode is not defined for the given data.
D. Median is not defined for the given data.
2. Which of the following statement(s) is/are true ?
A. Structured data doesn’t follow a predefined format, whereas unstructured data
does.
B. Recording of the data over time comes under Cross Sectional data.
C. Time (in minutes) taken by student to reach school from his home is a continuous
variable.
D. Comments on a youtube video comes under the unstructured data.

3. What is the value of x ? Enter the answer correct of two decimal places.

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

4. If the number of books read by Prateek is same as the number of books read by
Sonakshi, then find the value of y+z.

5. Create a two-way contingency table and find out the number of males in this group
who own a smartphone ?
A. 0
B. 2
C. 3
D. 4
6. Choose the correct option(s) after making a two-way contingency table.
A. There are 40% of the males who do not own a smartphone.
B. There are 14.81% of the females who own a smartphone.
C. 18.75% of the total students own a smartphone.
D. We can calculate covariance to find the association between ‘Gender’ and
‘Ownership of the smartphone’.
7. Consider the following three statements :
I. Election symbol is a categorical variable.
II. Election symbol has a nominal scale of measurement.
III. Number of votes received by a candidate is a continuous variable.
A. Statement 2 and Statement 3 both are correct.
B. Statement 1 and Statement 3 both are correct.
C. Statement 1 and Statement 2 both are correct.
D. All statements are correct.
8. Choose the correct statements form the following :
A. Descriptive statistics is concerned with drawing of conclusions from the sample
data.
B. Inferential statistics is concerned with describing and summarizing the data.
C. Inferential statistics doesn’t require sample data.
JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

D. All statements are incorrect.

9.
A. A<B<C<D
B. B<C<D<A
C. B<A<C<D
D. A<D<C<B

10.
11.

The marks (out of 100) scored by Manoj in a semester exam are given as
60,70,65,75,80. If Nitin has scored 5 marks more than Manoj in each subject.
Based on the given information, answer the following subquestions.
12. Find the mean of the marks scored by Nitin.
13.

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

A. 25
B. 50
C. 12.5
D. Cannot determined
14. Calculate the correlation coefficient between the marks scored by Manoj and Nitin.

15.What will be the median age for this group ?


16. How many people are above 23 years of Age in this given stem and leaf plot ?

17.
18. Choose the correct option(s) :
A. 25th percentile is known as the first quartile.
B. Median is the 60th percentile of any data.
C. Inter-quartile is defined as the difference between third quartile and second
quartile.
D. We need to arrange the data in ascending order to calculate the percentile.

ANSWER
JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

1. The data is bimodal. Median is not defied for the given data.
2. Times (in minutes) taken by a student to reach school from his home is a continuous
variable. Comments on a youtube video comes under the unstructured data.
3. 0.23 to 0.27
4. 0.4
5. 2
6. There are 14.81% of the females who own a smartphone. 18.75% of the total
students own a smartphone.
7. Statement 1 and Statement 2 both are correct.
8. All statements are incorrect.
9. B < C < D < A
10. 9.90 to 9.96
11. 0
12. 75
13. 12.5
14.1
15. 31
16.8
17.90
18. 25th percentile is known as the first quartile. We need to arrange the data in
ascending order to calculate the percentile.

JOIN GROUP THROUGH THIS LINK :


https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40
[Type here]

JOIN THIS GROUP FOR NOTES, GRADED ASSIGNMENT SOLUTIONs and OTHER RESOURCES ..
https://chat.whatsapp.com/J0R07YcJfYc8dKZdaHYs40

You might also like