0% found this document useful (0 votes)
6 views

Module 3 Numericals

Uploaded by

agent47msr
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views

Module 3 Numericals

Uploaded by

agent47msr
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

MODULE 3

NUMERICALS

1. The following are the exam scores of 10 students: 55, 60, 65, 70, 75, 80, 85, 90,
95, 100. Find the mean, median, and mode of the scores.
2. A dataset has the following values: 2, 4, 6, 8, 10. Calculate the variance and
standard deviation of the dataset.
3. In a deck of 52 cards, what is the probability of drawing a king or a red card?
4. A box contains 4 red balls, 6 green balls, and 10 blue balls. If a ball is drawn
randomly, what is the probability that it will be either green or red?
5. How many different ways can a committee of 3 people be selected from a group
of 8 individuals?
6. A student needs to arrange 5 books on a shelf. How many different ways can the
books be arranged if two of them are identical?
7. The relationship between hours studied and marks obtained is given by the
following data: Hours: [1, 2, 3, 4, 5], Marks: [50, 60, 65, 70, 80]. Using simple
linear regression, find the equation of the line (Y = aX + b), where Y is the marks
and X is the hours studied.
8. In a dataset of sales figures, the following values are observed: 15, 16, 17, 18,
200, 19, 20, 21, 22. Determine if 200 is an outlier.
9. A random sample of 50 students' exam scores has a mean of 78 and a standard
deviation of 10. Find the 95% confidence interval for the mean exam score of all
students.
10. A study shows that out of 500 patients, 80 are suffering from a particular disease.
Use this data to estimate the proportion of the population with the disease with a
95% confidence interval.
11. A dataset contains columns for age, income, and education level. Using feature
engineering, create a new column called 'Income per Age' by dividing the Income
by Age.
12. In a sales dataset, there are negative values for the total sales column, which is
unrealistic. What steps would you take to identify and correct this bad data?
13. The following are the ages of 10 students in a class: 15, 16, 18, 19, 20, 22, 23, 24,
25, 30. Find the mean, median, and mode of the ages.
14. A dataset contains the following values: 5, 8, 10, 15, 20. Calculate the range,
variance, and standard deviation.
15. In a deck of 52 cards, what is the probability of drawing a heart or a face card?
16. A box contains 3 red, 5 green, and 8 yellow balls. If a ball is drawn randomly,
what is the probability that it is neither red nor yellow?
17. A team has 7 men and 5 women. How many different ways can a committee of 4
people be selected, including at least 1 woman?

18. How many different ways can you arrange the letters of the word
"EXAMINATION"?
19. A student needs to arrange 6 books on a shelf, where 2 books are identical and 4
are different. How many different ways can the books be arranged?
20. The relationship between years of experience and salary (in thousand dollars) is
given by the following data:

Experience: [1, 2, 3, 4, 5],

Salary: [30, 35, 40, 45, 50].

21. Using simple linear regression, find the equation of the line (Y = aX + b), where
Y is the salary and X is the years of experience.
22. In a dataset of 100 sales transactions, the following values are observed:

20, 22, 23, 25, 100, 27, 28, 30, 32, 35.

Determine if 100 is an outlier.

23. A random sample of 200 customers has an average spending of $50 with a
standard deviation of $10. Find the 99% confidence interval for the average
spending of all customers.
24. A study shows that out of 800 students, 120 are overweight. Estimate the
proportion of the student population that is overweight with a 95% confidence
interval.
25. A dataset contains the following columns: Age, Height, and Weight. Using
feature engineering, create a new column called 'BMI' (Body Mass Index) using
the formula:
BMI = Weight / (Height^2).
26. In a sales dataset, there are missing values in the "Amount Sold" column. What
steps would you take to handle these missing values?
27. The average temperature of 10 cities is 25°C. The temperatures of 5 cities are as
follows: 22, 24, 26, 28, and 30°C. What is the mean temperature of the other 5
cities?
28. In a deck of cards, what is the probability of drawing a queen or a spade?
29. A box contains 2 defective and 8 non-defective light bulbs. If 4 bulbs are selected
at random, what is the probability that none of the selected bulbs is defective?
30. How many different ways can you arrange the letters of the word
"MISSISSIPPI"?
31. The following data represents the daily sales (in dollars) of a store for 7 days:

200, 220, 250, 300, 350, 400, 450. Find the mean and standard deviation of the sales.

32. A dataset contains the following values: 10, 20, 30, 40, 50. Find the 95%
confidence interval for the population mean.
33. A dataset contains the following values: 18, 20, 21, 25, 26, 30. Calculate the
variance and standard deviation.
34. A company has 3 locations, and each location has 10 employees. In how many
ways can a manager be selected from the 30 employees?
35. A student needs to arrange 7 different books and 3 identical books on a shelf.
How many different ways can the books be arranged?
36. A class of 30 students took a test. The mean score was 75, and the standard
deviation was 10. What is the probability that a randomly selected student scored
more than 85?
37. In a random sample of 50 students, the average number of hours spent on
homework is 2.5 hours with a standard deviation of 0.8 hours. Find the 95%
confidence interval for the mean number of hours spent on homework for all
students.
38. A survey of 500 individuals shows that 60% prefer online shopping. Estimate the
proportion of the population that prefers online shopping with a 99% confidence
interval.
39. A dataset contains information about individuals’ ages, incomes, and education
levels. Create a new feature called “Income per Education Level” by dividing the
income by the education level.
40. The following data represents the number of hours studied and the exam scores of
5 students:

Hours: [3, 4, 5, 6, 7],

Scores: [60, 65, 70, 75, 80].

41. Using linear regression, find the regression line equation and predict the score for
a student who studies for 8 hours.
42. A dataset contains the following values: 5, 6, 7, 8, 9, 100. Determine if 100 is an
outlier using the IQR method.

You might also like