0% found this document useful (0 votes)
4 views

ds quiz

Data science quiz

Uploaded by

priyajenat
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

ds quiz

Data science quiz

Uploaded by

priyajenat
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

### Set 1: Benefits and Uses of Data Science

1. **What is a primary benefit of data science in business?**

A) Increased data storage

**B) Improved decision-making based on data insights**

C) More complex algorithms

2. **Which of the following is an application of data science in healthcare?**

**A) Predicting patient readmission rates**

B) Managing hospital inventory

C) Scheduling staff shifts

3. **What does 'structured data' refer to?**

**A) Data that is organized in a fixed format**

B) Data that is unorganized and free-form

C) Data that can only be read by humans

4. **Which type of data is characterized by its ability to change over time?**

A) Static data

**B) Dynamic data**

C) Historical data

5. **Which step comes first in the data science process?**

A) Data preparation
**B) Defining research goals**

C) Presenting findings

6. **What is the purpose of exploratory data analysis (EDA)?**

A) To clean data

**B) To visualize data and uncover patterns**

C) To build predictive models

7. **Why is it important to define research goals in data science?**

A) To collect as much data as possible

**B) To guide the data collection and analysis process**

C) To increase computational power

8. **Which of the following is a common method for retrieving data?**

**A) Data scraping**

B) Data cleaning

C) Data visualization

9. **What is data normalization?**

A) Removing duplicate records

**B) Adjusting values to a common scale**

C) Changing data types

10. **Which visualization is commonly used in EDA to display the distribution of a dataset?**

**A) Box plot**


B) Bar chart

C) Pie chart

11. **What is the primary goal of building a predictive model?**

A) To understand past data

**B) To predict future outcomes based on historical data**

C) To clean the data

12. **What is an effective way to present data findings to stakeholders?**

A) Using technical jargon

**B) Creating clear visualizations and summaries**

C) Providing raw data without context

13. **Which of the following is a key technique used in data mining?**

**A) Clustering**

B) Data entry

C) Data formatting

14. **What is the primary purpose of a data warehouse?**

A) To store operational data for daily transactions

**B) To consolidate and analyze large amounts of historical data**

C) To perform real-time data processing

15. **What does the term 'mean' refer to in statistics?**

A) The most frequently occurring value


**B) The average of a dataset**

C) The middle value in a dataset

16. **Which of the following is an example of unstructured data?**

A) Customer names

**B) Social media posts**

C) Product prices

17. **What is a common challenge in data science?**

A) Excessive data cleaning

B) Too few algorithms

**C) Data privacy concerns**

18. **What role does data visualization play in data science?**

A) Only for aesthetic purposes

**B) To communicate insights effectively**

C) To complicate data analysis

19. **Which of the following techniques is often used for predictive modeling?**

A) Data entry

**B) Regression analysis**

C) Data storage

20. **What is the significance of data quality in data science?**

A) It does not matter if data is large


B) It only affects visualizations

**C) It directly impacts analysis results and decisions**

---

### Set 2: Types of Data and Descriptive Statistics

1. **Which of the following is an example of qualitative data?**

A) Height of individuals

**B) Colors of cars**

C) Temperature readings

2. **What type of data is represented by numerical values?**

A) Categorical data

B) Ordinal data

**C) Quantitative data**

3. **What is a continuous variable?**

**A) A variable that can take on any value within a range**

B) A variable that has a fixed number of categories

C) A variable that can only take on whole numbers

4. **Which type of variable represents categories with a meaningful order?**

A) Nominal variable

**B) Ordinal variable**


C) Discrete variable

5. **Which of the following is a common graphical representation of categorical data?**

A) Histogram

**B) Bar chart**

C) Line graph

6. **What is a frequency table used for?**

**A) To summarize data values and their counts**

B) To display data trends over time

C) To show the relationship between two variables

7. **What is the median?**

A) The sum of all values divided by the number of values

**B) The middle value when data is ordered**

C) The most frequently occurring value

8. **Which measure of central tendency is most affected by outliers?**

**A) Mean**

B) Median

C) Mode

9. **What does standard deviation measure?**

A) The average of a dataset

**B) The spread of data points around the mean**


C) The maximum value in a dataset

10. **Which of the following indicates less variability in a dataset?**

A) A high standard deviation

**B) A low standard deviation**

C) A high range

11. **What characterizes a normal distribution?**

A) Data is skewed to the left

**B) Data is symmetrically distributed around the mean**

C) Data has multiple peaks

12. **What does a z-score represent?**

A) The percentage of data below a certain value

**B) The number of standard deviations a data point is from the mean**

C) The average of a dataset

13. **In a standard normal distribution, what is the mean and standard deviation?**

**A) Mean = 0, Standard Deviation = 1**

B) Mean = 1, Standard Deviation = 0

C) Mean = 0, Standard Deviation = 0

14. **If a z-score is positive, what does that indicate?**

A) The value is below the mean

B) The value is equal to the mean


**C) The value is above the mean**

15. **What type of graph is typically used to display the frequency distribution of a continuous variable?
**

**A) Histogram**

B) Bar chart

C) Pie chart

16. **Which of the following is an example of a discrete variable?**

A) Height of a person

**B) Number of students in a class**

C) Temperature

17. **What is the range of a dataset?**

A) The sum of all values

B) The difference between the highest and lowest values

**C) The average of the values**

18. **Which of the following best describes a bimodal distribution?**

A) One peak

B) No peaks

**C) Two distinct peaks**

19. **What does it mean if data is positively skewed?**

A) Most values are on the right side of the distribution

**B) Most values are on the left side of the distribution**


C) The mean is greater than the median

20. **What is the mode of a dataset?**

A) The middle value when ordered

**B) The most frequently occurring value**

C) The average of the dataset

Feel free to use or modify these questions as needed!

You might also like