0% found this document useful (0 votes)
2 views

Data Analyst Multiple Choice Questions

The document contains multiple-choice questions covering three main topics: Statistics, SQL, and Excel, with a total of 60 questions. Each question includes four answer options, with the correct answers marked. Additionally, there are 20 questions related to Power BI, focusing on its features and functionalities.

Uploaded by

complex web
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Data Analyst Multiple Choice Questions

The document contains multiple-choice questions covering three main topics: Statistics, SQL, and Excel, with a total of 60 questions. Each question includes four answer options, with the correct answers marked. Additionally, there are 20 questions related to Power BI, focusing on its features and functionalities.

Uploaded by

complex web
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

Data Analyst Multiple Choice Questions

Statistics (20 Questions)

1. What is the measure of central tendency that is most affected by extreme values?
o A. Mean *
o B. Median
o C. Mode
o D. Range
2. What does a p-value less than 0.05 indicate in hypothesis testing?
o A. The null hypothesis should be accepted
o B. The test is invalid
o C. There is strong evidence against the null hypothesis *
o D. The alternative hypothesis is false
3. Which of the following distributions is used when the sample size is small (n < 30)?
o A. Normal distribution
o B. Poisson distribution
o C. Student’s t-distribution *
o D. Chi-square distribution
4. In regression analysis, what does a high R-squared value indicate?
o A. The model explains most of the variability in the dependent variable *
o B. The model is not statistically significant
o C. The model has multicollinearity issues
o D. The independent variables are not correlated
5. What is the purpose of standard deviation in statistics?
o A. To measure the central tendency
o B. To measure the dispersion of data points from the mean *
o C. To calculate probability
o D. To determine correlation strength
6. What type of data is used in a chi-square test?
o A. Continuous data
o B. Categorical data *
o C. Interval data
o D. Ordinal data
7. In hypothesis testing, what is the null hypothesis?
o A. A hypothesis that assumes no effect or difference *
o B. A hypothesis that assumes a significant effect
o C. The main research hypothesis
o D. An untestable assumption
8. What does a confidence interval of 95% mean?
o A. 95% of the data falls within the mean
o B. There is a 95% chance the true parameter is within the interval *
o C. The interval is always accurate
o D. It means the hypothesis is 95% correct
9. What type of correlation exists when both variables increase together?
o A. Negative correlation
o B. Zero correlation
o C. Positive correlation *
o D. No correlation
10. What is the main goal of descriptive statistics?

• A. To infer conclusions from data


• B. To summarize and describe data *
• C. To test hypotheses
• D. To predict future trends

11. What is an outlier in statistics?

• A. A missing data point


• B. A value significantly different from others in a dataset *
• C. A value within one standard deviation of the mean
• D. The most frequent value in a dataset

12. In a normal distribution, what percentage of data falls within one standard deviation of
the mean?

• A. 50%
• B. 68% *
• C. 90%
• D. 99%

13. What is a Type I error in hypothesis testing?

• A. Accepting the null hypothesis when it is false


• B. Rejecting the null hypothesis when it is true *
• C. Accepting the alternative hypothesis when it is false
• D. Failing to reject the alternative hypothesis

14. What does a z-score represent in statistics?

• A. The number of standard deviations a data point is from the mean *


• B. The average of a dataset
• C. The total sum of all data points
• D. The percentage of missing values in a dataset

15. What type of test is used to compare the means of two independent groups?

• A. ANOVA
• B. Chi-square test
• C. Independent t-test *
• D. Correlation analysis
16. What is the purpose of the central limit theorem?

• A. To explain how a large sample size approximates a normal distribution *


• B. To measure correlation between variables
• C. To describe the relationship between probability and statistics
• D. To determine if data is normally distributed

17. What is multicollinearity in regression analysis?

• A. A situation where independent variables are highly correlated *


• B. A situation where dependent variables are unrelated
• C. A method for normalizing data
• D. A technique for data transformation

18. What does heteroscedasticity refer to in regression analysis?

• A. Constant variance of residuals


• B. Non-constant variance of residuals *
• C. High correlation between predictors
• D. A measure of model accuracy

19. Which statistical test is used to analyze the relationship between three or more group
means?

• A. T-test
• B. ANOVA *
• C. Chi-square test
• D. Regression analysis

20. What is the primary purpose of inferential statistics?

• A. To collect data
• B. To summarize data
• C. To make predictions or generalizations about a population *
• D. To clean raw data

SQL (20 Questions)

21. Which SQL statement is used to retrieve data from a database?

• A. SELECT *
• B. UPDATE
• C. INSERT
• D. DELETE

22. What SQL clause is used to filter results based on conditions?


• A. ORDER BY
• B. GROUP BY
• C. WHERE *
• D. HAVING

23. Which SQL clause is used to sort the result set?


A. GROUP BY
B. ORDER BY *
C. WHERE
D. HAVING
24. What does the GROUP BY clause do in SQL?
A. Filters rows based on a condition
B. Groups rows that have the same values into summary rows *
C. Sorts the result set in ascending order
D. Combines two tables
25. Which SQL function is used to count the number of rows in a table?
A. SUM()
B. AVG()
C. COUNT() *
D. MAX()
26. What is the purpose of the HAVING clause in SQL?
A. To filter rows before grouping
B. To filter groups after the GROUP BY clause *
C. To sort the result set
D. To join tables
27. Which SQL statement is used to update existing records in a table?
A. INSERT
B. UPDATE *
C. DELETE
D. MODIFY
28. What does the JOIN clause do in SQL?
A. Combines rows from two or more tables based on a related column *
B. Filters rows based on a condition
C. Groups rows based on a condition
D. Sorts the result set
29. Which type of JOIN returns all rows from both tables, even if there is no match?
A. INNER JOIN
B. LEFT JOIN
C. RIGHT JOIN
D. FULL OUTER JOIN *
30. What is the purpose of the DISTINCT keyword in SQL?
A. To sort the result set
B. To remove duplicate rows from the result set *
C. To filter rows based on a condition
D. To count the number of rows
31. Which SQL statement is used to delete records from a table?
A. REMOVE
B. DELETE *
C. DROP
D. TRUNCATE
32. What does the TRUNCATE TABLE statement do?
A. Deletes specific rows from a table
B. Deletes all rows from a table but keeps the table structure *
C. Deletes the entire table
D. Updates all rows in a table
33. Which SQL function is used to find the average value of a numeric column?
A. SUM()
B. AVG() *
C. COUNT()
D. MAX()
34. What is the purpose of the UNION operator in SQL?
A. Combines rows from two tables based on a condition
B. Combines the result sets of two SELECT statements into one *
C. Filters rows based on a condition
D. Groups rows based on a condition
35. Which SQL statement is used to create a new table?
A. CREATE TABLE *
B. INSERT TABLE
C. ADD TABLE
D. MAKE TABLE
36. What does the ALTER TABLE statement do?
A. Deletes a table
B. Modifies the structure of an existing table *
C. Inserts new rows into a table
D. Updates existing rows in a table
37. Which SQL constraint ensures that a column cannot have NULL values?
A. UNIQUE
B. PRIMARY KEY
C. NOT NULL *
D. FOREIGN KEY
38. What is the purpose of the FOREIGN KEY constraint?
A. Ensures that all values in a column are unique
B. Links two tables together by referencing the primary key of another table *
C. Ensures that a column cannot have NULL values
D. Automatically increments the value of a column
39. Which SQL function is used to find the highest value in a column?
A. MIN()
B. MAX() *
C. AVG()
D. COUNT()
40. What does the LIKE operator do in SQL?
A. Compares two values for equality
B. Searches for a specified pattern in a column *
C. Filters rows based on a range of values
D. Groups rows based on a condition

Excel (20 Questions)

41. Which Excel function is used to calculate the sum of a range?

• A. AVERAGE
• B. COUNT
• C. SUM *
• D. CONCATENATE

42. Which Excel feature allows users to filter data based on conditions?

• A. Pivot Table
• B. Data Validation
• C. Conditional Formatting
• D. AutoFilter *

43. What is the shortcut key to open the Format Cells dialog box in Excel?

• A. Ctrl + F
• B. Ctrl + 1 *
• C. Ctrl + Shift + F
• D. Alt + Enter

44. Which function is used to find the highest value in a range?

• A. MIN
• B. MAX *
• C. LARGE
• D. AVERAGE

45. What is the purpose of the VLOOKUP function in Excel?

• A. To look up values in a row


• B. To look up values in a column *
• C. To validate data entry
• D. To find duplicate values

46. What feature allows you to quickly apply a format based on specific conditions?

• A. Data Validation
• B. Conditional Formatting *
• C. Pivot Table
• D. Goal Seek

47. Which Excel tool allows you to summarize data and analyze patterns?

• A. Pivot Table *
• B. AutoFilter
• C. CONCATENATE
• D. Flash Fill

48. What does the CONCATENATE function do?

• A. Merges multiple cells into one


• B. Combines text from different cells *
• C. Counts the number of characters in a cell
• D. Converts text to uppercase

49. How can you lock a cell reference in a formula?

• A. Using the & symbol


• B. Using the $ symbol *
• C. Using parentheses
• D. Using the % symbol

50. What is the default file format of an Excel workbook?

• A. .xls
• B. .xlsx *
• C. .csv
• D. .xlsm
51. Which function is used to count only numeric values in a range?

• A. COUNTA
• B. COUNT *
• C. COUNTIF
• D. COUNTBLANK

52. What does the TRIM function do in Excel?

• A. Removes all spaces in a text string


• B. Removes leading, trailing, and extra spaces between words *
• C. Trims text to a specified length
• D. Converts text to lowercase

53. What is the purpose of the IF function?

• A. To filter data
• B. To perform logical tests *
• C. To find duplicate values
• D. To create charts

54. How do you apply an absolute reference to a cell in a formula?

• A. Use parentheses around the reference


• B. Add a $ symbol before the column and row *
• C. Use brackets around the reference
• D. Add a # symbol before the column and row

55. What is the primary use of the INDEX function in Excel?

• A. To return the position of a value in a range


• B. To return a value from a specified row and column *
• C. To look up values in a vertical range
• D. To create a table of contents

56. Which function can be used to count cells that meet a specific condition?

• A. COUNT
• B. COUNTA
• C. COUNTIF *
• D. COUNTBLANK

57. What does the MATCH function do?

• A. It looks up and returns a value from a range


• B. It returns the relative position of a value in a range *
• C. It combines text values
• D. It finds the sum of a range

58. What feature in Excel allows you to fill a series based on patterns?

• A. Flash Fill *
• B. AutoFormat
• C. Data Validation
• D. Goal Seek

59. What is the purpose of the TEXT function in Excel?

• A. To convert numbers to text in a specific format *


• B. To remove text formatting
• C. To count text values in a range
• D. To convert text to numbers

60. What does the REPT function do?

• A. Repeats a given text a specified number of times *


• B. Replaces a part of a text string
• C. Removes duplicate values
• D. Rounds a number to a specified number of digits

Power BI (20 Questions)

61. What is Power BI primarily used for?

• A. Data Visualization and Business Intelligence *


• B. Database Management
• C. Data Entry
• D. Statistical Testing

62. Which component in Power BI is used for creating reports?

• A. Power Query
• B. Power Pivot
• C. Power View *
• D. Power Automate

63. What is the primary purpose of Power BI?


A. Data visualization and business intelligence *
B. Database management
C. Data entry
D. Statistical testing
64. Which Power BI component is used to create interactive visualizations?
A. Power Query
B. Power View *
C. Power Pivot
D. Power Automate
65. What is the purpose of Power Query in Power BI?
A. To create visualizations
B. To transform and clean data *
C. To perform statistical analysis
D. To manage user permissions
66. Which Power BI feature allows you to create calculated columns?
A. Power Query
B. DAX (Data Analysis Expressions) *
C. Power View
D. Power Pivot
67. What is a measure in Power BI?
A. A column in a table
B. A calculated field used in aggregations *
C. A visualization type
D. A data source
68. Which visualization is best for showing trends over time?
A. Pie chart
B. Line chart *
C. Bar chart
D. Table
69. What is the purpose of a slicer in Power BI?
A. To filter data in a report *
B. To create calculated columns
C. To connect to a data source
D. To perform data transformations
70. Which Power BI feature allows you to create relationships between tables?
A. Power Query
B. Model View *
C. Power View
D. Power Pivot
71. What is the purpose of the DAX function CALCULATE?
A. To filter data in a calculation *
B. To create a new table
C. To merge tables
D. To create a visualization
72. Which Power BI feature allows you to share reports with others?
A. Power BI Service *
B. Power BI Desktop
C. Power Query
D. Power Pivot
73. What is the purpose of the Power BI Gateway?
A. To connect to on-premises data sources *
B. To create visualizations
C. To clean data
D. To perform statistical analysis
74. Which DAX function is used to sum values in a column?
A. SUM() *
B. AVERAGE()
C. COUNT()
D. MAX()
75. What is the purpose of the Power BI Q&A feature?
A. To ask questions about your data using natural language *
B. To create calculated columns
C. To connect to data sources
D. To clean data
76. Which Power BI feature allows you to create custom visuals?
A. Power Query
B. Power View
C. Marketplace *
D. Power Pivot
77. What is the purpose of the Power BI Mobile App?
A. To create reports
B. To view and interact with reports on mobile devices *
C. To clean data
D. To perform statistical analysis
78. Which DAX function is used to calculate the average of values in a column?
A. SUM()
B. AVERAGE() *
C. COUNT()
D. MAX()
79. What is the purpose of the Power BI Report Server?
A. To host and share Power BI reports on-premises *
B. To clean data
C. To create visualizations
D. To perform statistical analysis
80. Which Power BI feature allows you to create hierarchies?
A. Power Query
B. Model View *
C. Power View
D. Power Pivot

Data Mining (10 Questions)

81. Which of the following is NOT a data mining technique?

• A. Classification
• B. Clustering
• C. Data Replication *
• D. Association Rule Mining

82. Which of the following is NOT a data mining technique?


A. Classification
B. Clustering
C. Data Replication *
D. Association Rule Mining
83. What is the purpose of classification in data mining?
A. To group similar data points together
B. To predict categorical outcomes *
C. To find relationships between variables
D. To reduce data dimensionality
84. Which algorithm is commonly used for clustering?
A. Decision Tree
B. K-Means *
C. Linear Regression
D. Apriori
85. What is the purpose of association rule mining?
A. To find relationships between variables in large datasets *
B. To predict numerical outcomes
C. To group similar data points together
D. To reduce data dimensionality
86. Which technique is used to reduce the number of features in a dataset?
A. Clustering
B. Dimensionality Reduction *
C. Classification
D. Association Rule Mining
87. What is the purpose of anomaly detection in data mining?
A. To identify unusual patterns or outliers *
B. To group similar data points together
C. To predict categorical outcomes
D. To find relationships between variables
88. Which algorithm is commonly used for classification?
A. K-Means
B. Decision Tree *
C. Apriori
D. Principal Component Analysis
89. What is the purpose of regression in data mining?
A. To predict numerical outcomes *
B. To group similar data points together
C. To find relationships between variables
D. To reduce data dimensionality
90. Which technique is used to discover patterns in sequential data?
A. Clustering
B. Sequence Mining *
C. Classification
D. Association Rule Mining
91. What is the purpose of text mining in data mining?
A. To analyze and extract meaningful information from text data *
B. To predict numerical outcomes
C. To group similar data points together
D. To reduce data dimensionality

Python for Data Analysis (10 Questions)

91. What is the purpose of the Pandas library in Python?

A. Data visualization

B. Data manipulation and analysis *

C. Machine learning
D. Web scraping

92. Which of the following libraries is used for data visualization in Python?

• A. Matplotlib *
• B. Scikit-learn
• C. NumPy
• D. Pandas

93. What is the purpose of the Pandas library in Python?


A. Data visualization
B. Data manipulation and analysis *
C. Machine learning
D. Web scraping
94. Which of the following libraries is used for data visualization in Python?
A. Matplotlib *
B. Scikit-learn
C. NumPy
D. Pandas
95. What does the read_csv() function in Pandas do?
A. Reads data from a CSV file into a DataFrame *
B. Writes data to a CSV file
C. Cleans data in a DataFrame
D. Visualizes data
96. Which Python library is used for machine learning?
A. Pandas
B. Matplotlib
C. Scikit-learn *
D. NumPy
97. What is the purpose of the groupby() function in Pandas?
A. To group rows of data based on a column *
B. To merge two DataFrames
C. To clean missing data
D. To visualize data
98. Which function is used to check for missing values in a Pandas DataFrame?
A. isnull() *
B. dropna()
C. fillna()
D. replace()
99. What is the purpose of the merge() function in Pandas?
A. To combine two DataFrames based on a common column *
B. To group rows of data
C. To clean missing data
D. To visualize data
100. Which Python library is used for numerical computations?
A. Pandas
B. Matplotlib
C. NumPy *
D. Scikit-learn
101. What does the describe() function in Pandas do?
A. Provides summary statistics for numerical columns *
B. Cleans missing data
C. Visualizes data
D. Groups rows of data
102. Which function is used to drop missing values in a Pandas DataFrame?
A. isnull()
B. dropna() *
C. fillna()
D. replace()
103. What is the purpose of the pivot_table() function in Pandas?
A. To create a summary table with aggregated data *
B. To merge two DataFrames
C. To clean missing data
D. To visualize data
104. Which Python library is used for advanced data visualization?
A. Matplotlib
B. Seaborn *
C. Pandas
D. NumPy
105. What does the corr() function in Pandas do?
A. Calculates the correlation between columns *
B. Cleans missing data
C. Groups rows of data
D. Visualizes data
106. Which function is used to apply a function to each element in a Pandas
DataFrame?
A. apply() *
B. map()
C. filter()
D. reduce()
107. What is the purpose of the concat() function in Pandas?
A. To concatenate DataFrames vertically or horizontally *
B. To merge DataFrames based on a common column
C. To clean missing data
D. To visualize data
108. Which Python library is used for time series analysis?
A. Pandas
B. NumPy
C. Statsmodels *
D. Scikit-learn
109. What does the value_counts() function in Pandas do?
A. Counts the frequency of unique values in a column *
B. Cleans missing data
C. Groups rows of data
D. Visualizes data
110. Which function is used to rename columns in a Pandas DataFrame?
A. rename() *
B. replace()
C. drop()
D. fillna()
111. What is the purpose of the cut() function in Pandas?
A. To bin numerical data into discrete intervals *
B. To clean missing data
C. To group rows of data
D. To visualize data
112. Which Python library is used for natural language processing?
A. Pandas
B. NLTK *
C. NumPy
D. Scikit-learn

Data Engineering & Data Warehouse (10 Questions)

101. What is the primary function of a data warehouse?

• A. Storing transactional data


• B. Analyzing and reporting historical data *
• C. Performing real-time updates
• D. Managing user authentication
102. Which of the following is a common ETL tool?

• A. Power BI
• B. Tableau
• C. Apache NiFi
• D. Talend *

103. What is the main advantage of using a star schema in a data warehouse?

• A. Normalized data storage


• B. Improved query performance *
• C. Reduced storage requirements
• D. Increased data redundancy

104. What does OLAP stand for?

• A. Online Analytical Processing *


• B. Online Application Processing
• C. Offline Analytical Processing
• D. Operational Layered Application Platform

105. Which type of database system is commonly used in data warehousing?

• A. NoSQL databases
• B. Relational databases *
• C. Time-series databases
• D. Key-value stores

106. In a data warehouse, what is a fact table?

• A. A table that stores dimensional data


• B. A table that contains business process measurements *
• C. A table used for indexing data
• D. A table that stores metadata

107. What is the main difference between OLAP and OLTP?

• A. OLAP is optimized for transactional processing, while OLTP is optimized for


analytics
• B. OLAP is optimized for analytics, while OLTP is optimized for transactional
processing *
• C. OLAP and OLTP are the same
• D. OLAP focuses on operational data, while OLTP focuses on historical data

108. What is data partitioning in a data warehouse?


• A. Dividing a database into multiple logical sections *
• B. Encrypting data for security
• C. Creating duplicate copies of data
• D. Reducing data redundancy

109. What is the purpose of a dimension table in a data warehouse?

• A. To store transactional records


• B. To provide descriptive attributes for analysis *
• C. To hold aggregated metrics
• D. To replace fact tables

110. Which cloud service is widely used for data warehousing?

• A. AWS Redshift *
• B. Google Drive
• C. Microsoft OneDrive
• D. GitHub

111. What is the purpose of ETL in data engineering?


A. To extract, transform, and load data into a data warehouse *
B. To visualize data
C. To perform statistical analysis
D. To clean data
112. Which tool is commonly used for data pipeline orchestration?
A. Apache Airflow *
B. Tableau
C. Power BI
D. Talend
113. What is the purpose of a data lake?
A. To store raw, unstructured data for analysis *
B. To perform real-time analytics
C. To clean and transform data
D. To visualize data
114. Which of the following is a characteristic of a data warehouse?
A. Optimized for real-time transactions
B. Optimized for historical data analysis *
C. Stores unstructured data
D. Requires no schema
115. What is the purpose of a slowly changing dimension (SCD)?
A. To track changes in dimension data over time *
B. To store transactional data
C. To clean data
D. To visualize data

116. What is the purpose of Change Data Capture (CDC)?

A. To visualize data changes

B. To capture and track data changes in a database

C. To perform statistical analysis

D. To clean data

117. What is the purpose of a data pipeline?

A. To visualize data

B. To automate the flow of data from source to destination

C. To perform statistical analysis

D. To clean data

118. Which of the following is a characteristic of a data lake?

A. Structured data

B. Schema-on-write

C. Schema-on-read

D. Transactional processing

119. What is the purpose of data modeling in data warehousing?

A. To visualize data

B. To design the structure of the data warehouse

C. To perform statistical analysis


D. To clean data

120. What is the purpose of a staging area in ETL?

A. To store transformed data

B. To store raw data before transformation

C. To visualize data

D. To perform statistical analysis

121. What is the purpose of a data vault architecture?

A. To provide a highly normalized data warehouse *

B. To provide a denormalized data warehouse

C. To provide a data lake

D. To provide a data mart

122. Which of the following is a cloud-based ETL service?

A. Apache Airflow

B. Apache NiFi

C. AWS Glue *

D. Talend

123. What is the purpose of a data mart?

A. A central repository for all enterprise data

B. A subset of a data warehouse focused on a specific business unit *

C. A repository for raw data

D. A tool for data visualization

124. What is the purpose of data lineage?

A. To track the flow of data from source to destination*


B. To visualize data

C. To perform statistical analysis

D. To clean data

125. Which of the following is a NoSQL database commonly used in data engineering?

A. MySQL B. PostgreSQL

C. MongoDB *

D. SQL Server

126. What is the purpose of data orchestration?

A. To visualize data

B. To automate and manage data pipelines

C. To perform statistical analysis

D. To clean data

127. What is the purpose of a message queue in data engineering?

A. To store data

B. To decouple data producers and consumers

C. To visualize data

D. To perform statistical analysis

128. What is the purpose of data versioning?

A. To track changes to data over time

B. To visualize data

C. To perform statistical analysis

D. To clean data

129. Which technique is used to ensure data consistency in a distributed system?


A. Data visualization

B. Data replication

C. Data cleaning

D. Data analysis

130. In the ETL process, what does the "Transform" step primarily involve?

A. Moving data from its source to a staging area.

B. Cleaning, standardizing, and enriching data.

C. Loading processed data into a target system.

D. Monitoring data quality.

Ethical Issues, Data Privacy, and Data Literacy (20 Questions).

1. What is data privacy?

A. The ability to access any data

B. The right of individuals to control their personal data *

C. The process of cleaning data

D. The ability to visualize data

2. What is data literacy?

A. The ability to read and write data

B. The ability to understand, analyze, and communicate with data *

C. The process of cleaning data

D. The ability to visualize data

3. What is algorithmic bias?

A. Fair and unbiased algorithms


B. Systematic and repeatable errors in an algorithm that create unfair outcomes *

C. Random errors in algorithms

D. Algorithms with no errors

4. What is informed consent in the context of data collection?

A. Collecting data without telling individuals B. Telling individuals their data is being
collected, but not why C. Obtaining permission from individuals after fully explaining
how their data will be used * D. Collecting data only from public sources

5. What is data anonymization? A. Making data publicly available B. Removing personally


identifiable information from data * C. Sharing data with third parties D. Keeping data in
its original form
6. What is the purpose of GDPR (General Data Protection Regulation)? A. To regulate data
in the United States B. To protect the personal data of individuals in the European Union
* C. To encourage free data sharing D. To regulate data in China
7. Which principle emphasizes that data should only be collected for specified, explicit, and
legitimate purposes? A. Data minimization B. Purpose limitation * C. Storage limitation
D. Accuracy
8. What is data governance? A. The process of cleaning data B. The overall management of
the availability, usability, integrity, and security of data * C. The process of visualizing
data D. The process of collecting data
9. What is the potential ethical issue of using predictive models for hiring decisions? A.
Lack of data B. Algorithmic bias leading to discrimination * C. Data accuracy D. Data
privacy
10. What is the importance of transparency in data analysis? A. Hiding the data analysis
process B. Making the data analysis process and results understandable to stakeholders *
C. Using complex data analysis techniques D. Only sharing data with experts
11. What is the ethical concern related to data aggregation? A. Data accuracy B. Potential re-
identification of individuals from combined data * C. Data storage D. Data visualization
12. What is the purpose of differential privacy? A. To make data publicly available B. To add
statistical noise to data to protect individual privacy * C. To remove all data D. To
encrypt data
13. What is the ethical consideration related to data ownership? A. Sharing all data freely B.
Determining who has the right to control and use data * C. Ignoring data security D.
Visualizing data without context
14. What is the importance of data security? A. Making data easily accessible B. Protecting
data from unauthorized access and breaches * C. Sharing data without restrictions D.
Ignoring data integrity
15. What is the potential ethical issue of using facial recognition technology? A. Data
accuracy B. Potential for mass surveillance and privacy violations * C. Data storage D.
Data visualization
16. What does the term "data literacy" encompass beyond basic understanding of data? A.
Only data visualization skills B. Ability to critically evaluate data and its sources * C.
Only data cleaning skills D. Only data collection skills.
17. What is the ethical concern related to data profiling? A. Data accuracy B. Potential for
discriminatory practices based on inferred characteristics * C. Data storage D. Data
visualization
18. What is the concept of data sovereignty? A. Data sharing without restrictions B. The idea
that data is subject to the laws and governance structures within the country where it is
collected * C. Ignoring data privacy D. Only using data for commercial purposes
19. What is the "right to be forgotten" in GDPR? A. The right to share data with anyone B.
The right to have personal data erased under certain conditions * C. The right to collect
any data D. The right to ignore data privacy
20. What is the potential ethical concern related to the use of AI in decision-making
processes? A. Data accuracy B. The lack of transparency and explainability in AI
decisions * C. Data storage D. Data visualization

You might also like