MCQs Topic 5 Emerging Issues in Data Analytics
MCQs Topic 5 Emerging Issues in Data Analytics
1. What is the term used to describe the process of using machine learning algorithms to analyze large
datasets and identify patterns that can be used to make predictions?
b) Deep learning
c) Predictive modeling
2. What is the process of collecting and analyzing data in real-time, and using that data to make
immediate decisions called?
a) Real-time analytics
b) Predictive analytics
c) Prescriptive analytics
d) Descriptive analytics
1. What is the term used to describe the problem of making predictions based on historical data that
may not accurately reflect future trends?
a) Overfitting
b) Sampling bias
c) Selection bias
d) Confounding variables
Answer: a) Overfitting
2. What is the term used to describe the process of adjusting statistical models to account for errors or
inaccuracies in the data?
a) Calibration
b) Validation
c) Bias correction
d) Regularization
Answer: d) Regularization
1. What is the term used to describe the process of removing personally identifiable information from
datasets in order to protect privacy?
a) Data anonymization
b) Data encryption
c) Data classification
d) Data mining
2. What is the term used to describe the practice of using algorithms to make decisions that affect
people's lives, such as credit scores or job applications?
a) Algorithmic bias
b) Predictive modeling
c) Machine learning
d) Decision automation
1. What is the process of ensuring that data is not accessed or modified by unauthorized users?
a) Data backup
b) Data encryption
d) Data retention
3. What is the term used to describe the process of storing data in multiple locations, in order to prevent
loss of data due to hardware failure or other disasters?
a) Data redundancy
b) Data availability
c) Data privacy
d) Data retention
1. What is the term used to describe the process of using multiple processors or computers to perform
data analysis tasks?
a) Parallel computing
b) Distributed computing
c) Cloud computing
d) Grid computing
2. What is the term used to describe the process of summarizing large datasets into smaller, more
manageable subsets?
a) Sampling
b) Filtering
c) Aggregation
d) Clustering
Answer: c) Aggregation
3. Which of the following is a limitation of using SQL to analyse large datasets?
Answer: b) SQL is slow and inefficient when dealing with large datasets
4. What is the term used to describe the process of using statistical models to predict future outcomes
based on historical data?
a) Predictive modeling
b) Descriptive modelling
c) Prescriptive modelling
d) Inferential modeling
5. What is the term used to describe the process of identifying outliers or anomalies in data?
a) Data profiling
b) Data cleansing
c) Data quality
d) Data scrubbing
6. Which of the following is a technique used in data analytics to identify patterns or relationships
between variables?
a) Regression analysis
b) Decision trees
c) Clustering
d) Neural networks