0% found this document useful (0 votes)
2 views

Exam Questions Based on DSI436 Module Outline

The document outlines exam questions for the DSI436 module on Data Analytics and Visualization using Matplotlib and Seaborn. It includes both theoretical and practical sections, covering topics such as data preparation, descriptive statistics, correlation, regression analysis, hypothesis testing, time series analysis, and data visualization techniques. Each question specifies the marks allocated and the key concepts to be addressed.

Uploaded by

Ashley Zhanje
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Exam Questions Based on DSI436 Module Outline

The document outlines exam questions for the DSI436 module on Data Analytics and Visualization using Matplotlib and Seaborn. It includes both theoretical and practical sections, covering topics such as data preparation, descriptive statistics, correlation, regression analysis, hypothesis testing, time series analysis, and data visualization techniques. Each question specifies the marks allocated and the key concepts to be addressed.

Uploaded by

Ashley Zhanje
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Exam Questions Based on DSI436 Module Outline

Below are well-structured exam questions covering all key topics from the Data Analytics and
Visualization Using Matplotlib and Seaborn (DSI436) module.

SECTION A: THEORY QUESTIONS

(Answer ALL questions in this section)

QUESTION 1: Introduction to Data Analytics and Visualization (10 Marks)

(a) Explain the importance of data analysis in decision-making. (4 Marks)


(b) Discuss two data analysis techniques used in the industry today. (4 Marks)
(c) What are the key components of the Python ecosystem for data analysis? (2 Marks)

QUESTION 2: Data Preparation and Cleaning [PYTHON] (15 Marks)

(a) Explain the role of data preprocessing in data analytics. (3 Marks)


(b) Describe the steps involved in data cleaning, highlighting the following:

 Handling missing values (4 Marks)

 Detecting and removing outliers (4 Marks)

 Resolving data inconsistencies (4 Marks)

QUESTION 3: Descriptive Statistics [EXCEL] (15 Marks)

(a) Define descriptive statistics and explain its importance in data analytics. (4 Marks)
(b) Compare and contrast mean, median, and mode as measures of central tendency. (4 Marks)
(c) Discuss two measures of dispersion used to summarize data variability. (4 Marks)
(d) Explain how boxplots and histograms help in data visualization. (3 Marks)

QUESTION 4: Correlation and Regression Analysis [EXCEL] (15 Marks)

(a) What is the difference between correlation and regression? (4 Marks)


(b) Explain the meaning of a correlation coefficient and how to interpret it. (3 Marks)
(c) Describe the process of building a simple linear regression model using Excel. (4 Marks)
(d) How do we evaluate the performance of a regression model? Discuss two techniques. (4 Marks)

QUESTION 5: Estimation & Hypothesis Testing [EXCEL] (15 Marks)

(a) Define hypothesis testing and explain its importance in data analysis. (4 Marks)
(b) Differentiate between the null hypothesis (H0H_0) and the alternative hypothesis (H1H_1). (4
Marks)
(c) Describe the steps involved in conducting a t-test in Excel. (4 Marks)
(d) Explain the concept of confidence intervals and how they are used in data estimation. (3 Marks)

QUESTION 6: Multiple Regression [EXCEL] (10 Marks)

(a) What is multiple regression analysis, and how does it differ from simple regression? (4 Marks)
(b) Discuss the significance of R-squared (R2R^2) in a multiple regression model. (3 Marks)
(c) Explain how multiple regression can be used to predict sales in a business setting. (3 Marks)

QUESTION 7: Time Series Analysis [PYTHON] (15 Marks)

(a) Define time series analysis and discuss three key characteristics of time series data. (6 Marks)
(b) Describe how to perform time series decomposition in Python using Statsmodels. (4 Marks)
(c) What is the difference between trend analysis and seasonality detection? (5 Marks)

SECTION B: PRACTICAL QUESTIONS

(Answer any THREE questions in this section)

QUESTION 8: Data Visualization with Matplotlib [PYTHON] (20 Marks)

(a) Write Python code to generate the following plots using Matplotlib:

 Line plot (4 Marks)

 Bar chart (4 Marks)

 Scatter plot (4 Marks)

 Histogram (4 Marks)

(b) Explain how to customize the following in Matplotlib:

 Adding labels and titles (2 Marks)

 Adjusting colors and line styles (2 Marks)

QUESTION 9: Advanced Data Visualization with Seaborn [PYTHON] (20 Marks)

(a) Explain the role of Seaborn in statistical data visualization. (4 Marks)


(b) Write Python code to generate the following Seaborn visualizations:

 Jointplot (4 Marks)

 Pairplot (4 Marks)
 Violin plot (4 Marks)

 Heatmap (4 Marks)

QUESTION 10: Visual Perception and Design Principles (20 Marks)

(a) Explain how human visual perception influences the effectiveness of data visualizations. (5 Marks)
(b) Discuss three design principles that improve data visualization quality. (6 Marks)
(c) How do color schemes and themes enhance readability in Seaborn plots? (4 Marks)
(d) Write a Python script to apply aesthetic themes in Seaborn (use sns.set_style()). (5 Marks)

QUESTION 11: Data Reduction & Dimensionality Reduction [PYTHON] (20 Marks)

(a) What is dimensionality reduction, and why is it important? (5 Marks)


(b) Explain the difference between Principal Component Analysis (PCA) and Linear Discriminant
Analysis (LDA). (6 Marks)
(c) Write Python code to perform PCA on a dataset using Scikit-Learn. (5 Marks)
(d) Describe how feature selection differs from feature extraction in data preprocessing. (4 Marks)

QUESTION 12: Interactive Dashboards & Real-Time Data Visualization (20 Marks)

(a) Explain the benefits of using interactive dashboards for business intelligence. (5 Marks)
(b) Compare Tableau and Streamlit for creating dashboards. (5 Marks)
(c) Write Python code to create a Streamlit dashboard that displays a filtered dataset. (10 Marks)

You might also like