0% found this document useful (0 votes)

4 views

End semester Answer key format-fods

The document outlines the examination details for the course 'Foundations of Data Science' at Jai Shriram Engineering College, including an answer key for various questions related to data science concepts. It covers topics such as project charters, data warehousing, correlation coefficients, and data visualization techniques using Python libraries like NumPy and Matplotlib. Additionally, it includes tasks for analyzing sales data using pandas and creating different types of plots.

Uploaded by

Dhanasekar Sethupathi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

End semester Answer key format-fods

Uploaded by

Dhanasekar Sethupathi

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Q.

P Code: 100114
JAI SHRIRAM ENGINEERING COLLEGE
An Autonomous Institution
B.E / B.Tech Degree Examinations Nov/ Dec – 2024

Course Code: CS3352 Course Name: Foundations of Data Science

Semester:3rd semester Max Marks: 100

Answer key
Part – A
10 x 2 = 20
1. Question: Identify the importance of project charter.
Answer:

A project charter authorizes a project, defines objectives, scope, and stakeholder roles,
ensuring alignment and clarity. It acts as a reference document throughout the project
lifecycle.

2. Question: Define Data Warehousing.

Answer:
Data warehousing involves collecting and storing data from multiple sources in a
centralized repository. It facilitates efficient querying, reporting, and decision-making.

3. Question: Given the following data set: 5,7,8,10,12,14,15,18,20.Calculate the

interquartile range.
Answer:

4. Question: Apply the formula to convert Z score to original score

Answer:
5. Question: Define z scores.
Answer:

6. Question: Identify the properties of correlation coefficient.

Answer:

7. Question: Name some NumPy Array attributes.

Answer:

8. Question: Write a comment to create two-dimensional array?

Answer:

# Create a 2D NumPy array using np.array()

import numpy as np
# Create a 2D array with 3 rows and 4 columns
array_2d = np.array([[1, 2, 3, 4], [5, 6, 7, 8],[9, 10, 11, 12]])
print(array_2d)
9. Question: How can you set different colors for bar plot?
Answer:
Use the color parameter in plt.bar().
plt.bar(x, y, color=['red', 'blue', 'green'])

10. Question: State the purpose of histogram

Answer:
 To visualize the distribution of numerical data.
 Shows the frequency of data within specific intervals (bins).
 Helps identify patterns, such as skewness or modality.

Scheme of Evaluation
Part – B
5 X 13 = 65
15. b) .

import matplotlib.pyplot as plt

import numpy as np
# Generate sample data for three groups
np.random.seed(42)
# Group 1
weights_group1 = np.random.uniform(56, 64, 20)
heights_group1 = np.random.uniform(120, 180, 20)
# Group 2
weights_group2 = np.random.uniform(60, 68, 20)
heights_group2 = np.random.uniform(140, 200, 20)
# Group 3
weights_group3 = np.random.uniform(66, 72, 20)
heights_group3 = np.random.uniform(160, 240, 20)
# Plotting the scatter plot
plt.figure(figsize=(8, 6))
# Group 1
plt.scatter(weights_group1, heights_group1, label='Group 1', color='blue', alpha=0.7)
# Group 2
plt.scatter(weights_group2, heights_group2, label='Group 2', color='green', alpha=0.7)
# Group 3
plt.scatter(weights_group3, heights_group3, label='Group 3', color='red', alpha=0.7)
# Adding labels, title, and legend
plt.title("Group wise Weight vs Height scatter plot")
plt.xlabel("weight")
plt.ylabel("height")
plt.legend()
plt.grid(True)
# Show plot
plt.show()

Part – C
1 X 15 = 15
16. a) You have been provided with a CSV file named "sales_data.csv" that contains
sales data for acompany. The file has the following columns: "Date", "Product",
"Quantity", and" Revenue". Your task is to load the data into a pandas Data Frame
and perform the following analysis.
Each 3 marks
i. Calculate the total revenue generated by the company.
ii. Find the product that generated the highest revenue.
iii. Calculate the average quantity sold per day.
iv. Group the data by month and calculate the total revenue for each month.
v. Plot a line graph showing the monthly revenue over time.
Answer

Python program
import pandas as pd
import matplotlib.pyplot as plt

# Load the CSV file into a DataFrame

df = pd.read_csv("sales_data.csv")

# Ensure 'Date' column is in datetime format

df['Date'] = pd.to_datetime(df['Date'])

# i. Calculate the total revenue generated by the company

total_revenue = df['Revenue'].sum()
print(f"Total Revenue: {total_revenue}")

# ii. Find the product that generated the highest revenue

highest_revenue_product = df.groupby('Product')['Revenue'].sum().idxmax()
print(f"Product with highest revenue: {highest_revenue_product}")

# iii. Calculate the average quantity sold per day

avg_quantity_per_day = df.groupby('Date')['Quantity'].sum().mean()
print(f"Average quantity sold per day: {avg_quantity_per_day}")

# iv. Group the data by month and calculate the total revenue for each month
df['Month'] = df['Date'].dt.to_period('M') # Group by month
monthly_revenue = df.groupby('Month')['Revenue'].sum()

# v. Plot a line graph showing the monthly revenue over time

plt.figure(figsize=(10, 6))
monthly_revenue.plot(kind='line', marker='o')
plt.title('Monthly Revenue Over Time')
plt.xlabel('Month')
plt.ylabel('Total Revenue')
plt.grid(True)
plt.show()

OR
b) Develop an example for contour plot,histogram,3D plotting and line plot for
Matplotlib.
Answer

import matplotlib.pyplot as plt

from mpl_toolkits.mplot3d import Axes3D
import numpy as np

# Prepare a grid and data for Contour Plot and 3D Plot

x = np.linspace(-5, 5, 50)
y = np.linspace(-5, 5, 50)
X, Y = np.meshgrid(x, y)
Z = np.sin(np.sqrt(X**2 + Y**2))

# Random data for Histogram

data = np.random.randn(1000)

# Data for Line Plot

x_line = np.linspace(0, 10, 100)
y_line = np.sin(x_line)

# Create a figure with 4 subplots

fig = plt.figure(figsize=(14, 10))

# 1. Contour Plot
ax1 = fig.add_subplot(2, 2, 1)
contour = ax1.contour(X, Y, Z, levels=10, cmap='viridis')
fig.colorbar(contour, ax=ax1)
ax1.set_title('Contour Plot')
ax1.set_xlabel('X-axis')
ax1.set_ylabel('Y-axis')

# 2. Histogram
ax2 = fig.add_subplot(2, 2, 2)
ax2.hist(data, bins=30, color='blue', alpha=0.7, edgecolor='black')
ax2.set_title('Histogram')
ax2.set_xlabel('Data')
ax2.set_ylabel('Frequency')
# 3. 3D Plot
ax3 = fig.add_subplot(2, 2, 3, projection='3d')
ax3.plot_surface(X, Y, Z, cmap='viridis', edgecolor='none')
ax3.set_title('3D Surface Plot')
ax3.set_xlabel('X-axis')
ax3.set_ylabel('Y-axis')
ax3.set_zlabel('Z-axis')

# 4. Line Plot
ax4 = fig.add_subplot(2, 2, 4)
ax4.plot(x_line, y_line, label='sin(x)', color='red', linewidth=2)
ax4.set_title('Line Plot')
ax4.set_xlabel('X-axis')
ax4.set_ylabel('Y-axis')
ax4.legend()
ax4.grid(True)

# Adjust layout and show the plots

plt.tight_layout()
plt.show()
Course In-Charge HoD

Worksheet-1 (Python)
No ratings yet
Worksheet-1 (Python)
9 pages
Certificate
No ratings yet
Certificate
25 pages
Informatics Practices Practical List22-2323
100% (1)
Informatics Practices Practical List22-2323
7 pages
pp DWDM 4 5
No ratings yet
pp DWDM 4 5
26 pages
Week2 lab
No ratings yet
Week2 lab
8 pages
DEV RECORD AIDS
No ratings yet
DEV RECORD AIDS
24 pages
Informatics Practices Practical List22-2323
No ratings yet
Informatics Practices Practical List22-2323
6 pages
Remove (2)
No ratings yet
Remove (2)
38 pages
2020-21 XIIInfo - Pract.S.E.155
No ratings yet
2020-21 XIIInfo - Pract.S.E.155
11 pages
DOC 20241119 WA0039. Pages Deleted (1) Merged Cropped (2)
No ratings yet
DOC 20241119 WA0039. Pages Deleted (1) Merged Cropped (2)
38 pages
Be A 65 Ads Exp 2
No ratings yet
Be A 65 Ads Exp 2
10 pages
Experiment - 6 DATE: 28.2.2020 Data Analytics Lab: Seq (1, 3, by 0.2)
No ratings yet
Experiment - 6 DATE: 28.2.2020 Data Analytics Lab: Seq (1, 3, by 0.2)
3 pages
SampleQuestion- AIOL 2024
No ratings yet
SampleQuestion- AIOL 2024
5 pages
batch1 ds
No ratings yet
batch1 ds
15 pages
Institute For Future Education, Entrepreneurship and Leadership (iFEEL), Karla, Lonavala PGDM-2022-24 (Semester-1) - End-Term Exam
No ratings yet
Institute For Future Education, Entrepreneurship and Leadership (iFEEL), Karla, Lonavala PGDM-2022-24 (Semester-1) - End-Term Exam
2 pages
Final Practical File 2022-23
No ratings yet
Final Practical File 2022-23
87 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
16 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
EDA Lab Manual
100% (2)
EDA Lab Manual
93 pages
DVST practicle finalll
No ratings yet
DVST practicle finalll
22 pages
Lab 02 - Introduction to Pandas
No ratings yet
Lab 02 - Introduction to Pandas
6 pages
Journal 12
No ratings yet
Journal 12
54 pages
dfs manual
No ratings yet
dfs manual
43 pages
univds
No ratings yet
univds
8 pages
EDA LAB MANUAL (1) (1)
No ratings yet
EDA LAB MANUAL (1) (1)
34 pages
Ai Class 12 Practical
No ratings yet
Ai Class 12 Practical
21 pages
IP Book 12 Question Bank
No ratings yet
IP Book 12 Question Bank
20 pages
Informatic Practices Hhw (3)
No ratings yet
Informatic Practices Hhw (3)
59 pages
Tution Representation
No ratings yet
Tution Representation
38 pages
609008987-EDA-Lab-Manual
No ratings yet
609008987-EDA-Lab-Manual
93 pages
I.P Practical Solution - Plotting
No ratings yet
I.P Practical Solution - Plotting
9 pages
DATA HANDLING AND CSV 2024- 2025
No ratings yet
DATA HANDLING AND CSV 2024- 2025
12 pages
DEV Experiment No.3
No ratings yet
DEV Experiment No.3
10 pages
SSCE-2025 PRACTICAL TEST SOLUTION
No ratings yet
SSCE-2025 PRACTICAL TEST SOLUTION
7 pages
Matplotlib linechatsy
No ratings yet
Matplotlib linechatsy
38 pages
Lab_sneha
No ratings yet
Lab_sneha
20 pages
Pragya File
No ratings yet
Pragya File
31 pages
IDS-1
No ratings yet
IDS-1
30 pages
DNN ALL Practical 28
No ratings yet
DNN ALL Practical 28
34 pages
DATA SCIENCE EXPERIMENTS
No ratings yet
DATA SCIENCE EXPERIMENTS
31 pages
CS605 DA
No ratings yet
CS605 DA
21 pages
Holidays Homework - 20231204 - 195647 - 0000
No ratings yet
Holidays Homework - 20231204 - 195647 - 0000
15 pages
Practical List 2022-23
100% (1)
Practical List 2022-23
4 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
sowmi DS
No ratings yet
sowmi DS
27 pages
Ilovepdf Merged (2) Merged
No ratings yet
Ilovepdf Merged (2) Merged
65 pages
DVPD Final Lab Word PDF
No ratings yet
DVPD Final Lab Word PDF
93 pages
L and T Projects - Colabs
No ratings yet
L and T Projects - Colabs
7 pages
Assignment-1 (Python Pandas-Series Object and Data Frame: 1. Answer The Following
100% (1)
Assignment-1 (Python Pandas-Series Object and Data Frame: 1. Answer The Following
8 pages
Informatic Practices Hhw
No ratings yet
Informatic Practices Hhw
21 pages
UT1class12IP2425
No ratings yet
UT1class12IP2425
2 pages
14401172022_tanu raman ml lab file
No ratings yet
14401172022_tanu raman ml lab file
21 pages
Ml Cyber Lab
No ratings yet
Ml Cyber Lab
16 pages
Cycle 1
No ratings yet
Cycle 1
110 pages
Bda Assign
No ratings yet
Bda Assign
15 pages
MLCyberLab
No ratings yet
MLCyberLab
9 pages
DEV Lab Material
No ratings yet
DEV Lab Material
16 pages
G Pandey Practical
No ratings yet
G Pandey Practical
33 pages
Ai Class 12 Practical 2
No ratings yet
Ai Class 12 Practical 2
21 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python Network Programming Applications
No ratings yet
Python Network Programming Applications
185 pages
UPCAT Language Proficiency Tips and Tricks
No ratings yet
UPCAT Language Proficiency Tips and Tricks
11 pages
Analysis History
No ratings yet
Analysis History
2 pages
Probability Project: Design Your Own Game
No ratings yet
Probability Project: Design Your Own Game
6 pages
Be8256 Basic Mechanical Engineering Syllabus
No ratings yet
Be8256 Basic Mechanical Engineering Syllabus
1 page
Grade 3 The Fisherman and His Wife Close Reading Exemplar
No ratings yet
Grade 3 The Fisherman and His Wife Close Reading Exemplar
25 pages
Welcome To Bulacan Polytechnic College
No ratings yet
Welcome To Bulacan Polytechnic College
16 pages
CBSE - Class 10 - Math - CH 05 - Arithmetic Progressions - HOTS - Answers
No ratings yet
CBSE - Class 10 - Math - CH 05 - Arithmetic Progressions - HOTS - Answers
19 pages
Day 3 - Functions of Communication
No ratings yet
Day 3 - Functions of Communication
14 pages
Journal of Sports Medicine - An Open Access Journal
No ratings yet
Journal of Sports Medicine - An Open Access Journal
1 page
Module 3: Element Properties Lecture 8: Numerical Integration: One Dimensional
No ratings yet
Module 3: Element Properties Lecture 8: Numerical Integration: One Dimensional
5 pages
2002 Exam Answers
No ratings yet
2002 Exam Answers
5 pages
Syllabus For QTP 11 (HPO-M47) Certification Exam: o o o o
No ratings yet
Syllabus For QTP 11 (HPO-M47) Certification Exam: o o o o
14 pages
Skills Speaking Part3 Linkers
No ratings yet
Skills Speaking Part3 Linkers
6 pages
English Notes Class 11
100% (3)
English Notes Class 11
67 pages
Sip Report
50% (2)
Sip Report
54 pages
Zhou 2011
No ratings yet
Zhou 2011
9 pages
Lesson Plan Template
No ratings yet
Lesson Plan Template
66 pages
Wilcoxon Rank Sum Test
100% (1)
Wilcoxon Rank Sum Test
4 pages
Lingo Dingo and The Vietnamese Astronaut Lo Res
No ratings yet
Lingo Dingo and The Vietnamese Astronaut Lo Res
18 pages
Deborah Schroeder Saulnier Employee Engagement
No ratings yet
Deborah Schroeder Saulnier Employee Engagement
6 pages
New Assignment 4
No ratings yet
New Assignment 4
6 pages
Non-Linear Alignment Dynamics in Suspensions of Platelets Under Rotating Magnetic Fields PDF
No ratings yet
Non-Linear Alignment Dynamics in Suspensions of Platelets Under Rotating Magnetic Fields PDF
7 pages
Receiving Receipts FAQS
No ratings yet
Receiving Receipts FAQS
13 pages
Confidence Intervel
No ratings yet
Confidence Intervel
2 pages
Aapm TG 40
67% (3)
Aapm TG 40
41 pages
Download Full The Econometrics of Multi-dimensional Panels 2nd Edition Laszlo Matyas PDF All Chapters
100% (14)
Download Full The Econometrics of Multi-dimensional Panels 2nd Edition Laszlo Matyas PDF All Chapters
40 pages
Toward An Integrative Theory of Urban Design - Bahrainy, H. & Bakhtiar, A PDF
No ratings yet
Toward An Integrative Theory of Urban Design - Bahrainy, H. & Bakhtiar, A PDF
120 pages
International Human Resource Management Assignment Briefing Sheet Autumn 2010
No ratings yet
International Human Resource Management Assignment Briefing Sheet Autumn 2010
6 pages
Eq-Trp & MRP
No ratings yet
Eq-Trp & MRP
9 pages

End semester Answer key format-fods

Uploaded by

End semester Answer key format-fods

Uploaded by

Q.

Course Code: CS3352 Course Name: Foundations of Data Science

2. Question: Define Data Warehousing.

3. Question: Given the following data set: 5,7,8,10,12,14,15,18,20.Calculate the

4. Question: Apply the formula to convert Z score to original score

6. Question: Identify the properties of correlation coefficient.

7. Question: Name some NumPy Array attributes.

8. Question: Write a comment to create two-dimensional array?

# Create a 2D NumPy array using np.array()

10. Question: State the purpose of histogram

import matplotlib.pyplot as plt

# Load the CSV file into a DataFrame

# Ensure 'Date' column is in datetime format

# i. Calculate the total revenue generated by the company

# ii. Find the product that generated the highest revenue

# iii. Calculate the average quantity sold per day

# v. Plot a line graph showing the monthly revenue over time

import matplotlib.pyplot as plt

# Prepare a grid and data for Contour Plot and 3D Plot

# Random data for Histogram

# Data for Line Plot

# Create a figure with 4 subplots

# Adjust layout and show the plots

You might also like