Home_Assignment_CO3 (2)
Home_Assignment_CO3 (2)
2024-25-ODD-SEM
DATA ANALYTICS AND VISUALIZATION
(22CS2227F)
CO-3- Home-Assignment
1. A retail company operates in multiple regions, and it wants to analyze its sales performance to identify
trends and areas for improvement. The company has provided a dataset containing sales records, including the
following columns: Date, Region, Product Category, Sales Amount, and Units Sold. Dataset Description:
a.Date: The date of the sale. b.Region: The region where the sale occurred (e.g., North, South, East, West).
c.Product Category: The category of the product sold (e.g., Electronics, Furniture, Clothing). d.Sales Amount:
The total amount of sales in dollars. e.Units Sold: The number of units sold. i).Create a bar chart to visualize
the total Sales Amount across different regions. ii).Create a stacked bar chart to show the distribution of Sales
Amount across different product categories within each region.
2. A retail company is analyzing its sales data from the past year. The data includes the number of units
sold per product category each month. The company wants to understand the distribution of sales to optimize
inventory and marketing strategies. Dataset: The dataset contains the following columns: a.Product_Category:
The category of the product (e.g., Electronics, Clothing, Groceries, etc.) b.Month: The month in which the
sales were recorded c.Units_Sold: The number of units sold in that month for that category Distribution of
Sales: Create histograms to visualize the distribution of units sold for each product category. Analyze the shape
of the distribution for each category. Is the distribution normal, skewed, or uniform?
3. Create a case study where you use text data from a dataset (e.g., product reviews, tweets) to create a
comprehensive report. Describe the process of text extraction, analysis, and visualization. What insights did
you gain, and how did the visualizations help convey your findings?23. Explain how you would use
customized ticks to highlight specific ranges or data points in a time series plot. What considerations would
you keep in mind when choosing tick intervals and labels? Discuss the differences between major and minor
ticks in a plot. How can customizing both types of ticks enhance the information conveyed in a graph?
Illustrate your answer with an example.
4. Plot a scatter plot with three different markers. Customize the legend to include markers that match the
style used in the plot, and adjust the font size of the legend labels. Dataset: a.Scatter Plot 1: x1 = [1, 2, 3, 4], y1
= [4, 5, 6, 7], marker = 'o' b.Scatter Plot 2: x2 = [2, 3, 4, 5], y2 = [3, 4, 5, 6], marker = 's' c.Scatter Plot 3: x3 =
[1, 3, 5, 7], y3 = [2, 3, 4, 5], marker = '^' Requirements: 1.Plot the scatter points with different markers. 2.Add
a legend that includes markers and customize the font size to be larger.
Photos:
GRADE SHEET
S.NO ROLL NO Name of the Student Marks Awarded for ALM( Total 8 Marks)