0% found this document useful (0 votes)
3 views

DAUP_presentation

The document presents a data analysis project using Python, focusing on a retail dataset containing 1000 customer transactions. It includes descriptive statistics, data manipulation techniques, and machine learning methods, highlighting the performance of Random Forest and Gradient Boosting models. Predicted sales for the next month are provided for different product categories.

Uploaded by

Rochit Limje
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

DAUP_presentation

The document presents a data analysis project using Python, focusing on a retail dataset containing 1000 customer transactions. It includes descriptive statistics, data manipulation techniques, and machine learning methods, highlighting the performance of Random Forest and Gradient Boosting models. Predicted sales for the next month are provided for different product categories.

Uploaded by

Rochit Limje
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

$

DATA ANALYSIS
%

USING PYTHON
PROJECT PRESENTATION $0.00
ADD TO CART

Submitted to : Prof. Rekha Gaur


TEAM MEMBERS
NAME PRN

Amit Kumar Chaudhary 23020841090

Vinutna Kolachana 23020841096

Kousani Das 23020841102

Sakshi Meshram 23020841125

Rochit Limje 23020841149


DATASET OVERVIEW
This dataset is a snapshot of a retail landscape, capturing essential attributes that drive retail
operations and customer interactions.
Contains 1000 customer transactions with details like date, product, price, and quantity.

Retail_Dataset
DESCRIPTIVE STATISTICS

Slicing: dataset.Age[:20] retrieves first 20 ages.


Index Retrieval: dataset.iloc[10] fetches the 10th row.
Queries: Filtered rows where Age > 25 (831 rows) and
Age > mean (513 rows).
Apply Method: dataset['Product
Category'].apply(str.upper) converts categories to
uppercase.
Sorting: Sorted by Total Amount (descending) and
Quantity (ascending).
GroupBy: Grouped by Product Category and Age for
aggregation.

HTML_File
PLOTS

HTML_File
MACHINE LEARNING
METHODS
Performance comparision
Random Forest:
Accuracy: 33%
Precision: 34%
Gradient Boosting:
Accuracy: 34%
Precision: 35%

Both models struggle to classify items accurately, suggesting


the need for better features or more data.
Predicted sales for next month:
Beauty: $26,993.46
Clothing: $33,882.73
Electronics: $16,771.75
HTML_File
THANK YOU

You might also like