Data Research Using Marpho Technique
Data Research Using Marpho Technique
Summary:
This report is a data analysis of cars dataset, a collection of personal cars data collected from several
European countries between 2015 and 2017. The dataset contains 3552913 records capturing individual
car details like the make, model, colour, body type, fuel, transmission, doors, seating, engine,
manufacture year, price, and listing information like creating date, year and last login date. The dataset is
further refined by data cleaning to improving data quality.
Data cleaning:
1 Original dataset contains 3.5 million records
2 Analyzing columsn Maker and Model and found 0.5 million records are missing make and model
Price column contained values too low and too high.
After filtering the data for make and model missing data and records proced too low and too high
we have 1.25 million records .
The new table CARS_CLEAN1 will be used to further analyze the clean data.
There is a possibility for additional categorical column creation based on the data range .
Analysis:
There are 44 makers with large volume of cars from Audi , Mercedes-Benz, BMW and Volkswagen
Large Volume of cars are from year 2015 , followed by 2012 and 2014
Maker vs Volume pareto chart shows that Audi, Mercedes-Benz, BMW and Volkswage have large
volume of cars in the market .
Maker vs Average price pareto chart shows that lamborghini , rolls-ryce , tesla , Aston-martin ,
Bentley , Porsche and Maserati shows large average price of cars in the market .
Models with high Average price are less in volume compared to models with lower average price
Fuel Type vs Price:
Diesel vehicles are pricer compare to Gasoline .
Conclusion:
There is a large volume of Diesel Cars with Avergage price greater than the gasoline cars. Car Models with
higer average price are low in volume in comparision to low average proced cars.
Audi, Mercedes-Benz, BMW and Volkswage are in large volumes compare to reset othercar makers.