Project 4
Project 4
Description:
Objective:
The main goal is to analyze the hiring process data to draw meaningful insights
that can contribute to improving the company's hiring process.
Approach:
Data Exploration:
1. Use Excel functions like COUNT and IF to identify missing values in relevant
columns.
2. Decide on the best strategy to handle missing data:
• For numerical data: Consider imputation with mean or median.
• For categorical data: Use mode or imputation based on context.
Clubbing Columns:
1. Identify columns with multiple categories that can be combined for simplified
analysis.
2. Use Excel functions like CONCATENATE or & to merge columns.
Outlier Detection:
Data Summary:
Your Task: Determine the gender distribution of hires. How many males and
females have been hired by the company?
Your Task: What is the average salary offered by this company? Use Excel
functions to calculate this.
Your Task: Create class intervals for the salaries in the company. This will help you
understand the salary distribution.
From the above class interval of salary, I have observed that the salary range between
40100-50099 is offered to the maximum no.of employees(777) when considered both
hired and rejected.
From the above class interval of salary, I have observed that the salary range between
40100-50099 is offered to a maximum no.of employees(523).when considered only
the hired employees.
Your Task: Use a pie chart, bar graph, or any other suitable visualization to show
the proportion of people working in different departments.
0 200 400 600 800 1000 1200 1400 1600 1800 2000
113 70
202 Finance Department
176
1332 General Management
Human Resource Department
Marketing Department
485 1843 Operations Department
230 Production Department
246
Purchase Department
Sales Department
Service Department
E. Position Tier Analysis: Different positions within a company often have different
tiers or levels.
Your Task: Use a chart or graph to represent the different position tiers within the
company. This will help you understand the distribution of positions across different
tiers.
0 200 400 600 800 1000 1200 1400 1600 1800 2000
Count of Post Tiers
1 1
3 1
1 b9
232
982 463 c-10
c5
527
c8
1747
c9
787
i1
i4
222
320
i5
88 i6
1792
i7
m6
m7
n10
Conclusion:
The approach involves a systematic and structured process of data exploration,
cleaning, analysis, and visualization using Microsoft Excel. It emphasizes
leveraging Excel functions, tools, and statistical measures to derive meaningful
insights from the hiring process data. The final output is a well-documented report
that presents findings, recommendations, and a clear understanding of the hiring
process analytics.