0% found this document useful (0 votes)
58 views

HR Data Analysis - Power BI, Excel

The HR Data Analysis project utilizes tools like Power BI and Excel to analyze employee data across various dimensions. Key tasks include data cleaning, creating pivot tables, visualizations, and establishing relationships between datasets. The project also covers advanced techniques such as DAX calculations, star schema design, and parameterized queries for enhanced data analysis.

Uploaded by

harsh kumar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
58 views

HR Data Analysis - Power BI, Excel

The HR Data Analysis project utilizes tools like Power BI and Excel to analyze employee data across various dimensions. Key tasks include data cleaning, creating pivot tables, visualizations, and establishing relationships between datasets. The project also covers advanced techniques such as DAX calculations, star schema design, and parameterized queries for enhanced data analysis.

Uploaded by

harsh kumar
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

HR DATA ANALYSIS

PSYLIQ INTERNSHIP PROJECT


SHAMIL P
[email protected]

Tools Used : Power BI, Excel


INTRODUCTION
 Welcome to the HR Data Analysis Project!

 This project is to explore tasks using tools like MS


Excel and Power BI.

 The dataset consist of 5 Excel files( General Data,


Employee Survey Data, Manager Survey Data, In Time,
Out Time).

Topics covered include:

 Mastering Data Cleaning techniques.

 Navigating Power Query for seamless data


importing.

Diving into the world of DAX with the DAX editor.

Excel wizardry – crafting dynamic pivot tables and


diverse chart types. Ready for the next data adventure!
1. Using excel, how would you filter the dataset to only show employees aged 30
and above?

• Click on the column header of the “Age” column.


• Go to the “Data” tab and click on the “Filter” button. This will add filter arrows to the column headers.
• Click on filter arrow. Then “Number Filters”  “Greater Than or Equel To”  (Add 30)  Ok

SHAMIL P
2. Create a pivot table to summarize the average monthly income by job role.

SHAMIL P
3. Apply conditional formatting to highlight employees with monthly income above
the company's average income.

SHAMIL P
4. Create a bar chart in Excel to visualize the distribution of employee ages.

SHAMIL P
5. Identify and clean any missing or inconsistent data in the "Department" column.

There is no missing or inconsistent data in the Department


column

SHAMIL P
6. In Power BI, establish a relationship between the "EmployeeID" in the employee data and the
"EmployeeID" in the time tracking data.

SHAMIL P
7. Using DAX, create a calculated column that calculates the average years an employee has spent
with their current manager.

SHAMIL P
8. Using Excel, create a pivot table that displays the count of employees in each Marital Status
category, segmented by Department

SHAMIL P
9. Apply conditional formatting to highlight employees with both above-average Monthly Income
and above-average Job Satisfaction.

• Cells with green color filling represent employees


who have both above-average monthly income and
above-average job satisfaction.

SHAMIL P
10.In Power BI, create a line chart that visualizes the trend of Employee Attrition over the years.

SHAMIL P
11. Describe how you would create a star schema for this dataset, explaining the benefits of
doing so.
• Creating a star schema involves structuring a database with a central fact table surrounded by
dimension tables, forming a star-like shape.
• Consider general_data table as a fact table and rest other tables as dimension tables. We can create
star schema by connecting employee id column of fact table with all the dimension table of primary
key(i.e, employee id) .
Benefits
1. Improved Performance: Faster query execution due to fewer joins.
2. Simplified Maintenance: Changes to dimensions don't impact the core data.
3. Scalability: Easily expand with new dimensions and measures.
4. Readability: Clear structure for intuitive data analysis.
5. Flexible Analysis: Granular insights across various dimensions.
6. BI Tool Compatibility: Seamless integration with BI tools.
7. Data Warehouse Best Practice: Aligns with industry standards.

SHAMIL P
12. Create a hierarchy in Power BI that allows users to drill down from Department to Job Role to
further narrow their analysis.

SHAMIL P
13. How can you set up parameterized queries in Power BI to allow users to filter data based 2 of
2 on the Distance from Home column?

1. Create Parameter
• In Power BI desktop, open Power Query
• Home  Manage parameter  New Parameter. Then name the parameter (Distance Parameter) and
set the data type (Decimal Number). Then define a default value.
2. Apply the parameter in query
• In Power Query Editor, identify the column to filter based on parameter (Distance From Home)
• Click on the drop-down arrow on the column header and choose “Number Filters”  "Greater Than"
or “Equals”.
• Instead of entering a specific value, enter the created parameter.
3. Close and Visualize
• Close the Power Query Editor.
• Build visualizations and use a slicer for Distance Parameter.

SHAMIL P
14. In Excel, calculate the total Monthly Income for each Department, considering only the
employees with a Job Level greater than or equal to 3.

SHAMIL P
15. Explain how to perform a What-If analysis in Excel to understand the impact of a 10% increase
in Percent Salary Hike on Monthly Income.

Steps:
1. Select the cell that holds the Monthly Income formula.
2. Go to the "Data" tab in the Excel ribbon.
3. Click on "What-If Analysis" and opt for "Data Table."
4. Indicate the cell with the Percent Salary Hike in the Row Input Cell box.
5. Keep the Column Input Cell box empty or reference the Monthly Income formula.
6. Create a list of various percentage scenarios in a separate column.
7. In a nearby column, compute Monthly Income for each scenario.
8. The Data Table displays Monthly Income values for each scenario, illustrating the impact of a 10%
increase in Percent Salary Hike.

SHAMIL P
16. Verify if the data adheres to a predefined schema. What actions would you take if you find
inconsistencies
• Verifying if the data adheres to a predefined schema involves checking if the data structure and content align
with the expected format and standards.
Actions for Inconsistencies
1. Data Profiling: Use data profiling tools to analyze the dataset and identify patterns, data types, and potential
anomalies.
2. Column Validation: Check each column against the predefined schema to ensure it contains the expected
data type and format.
3. Missing Values: Identify and handle missing values in the dataset.
4. Data Type Conversion: Convert data types if needed to match the predefined schema. For example, ensure
that numeric columns contain only numbers, date columns contain valid dates, etc.
5. Data Cleaning: Cleanse the data by identifying and rectifying inconsistencies.
• As per current status of the data, there is a need to reorder EmployeeID column, changing data type of
'TotalWorkingHours' column, flling NA values and blank values from general_data, employee_survey_data,
manager_survey_data.
• There is 'EmployeeID' label missing in the 'in-time' and 'out-time' data.

SHAMIL P
SHAMIL P
THANK YOU
SHAMIL P
[email protected]

You might also like