HR Data Analysis - Power BI, Excel
HR Data Analysis - Power BI, Excel
SHAMIL P
2. Create a pivot table to summarize the average monthly income by job role.
SHAMIL P
3. Apply conditional formatting to highlight employees with monthly income above
the company's average income.
SHAMIL P
4. Create a bar chart in Excel to visualize the distribution of employee ages.
SHAMIL P
5. Identify and clean any missing or inconsistent data in the "Department" column.
SHAMIL P
6. In Power BI, establish a relationship between the "EmployeeID" in the employee data and the
"EmployeeID" in the time tracking data.
SHAMIL P
7. Using DAX, create a calculated column that calculates the average years an employee has spent
with their current manager.
SHAMIL P
8. Using Excel, create a pivot table that displays the count of employees in each Marital Status
category, segmented by Department
SHAMIL P
9. Apply conditional formatting to highlight employees with both above-average Monthly Income
and above-average Job Satisfaction.
SHAMIL P
10.In Power BI, create a line chart that visualizes the trend of Employee Attrition over the years.
SHAMIL P
11. Describe how you would create a star schema for this dataset, explaining the benefits of
doing so.
• Creating a star schema involves structuring a database with a central fact table surrounded by
dimension tables, forming a star-like shape.
• Consider general_data table as a fact table and rest other tables as dimension tables. We can create
star schema by connecting employee id column of fact table with all the dimension table of primary
key(i.e, employee id) .
Benefits
1. Improved Performance: Faster query execution due to fewer joins.
2. Simplified Maintenance: Changes to dimensions don't impact the core data.
3. Scalability: Easily expand with new dimensions and measures.
4. Readability: Clear structure for intuitive data analysis.
5. Flexible Analysis: Granular insights across various dimensions.
6. BI Tool Compatibility: Seamless integration with BI tools.
7. Data Warehouse Best Practice: Aligns with industry standards.
SHAMIL P
12. Create a hierarchy in Power BI that allows users to drill down from Department to Job Role to
further narrow their analysis.
SHAMIL P
13. How can you set up parameterized queries in Power BI to allow users to filter data based 2 of
2 on the Distance from Home column?
1. Create Parameter
• In Power BI desktop, open Power Query
• Home Manage parameter New Parameter. Then name the parameter (Distance Parameter) and
set the data type (Decimal Number). Then define a default value.
2. Apply the parameter in query
• In Power Query Editor, identify the column to filter based on parameter (Distance From Home)
• Click on the drop-down arrow on the column header and choose “Number Filters” "Greater Than"
or “Equals”.
• Instead of entering a specific value, enter the created parameter.
3. Close and Visualize
• Close the Power Query Editor.
• Build visualizations and use a slicer for Distance Parameter.
SHAMIL P
14. In Excel, calculate the total Monthly Income for each Department, considering only the
employees with a Job Level greater than or equal to 3.
SHAMIL P
15. Explain how to perform a What-If analysis in Excel to understand the impact of a 10% increase
in Percent Salary Hike on Monthly Income.
Steps:
1. Select the cell that holds the Monthly Income formula.
2. Go to the "Data" tab in the Excel ribbon.
3. Click on "What-If Analysis" and opt for "Data Table."
4. Indicate the cell with the Percent Salary Hike in the Row Input Cell box.
5. Keep the Column Input Cell box empty or reference the Monthly Income formula.
6. Create a list of various percentage scenarios in a separate column.
7. In a nearby column, compute Monthly Income for each scenario.
8. The Data Table displays Monthly Income values for each scenario, illustrating the impact of a 10%
increase in Percent Salary Hike.
SHAMIL P
16. Verify if the data adheres to a predefined schema. What actions would you take if you find
inconsistencies
• Verifying if the data adheres to a predefined schema involves checking if the data structure and content align
with the expected format and standards.
Actions for Inconsistencies
1. Data Profiling: Use data profiling tools to analyze the dataset and identify patterns, data types, and potential
anomalies.
2. Column Validation: Check each column against the predefined schema to ensure it contains the expected
data type and format.
3. Missing Values: Identify and handle missing values in the dataset.
4. Data Type Conversion: Convert data types if needed to match the predefined schema. For example, ensure
that numeric columns contain only numbers, date columns contain valid dates, etc.
5. Data Cleaning: Cleanse the data by identifying and rectifying inconsistencies.
• As per current status of the data, there is a need to reorder EmployeeID column, changing data type of
'TotalWorkingHours' column, flling NA values and blank values from general_data, employee_survey_data,
manager_survey_data.
• There is 'EmployeeID' label missing in the 'in-time' and 'out-time' data.
SHAMIL P
SHAMIL P
THANK YOU
SHAMIL P
[email protected]