Module 4
Module 4
Introduction to Excel:
Excel offers a wide range of features & tools for data management, including data validation, sorting/filtering,
pivot tables, charts & graphs, and more. It's commonly used for financial analysis, budgeting, data entry,
inventory management, and other data-driven tasks.
Excel allows users to perform calculations on large datasets quickly & easily. Users can create complex
formulas using functions such as SUM, AVERAGE, MAX, MIN, COUNT, and more. Excel also offers users
the ability to create custom functions to perform specific calculations.
Excel is a popular spreadsheet program developed by Microsoft that allows users to organize, analyze, and
manipulate data. It provides users with a grid-like interface, commonly referred to as a worksheet, where data
can be entered, stored, and manipulated using various functions and formulas.
Excel is widely used in business, education, and research. Its popularity is due to its ease of use, versatility, and
powerful capabilities. It's a valuable tool for organizing and analyzing data and presenting findings in a clear
and concise manner.
Removing duplicates: You can remove duplicate values in a dataset to ensure that each record is unique. To
remove duplicates, select the data range, go to the "Data" tab, and click on "Remove Duplicates".
Concatenating data: You can combine two or more columns of data into a single column using the
CONCATENATE function. To do this, select the cell where you want to concatenate the data, type
"=CONCATENATE(" and then select the cells you want to combine. You can also add text or special characters
in between the cells.
Splitting data: You can split a single column of data into multiple columns using the Text to Columns feature.
To do this, select the column, go to the "Data" tab, and click on "Text to Columns". Then, select the delimiter
that separates the data (such as a comma or space) and choose where to place the split data.
Creating pivot tables: You can summarize and analyze large datasets using pivot tables. To create a pivot
table, select the data range, go to the "Insert" tab, and click on "PivotTable". Then, choose where to place the
pivot table and select the fields you want to summarize.
Arithmetic Manipulation in Excel
Arithmetic Operators in Excel:
Operator Condition Name Excel Formula Description
+ Addition = A1 + B1 Makes addition of two or more than two
numeric values and gives a numeric value as a
result.
- Subtraction = A1 - B1 Subtracts two or more than two numeric
values and gives a numeric value in return.
* Multiplication = A1 * B1 Multiplies one numeric value with the other
and gives a numeric value in return.
/ Division = A1 / B1 Divides the numerator(should be numeric)
value with the denominator(should be
numeric) value and gives a numeric value in
return.
% Percent =A1% Converts the number present in a cell into a
percentage(i.e. divides the number by 100)
^ Exponentiation = A1^2 Raises the power of first numeric value by
second numeric value.
Excel is a powerful tool for performing arithmetic operations on data. Here are some basic arithmetic
manipulation tasks that you can perform in Excel:
1. ➕ Addition: To add two or more values, simply type the "+" sign between them. For example, to add
the values in cells A1 and A2, you can type "=A1+A2" into another cell.
2. ➖ Subtraction: To subtract one value from another, type the "-" sign between them. For example, to
subtract the value in cell A2 from the value in cell A1, you can type "=A1-A2" into another cell.
3. Multiplication: To multiply two or more values, type the "" sign between them. For example, to
multiply the values in cells A1 and A2, you can type "=A1A2" into another cell.
4. ➗ Division: To divide one value by another, type the "/" sign between them. For example, to divide the
value in cell A1 by the value in cell A2, you can type "=A1/A2" into another cell.
5. 🆙 Exponents: To raise a value to a power, use the "^" sign. For example, to square the value in cell A1,
you can type "=A1^2" into another cell.
6. 🔢 Basic functions: Excel also provides basic functions for arithmetic operations, such as SUM,
AVERAGE, MAX, MIN, and COUNT. To use a function, type "=FUNCTION(range)" into a cell,
replacing "FUNCTION" with the name of the function and "range" with the cells you want to include
in the calculation. For example, to find the sum of the values in cells A1 to A10, you can type
"=SUM(A1:A10)" into another cell.
1. VLOOKUP: 🔍 This function looks up a value in a table and returns a corresponding value from a
specified column. This is useful for searching through large datasets. For example, "=VLOOKUP(A1,
B1:C10, 2, FALSE)" will search for the value in cell A1 in the first column of the range B1:C10, and
return the corresponding value in the second column.
2. SUMIF: 🔎➕ This function adds up a range of values that meet specified criteria. For example,
"=SUMIF(A1:A5,">10")" will add up the values in cells A1 to A5 that is greater than 10.
3. IFERROR: ❌👉📝 This function allows you to specify what should be displayed if a formula returns an
error. For example, "=IFERROR(A1/B1, "Divide by zero error")" will display the message "Divide by
zero error" if the formula returns an error.
4. CONCAT: 🔀📄 This function combines two or more strings of text into a single string. This is similar to
the CONCATENATE function, but it is more concise. For example, "=CONCAT(A1," ",B1)" can be
written as "=CONCAT(A1:B1)".
5. DATE: 📅 This function returns the serial number of a date, given the year, month, and day. For
example, "=DATE(2023,4,19)" will return the serial number for April 19, 2023.
6. INDEX: 🔍🔎 This function returns the value of a cell in a specified row and column of a range. This is
useful for retrieving data from large datasets. For example, "=INDEX(A1:C10, 3, 2)" will return the
value in the third row and second column of the range A1:C10.
IF Statement: The IF statement is used to check a condition and return a value if the condition is true, and
another value if the condition is false.
=IF(logical_test, value_if_true, value_if_false)
For example, =IF(A1>10, "Yes", "No") will check if the value in cell A1 is greater than 10, and return
"Yes" if true and "No" if false.
AND Statement: The AND statement is used to check if multiple conditions are true.
=AND(condition1, condition2, ...)
For example, =AND(A1>10, B1<20) will return TRUE if both the values in cells A1 and B1 meet the specified
conditions
OR Statement: The OR statement is used to check if at least one of multiple conditions is true.
=OR(condition1, condition2, ...)
For example, =OR(A1>10, B1<20) will return TRUE if either the value in cell A1 or B1 meet the specified
conditions.
NOT Statement: The NOT statement is used to reverse the result of a logical test.
=NOT(logical_test)
For example, =NOT(A1>10) will return TRUE if the value in cell A1 is less than or equal to 10.
Nested IF: A nested IF statement allows you to include multiple IF statements within each other.
=IF(logical_test1, value_if_true1, IF(logical_test2, value_if_true2, value_if_false2))
For example, =IF(A1>10, "High", IF(A1>5, "Medium", "Low")) will check if the value in cell A1 is greater
than 10 and return "High", if it is greater than 5 but less than or equal to 10, it will return "Medium", and if it is
less than or equal to 5, it will return "Low".
Example 1:
Excel workbook :
Scenario: You work for a small business that sells products online. Your boss has asked you to analyze the sales
data from the past year to identify which products are the most popular and which months had the highest sales.
You need to use data filtering in Excel to answer the following questions:
1. Which product sold the most units last year?
2. Which month had the highest sales revenue?
3. Which product had the highest average sales price?
Solution
To find out which product sold the most units last year, you can use data filtering. Follow these steps:
Select the entire table.
Go to the "Data" tab and click on the "Filter" button.
Click on the filter dropdown for the "Units Sold" column and select "Sort Largest to Smallest."
The product with the highest number of units sold will be at the top of the list.
In this case, Product A sold the most units last year (180 total).
To find out which month had the highest sales revenue, you can use data filtering. Follow these steps:
Select the entire table.
Go to the "Data" tab and click on the "Filter" button.
Click on the filter dropdown for the "Sales Revenue" column and select "Sort Largest to Smallest."
The month with the highest sales revenue will be at the top of the list.
In this case, March had the highest sales revenue ($3600 total).
To find out which product had the highest average sales price, you can use data filtering. Follow these steps:
Select the entire table.
Go to the "Data" tab and click on the "Filter" button.
Click on the filter dropdown for the "Sales Price" column and select "Sort Largest to Smallest."
The product with the highest average sales price will be at the top of the list.
In this case, Product C had the highest average sales price ($40 per unit).
Now, if a user attempts to enter a price less than $10 or greater than $100 in any of the selected cells, an error
message will appear, and the invalid entry will be rejected.
Data validation is a feature in Excel that allows you to control what users can input into a cell or range of cells.
It helps to ensure that data is entered accurately and consistently. Here are the steps to set up data validation in
Excel:
1. Select the cell or range of cells that you want to apply data validation to.
2. Go to the "Data" tab in the Excel ribbon and click on "Data Validation".
3. In the "Settings" tab, select the type of validation you want to apply (e.g. whole number, decimal, date,
time, text length, etc.).
4. Configure the validation criteria according to your needs. For example, if you want to allow only whole
numbers between 1 and 100, select "Whole number" as the validation criteria, and set the minimum
value to 1 and the maximum value to 100.
5. Optionally, you can add an input message and/or an error alert to inform users about the data validation
rules and provide guidance if they make an incorrect entry.
6. Click OK to apply the data validation to the selected cells.
Once data validation is set up, users will be prevented from entering data that does not meet the specified
validation criteria. If they attempt to do so, an error message will appear, and the invalid entry will be rejected.
Example 2
Scenario: You work for a company that tracks employee hours using an Excel spreadsheet. You want to ensure
that the data entered into the spreadsheet is accurate and valid. You need to use data validation in Excel to
ensure the following:
1. The employee ID numbers entered into the spreadsheet are between 1000 and 9999.
2. The hours worked by each employee are between 0 and 24.
3. The overtime hours worked by each employee are between 0 and 10.
Refer to the workbook for the dataset.
Pivot tables can save you a lot of time and effort when it comes to analyzing and summarizing data in Excel.
With some practice and experimentation, you can unlock their full potential and gain insights into your data that
would be difficult or time-consuming to obtain otherwise.
Open your workbook and work this out. You have a sales data set with columns for "Product", "Region",
"Month", and "Sales". Here's an example of how you could create a pivot table to summarize this data:
1. Select the data range that you want to summarize (e.g. A1:D50).
2. Go to the "Insert" tab in the Excel ribbon and click on "PivotTable".
3. In the "Create PivotTable" dialog box, make sure the data range is correct and choose where to place
the pivot table (e.g. new worksheet or existing worksheet).
4. Drag the "Product" field to the "Rows" area, the "Region" field to the "Columns" area, and the "Sales"
field to the "Values" area.
5. Drag the "Month" field to the "Filters" area (optional).
6. The pivot table should now summarize your data by product and region, with the sales totals for each
combination.
Example 3
1. In the PivotTable Field List in the workbook, drag the following fields to their respective areas:
Drag "Product" to the "Rows" area.
Drag "Region" to the "Columns" area.
Drag "Sales" to the "Values" area.
2. By default, Excel will sum the values. If you want to change the aggregation method, you can click on
the drop-down arrow next to "Sum of Sales" in the Values area and select a different calculation, such
as Average or Count.
3. To further analyze the data, you can:
Drag the "Month" field to the "Filters" area to filter the data by specific months.
Drag additional fields to the Rows or Columns areas to add more dimensions to the analysis.
Line graphs :
A line graph is a type of chart that displays data as a series of points connected by straight lines. Line graphs are
commonly used to show trends over time or to compare multiple sets of data.
In Microsoft Excel, creating a line graph is a simple process. Here are the steps:
1. Enter your data into Excel. The first column should contain the x-axis values (such as dates or times),
and the second column should contain the corresponding y-axis values.
2. Select the data range you want to include in your line graph.
3. Go to the "Insert" tab on the Excel ribbon and select "Line" from the "Charts" section.
4. Choose the type of line graph you want to create. Excel offers several different types, including basic
line graphs, stacked line graphs, and 100% stacked line graphs.
5. Excel will create a basic line graph for you, with the x-axis values displayed along the bottom and the
y-axis values displayed along the left side.
6. Customize your line graph as desired by adding a chart title, axis titles, and data labels. You can also
adjust the appearance of the chart by changing the colors, fonts, and other design elements.
7. Save your line graph by going to "File" > "Save" or "Save As" and choosing a file format.
Bar graphs:
A bar graph is a type of chart that displays data using rectangular bars. Bar graphs are commonly used to
compare different categories or values. The height of each bar represents the value of the data. There are several
types of bar graphs in Excel, including vertical bar graphs, horizontal bar graphs, stacked bar graphs, and
clustered bar graphs. To create a bar graph in Excel:
Pie Charts:
A pie chart is a type of chart that displays data as a circular graph, with each slice of the pie representing
a different category or value. Pie charts are commonly used to show the proportion of each category or
value. In Excel, you can create both 2D and 3D pie charts.
Excel Shortcuts
Storytelling with data is all about using your data to tell an engaging story that captures the attention and
imagination of your audience.
Creating cool charts, graphs, and other visual aids can take your audience on an exciting journey of discovery
that reveals all sorts of insights and information.
Data visualization is essential in storytelling with data because it helps to make complex information more
accessible, understandable, and memorable to your audience.
By using charts, graphs, and other visual aids, you can distill large amounts of data into easily digestible nuggets
of information that can be quickly grasped and retained by your audience.
Data visualization can also help to reveal patterns, trends, and relationships that might not be immediately
apparent from raw data.
By using different types of visualizations, you can highlight key insights and create a compelling narrative that
engages your audience and communicates your message effectively.
Whether you're a data nerd or just someone who loves a good story, data visualization is the key to unlocking
the power of your data and making an impact that lasts.
Bar charts are ideal for comparing values between different categories or groups.
Pie charts are useful for showing the proportions or percentages of a whole.
Scatterplots are ideal for visualizing the relationship between two variables.
Heat maps are great for showing the distribution of values across a two-dimensional space.
Bubble charts are useful for displaying three-dimensional data, where the size of the bubble represents a third
variable.
Treemaps are ideal for showing hierarchical data, where each level is represented by a rectangle or square.
Line charts are great for showing trends over time or comparing multiple trends on the same chart, not for
comparing values between different categories or groups.
Bar charts are ideal for comparing values between different categories or groups, not for showing proportions or
percentages of a whole. For that purpose, pie charts are more appropriate.
Pie charts are useful for showing the proportions or percentages of a whole. The description matches bubble
charts, where the size of the bubble represents a third variable.
Scatterplots are ideal for visualizing the relationship between two variables, usually showing how one variable
changes concerning the other.
Heat maps are indeed great for showing the distribution of values across a two-dimensional space, usually using
colors to represent the intensity of values.
Bubble charts are useful for displaying three-dimensional data, where the size of the bubble represents a third
variable. Treemaps, on the other hand, are ideal for showing hierarchical data, where each level is represented
by a rectangle or square.
Treemaps are ideal for showing hierarchical data, where each level is represented by a rectangle or square. They
are not used for comparing trends over time or multiple trends on the same chart.
Tasks
to do:
1. Introduction to Storytelling with Data: Explain the concept of storytelling with data and its significance
in various domains. Describe how data can be used to convey meaningful insights and support
decision-making processes.
2. The Importance of Data Visualization in Storytelling: Elaborate on the importance of data visualization
in the storytelling process. Discuss how visual representations of data can enhance understanding,
engagement, and retention of information.
3. Choosing the Right Chart Type for Different Data Scenarios: Identify various types of data
visualizations (e.g., bar charts, line charts, scatter plots, etc.) and explain the appropriate scenarios for
using each type. Illustrate examples using the given dataset.
4. Understanding the Principles of Good Data Visualization: Outline the fundamental principles of
creating effective data visualizations, such as simplicity, clarity, accuracy, and relevance. Provide real-
world examples demonstrating these principles.
5. How to Use Excel to Create Compelling Data Visualizations: Provide a step-by-step guide on using
Microsoft Excel to create compelling data visualizations. Include instructions on loading the dataset,
selecting appropriate chart types, customizing visuals, and adding necessary labels.
Solution:
1. Introduction to Storytelling with Data: Data-driven storytelling involves using data to communicate
meaningful narratives. It enables decision-makers to grasp complex information quickly and make
data-backed decisions. Through storytelling, data analysts can engage their audience and convey
insights effectively.
2. The Importance of Data Visualization in Storytelling: Data visualization is a crucial aspect of data-
driven storytelling. It transforms raw data into visual representations, such as charts or graphs,
making patterns and trends easily interpretable. Engaging visualizations capture attention, enhance
comprehension, and make the storytelling process more impactful.
3. Choosing the Right Chart Type for Different Data Scenarios: Depending on the data characteristics,
different chart types can be used. For instance:
Use a bar chart to compare the number of admitted and non-admitted students.
Utilize a scatter plot to visualize the relationship between GRE scores and GPAs.
Create a stacked bar chart to display admission counts based on rank categories.
4. Understanding the Principles of Good Data Visualization: Effective data visualization adheres to
several principles:
Simplicity: Avoid clutter and unnecessary elements that distract from the main message.
Clarity: Ensure that the visuals are easy to understand, with clear labels and legends.
Accuracy: Represent data faithfully and avoid misrepresentation or distortion.
Relevance: Focus on displaying data that directly supports the story being told.
5. How to Use Excel to Create Compelling Data Visualizations: Below are the steps to create data
visualizations in Excel using the dataset:
Step 1: Open Microsoft Excel and import the dataset from the given link.
Step 2: Clean the data if necessary (remove duplicates, handle missing values).
Step 3: Select the data you want to visualize.
Step 4: Choose the appropriate chart type based on the data and the story you want to tell (e.g., bar
chart, scatter plot, line chart).
Step 5: Customize the chart by adding titles, axis labels, and data labels.
Step 6: Apply appropriate formatting and color schemes to make the visual appealing. Step 7: Analyze
the visualizations and draw insights that contribute to the data story.
Introduction to dashboards:
A dashboard is a visual representation of data that displays important information at a glance. It's like a control
center for your data where you can monitor your key metrics and track your progress over time.
Creating a dashboard in Excel or other data visualization tools can be a great way to turn your data into
actionable insights. By presenting your data in a clear, concise, and visually appealing way, you can make it
easier to understand and act on.
Dashboards can be used for a wide range of purposes, from tracking sales performance to monitoring website
traffic to analyzing financial data. By customizing your dashboard to your specific needs, you can get a
complete picture of your data and make informed decisions based on it.
Dashboards are a visual representation of data that displays important information at a glance. They act as a
control center for your data where you can monitor your key metrics and track your progress over time.
Key Performance Indicators (KPIs) are measurable values that track progress and show how well a company is
meeting its goals. They serve as important compasses to guide businesses and organizations toward success.
One of the benefits of using dashboards is that they save you time by presenting your data in a clear and concise
way, allowing you to quickly identify trends and patterns.
Dashboards provide a visual representation of your data, making it easier to identify areas of improvement and
take action based on your insights.
By having all of your data in one place, dashboards can lead to increased productivity, as you can quickly and
easily access the information you need to make informed decisions.
Dashboards can be customized to meet your specific needs, allowing you to focus on the KPIs that are most
important to your business.
Whether you're a data analyst or a business owner, a dashboard can help you to achieve your goals and drive
success.
1. Use Excel's built-in features: Excel has several built-in features that allow for interactivity, such as drop-
down lists, checkboxes, and radio buttons. By using these features, you can enable users to filter and sort
data dynamically.
2. Utilize PivotTables: PivotTables are a powerful tool in Excel for summarizing and analyzing large
datasets. They can be used to create interactive dashboards that allow users to filter data by selecting
different criteria.
3. Incorporate charts and graphs: Charts and graphs are a great way to visually display data and make it
more accessible to users. By including interactive charts and graphs in your dashboard, users can hover over
data points to see additional information, or click on elements to drill down into the data.
4. Use slicers: Slicers are a feature in Excel that allow users to filter data in a PivotTable or PivotChart. They
can be customized to include buttons, sliders, or other interactive elements, making it easy for users to
interact with your dashboard.
1. Add buttons and drop-down lists: Buttons and drop-down lists can be used to create interactive
menus and allow users to navigate through different parts of your dashboard. They can also be used to
switch between different views or change the data displayed in a chart.
2. Incorporate macros: Macros are small programs that automate repetitive tasks in Excel. By using
macros, you can create interactive dashboards that respond to user input and dynamically update data.
What are some built-in features in Excel that can be used to create interactive dashboards?
Checkboxes and radio buttons
How can charts and graphs enhance the interactivity of an Excel dashboard?
By enabling users to hover over data points for additional information
Which Excel feature allows users to filter data in a PivotTable or PivotChart?
Slicers
How can buttons and drop-down lists be used in an Excel dashboard?
To switch between different views or change displayed data
What are macros in Excel primarily used for in the context of interactive dashboards?
Automating repetitive tasks and updating data dynamically
What is the primary advantage of creating dashboard templates in Excel?
They help save time and maintain consistency in design and formatting.
Problem Statement:
You are a data analyst working for a sports analytics company. Your task is to create an interactive dashboard
using Excel to analyze the performance of batsmen in the Indian Premier League (IPL) based on the given
dataset. The dataset contains information about various batsmen, including their total runs, number of outs,
number of balls faced, batting average, and strike rate. Your dashboard should provide insights into the
performance of batsmen and help stakeholders make informed decisions.
Tasks to do:
1. Data Preparation:
Import the provided dataset into Excel.
Clean the data if necessary (e.g., handle missing values, data types).
2. Choosing the Right Chart Types:
Based on the dataset, select appropriate chart types (e.g., bar charts, line charts, pie charts) to visualize
the batsmen's performance metrics (total runs, average, strike rate, etc.).
Justify your choice of chart types for each metric and explain how they effectively represent the data.
3. Creating Interactive Dashboards:
Utilize Excel's interactive features (e.g., pivot tables, slicers, drop-down lists) to make the dashboard
user-friendly and interactive.
Create dynamic visualizations that allow stakeholders to filter and explore the data interactively.
4. Creating Dashboard Templates:
Design a dashboard template that can be reused for future IPL seasons or other sports analytics
projects.
Include placeholders for new data and ensure that the template is easily adaptable.
5. Sharing Dashboards with Stakeholders:
Demonstrate how to share the interactive dashboard with stakeholders using Excel's sharing and
collaboration features.
Explain the methods for ensuring data security and access control while sharing the dashboard.
Solution:
Data Preparation:
Download the dataset from the provided link and import it into Excel.
Check for any missing values or inconsistencies in the data.
Ensure that the data types are appropriate for each column (e.g., numeric for total_runs, average, strikerate; text
for batsman).
Use another line chart to display the strike rate of each batsman.
Justify these chart choices, explaining how they help in comparing and analyzing batsmen's
performances effectively.
Creating Interactive Dashboards:
Utilize pivot tables to summarize the data and enable dynamic filtering.
Create slicers for batsman names, allowing stakeholders to select specific players and see their
statistics.
Add drop-down lists to select different metrics (e.g., total_runs, average, strikerate) and update the
visualizations accordingly.
Creating Dashboard Templates:
Design the dashboard in a user-friendly and visually appealing manner.
Keep areas for updating data clearly marked as placeholders.
Provide instructions on how to replace data and add new player information easily.
Sharing Dashboards with Stakeholders:
Save the interactive dashboard in a sharable format (e.g., Excel file, PDF).
Share the file with stakeholders using a secure and appropriate method (e.g., email, shared drive, online
collaboration platform).
Set access permissions to ensure data security and control who can make changes to the dashboard.
Tableau is a powerful data visualization and business intelligence software that helps organizations analyze and
visualize their data in an interactive and informative way.
Tableau connects to a wide range of data sources, including spreadsheets, databases, and cloud services, making
it easy for organizations to access and analyze their data in one place.
With an intuitive drag-and-drop interface, users can create interactive dashboards, reports, and visualizations
without the need for technical skills. Tableau provides a wide range of visualization options, including bar
charts, line charts, scatter plots, heat maps, and more.
The ability to interact with data is one of Tableau's key features. Users can hover over data points, drill down
into details, and filter data sets to find the information they need. This allows organizations to explore their data
and make informed decisions quickly.
Tableau has robust collaboration and sharing capabilities, allowing teams to work together on data projects and
share insights with others in the organization. It also provides a platform for publishing interactive dashboards
and reports to the web, allowing stakeholders to access and explore data from anywhere.
Why dashboarding?
Dashboarding helps in presenting data visually in an organized and easy-to-understand manner. This enables
decision-makers to quickly and effectively analyze data, track KPIs, identify trends and patterns, and make
informed decisions. Additionally, dashboards provide real-time updates, allow for quick data comparison, and
facilitate collaboration among teams.
Different dashboards tools:
1. Tableau
2. Power BI
3. QlikView
4. Looker
5. TIBCO Spotfire
6. SAP Lumira
7. IBM Cognos Analytics
8. Microsoft Excel
9. Oracle BI
10. Google Data Studio.
Why Tableau?
Tableau is considered better than other dashboarding tools for several reasons:
Application of Tableau
Tableau is widely used in various industries for data analysis and visualization, some real-life applications are:
1. Business Intelligence: Companies use Tableau to analyze sales, marketing, and financial data to make
informed business decisions.
2. Healthcare: Hospitals and healthcare organizations use Tableau to analyze patient data, monitor patient
health and track medical outcomes.
3. Retail: Retailers use Tableau to analyze customer behavior and sales data to optimize inventory and
make informed decisions.
4. Finance: Financial institutions use Tableau to analyze stock market trends, customer spending, and
other financial data to make informed investment decisions.
5. Government: Government agencies use Tableau to analyze census data, crime statistics, and other data
to inform policy and resource allocation decisions.
6. Manufacturing: Manufacturing companies use Tableau to monitor production processes, identify areas
for improvement, and analyze supply chain data.
7. Education: Universities and schools use Tableau to analyze student performance data and inform
decision-making around curriculum and resources.
Let’s understand the use of tableau for the company’s KPI with the help of conversation.
Person 1: Hey, have you checked out the latest dashboard on Tableau for the company's KPIs?
Person 2: No, I haven't. How does it look?
Person 1: It's really informative and interactive. You can see all the key metrics in one place and easily track
our progress over time.
Person 2: That sounds great. Can you show me how to access it?
Person 1: Sure, just log into Tableau and go to the "Business Performance" dashboard. From there you can see
all the KPIs, including revenue, customer satisfaction, marketing metrics, and target acquisition cost.
Person 2: Oh, I see. And what if I want to see the data for a specific time period?
Person 1: You can use the filters on the right side of the screen to select the date range you're interested in. You
can also see the data by different segments or categories, depending on the KPI.
Person 2: That's very helpful. Thanks for showing me this. I think it's going to be a valuable tool for our team.
Business KPIs, or Key Performance Indicators, are metrics used to measure the success and progress of a
company toward its goals and objectives. These metrics help businesses track their performance in key areas
such as revenue, customer satisfaction, marketing, and cost management. By monitoring these KPIs, businesses
can identify areas for improvement and make informed decisions to drive growth and success. Examples of
common business KPIs include customer acquisition cost, conversion rates, website traffic, net promoter score,
and return on investment (ROI). The specific KPIs used by a business will depend on its goals and the industry
it operates in.
Dashboards are used to visualize and present data in a clear and easily understandable way.
Tableau is a powerful data visualization tool that allows users to create interactive and dynamic visualizations.
Tableau finds application in various industries, including finance, marketing, and healthcare, to analyze and
present data effectively.
Business KPIs, or Key Performance Indicators, are measurable metrics that help organizations track their
performance and progress towards goals.
1. Go to the Tableau website: Visit the Tableau website (https://www.tableau.com) and click on the "TRY
TABLEAU FOR FREE" button.
2. Scroll down and click on ‘DOWNLOAD A FREE TRIAL’ on Tableau Desktop.
3. Create an account: Fill in the required information to create an account and download your free trial.
4. Download Tableau: After creating an account, navigate to the download page and select the appropriate
version of Tableau for your operating system (Windows or Mac).
5. Install Tableau: Once the download is complete, double-click the installer file to start the installation
process. Follow the on-screen instructions to complete the installation process.
6. Click on the ‘Install’ button.
7. Activate Tableau: After the installation is complete, launch Tableau and enter your product key to
activate the software or click on ‘Continue Trial’
8. Connect to data: Once Tableau is activated, you can connect to your data source by clicking on the
"Connect" button and selecting the type of data source(you can use a sample superstore dataset that is
present in Tableau as default as mentioned below) you want to connect to.
Tableau Desktop is one of the core components of the Tableau platform, but there are several other features and
tools available that work together to provide a comprehensive data analysis and visualization solution. Here are
some of the other features of the Tableau platform:
Tableau Server/Online: A web-based platform that enables data sharing, collaboration, and publishing of
interactive data visualizations, dashboards, and reports.
Tableau Reader: A free desktop application that allows users to view and interact with Tableau content created
by others.
Tableau Mobile: A mobile app for iOS and Android devices that provides access to Tableau content on the go.
Tableau Prep: A data preparation tool that enables users to clean, shape, and combine data from various sources
before analyzing it in Tableau.
Tableau Public: A free version of Tableau that allows users to publish data visualizations and dashboards to the
web for anyone to see.
Tableau Reader Cloud: A cloud-based service for deploying Tableau content to an unlimited number of users,
without the need for local installations of Tableau Reader.
Tableau Connectors: A set of pre-built connectors that allow Tableau to connect to various data sources,
including databases, spreadsheets, cloud services, and more.
Workbook, Worksheet, and Dashboard in Tableau
In Tableau Desktop, a workbook is a file that contains one or more views of data, as well as any related
calculations, filters, and data connections. A worksheet is a single view of data within a Tableau workbook. A
dashboard is a collection of worksheets and other objects, such as charts, tables, images, and text, arranged on a
single page to provide an overview of data and insights.
Here are some key differences between a Tableau workbook, worksheet, and dashboard:
1. Workbook: A workbook is a file that can contain multiple worksheets and dashboards, as well as any
related calculations, filters, and data connections.
2. Worksheet: A worksheet is a single view of data that is created in a Tableau workbook. It can contain
charts, tables, and other data visualizations.
3. Dashboard: A dashboard is a collection of worksheets and other objects arranged on a single page to
provide an overview of data and insights. Dashboards can also contain filters, parameters, and other
interactivity to allow users to explore and interact with data.
In summary, a workbook is an overall file that contains one or more worksheets and dashboards, while a
worksheet is a single view of data within a workbook, and a dashboard is a collection of worksheets and other
objects arranged on a single page.
1. Connect to Data: Connect to your data source, whether it's a spreadsheet, database, or cloud service, to
bring data into Tableau. In this case, we are using play_store.csv and user_reviews.csv and using the
inner join as the merging option and we are choosing connection as ‘Extract’.
2. Create Worksheets: Create a worksheet to start building your visualization by dragging and dropping
fields onto the Rows and Columns shelves.
Creating worksheet ‘Content rating by Installation’.
3. Create Dashboards: Combine multiple worksheets into a dashboard to display multiple visualizations in
one place. You can also add filters, parameters, and actions to interact with your data.
4. Publish and Share: Once you're satisfied with your workbook, publish it to Tableau Server or Tableau
Online to make it accessible to others. You can also share it via Tableau Public or Tableau Reader.
Before starting working on Tableau Public. Let’s learn about Tableau Public first.
Tableau Public
Tableau Public is a free version of Tableau's data visualization software that allows individuals and
organizations to share data stories and insights with a global audience. Unlike the commercial versions of
Tableau, Tableau Public uses public data and is intended for non-commercial use only.
1. Data Visualization: Tableau Public provides a range of data visualization tools and options, including
charts, maps, and dashboards, to help users explore, analyze, and communicate data insights.
2. Public Data: Tableau Public uses public data, meaning users can connect to, visualize, and share data
that is available to the general public, but cannot connect to proprietary or confidential data sources.
3. Sharing: Tableau Public allows users to publish their work to the Tableau Public website, where it can
be easily shared and viewed by others.
4. Collaboration: Tableau Public provides collaboration features, allowing users to work with others on
shared projects and data stories.
Tableau Public is a powerful tool for individuals and organizations looking to share data stories and insights
with a global audience. Its focus on public data and its ease of use make it an excellent choice for data
journalists, data educators, and other non-commercial users.
Let’s learn About Tableau Public using conversation between two persons.
A Tableau worksheet is the primary interface used to build visualizations and explore data in Tableau. It
consists of various components, including the data pane, the view pane, and the shelves. Here is a breakdown of
the anatomy of a Tableau worksheet:
Data Pane: The data pane is located on the left side of the worksheet and is used to connect to data sources and
to define the fields that will be used in the analysis. It also allows users to view and manipulate the data source,
including creating calculated fields, grouping data, and defining hierarchies.
View Pane: The view pane is located in the center of the worksheet and is used to build visualizations. It
displays the fields that have been added to the worksheet, as well as any filters, marks, or labels that have been
applied.
Shelves: The shelves are located on the top and bottom of the worksheet and are used to control the display of
the view pane. There are several different types of shelves:
Rows: The rows shelf is used to specify the fields that should be used to create rows in the visualization. For
example, if you want to create a bar chart that displays the sales by product category, you would add the product
category field to the rows shelf.
Columns: The columns shelf is used to specify the fields that should be used to create columns in the
visualization. For example, if you want to create a bar chart that displays the sales by year, you would add the
year field to the columns shelf.
Marks: The marks shelf is used to control the appearance of the marks in the visualization. It allows users to
specify the type of mark (e.g., bar, line, circle), the size of the mark, the color of the mark, and other visual
properties.
Filters: The filters shelf is used to apply filters to the data. It allows users to select a field and then specify the
values or conditions that should be included or excluded from the visualization.
Pages: The pages shelf is used to create a series of visualizations that can be displayed one after the other, based
on the values of a selected field. For example, if you want to create a series of bar charts that show the sales by
region, you would add the region field to the pages shelf.
A Tableau dashboard is a collection of visualizations, such as worksheets, charts, maps, and tables, that are
arranged on a single page for easy and interactive analysis of data. Here are the components of the Tableau
dashboard screen:
Menu Bar: The menu bar is located at the top of the screen and contains the standard file, edit, and view options,
as well as various other options for working with the dashboard.
Toolbar: The toolbar is located just below the menu bar and contains various tools and options for working with
the dashboard, such as undo/redo, select, and zoom.
Dashboard Pane: The dashboard pane is located on the right side of the screen and is used to organize the
different components of the dashboard. It allows users to add new sheets, create containers, and adjust the layout
of the dashboard.
Objects Pane: The objects pane is located on the left side of the screen and is used to access the worksheets,
charts, and other visualizations that can be added to the dashboard. It also provides access to filters and
parameters that can be used to interact with the data.
Filters Shelf: The filters shelf is located at the top of the screen and is used to apply filters to the data displayed
in the dashboard. Filters can be applied to individual visualizations or to the entire dashboard.
Pages Shelf: The pages shelf is located below the filters shelf and is used to create a series of visualizations that
can be displayed one after the other, based on the values of a selected field.
Views: The views are the visualizations that are added to the dashboard. They can be worksheets, charts, maps,
tables, or other types of visualizations. Each view can be resized, moved, and customized to fit the overall
layout of the dashboard.
Titles and Text: Titles and text can be added to the dashboard to provide context and explanation for the
visualizations. They can be added as individual text boxes or as part of the overall layout of the dashboard.
Tableau is a powerful data visualization tool that can connect to a wide range of data sources, including:
Excel spreadsheets: Tableau can directly connect to Excel spreadsheets and extract data from individual sheets
or the entire workbook.
Text files: Tableau can extract data from text files, such as CSV, TSV, and tab-delimited files.
Databases: Tableau can connect to a variety of databases, including SQL Server, Oracle, MySQL, PostgreSQL,
and more.
Cloud-based data sources: Tableau can connect to cloud-based data sources, such as Amazon Redshift, Google
BigQuery, Microsoft Azure SQL Database, and more.
Web-based data sources: Tableau can extract data from web-based data sources, such as Google Analytics,
Salesforce, and more.
Hadoop: Tableau can connect to Hadoop-based data sources, such as Cloudera, Hortonworks, and MapR.
To connect to a data source in Tableau, follow these steps:
Open Tableau and click on "Connect to Data" on the start page or select "Connect to Data" from the File menu.
Select the type of data source you want to connect to from the list of available options.
Follow the prompts to enter the necessary connection details, such as server name, port number, and credentials.
Once the connection is established, Tableau will display the available tables and fields from the data source.
Drag and drop the desired fields onto the view to start creating visualizations.
This platform offers various tools and features for data visualization and analysis, including workbooks,
worksheets, and dashboards. – Tableau
A single view that represents a combination of dimensions, measures, and marks in Tableau.Worksheet
Two individuals have a conversation about how to publish interactive data visualizations online using this free
version of Tableau. - Tableau Public
The central area in Tableau where visualizations are created, allowing users to drag and drop data fields to build
their views. -Worksheet canvas
Task 1:
Sharing and Collaborating on Tableau Dashboards
Instructions:
Create a dashboard with multiple visualizations using the "Superstore" dataset or any other dataset of your
choice.
Customize the layout and design of the dashboard to make it visually appealing.
Add interactive elements, such as filters and parameters, to enhance user experience.
Publish the dashboard to Tableau Public or Tableau Server.
Hints:
You can import the "Superstore" dataset into Tableau using the "Excel" connector or any other appropriate
connector depending on your dataset format.
Step 2: Data Exploration and Preparation
Use Tableau's Data Source tab to explore the data, check for missing values, and perform any necessary data
cleaning.
Step 3: Visualizations
To create visualizations, drag and drop the desired dimensions and measures onto the "Rows" and "Columns"
shelves in Tableau.
For example, to create a bar chart showing sales by product category, you would drag "Product Category" to
"Columns" and "Sales" to "Rows."
Step 4: Dashboard Design and Layout
To create a new dashboard, go to the "Dashboard" tab in Tableau and drag visualizations from the "Sheets" tab
to the dashboard area.
Customize the size and positioning of each visualization to create an appealing layout.
Step 5: Adding Interactivity
To add filters, right-click on a field and select "Show Filter" to add it to the dashboard. Users can now
interactively filter the data.
To add parameters, go to the "Data" menu and create a new parameter. You can then use this parameter in
calculations and visualizations to provide interactive elements.
Step 6: Publishing the Dashboard
To publish the dashboard to Tableau Public or Tableau Server, click on the "Server" menu and select "Publish
Workbook."
Follow the steps to publish the dashboard to the desired location.
Choosing the right chart type is crucial to communicate insights from your data effectively.
Accurately represent data: The primary goal of data visualization is to represent the data displayed accurately.
Choose a chart type that clearly and accurately conveys the information you want.
Highlight trends and patterns: Data visualization is also useful for identifying trends and patterns in your data.
Choose a chart type that highlights these trends and patterns effectively.
Tailor to the audience: Consider the needs and preferences of your audience when selecting a chart type.
Different chart types may be more effective for different audiences or purposes.
Communicate insights: The ultimate goal of data visualization is to communicate insights and inform decision-
making. Choose a chart type that effectively communicates the insights you want to share.
Handle data volume: Different chart types have different capacities for handling large volumes of data. Choose a
chart type that can handle the data you need to display.
Highlight comparisons: Sometimes, you may want to highlight comparisons between different data points.
Choose a chart type that effectively highlights these comparisons.
Emphasize outliers: Outliers can provide valuable insights and be misleading if not presented effectively.
Choose a chart type that effectively emphasizes outliers in your data.
What Is a Chart and Types of Charts?
In Tableau, a chart is a visual representation of data that helps people understand it better. Charts are great for
presenting large amounts of information in a simple and easy-to-understand way. There are many types of charts
available in Tableau, and the type of chart you choose depends on the type of data you have and the story you
want to tell.
Here are some of the most common types of charts in Tableau:
Line Chart - A line chart in Tableau is used to show trends over time. It is best used when you have data that
changes over a period of time.
Bar Chart - A bar chart in Tableau is used to compare different data points. It is best used when you want to
compare data across different categories.
Pie Chart - A pie chart in Tableau is used to show the proportion of different categories in a dataset. It is best
used when you have data that can be broken down into parts.
Area Chart - An area chart in Tableau is similar to a line chart, but the area under the line is filled in. It is best
used when you want to show how much of a whole is taken up by different parts.
Scatter Plot - A scatter plot in Tableau is used to show the relationship between two variables. It is best used
when you want to see if there is a correlation between two things.
Map Chart - A map chart in Tableau is used to show data on a map. It is best used when you want to show the
distribution of data across different regions or locations.
Bubble Chart - A bubble chart in Tableau is used to show data points as bubbles. It is best used when you want
to show the relationship between three variables.
Gantt Chart - A Gantt chart in Tableau is used to show the duration of different tasks or activities. It is best used
when you want to show the progress of a project over time.
LINE CHART:
Line Charts are one of the most common chart types used in Tableau. They are ideal for displaying trends over
time, like stock prices or weather patterns.
A Line Chart uses a continuous line to connect data points on the x and y axis. This helps visualize how data
changes over time.
Tableau offers a variety of customization options for Line Charts. You can adjust the line thickness, color, and
style. You can also add labels, markers, and tooltips to give more information about the data.
Creating a Line Chart in Tableau is easy. Just drag your date or time field to the columns or rows shelf, and your
measure field to the rows or columns shelf. Then, choose "Line" from the "Show Me" panel.
PIE CHART:
Pie Charts are used to show the proportion of different categories in a data set. They are ideal for displaying
data in a clear and easy-to-understand way.
A Pie Chart in Tableau is created by dividing a circle into slices, with each slice representing a category. The
size of the slice corresponds to the proportion of that category in the data.
Tableau offers a variety of customization options for Pie Charts. You can adjust the color scheme, labels, and
data labels to make your chart more visually appealing.
Creating a Pie Chart in Tableau is easy. Just drag your categorical variable to the "Columns" or "Rows" shelf
and your measure to the "Angle" shelf. Then, choose "Pie Chart" from the "Show Me" panel.
AREA CHART:
Area Charts are similar to Line Charts, but with an added area shaded in between the line and the x-axis. This
shading can help emphasize the magnitude of changes over time.
Area Charts are commonly used to display cumulative data over time, like population growth or revenue. They
can also be used to compare two or more measures over time.
Tableau offers a variety of customization options for Area Charts, including the ability to adjust the shading
color and opacity, as well as the thickness and style of the line.
To create an Area Chart in Tableau, simply drag your date or time field to the columns or rows shelf, and your
measure field to the rows or columns shelf. Then, choose "Area" from the "Show Me" panel.
HISTOGRAM:
Histograms are a type of chart used to visualize the distribution of data in a dataset. They group data into "bins"
and display the frequency of data points within each bin.
Histograms are useful for understanding patterns in data, such as identifying outliers or seeing where most of the
data falls on a spectrum.
Tableau provides various customization options for histograms. You can adjust the size of the bins, color, and
layout. You can also add labels, axis titles, and tooltips to give more context to the data.
Creating a histogram in Tableau is easy. Just drag your measure field to the columns or rows shelf and choose
"Histogram" from the "Show Me" panel. You can also adjust the bins and other settings from the "Format" pane.
SCATTER PLOT:
Scatter Plots are a type of chart that display the relationship between two measures. Each data point is
represented by a dot on the x and y axes.
Scatter Plots are useful for identifying patterns and trends in data, and for detecting outliers or anomalies.
Tableau allows you to customize your Scatter Plot with various options, such as changing the size, color, and
shape of the data points. You can also add labels and tooltips to provide more information about each data point.
Creating a Scatter Plot in Tableau is easy. Just drag your two measures to the columns and rows shelf, and
choose "Scatter Plot" from the "Show Me" panel.
MAP CHART:
Map Charts are an excellent way to visualize data geographically. They display data points on a map, making it
easy to see patterns and trends.
Tableau offers a variety of map types, including scatter plots, heat maps, and choropleth maps. You can
customize your map with colors, markers, and tooltips to help viewers understand the data.
To create a Map Chart in Tableau, you will need a location field, such as a city or country name, and a measure
field, such as sales or population. You can then drag and drop these fields onto the "Rows" and "Columns"
shelves and select "Map" from the "Show Me" panel.
Tableau's mapping features also allow you to zoom in and out, filter data by region, and even overlay multiple
maps for comparison.
BUBBLE CHART:
Bubble Charts are a type of visualization that use circles (or bubbles) to represent data points. The size of the
circle represents the value of a specific measure, while the x and y axis represent different dimensions.
Bubble Charts are great for comparing and analyzing multiple measures at once. They're especially useful for
visualizing data that has three variables, like revenue, profit, and sales volume.
Tableau offers many customization options for Bubble Charts. You can adjust the size, color, and transparency
of the bubbles. You can also add labels, tooltips, and reference lines to provide more context for the data.
To create a Bubble Chart in Tableau, simply drag your dimensions and measures to the appropriate shelves, and
choose "Bubble" from the "Show Me" panel.
GANTT CHART:
Gantt Charts are a type of bar chart that show the duration of tasks or events over time. They are useful for
project management, scheduling, and resource allocation.
A Gantt Chart in Tableau displays tasks as horizontal bars along a timeline, with the length of the bar
representing the duration of the task.
Tableau offers a range of customization options for Gantt Charts. You can adjust the color, size, and shape of
the bars. You can also add labels, tooltips, and other annotations to help explain the data.
To create a Gantt Chart in Tableau, you will need to use date and time fields to specify the start and end dates of
each task. Simply drag the relevant fields to the appropriate shelves, and then choose "Gantt Bar" from the
"Show Me" panel.
RADAR CHART:
A Radar Chart, also known as a Spider Chart or Polar Chart, displays multivariate data in a radial manner. It's a
useful chart type for comparing multiple variables at once, and for highlighting similarities and differences
between them. A Radar chart in Tableau is circular, with radii extending from the center to each data point. The
data points are connected by lines to create a polygon shape. While a Radar chart in Tableau is useful for
comparing multiple variables at once, it is generally only suitable for showing a small number of variables. This
is because the chart can become cluttered and difficult to read if there are too many variables.
In Tableau, you can create Radar Charts by dragging multiple dimensions or measures to the Rows or Columns
shelf, and then selecting the "Polygon" mark type.
The resulting chart will have a central point and radial axes for each variable, with a line connecting data points
for each category. You can customize the chart's appearance by adjusting the colors, labels, and axes.
Radar Charts can be particularly useful for comparing products, companies, or individuals across multiple
dimensions, such as customer satisfaction, pricing, or performance.
WATERFALL CHART:
Waterfall Charts are used to visualize the cumulative effect of positive and negative values over time or
categories.
The chart starts at a baseline and then shows how each value affects the overall total, either by adding to it or
subtracting from it. The result is a series of floating bars that represent the changes in the cumulative total.
The name "Waterfall Chart" comes from the way the bars resemble a waterfall.
Tableau offers a variety of customization options for Waterfall Charts. You can change the color, style, and
label of the bars, and you can also add reference lines and annotations to provide context.
Creating a Waterfall Chart in Tableau is easy. Just drag your dimension and measure fields to the Rows or
Columns shelf and choose "Waterfall" from the "Show Me" panel.
PARETO CHART:
A Pareto Chart is a combination of a bar graph and a line graph that displays data in descending order of
frequency.
The bars represent individual values, and the line represents the cumulative total. This helps you identify which
values have the biggest impact on your data.
Pareto Charts are useful for identifying the most important issues to address, such as the top reasons for
customer complaints or the most common defects in a manufacturing process.
In Tableau, you can customize your Pareto Chart by changing the colors, labels, and axes. You can also add
annotations and filters to provide more context.
To create a Pareto Chart in Tableau, you first need to create a bar chart of your data. Then, add a line chart that
shows the cumulative percentage of the total. Finally, synchronize the axes so they share a common scale.
FUNNEL CHART:
Funnel Charts are ideal for visualizing a sequence of events, like a sales funnel or a marketing campaign. They
show how the number of items in a process decreases as it moves towards the final stage.
Funnel Charts use a series of bars of different lengths to represent the different stages in the process. The bars
are arranged in descending order, with the longest bar representing the initial stage and the shortest bar
representing the final stage.
Tableau offers a variety of customization options for Funnel Charts. You can adjust the colors, labels, and the
size of the bars.
To create a Funnel Chart in Tableau, just drag your dimension field to the columns or rows shelf, and your
measure field to the rows or columns shelf. Then, choose "Funnel" from the "Show Me" panel.
HEAT MAP:
Heat Maps are a type of visualization that use color to represent data values in a table. They are particularly
useful when you have a large dataset and want to quickly identify patterns and trends.
Tableau allows you to customize your Heat Map with different color schemes, labels, and tooltips. You can also
adjust the size and shape of the cells to better visualize your data.
To create a Heat Map in Tableau, simply drag and drop your fields into the Rows and Columns shelves, and
then choose "Heat Map" from the "Show Me" panel.
The intensity of the color represents the value of the data. For example, in a Heat Map of temperature data,
hotter temperatures would be represented by darker colors.
TREE MAP:
A Tree Map is a chart type that shows hierarchical data in a rectangular, nested layout. It's a great way to
visualize the size and proportion of data within different categories.
In a Tree Map, each rectangle represents a different category or sub-category, and its size is proportional to the
data value. The rectangles are nested within each other to represent the hierarchy of the data.
Tableau allows you to customize your Tree Map by adjusting the colors, labels, and size of the rectangles. You
can also add tooltips to show additional information about the data.
To create a Tree Map in Tableau, simply drag and drop your dimension and measure fields onto the "Rows" and
"Columns" shelves. Then, choose "Tree Map" from the "Show Me" panel.
CHLOREPATH CHART:
Chloropleth Charts are a type of map visualization that color-codes geographic areas based on a data metric.
This allows you to quickly see patterns and trends in your data across different regions.
Tableau offers a range of customization options for Chloropleth Charts. You can adjust the colors and shading,
as well as add labels, tooltips, and legends to help interpret the data.
To create a Chloropleth Chart in Tableau, you'll need a geographic field, such as a city, state, or country, and a
corresponding data field to display on the map.
Once you have your data ready, just drag the geographic field to the "Rows" or "Columns" shelf, and the data
field to the "Color" shelf. Then, choose the map type you want to use, and Tableau will create a Chloropleth
Chart for you.
WORD CLOUD:
Word Clouds are a type of visualization that displays words in different sizes, based on their frequency or
importance in a given text or dataset.
In Tableau, Word Clouds can be created by simply dragging a text field to the "Text" card, and a numeric field
to the "Size" card.
Tableau offers a range of customization options for Word Clouds, including font, color scheme, and the number
of words displayed.
Word Clouds can be used to quickly identify trends, patterns, or outliers in large datasets, making them a
powerful tool for data exploration and analysis.
To create a Word Cloud in Tableau, just drag your text and numeric fields to the appropriate cards, and choose
"Word Cloud" from the "Show Me" panel.
SUNBURST CHART:
A Sunburst Chart is a type of visualization that displays hierarchical data in a circular format. The outer ring
represents the main categories, while the inner rings represent subcategories.
Each section of the chart is divided into arcs, with the size of the arc representing the proportion of the data. The
color of each arc can be customized to represent a specific category or subcategory.
Sunburst Charts are great for exploring and analyzing complex data structures. They make it easy to identify
patterns and relationships within the data.
To create a Sunburst Chart in Tableau, simply drag and drop your hierarchical data onto the "Marks" card. Then,
choose "Pie" from the "Show Me" panel, and drag the "Size" and "Color" fields onto the corresponding shelves.
RIBBON CHART:
Ribbon Charts are a unique way to display the relationship between two dimensions, such as a time period and a
category.
They are visually similar to stacked bar charts, but with a smooth ribbon connecting the bars. The width of the
ribbon indicates the magnitude of the data.
Tableau offers several customization options for Ribbon Charts, including color schemes and layout options.
To create a Ribbon Chart in Tableau, simply drag your dimensions to the rows and columns shelves, and your
measure to the size or color shelves. Then, choose "Ribbon" from the "Show Me" panel.
✅ ✅ ✅
Line Chart
✅ ✅ ✅
Bar Chart &
Column Chart
✅
Pie Chart
✅ ✅
Scatter Plot
✅ ✅ ✅
Area Chart
✅ ✅
Histograms
✅ ✅
Gantt Chart
✅ ✅
Bubble Chart
✅
Doughnut
Chart
✅ ✅
Radar Chart
✅
Waterfall
Chart
✅
Pareto Chart
✅
Funnel Chart
✅ ✅
Heat Map
✅ ✅
Tree Map
✅ ✅
Choropleth
Chart
✅
Word Cloud
✅
Sunburst
Chart
✅
Ribbon Chart
Line Chart: Best for showing trends over time, such as stock prices or website traffic.
Bar Chart & Column Chart: Great for visualizing categorical data, such as sales by region or number of website
visitors by day of the week.
Pie Chart: Useful for showing how parts contribute to a whole, like the percentage breakdown of a budget or
market share.
Scatter Plot: Perfect for visualizing relationships between two variables, such as the correlation between
temperature and sales.
Area Chart: Ideal for showing how a quantity changes over time, like the total revenue of a company.
Histograms: Best for showing the distribution of a single variable, such as the age distribution of a population.
Gantt Chart: Useful for showing the progress of a project over time, such as the different stages of a
construction project.
Bubble Chart: Great for visualizing three variables on a single chart, such as the relationship between revenue,
profit, and market share.
Doughnut Chart: Similar to a pie chart, but with a hole in the middle.
Pareto Chart: A combination of a column chart and a line chart, used to show the relative frequency of different
occurrences.
Funnel Chart: Used to show the different stages of a process, such as a sales funnel.
Heat Map: Best for visualizing data in a table format, using color to represent the value.
Tree Map: Useful for showing hierarchical data, such as the distribution of sales across different product
categories and subcategories.
Choropleth Chart: Used to show data on a geographic map, using color to represent the value.
Word Cloud: A visual representation of text data, where the size of each word represents its frequency.
Sunburst Chart: Used to show hierarchical data, where the size of each segment represents its value.
Ribbon Chart: A variation of a bar chart, used to show the relationship between two variables.
Chapter 4: Business Analysis
Dashboarding with Power BI
Installing PowerBI
Here are the steps to install Power BI Desktop:
Here are the steps to import data from Excel into Power BI:
1. Open Power BI Desktop and select "Get Data" from the Home tab in the ribbon.
2. In the "Get Data" dialog box, select "Excel" and then click "Connect".
3. In the "File" field, browse to the location of the Excel file that you want to import.
4. Select the Excel file and then click "Open".
5. In the "Navigator" dialog box, select the sheet or table you want to import and click "Load".
1. If you want to transform or clean the data before importing it, select "Transform Data" before clicking
"Load".
2. In the Power Query Editor, you can perform various data transformations like filtering, sorting,
merging, etc.
3. Once you have transformed the data as needed, click "Close & Apply" to import the data into Power
BI.
4. The imported data will now appear in the "Fields" pane on the right-hand side of the Power BI Desktop
screen.
5. You can now start building visualizations and reports using the imported data.
1. Open Power BI Desktop and open the report or dashboard you want to view.
2. In the "Fields" pane on the right-hand side of the screen, select the table or visualization you want to
view the data for.
3. The data for the selected table or visualization will appear in the "Values" section of the
"Visualizations" pane on the right-hand side of the screen.
4. You can customize the view of the data by adding filters, sorting, and grouping the data as needed.
5. To add a filter, select the table or visualization you want to filter and then drag the column you want to
filter by into the "Filters" section of the "Visualizations" pane. You can then select the values that you
want to filter by.
6. To sort the data, select the table or visualization you want to sort and then click on the column header
you want to sort by.
7. Once you have customized the view of the data, you can save the report or dashboard and share it with
others.
Once the "Format" pane is open, users can make various changes to the appearance of their visuals, including
the color scheme, font size and style, background color, and many other properties. The changes made in the
"Format" pane are applied in real-time, allowing users to see the effect of their changes immediately.
The "Format" pane is a powerful tool that can be used to create professional-looking reports and dashboards that
are tailored to specific audiences or use cases. With its intuitive interface and extensive customization options,
the "Format" pane is essential for any Power BI user who wants to create compelling data visualizations.
Format Painter is a useful tool in Power BI that allows users to copy formatting from one visual to another
quickly. It can be accessed by selecting a visual, clicking the Format Painter button on the Home tab, and then
clicking on the visual to which you want to apply the formatting. This can save time and effort when formatting
multiple visuals in a report.
1. Open your Power BI report and go to the page that contains the visual you want to format.
2. Select the visual that you want to copy the formatting from.
3. Go to the Home tab in the ribbon menu and click on the Format Painter button. It looks like a
paintbrush.
4. The cursor will turn into a paintbrush icon. Click on the visual or element that you want to apply the
formatting to. This can be a single visual or a group of visuals.
5. Once you have clicked on the target visual or element, the formatting will be applied instantly. You can
also use the Format Painter on multiple visuals or elements by clicking and dragging over them to
select them.
6. To turn off the Format Painter, simply click on the button again or press the Esc key on your keyboard.
1. Select the visuals that you want to align or position. You can do this by clicking on each visual while
holding down the Ctrl key, or by using the Select tool in the Visualizations pane.
2. Go to the Format tab in the Visualizations pane. Here, you will see a section called "Alignments."
3. Use the "Align Left," "Align Center," "Align Right," "Align Top," "Align Middle," and "Align
Bottom" buttons to align the selected visuals horizontally or vertically. You can also use the "Distribute
Horizontally" and "Distribute Vertically" buttons to evenly space out the selected visuals.
4. To adjust the position of a visual, click and drag it to the desired location. You can also use the
horizontal and vertical position fields in the Visualizations pane → general →Properties , to set precise
values for the position of the visual.
5. To resize a visual, click and drag one of the handles on the edges of the visual. You can also use the
Width and Height fields in the Visualizations pane to set precise values for the visual size.
Default Summarization: By default, Power BI automatically summarizes numerical data using the "Sum"
aggregation function. This can be changed to another aggregation function such as "Count" or "Average" by
selecting the visual in the report and then choosing the desired aggregation function from the "Values" drop-
down menu in the "Visualizations" pane.
Specific Visualization Summarization: Some visualizations in Power BI allow for more specific
summarization options. For example, individual columns can be set to different aggregation functions in a table
visualization by selecting the column and then choosing the desired aggregation function from the "Modeling"
tab in the "Visualizations" pane.
Number Formatting:
To change the number formatting in Power BI, you can follow the below steps:
Go to the Model View at the left side of the page .
In the Data pane, select the number data that you want to format.
In the properties section next to Data, you can see the options ‘Data Type’, ‘Format’ and ‘Decimal places’ , etc.
by which you can format your number data.
Choose the format that you want to apply to your data, such as currency, percentage, or decimal places.
Date Formatting:
To change the date formatting in Power BI, you can follow the below steps:
Go to the Model View at the left side of the page.
In the Data pane, select the date data you want to format.
In the properties section next to Data, you can see the options ‘Data Type’, ‘Format’ and ‘Decimal places’ , etc.
by which you can format your number data.
Choose the format that you want to apply to your date fields, such as the month name, day, year, or a custom
format.
By combining matrices and charts in your Power BI report, you can create a more comprehensive view of your
data and provide users with a variety of ways to interact with it.
Whether you're exploring trends, identifying outliers, or analyzing patterns, using different visualizations can
help you gain deeper insights and make more informed decisions.
Matrix:
In Power BI, you can create matrices to display summarized data in a grid format. Here's how to create a matrix
in Power BI:
1. Drill Down Data: Drill down allows you to explore and analyze data at different levels of granularity.
You can drill down from a higher-level summary view to a more detailed view of the data by clicking
on the data point or using the drill down button in the visualizations pane. You can also drill down on a
particular dimension or hierarchy to get more specific information.
2. See Data and Records: Power BI allows you to see the underlying data and records that make up your
visualizations. To see the data and records behind a visualization, you can click on the ellipsis (...) in
the top right corner of the visualization and select "See Data." This will open up a table view of the
data that you can interact with.
3. Export Data: Power BI allows you to export data from your visualizations in several different formats.
To export data, you can click on the ellipsis (...) in the top right corner of the visualization and select
"Export data." You can then choose the format you want to export the data in, such as CSV, Excel, or
PDF. You can also choose to export the summarized data or the underlying data.
These features allow you to dive deeper into your data and gain more insights and understanding of your data
sets.
1. Axis Range: Power BI allows you to choose the maximum and minimum values or range of values you
want to show on your continuous axis. Go to “format” in the visualizations pane → Axis→ Range.
2. Axis values: You can customize the axis values by clicking on the axis and then click "Format" in the
Visualizations pane. Under the "Axis" section → values, you can change the style, font size, color, and
other options.
3. Axis Title: Click on the Title option under the Axis in format to customize the Title of the Axis.
4. Bar colors: To change the colors of the bars, click on the bars and then click "colors" in the
Visualizations pane. You can choose a single color for all the bars or assign different colors to
individual bars based on the data.
5. Legend: You can customize the legend by clicking on it and then click "Format" in the Visualizations
pane. Under the "Legend" section, you can change the position, font size, color, and other options.
6. Data labels: Data labels can be used to display the actual values for each bar in the chart. To add data
labels, click on the chart and then click on "Data labels" in the Visualizations pane. From there, you
can choose which data to display and customize the appearance of the labels, such as font size, color,
and position.
7. Zoom slider: The zoom slider is a useful feature in Power BI that allows you to zoom in and out of
your chart. To add a zoom slider, click on the chart and then click on "Format" in the Visualizations
pane. From there, you can turn on the "Data Zoom" option, which will add a zoom slider to your chart.
You can then adjust the zoom level using the slider.
8. Gridlines: Gridlines can be used to make it easier to read the chart and compare data points. To add
gridlines, click on the chart and then click on "Format" in the Visualizations pane. From there, you can
turn on the "Gridlines" option and customize the appearance of the gridlines, such as color and style.
1. To have a selected visualization highlight another visualization on the page, select the highlight
icon.
1. To have a selected visualization not impact any other visualization on the page, select the no
impact icon.
Charting Guidelines: Histograms, Bullet Charts, Heat Maps and Highlight Tables
Histograms:
Use histograms to show the distribution of a single variable.
Use appropriate bin sizes and intervals to accurately display the data.
Bullet Charts:
Use bullet charts to show progress toward a goal or target.
Use different colors to indicate different levels of achievement.
Heat Maps and Highlight Tables:
Use heat maps and highlight tables to show patterns or trends in large datasets.
Use a color scheme that is easy to read and understand.
Use tooltips to provide additional information about each cell.
TreeMaps:
Use treemaps to show hierarchical data.
Use different colors or shades to indicate different levels of the hierarchy.
Box-and-Whisker Plots:
Use box-and-whisker plots to show the distribution of a dataset.
Use different colors or styles to differentiate between different categories or groups.
Include axis labels and units to provide context for the data.
In general, it's important to choose the right type of chart for the data and purpose of the visualization. Keeping
the chart simple, easy to read, and visually appealing is also important. Consider using color and other
formatting options to highlight key data points and make the chart more engaging.
Text Boxes
1. To add a text box in Power BI Desktop, navigate to the Home tab, choose Insert, and then select Text
box.
2. An empty text box will be placed on the canvas. You can move the text box to your preferred location
by selecting the grey area at the top and dragging it. To adjust the size, use any of the sizing handles to
resize the text box.
3. Once you have positioned the text box, type your text into it. You can customize the font style by
setting the color, size, and subscript or superscript, and also modify text alignment, indents, and
bulleted lists.
4. To add further formatting, select the text box, and utilize the Format text box pane. This can include
changing the color theme, adding bullets or superscripts, and specifying a precise size and position for
the text box.
5. When finished editing the text box, select any blank area on the report canvas.
6. In Power BI service, you have the option to pin the text box to a dashboard. Select the pin icon and
then choose the dashboard to which you want to pin it.
Visual Level, Page Level and Report Level Filters - Basic Filters
In both Power BI Desktop and Power BI service, the Filters pane will appear on the right-hand side of the report
canvas. If it is not visible, click the ">" icon located at the upper-right corner to expand it.
Filters can be applied at three distinct levels within the report, including the visual level, the page level, and the
report level.
For the examples, I used the retail analysis sample available in Power bi . Install the Retail Analysis sample .
Visual Level
1. To begin, click on the Report icon in Power BI Desktop.
2. If not already open, navigate to the Visualizations, Filters, and Fields panes and open them.
3. Choose a visual to activate, such as the scatter chart on the Overview page. All fields within the visual
will be displayed in the Visualizations pane, as well as listed in the Filters pane under the "Filters on
this visual" heading.
4. Next, select the field that you want to add as a new visual-level filter from the Fields pane, and drag it
into the "Filters on this visual" area. For example, if you want to filter by category, drag the Category
field into the "Add data fields here" section, and select the desired category (e.g., Kids).
5. The scatter chart will now be filtered by the selected category, while the other visuals on the report will
remain unchanged.
Page Level
1. To get started, open the Retail Analysis report in Power BI Desktop.
2. Once the report is open, click on the Report icon and navigate to the District Monthly Sales page.
3. If not already open, open the Visualizations, Filters, and Fields panes.
4. Next, select the field that you want to add as a new page-level filter from the Fields pane, and drag it
into the "Add data fields here" section located in the "Filters on this page" area. For example, if you
want to filter by the district manager, drag the District Manager field into the "Add data fields here"
section.
1. Select the values you want to filter and set either Basic or Advanced filtering controls.
All visualizations on the page will be redrawn to reflect the filter change. If you save the report with the filter,
users who view the report in Reading view will be able to interact with the filter by selecting or clearing values.
Report Level
1. To add a report-level filter in Power BI Desktop, start by clicking on the Report icon.
2. Next, open the Visualizations and Filters pane and the Fields pane, if they're not already open.
1. From the Fields pane, select the field that you want to add as a new report-level filter and drag it into
the Report-level filters area.
2. Select the values that you want to filter.
The visuals on the active page and on all pages in the report will change to reflect the new filter. If you save the
report with the filter, users who view the report in Reading view will be able to interact with the filter by
selecting or clearing values.
Create a Slicer
We have used the Retail Analysis Sample PBIX file available on Power bi to create a slicer.
1. To open the Retail Analysis Sample PBIX.pbix file in Power BI Desktop, go to the menu bar and select
File > Open report.
2. Then, select the Report icon on the left pane to open the file in report view.
3. Once you're on the Overview page, create a new slicer by selecting the Slicer icon in the Visualizations
pane, provided that nothing is selected on the report canvas.
1. Select the new slicer and go to the Fields pane. Expand the District table and choose DM to add the
data to the slicer.
2. Adjust the size and position of the elements on the report canvas to make room for the slicer. If the
slicer is too small, some of its items may be cut off.
The slicer is now populated with a list of district manager names and their corresponding selection boxes.
1. Try selecting different names on the slicer and observe how it affects the other visualizations on the
page. To deselect a name, simply click on it again. You can select multiple names by holding down the
Ctrl key. If all names are selected, it has the same effect as selecting none.
However, it's important to note that slicers can sometimes produce misleading or inaccurate results. This is
particularly true when users select multiple items from the slicer, as the filter will show data that meets any of
the selected criteria, rather than data that meets all of them. This can result in data that is less precise and less
accurate.
To address this issue, Power BI includes a slicer warning feature that alerts users when they have selected
multiple items from a slicer. The warning message reminds users that the filter may produce inaccurate results
and encourages them to use the "Select all" option or to select a smaller number of items for more accurate
filtering.
In summary, slicers are a powerful tool in Power BI for filtering data and exploring insights, but users should be
aware of the limitations and potential inaccuracies that can arise from using them with multiple selections. The
slicer warning feature helps to mitigate these issues and encourage more accurate filtering.
1. Click on the three dots at the top right corner of the multiple chart.
2. Select the sort small multiples option in the drop-down menu.
3. Now select the field by which you want to sort .
However, creating bookmarks that others can use requires editing permissions for the report.
Drillthrough
Drillthrough is a powerful feature in Power BI that allows users to navigate from a summary report to a detailed
report. It allows users to focus on specific data points and explore them in more detail.
1. In order to enable drill through, you can generate a target report page that includes the desired visuals
for the specific entity you intend to provide drillthrough for example I had generated Sales vs Sub-
category as shown below :
2. Next, navigate to the Build visual section of the Visualizations pane on the drillthrough target page,
and move the field that you want to enable drillthrough for into the Drill through well. Below I dropped
the category field to drillthrough . This acts as summary data for the sub-category .
1. Now you can drill through the subcategory data by selecting category in the drill through menu .
2. Or you can create a new report, “Summary” in the example. The new report should contain the
summary of the target report you had created earlier . In the example I had created Sales vs Category
since the target report contains sales vs sub-category .
3. Now add the sub-category to drill through in the Summary page .
4. Now on the Target page, you can right-click on any data point and select drillthrough →Summary .
5. You get the summary of the selected data point in the summary page . You can do the same in the
summary page, here you can drill through each category.
Buttons and Actions
By utilizing buttons in Power BI, it's possible to generate reports that function much like applications and foster
an environment where users can engage with Power BI content by hovering, clicking, and performing other
interactions.
On the Insert ribbon in Power BI Desktop, you can access the Buttons drop-down menu to select the
desired button from a collection of options
Once you have selected a button on the report canvas, the Format pane presents multiple options for
customizing the button to meet your specific requirements, such as modifying the shape.
Button States
Buttons in Power BI have four potential states, which are Default, On hover, On press, and Disabled.
The Format pane enables you to adjust formatting for each state independently, providing ample flexibility to
tailor your buttons. The Shape, Style, and Rotation cards in the Format pane allow you to modify the button
appearance for each state.
1. Access the Format pane, select the Button tab, and expand the appropriate card.
1. Then, under Apply settings, select State at the top of the card, and choose the settings you wish to apply
for that state.
In the included image, the Style card and Icon are expanded, and the State is On hover, with the Icon type set to
the Right arrow.
Actions for Buttons
In Power BI, it is possible to choose the action that will be triggered
when a user clicks a button. Some of the available actions include:
Back: This action takes the user back to the previous page of the report,
which can be helpful for drillthrough pages.
Bookmark: This action displays the report page associated with a
bookmark that has been defined for the current report. To learn more
about bookmarks in Power BI, refer to the relevant documentation.
Drillthrough: This action takes the user to a drillthrough page that is
filtered based on their selection, without using bookmarks. For more
information on using drillthrough buttons in reports, see the Power BI
documentation.
Page navigation: This action directs the user to a different page within
the report, without using bookmarks. Details on how to create page
navigation can be found in the Power BI documentation.
Q&A: This action opens a Q&A Explorer window that allows users to
ask natural-language questions about the data. When a Q&A button is
selected, the Q&A Explorer will launch automatically.
Web URL: This action opens a web
page in a browser.
Go to the Button tab in the Format pane and turn the Action toggle to On.
Expand the card to access the Action options, and select the Type of button action you want.
To test the button, hold down the Ctrl key and select the button you have created in your report.
1. Changing the page size: You can adjust the size of the report page to suit your needs. Simply select
the Page Size option from the Format pane and choose the size you want.
2. Using themes: Power BI includes a variety of built-in themes that can be applied to your report page to
give it a consistent and polished look. To change the theme, select the Theme option from the view
pane and choose a theme from the drop-down menu.
3. Background color and wallpaper :You can change the canvas color by selecting the "Format" tab in
the Visualizations pane, and then selecting "Page background" from the drop-down menu. From there,
you can select a color or even use an image as the background.
The Wallpaper can be changed in the similar way.
Tooltip Pages
Tooltip pages in Power BI are a feature that allows you to create additional information or data visualizations
that appear when you hover over a particular visual on your report. When you hover over a visual, a tooltip
appears with additional information or a link to another page or report. This allows you to provide more context
to your visualizations without cluttering your main report.
To create a tooltip page in Power BI, you can follow these steps:
1. Select the visual that you want to add the tooltip to.
2. Navigate to the "Visualizations" pane and select the "Format" tab.
3. Expand the "Tooltip" section and toggle the "Tooltip" switch to "On".
1. Click on the "Type" drop-down menu and select "Report page".
2. Choose the report page that you want to link to from the tooltip.
3. Customize the tooltip page by adding visuals or information that you want to display when the user
hovers over the visual on the main report page.
4. You can also customize the text and background of the tooltip page in the Text and Background
options available in the Tooltips drop-down menu.
You can also add visuals to the tooltip page that are not included in the main report, providing additional
information or detail that may not fit within the constraints of the main report. Tooltip pages can help to make
your report more interactive and informative for your users.
1. Open your Power BI report and go to the "Visualizations" pane on the right-hand side of the screen.
2. Click on the three dots in the top-right corner of the pane and select Get more visuals .
3. In the marketplace search bar, type in the name of the custom visual you want to import and hit enter or
you can select from All visuals, Organizational visuals, and from AppSource available in Power BI
visuals.
1. Click on the visual you want to import from the search results, and then click "Add".
2. Review the details of the custom visual and click "Import" if you want to proceed.
3. Once the custom visual is imported, it will appear in the "Visualizations" pane. You can then drag it
onto your report canvas and begin using it.
To import local visualization select “Import a visual from a file” , and then select the visual on your local
computer you want to import .
Introduction to Mapping:
Power BI is a powerful data visualization tool that enables users to create interactive maps that can help them
analyze and understand their data more intuitively.
Formatting Maps
Customize the map style: To customize the map style, you can select the "Format" pane and choose the "Map"
tab. Here, you can change the map type, such as selecting a satellite view or a road map view, and adjust the
zoom level.
Change the color scheme: To change the color scheme of the map, you can select the "colors" tab in the
Bubbles menu in the "Format" pane. Here, you can choose a color palette, adjust the color saturation, and set the
minimum and maximum values.
Add data labels: To add data labels to the map, select the "Category labels" tab in the "Format" pane. Here, you
can choose to display data labels, adjust the font size and color, and customize the label format.
Other general customizations include tooltips, Legend, etc.
These are some basic steps to format maps in Power BI. With these options, you can create customized,
interactive maps that display your data in a visually appealing way.
Color Blindness
Color blindness can make it difficult for people to distinguish between certain colors, which can be a significant
challenge when using maps in Power BI. To make maps accessible to people with color blindness, it's important
to use color schemes that have high contrast and are easily distinguishable. Here are some tips to make your
maps more accessible:
1. Avoid using colors that are close in hue or saturation, as people with color blindness may have
difficulty distinguishing between them.
2. Use high-contrast colors, such as black and white, to differentiate between features on the map.
3. Use patterns, textures, or shapes in addition to colors to represent different features on the map.
4. Use color-blind-friendly palettes, which are designed to be easily distinguishable by people with color
blindness. Some examples include the "Color Blind 10" palette and the "Cividis" palette.
5. Use data labels or annotations to provide additional context and information about the features on the
map.
By following these tips, you can create maps in Power BI that are accessible and easily understood by people
with color blindness.
Users can add various types of data to their maps, including points, lines, polygons, and heat maps, and can
overlay them with additional data layers such as demographics, weather data, and imagery. ArcGIS Maps also
offers a wide range of base maps to choose from, including satellite imagery, topographic maps, and street maps.
In addition to creating static maps, ArcGIS Maps for Power BI also allows users to create interactive dashboards
that can be used to explore data and gain insights. Users can add filters and slicers to their maps, which can be
used to dynamically update the data displayed on the map based on user selections.
ArcGIS Maps for Power BI is available as an add-in for Power BI and requires a subscription to ArcGIS Online
or ArcGIS Enterprise to use.
1. Once you've connected to your data source, create a new report in Power BI.
2. Select the "KPI" visualization from the Visualizations pane.
3. Drag and drop the data field you want to use as the KPI value into the "Value" field well. This could be
a metric like revenue, profit margin, or customer satisfaction.
4. Set the target or goal for your KPI. You can either set a static target value or use a dynamic target that
updates based on other metrics or data. The below chart represents KPI of sum of profit by region .
5. Customize the formatting of your KPI to match your branding or design preferences. You can adjust
colors, fonts, and other formatting options.
6. Add your KPI to a dashboard or report page, along with other visualizations and metrics that provide
additional context and insight.
By following these steps, you can create effective KPIs in Power BI that helps you monitor performance, track
progress toward goals, and make data-driven decisions.
Gauges
Gauges are visual representations of data that help to quickly and easily convey information about performance.
They are commonly used to show progress toward a specific goal or target. Gauges typically have a range of
values, with a needle or pointer that indicates the current value. Examples of gauges include speedometers, fuel
gauges, and temperature gauges.
Creating gauges in Power BI is a straightforward process. Here are the steps you can follow:
1. Cards Cards are simple visualization that displays a single metric or value. They are useful when you
want to quickly highlight a key performance indicator (KPI) or other important metrics. To create a
card in Power BI:
Click on the "Card" visualization icon from the Visualizations pane.
Drag and drop the data field you want to display on the card into the "Values" field well.
Customize the card by adjusting the formatting options, such as font size, color, and background.
2. Multi-row cards Multi-row cards are similar to cards but can display multiple metrics or values in a
single visualization. They are useful when you want to display a summary of several KPIs or other
metrics. To create a multi-row card in Power BI:
Click on the "Multi-row card" visualization icon from the Visualizations pane.
Drag and drop the data fields you want to display on the card into the "Values" field well.
Customize the multi-row card by adjusting the formatting options, such as font size, color, and
background. The below multi-row card shows the sum of profits and discounts in different regions.
You can also add additional fields to the multi-row card, such as images or icons, to make it more
visually appealing.
Both cards and multi-row cards are highly customizable in Power BI, allowing you to format them to match
your branding or design preferences. You can also use them in combination with other visualizations, such as
charts and tables, to create informative and engaging dashboards and reports.
Quick Measures
Quick Measures is a feature in Power BI that allows users to create new measures by using a simple interface
instead of writing DAX expressions. It can save time and effort for users who may need to improve in DAX.
1. Select the table that contains the column you want to use for the measure.
2. Click on the "New Quick Measure" button under the "Quick Measures" section of the ribbon.
3. Choose the calculation you want to perform from the list of available options, such as sum, average,
count, or min/max.
4. Select the column you want to use for the calculation from the dropdown menu.
5. Click "OK" to create the measure.
Once you create a Quick Measure, it will appear in the "Fields" pane as a regular measure, and you can use it in
visualizations and calculations like any other measure.
Note that Quick Measures are based on DAX expressions, so it is still helpful to have some familiarity with
DAX functions and expressions. Quick Measures may also be unsuitable for complex calculations or advanced
scenarios, where you may need to write custom DAX expressions.
Create Reference Lines by Using Analytics Pane, Including the Forecast Feature
The Analytics pane in Power BI is a feature that allows users to add advanced analytical calculations and
visualizations to their data in a report.
The Analytics pane is available in the Visualizations pane of the Power BI Desktop application and can be
opened by selecting a visual on the canvas and then clicking on the "Analytics" button in the Visualizations
pane. Once the Analytics pane is open, users can choose from a variety of analytical calculations such as
averages, running totals, percent of totals, and more. These calculations can be added to a visual to provide
additional insights into the data being analyzed.
To add a reference line using the Analytics pane, follow these steps:
Forecasting
The forecasting feature can be utilized if your data source includes time data. By selecting a visual, you can
access the Forecast section of the Analytics pane and make adjustments to the forecast based on various inputs,
such as the Forecast length or the Confidence interval. The provided image demonstrates a simple line visual
with forecasting applied. Experimenting with forecasting can help you imagine how it may be beneficial for
your models.
The following visual supports x-axis constant line, y-axis constant line, and symmetry shading:
Scatter chart
The following visuals allow for the use of constant line, min line, max line, average line, median line, and
percentile line from the Analytics pane:
Area chart
Clustered bar chart
Clustered column chart
Line chart
Scatter chart
The following visuals only support the use of a constant line from the Analytics pane:
Stacked area chart
Stacked bar chart
Stacked column chart
Waterfall chart
100% Stacked bar chart
100% Stacked column chart
The following visuals support the use of a trend line if time data is present:
Area chart
Clustered column chart
Line chart
Line and clustered column chart
Scatter chart
Identify Outliers
Power BI provides several methods to identify outliers in your data. Here are some of them:
1. Scatter chart: A scatter chart can show outliers by displaying data points that are far away from the
general trend.
2. Box and Whisker chart: This chart displays the distribution of a dataset using quartiles. Any data
points that are outside the whiskers are considered outliers.
3. Z-score: A Z-score measures how many standard deviations a data point is from the mean. A Z-score
greater than 3 or less than -3 indicates an outlier.
4. Modified Z-score: The modified Z-score is a variation of the Z-score that is more robust to outliers. A
modified Z-score greater than 3.5 or less than -3.5 indicates an outlier.
Using different methods to identify outliers is essential, as each approach may have limitations based on the data
structure and distribution. By utilizing a combination of these techniques, you can obtain a comprehensive
understanding of outliers in your data.
Use Clustering
Clustering involves grouping data points in a dataset that share similar properties, forming clusters of objects
with common characteristics. This technique is classified as unsupervised learning, meaning that there is no
need for pre-assigned labels for each data object.
Applications of clustering include identifying anomalies like fraudulent activities or defective mechanical parts,
as well as customer segmentation for marketing strategies and analyzing rideshare data.
Grouping:
1. Select the column you want to group from the Fields pane and drag it to the Values area of the table or
matrix visual.
2. Right-click on the column and select "New Group" from the context menu.
3. In the "New Group" dialog box, enter a name for the group.
4. Add the values you want to group by selecting them from the list of available values.
5. Click "OK" to create the group.
6. The group will appear as a new column in the Fields pane, which you can use in your visualizations.
Binning:
1. Select the column you want to bin from the Fields pane and drag it to the Values area of the table or
matrix visual.
2. Right-click on the column and select "New Bins" from the context menu.
3. In the "New Bins" dialog box, select the number of bins you want to create and the range of values to
bin.
4. You can also customize the bin size, start and end points, and other properties.
5. Click "OK" to create the bins.
6. The bins will appear as a new column in the Fields pane, which you can use in your visualizations.
Grouping and binning can help you organize data into meaningful categories, making it easier to analyze and
visualize. They are particularly useful for numerical data that can be divided into discrete ranges or categories.
1. Open Power BI Desktop and click on "Get Data" to import your data into Power BI.
2. Create a new report page by clicking on the "New Page" button.
3. From the "Visualizations" pane, select the "Table" visualization and add it to the report page.
4. From the "Fields" pane, drag and drop the fields you want to include in the table.
5. Customize the table by formatting the columns, adding filters, and changing the column headers.
6. Click on the "File" menu and select "Export" > "Power BI Paginated Report".
7. In the "Export to Power BI Paginated Report" dialog box, choose a location to save the report and enter
a name for the report.
8. Customize the report properties, such as the paper size and orientation, in the "Report Properties"
section.
9. Click on the "Export" button to create the paginated report.
10. Once the report is created, you can view it by opening it in Power BI Report Builder or any other tool
that supports paginated reports.
Creating a paginated report requires some basic knowledge of Power BI and report design. It is important to
select the appropriate data sources, format the report correctly, and design it in a way that is easy to understand
and navigate.
If there are no data connections, the Power Query Editor will appear as a blank pane, ready to receive data.
Once a query is loaded, the Power Query Editor becomes more functional. After establishing a data connection :
1. you'll notice that many buttons in the ribbon become active and you can interact with the data in your
query.
2. The queries you have created are listed in the left pane and can be selected, viewed, and edited.
3. In the center pane, the data from the selected query is displayed and can be manipulated.
4. Additionally, the Query Settings pane appears, which lists the query's properties and applied steps.
The "in" keyword is used to separate the variable assignments from the main expression of the function. The
main expression is evaluated after all of the variables have been defined.
For example, here is an M function that uses "let" and "in" to define two variables and return their sum:
let
a = 1,
b=2
in
a+b
In this example, the variables "a" and "b" are defined with the values 1 and 2, respectively. The main expression
of the function adds these two variables together and returns the result, which is 3.
Structure of an M function
The structure of an M function in Power BI consists of a number of elements, including:
1. Function name: This is the name given to the function, which should be descriptive and indicate what
the function does.
2. Function parameters: These are the inputs to the function, which can be optional or required. Each
parameter is defined by its name and data type.
3. Function body: This is the code that defines what the function does. The body of the function can
include variables, expressions, and other functions.
4. Function return value: This is the value that the function returns when it is called. The return value can
be any data type, including tables, lists, or scalar values.
Here is an example of a simple M function that takes two parameters and returns their sum:
typescriptCopy code
let
Sum = (a as number, b as number) =>
a+b
in
Sum
In this example, the function name is "Sum", and it takes two parameters of type "number" named "a" and "b".
The body of the function simply adds the two parameters together, and the return value is the result of the
addition.
Steps to apply M Functions in Query editor :
1. Open Power BI Desktop and load your data .
2. Click on ‘Transform Data’ button in the Home tab .
1. This will open the Power Query Editor .For example , I had loaded the Superstore data and you can see
it in the power query editor .
The advanced editor box appears as shown below , this is where you can write your M functions .You can
already see a m function , this refers to the loaded data .Any change you make in the data like sorting columns ,
deleting or adding columns will appear as a m function in this editor .
Dax Functions:
DAX stands for Data Analysis Expressions, and it is the formula language used in Power BI for creating custom
calculations and metrics.
In Power BI, DAX is used to create calculated columns, calculated tables, and measures.
To use DAX in Power BI, you can create a new measure or calculated column by selecting the "New measure"
or "New column" button from the "Modeling" tab.
You can then enter your DAX formula in the formula bar, and Power BI will automatically calculate the
results for you.
DAX includes a wide range of functions that can be used to create complex calculations, such as filtering,
aggregation, time intelligence, and more. By mastering DAX, you can create sophisticated data models in Power
BI that enable powerful data analysis and visualization.
The syntax of DAX functions in Power BI is generally similar to that of Excel functions. Here is an example of
the syntax of the SUM function in DAX: SUM(<column>)
Dax Functions
There are many DAX functions available in Power BI that can be used for creating calculations and metrics.
Here are some commonly used DAX functions in Power BI:
1. SUM: Calculates the sum of a column.
2. AVERAGE: Calculates the average of a column.
3. COUNT: Counts the number of rows in a table or column.
4. MAX: Returns the maximum value in a column.
5. MIN: Returns the minimum value in a column.
6. CALCULATE: Evaluates an expression in a modified filter context.
7. FILTER: Returns a table that has been filtered by a specified condition.
8. RELATED: Returns a related value from a different table in a one-to-many relationship.
9. IF: Returns one value if a condition is true and another value if it is false.
10. SWITCH: Evaluates an expression and returns a result based on a series of cases.
11. CONCATENATE: Combines two or more strings into one string.
12. YEAR: Returns the year of a date.
13. MONTH: Returns the month of a date.
14. DAY: Returns the day of a date.
15. TODAY: Returns the current date.
16. TOTALYTD: Returns the total of a measure for the year to date.
17. RANKX: Returns the rank of a value in a column.
18. DATEDIFF: Calculates the difference between two dates in days, months, or years.
Home Menu - Manage Columns, Reduce Rows and Use First Row as Headers
The Home menu in Power Query Editor of Power BI is where you can perform basic data transformation tasks,
such as managing columns, reducing rows, and using the first row as headers. Here's how to perform these
tasks:
1. Manage Columns: To manage columns in Power BI, go to the Home tab and select the "Manage
Columns" button. From here, you can perform various column transformation tasks, such as renaming
columns, changing data types, removing columns, and adding calculated columns.
2. Reduce Rows: To reduce the number of rows in your data, go to the Home tab and select the "Reduce
Rows" button. From here, you can filter your data based on specific criteria, such as removing
duplicates, filtering by value, or removing blank rows.
3. Use First Row as Headers: To use the first row of your data as headers, go to the Home tab and select
the "Use First Row as Headers" button. This will convert the first row of your data into column
headers, which can make it easier to work with and analyze your data.
By using these basic data transformation tools in the Home menu of Power BI, you can quickly clean and
prepare your data for analysis and visualization.
Here's a brief introduction to Sort and Filter options in the Home menu of Power Query:
1. Sort: The Sort function can be used to sort the data in ascending or descending order based on one or
more columns. To use the Sort function, select the column(s) that you want to sort, click the Sort
Ascending or Sort Descending button in the Transform tab of the Home menu, and select the
appropriate option. Users can also choose to sort by multiple columns by clicking on the drop-down
arrow next to the Sort Ascending/Descending button.
2. Filter: The Filter function can be used to exclude or include rows based on certain criteria. To use the
Filter function, select the column(s) that you want to filter, click on the Filter button in the Transform
tab of the Home menu, and select the appropriate option. Users can also choose to filter by multiple
columns by clicking on the drop-down arrow next to the Filter button.
The Filter function offers several filter types such as Equals, Not Equals, Greater Than, Less Than, and other
options depending on the data type of the selected column. Users can also apply multiple filters on different
columns by using the Add Column Filter option.
Split Column:
The Split Column function in the Home menu of Power Query Editor allows users to split a column into
multiple columns based on a specific delimiter or character. This is useful when dealing with data that contains
multiple values in a single column or when trying to separate information into different columns for analysis.
Here's a brief introduction on how to use the Split Column function in the Home menu of Power Query
Editor:
1. Select the column you want to split: Click on the column header to select the column that you want to
split.
2. Click on the Split Column button: In the Transform tab of the Home menu, click on the Split Column
button and select the appropriate option from the dropdown menu.
3. Choose the delimiter or character to split the column: In the Split Column dialog box, select the
delimiter or character that you want to use to split the column. You can choose from several options
such as Comma, Space, or Custom.
4. Choose how many new columns you want to create: Select how many new columns you want to create
based on the number of delimited values in the original column. You can choose to split the column
into two or more new columns.
5. Rename the new columns: After the column has been split, the new columns will be automatically
named with the original column name and an index number. You can rename the new columns by
right-clicking on the column header and selecting Rename.
The Split Column function can also be used to split a column by position or by the number of characters.
This option can be selected in the Split Column dialog box.
1. Merge Columns: The Merge Columns function allows users to combine data from multiple columns
into a single column. This is useful when dealing with data that has been split across multiple columns,
such as a first name and last name. Select the merge columns option from the “Transform” tab and the
merge dialogue box appears
2. Group By: The Group By function allows users to group data by one or more columns and perform
aggregations on the grouped data. This is useful for summarizing data and creating subtotals.
1. Pivot Column: The Pivot Column function allows users to pivot data in a table by converting rows
into columns. This is useful when dealing with data that is arranged in a vertical format and needs to be
transformed into a horizontal format.
2. Unpivot Column: The Unpivot Column function allows users to unpivot data in a table by converting
columns into rows. This is useful when dealing with data that is arranged in a horizontal format and
needs to be transformed into a vertical format.
3. Fill: The Fill function allows users to fill empty cells or missing values with data from adjacent cells or
by using a specific value. This is useful for cleaning data and ensuring consistency in a dataset.
4. Replace Values: The Replace Values function allows users to replace specific values in a column with
other values. This is useful for cleaning data and correcting errors in a dataset.
5. Text Transformations: The Text Transformations function includes a range of options for
manipulating text in a column, such as changing case, trimming whitespace, and extracting substrings.
This is useful for cleaning and normalizing text data.
1. In Power Query Editor, select the first table or query you want to merge.
2. Click on the "Merge Queries" button and select any one option ‘to merge queries’ or to ‘merge as new ’
in the "Combine" tab of the ribbon.
3. In the "Merge" dialog box, select the second table or query you want to merge.
4. Select the common column(s) that the two tables share and click "OK". This will create a new merged
table.
1. By default, the merge creates an "Inner" join. You can change this by selecting a different join type
from the dropdown menu in the Merge dialog box.
2. You can also use fuzzy matching options to merge and then click on ok .
Note that you can merge more than two tables or queries by repeating these steps for each additional table or
query.
Expand Table
Expanding tables in Power Query Editor in Power BI allows you to split a single column that contains multiple
values into separate columns. Here are the steps to expand a table in Power Query Editor in Power BI:
1. Inner Join: This type of join only includes rows that have matching values in both tables. Any rows
that do not have a match in the other table are excluded.
2. Left Outer Join: This type of join includes all rows from the left table and any matching rows from the
right table. If there are no matching rows in the right table, the resulting table will contain null values
for the right table columns.
3. Right Outer Join: This type of join includes all rows from the right table and any matching rows from
the left table. If there are no matching rows in the left table, the resulting table will contain null values
for the left table columns.
4. Full Outer Join: This type of join includes all rows from both tables, regardless of whether there is a
match in the other table. If there is no matching row in the other table, the resulting table will contain
null values for the columns from that table.
When joining tables in Power Query Editor, you can choose which type of join to use by selecting it from the
dropdown menu in the "Merge" dialog box. By default, the Inner Join is selected.
1. In Power Query Editor, select the first query you want to append.
2. Click on the "Append Queries" button in the "Combine" tab of the ribbon.
3. In the "Append Queries" dialog box, select the queries you want to append.
4. You can choose to between Two tables or Three or more tables .
1. You can choose to append the rows from the second query to the bottom of the first query (vertically)
or to the right of the first query (horizontally). Select the desired option and click "OK".
2. If you selected to append the rows vertically, the columns from the second query will be added to the
columns of the first query. If you selected to append the rows horizontally, the rows from the second
query will be added to the rows of the first query.
3. Once you have made all the desired changes, click "Close & Apply" to load the appended query into
Power BI.
1. In Power Query Editor, select the query that has the mismatched data type.
2. Click on the column with the mismatched data type to select it.
3. Click on the "Data Type" button in the "Transform" tab of the ribbon.
1. Select the desired data type from the dropdown menu.
2. Repeat steps 2-4 for each column with a mismatched data type.
3. Once you have resolved the data type mismatches, you should be able to append the queries together as
described above.
Combine Files
To combine files in Power Query Editor, you can follow these steps:
1. To initiate the procedure of merging files from a particular folder, opt for Get data, then select Folder
under File, and finally Connect.
2. To view the files in a folder within Power Query Editor, you need to input the folder path and click on
OK. Afterwards, select the option "Transform data" to see the files.
3. If you want to merge binary files in Power Query Editor, you can either click on the "Combine Files"
icon beside the "Content" label or go to Home and then choose "Combine Files".
1. Click the "Expand" icon next to the Folder Path column to expand it.
2. Select the columns you want to include in your combined table by checking the boxes next to their
names.
3. Click the "Combine" icon in the "Combine" group in the "Home" tab.
4. The combine files transform operates in the following way:
it examines every input file to identify the appropriate file type, which may include text, Excel
workbook, or JSON file.
Moreover, this transform enables you to choose a particular element from the initial file (such as an
Excel workbook) and extract it.
1. Subsequently, the combine files transform automatically executes the following actions:
Generates a sample query that encompasses all the necessary extraction operations within a single file.
Generates a function query that receives the file/binary input as a parameter for the exemplar query.
The exemplar query and the function query are connected, so that any modifications to the exemplar
query are propagated to the function query.
Applies the function query to the original query with input binaries, such as the Folder query. The
function query is executed for each row of the binary inputs and then expands the resulting data
extraction as top-level columns .
The combine files feature enables you to merge all the files in a designated folder effortlessly, provided they
possess comparable file types and structures (such as identical columns).
Transform menu
The Transform menu in Power Query Editor is one of the primary menus used to perform data transformations
in Power BI. This menu contains a wide range of data transformation options enabling users to modify and
reshape data according to their needs.
Some of the key features and capabilities of the Transform menu include:
1. Filtering: Allows users to filter rows based on specific criteria or conditions, such as text matching or
value ranges.
2. Grouping: Allows users to group data by one or more columns and perform aggregate functions, such
as summing or averaging, on the grouped data.
3. Pivot: Enables users to pivot data by transforming rows into columns and columns into rows.
4. Unpivot: The opposite of Pivot, which turns columns into rows and rows into columns.
5. Split Column: Allows users to split a column into multiple columns based on a specific delimiter, such
as a comma or a space.
6. Merge Columns: Enables users to merge two or more columns into a single column, with the option to
add a separator between the merged values.
7. Transpose: Transposes the data, turning rows into columns and columns into rows.
8. Fill: Allows users to fill missing values in a column, either by copying the value from the previous or
next row or by using a specific value or formula.
9. Extract: Enables users to extract specific parts of a text string, such as extracting the first name from a
full name column.
These are just a few of the many powerful features available in the Transform menu of Power Query Editor. By
using these tools, users can quickly and easily reshape and clean their data to prepare it for analysis and
reporting in Power BI.
Pivot Column
In Power Query Editor in Power BI, the Pivot Column transformation allows you to transform rows into
columns based on the values in a specified column. This is useful when you have data in a tabular format and
want to summarize it or aggregate it by specific columns.
Here's how you can use the Pivot Column transformation in Power Query Editor:
Unpivot
In Power Query Editor in Power BI, the Unpivot transformation allows you to transform columns into rows.
This is useful when you have data in a wide format, where each column represents a specific attribute, and you
want to convert it to a long format, where each row represents a unique combination of attributes.
Here's how you can use the Unpivot transformation in Power Query Editor:
1. Select the columns that you want to unpivot.
2. Select "Unpivot Columns" from the transform menu and select to unpivot ‘all columns’ or only
‘selected columns’ .
3. The selected columns get unpivoted .
1. Unpivot and then aggregate: You can use the Unpivot transformation to convert columns into rows,
and then aggregate the resulting values to create summary data. For example, you could unpivot
columns that represent monthly sales figures, and then aggregate those values by product or region to
create a summary of total sales.
2. Unpivot and then filter: You can use the Unpivot transformation to create a long format table and then
filter the resulting rows based on specific criteria. For example, you could unpivot columns that
represent different types of expenses, and then filter the resulting rows to show only expenses that
exceed a certain threshold.
3. Unpivot and then merge: You can use the Unpivot transformation to create a long format table and
then merge that table with another table based on a common key. For example, you could unpivot
columns that represent different types of products, and then merge that table with a separate table that
contains pricing information for those products.
4. Unpivot and then transform data types: You can use the Unpivot transformation to create a long
format table and then transform the data types of the resulting columns. For example, you could
unpivot columns that contain string values, and then transform those values into numerical values so
that they can be used in calculations.
5. Unpivot and then pivot: You can use the Unpivot transformation to create a long format table and
then pivot that table to create a summary of data. For example, you could unpivot columns that
represent different types of products, and then pivot that table to show the total sales for each product
category.
These are just a few examples of how you can use the Unpivot transformation in conjunction with other
transform features in Power Query Editor to create more complex data transformations. The possibilities are
endless, and the best approach will depend on the specific requirements of your analysis or reporting needs.
When you select the "Merge Columns" option, after you select the columns that you want to merge merge
dialogue box appears where you can choose to specify a separator character to be used between the merged
values, such as a comma, semicolon, or space.
Once you have selected the columns and separator character, the software will create a new column that
contains the merged values. The new column will be added at the place of the selected columns.
The "Merge Columns" option can be useful for consolidating data from different sources, such as combining a
first name column and a last name column into a single full name column.
When you select the "Extract" option, you will be prompted to specify the pattern or position that you want to
extract. The available options may vary depending on the software you are using, but common options include:
Extracting characters from a specific position: The options First characters ,last characters and
range allows you to extract a specific number of characters from the beginning, end, or middle of the
text string.
Extracting text between, before and after delimiters: This option allows you to extract text that is
enclosed between specific characters, such as a pair of parentheses or quotation marks.
Once you have specified the extraction pattern, the software will create a new column that contains the extracted
values. You can choose to insert the new column at a specific location in the dataset, or replace the original
column with the extracted values.
To execute this operation, you can click on the Parse button located in the Transform tab within the Power
Query Editor. By doing so, you can parse the contents of the column and transform it accordingly.
The sample table provided for this article includes columns that require parsing.
The SalesPerson column contains JSON text strings with details of the salesperson's FirstName and
LastName.
{
"id" : 249319,
"FirstName": "Lesa",
"LastName": "Byrd"
}
Similarly, the Country column contains XML text strings with information about the Country and the
Division to which the account has been assigned, as shown in the example.
<root>
<id>1</id>
<Country>USA</Country>
<Division>BI-3316</Division>
</root>
When we parse the columns of the above table and expand the columns we get the below output
The "Statistics" option generates a summary of descriptive statistics for the selected column. This includes
statistics such as the count, mean, median, standard deviation, minimum, and maximum values. It can also
provide information on the distribution of the data, including the skewness and kurtosis. This option is useful
when you need to understand the characteristics of your data and get an overview of its distribution.
The "Standard" option is used to standardize the values in a column. This involves scaling the values to have a
mean of zero and a standard deviation of one. Standardization can be useful when you want to compare
variables with different measurement units or scales. You can compare the variables on a level playing field by
standardizing them.
Both of these options are useful for data exploration and preparation, especially when you're working with large
datasets and need to get a quick overview of the data.
1. "Scientific": This option allows you to convert the values in the selected column to scientific notation.
You can specify the number of significant digits and the exponent base.
2. "Trigonometry": This option provides functions for calculating trigonometric values such as sine,
cosine, tangent, and their inverse functions for each value in the selected column.
3. "Rounding": This option provides functions for rounding the values in the selected column to a
specified number of decimal places or significant digits.
4. "Information": This option provides functions for testing the values in the selected column, such as
whether they are even or odd, positive or negative, or whether they contain a specific text string.
These options are useful when you need to perform more specialized operations on numerical data. The
scientific option can be used to convert large or small numbers to more manageable formats for analysis. At the
same time, the trigonometry functions are useful for analyzing cyclical data such as waveforms. The rounding
functions can be useful when you need to control the precision of calculations. In contrast, the information
functions can help you test the characteristics of your data for further analysis.
Transform - Dates and time
The time and date columns in the Transform menu of Power Query Editor are used to manipulate and transform
time and date data within a data set.
The time and date functions in Power Query Editor can be used to clean, transform, and prepare time and date
data for analysis or visualization. These functions are particularly useful when working with data sources that
store time and date information in various formats or time zones. They allow you to standardize the data for
easier analysis.
Creating a List of Dates
To create a list of dates in Power Query Editor, you can use the "List.Dates" function.
1. Open the Power Query Editor by clicking on "Edit Queries" in the Home tab of the Power BI Desktop
ribbon.
2. In the Power Query Editor, click on "New Source" in the "Home" tab and select "Blank Query".
3. In the "Formula Bar", type the following formula:
= List.Dates(#date(2020,1,1),365,#duration(1,0,0,0))
This formula creates a list of 365 dates starting from January 1, 2022, with a step of one day.
4. Click on "Done" in the Formula Bar to apply the formula.
5. Rename the query by right-clicking on it in the "Queries" pane and selecting "Rename".
6. Click on "Close & Apply" in the "Home" tab to close the Power Query Editor and apply the changes to
your data model.
The resulting query will generate a table with one column containing the list of dates. You can then use this
table as a source for your visuals and analysis in Power BI.
When you select the "Date" option, a new dialog box will appear, where you can choose the source column that
contains your date or date/time values, and then specify the type of date you want to create based on the values
in the source column. You can choose from various options, such as "Year", "Quarter", "Month", "Day",
"Weekday", and more.
For example, if you have a column with date/time values, you could use the "Date" option to create a new
column that only contains the year value from the original column. Alternatively, you could create a new
column that contains the day of the week for each date value in the original column.
Once you have selected the source column and specified the type of date you want to create, Power Query will
generate a new column with the requested date values based on the data in the source column.
1. Understand the different date formats: Different cultures and languages may use different formats
for writing dates. For example, in the United States, dates are often written in the format of
month/day/year (e.g., 03/12/2023), while in many other countries, the format is day/month/year (e.g.,
12/03/2023). Additionally, some countries may use different separators, such as hyphens or dots, to
separate the different components of the date.
2. Use the appropriate date functions: Depending on the programming language or software you are
using, there may be different functions or methods for working with dates in other cultures or
languages. Make sure you are using the appropriate functions for the language and culture you are
working with.
3. Be aware of different calendar systems: Different cultures and languages may use different calendar
systems. For example, the Gregorian calendar is used in many countries, but some countries may use
different calendar systems, such as the Chinese lunar calendar or the Islamic calendar. If you are
working with dates in a culture or language that uses a different calendar system, you may need to use
special functions or algorithms to convert between calendar systems.
4. Pay attention to language and cultural differences: When working with dates in other cultures or
languages, be aware of any language or cultural differences that may affect how dates are written or
interpreted. For example, in some cultures, dates may be written with the year first (e.g., 2023-03-12),
while in other cultures, the day may come first (e.g., 12 March 2023). Additionally, some cultures may
use different words or phrases to describe dates or date-related concepts.
Overall, when working with dates in other cultures or languages, it is important to be mindful of the differences
in date formats, calendar systems, and language and cultural norms. By taking these factors into account, you
can ensure that your date calculations and manipulations are accurate and culturally appropriate.
When you select the "Time" option, it will create a new column that contains only the time component of the
original date/time value. The format of the new column will be in a time format, such as "hh:mm:ss".
This can be useful if you have a column that contains both date and time values, but you only need to work with
the time component. For example, if you have a column that contains the start and end times for a task, you can
extract the time component to perform calculations based on the duration of the task.
Some of the operations you can perform on the Duration column include:
1. Converting duration values to a standard format: You can convert duration values to a standard format,
such as hours, minutes, and seconds.
2. Extracting components of the duration: You can extract specific components of the duration, such as
the total number of hours, minutes, or seconds.
3. Performing calculations with durations: You can perform various calculations with duration values,
such as adding or subtracting a certain number of hours, minutes, or seconds, or finding the difference
between two durations.
Overall, the Duration column in the Transform menu of Power Query Editor provides a range of useful tools for
working with time duration values in your data, helping you transform and manipulate it the way you need it.
View Menu: The View menu in Power Query Editor allows you to change the view of your data table. You can
find this menu on the Home tab of the Power Query Editor. There are various options available in the View
menu, such as sorting, filtering, grouping, and more. These options can help you customize the view of your
data table based on your specific needs.
Help Menu: The Help menu in Power Query Editor provides access to the online help resources and
community forums. You can find this menu on the Home tab of the Power Query Editor. There are various
options available in the Help menu, such as getting help, accessing the community forum, and more. These
options can help you learn more about Power Query Editor and find solutions to any issues you may face while
working with your data.
Conditional Column
The "Conditional Column" option in the "Add Column" menu in Power Query Editor allows you to create a new
column based on a condition that you specify.
When you select this option, a dialog box will appear where you can define the condition for the new column
using the Power Query formula language. The formula should evaluate to true or false for each row in the table.
If the formula evaluates to true, the value of the new column will be set to a specified value; if the formula
evaluates to false, the value of the new column will be set to a different specified value.
For example, let's say you have a column named "Sales" in a table and you want to create a new column named
"Profitable" that indicates whether the sale is profitable or not based on a certain profit margin threshold. You
could use the "Conditional Column" option to create this new column with a formula like this:
pythonCopy code
if [Sales] * 0.6 > [Cost], "Profitable", "Not Profitable"
This formula checks whether the sales amount is at least 60% greater than the cost of the sale. If it is, the value
of the "Profitable" column will be set to "Profitable"; if not, it will be set to "Not Profitable".
1. Check the syntax of your formula: Make sure that your formula is written correctly and follows the
rules of the Power Query formula language. Look for missing or mismatched parentheses, quotation
marks, and commas. You can also use the "Formula Bar" at the top of the Power Query Editor window
to check your formula syntax and get suggestions for completing your formula.
2. Verify column references: Check that the column references in your formula match the column names
in your data source. If a column is renamed or removed from your data source, any formulas that
reference that column will return an error. You can also use the "Applied Steps" pane in Power Query
Editor to trace the dependencies of your conditional column and ensure that all required columns are
available.
3. Handle null or missing values: If your formula references a column that contains null or missing
values, you may need to handle those values explicitly to avoid errors. You can use functions like "if
error" or "null. if" to handle null or missing values and return a default value instead.
4. Troubleshoot step-by-step: If you are still having trouble with your conditional column, you can use
the "View Native Query" option in Power Query Editor to see the SQL code generated by your steps.
This can help you identify any issues with your conditional column or other steps in your query.
By following these steps, you can resolve errors with conditional columns and create new columns with custom
formulas in Power Query Editor.
The "Index Column" option allows you to add a column containing a unique index value for each row in your
table. This is useful if you want to track the order of the rows or reference specific rows in a formula or
calculation. You can specify the starting value for the index and the increment value (i.e. how much the index
should increase by for each row).
The "Duplicate Column" option allows you to create a new column that duplicates the values in an existing
column. This can be useful if you want to modify the original column without losing the original values, or if
you want to perform calculations on the same column multiple times with different criteria. You simply select
the column you want to duplicate and give the new column a name.
Note that in order to use these options, you must first select the column or columns that you want to modify.
You can do this by clicking on the column header in the table preview or by selecting the column name in the
"Queries & Connections" pane.
View menu:
Formula Bar: Shows the M code for the current step in the query.
Applied Steps: Displays a list of all the steps in the current query, and allows you to select and modify them.
Data Preview: Displays a preview of the data after the current step in the query.
Column Profiles: Provides an overview of the data in each column, including data types, null values, and unique
values.
Query Dependencies: Shows a visual representation of the relationship between queries.
Help menu:
Power Query Documentation: Opens the Microsoft Power Query documentation in your default browser.
Diagnostics: Provides information about any errors or warnings in the current query.
Show Logs: Opens the Power Query trace logs in a new window.
Feedback: Opens the Microsoft Feedback hub where you can provide feedback or report issues.
In addition to these menus, the Column Properties pane can be accessed by right-clicking on a column header in
the data preview and selecting "Column Profile." This pane displays additional information about the selected
column, such as data type, minimum and maximum values, and distribution. You can also modify certain
properties of the column, such as the data type or renaming the column.
Overall, these menus and tools can be incredibly helpful in navigating and working with your data in Power
Query.
Data Preview
In the View menu of the Power Query Editor, there are several options related to the display of your data,
including:
1. Monospaced: This option displays your data in a fixed-width font, with each character taking up the
same amount of space. This can be helpful for aligning columns of data and making it easier to read.
2. Show Whitespace: This option displays whitespace characters, such as spaces and tabs, in your data.
This can be helpful for identifying issues with data quality, such as trailing spaces in text fields.
3. Column Quality: This option provides an overview of the quality of the data in each column,
including the percentage of null values, the number of distinct values, and the data type. This
information can be helpful for identifying potential issues with your data, such as columns with many
null values.
4. Column Distribution: This option displays a histogram of the values in the selected column, showing
the distribution of values across a range of bins. This can be helpful for identifying patterns in your
data and understanding the distribution of values.
5. Column Profile: This option provides a summary of each column in your data, including data types,
null values, and unique values. This information can be helpful for identifying potential data quality
issues and guiding your data transformation steps.
Advanced Editor
In the View menu of the Power Query Editor, there is an option called "Advanced Editor." This opens the
Advanced Editor window, which allows you to view and edit the M code behind your query.
1. Functions: Functions are reusable pieces of code that can be called multiple times with different inputs
to produce consistent outputs. Functions can be used to perform complex data transformations and
make your queries more modular and easier to maintain.
2. Parameters: Parameters are values that can be passed into a function or query to make it more
dynamic. Parameters can be used to control which data is imported or transformed, or to specify input
values for calculations or filters. Parameters can be created using the "New Parameter" option in the
"Home" tab. Parameters can be useful for creating dynamic dashboards or reports that allow users to
specify inputs such as dates or categories.
Parameters can be managed in the "Manage Parameters" option in the "Home" tab. This options allow you to
create, edit, and delete parameters, and to specify default values and data types. Once you have created a
function or parameter, you can call it from other queries or functions by referencing its name and passing in any
necessary inputs.
To work with Datetimezone data type in Power Query Editor, there are several functions available:
DateTimeZone.FromText: This function converts a text value to a Datetimezone value. The function takes two
arguments, the text value and the time zone offset, and returns a Datetimezone value. Syntax :
DateTimeZone.From(dateTime as datetime, timeZone as text)
Example:
cCopy code
DateTimeZone.From(#datetime(2023,3,13,12,0,0), "Eastern Standard Time")
Output:
Copy code
3/13/2023 12:00:00 PM -05:00
DateTimeZone.LocalNow: This function returns the current date and time in the local time zone as a
Datetimezone value.
scssCopy code
DateTimeZone.LocalNow()
Output:
Copy code
3/13/2023 12:00:00 PM -05:00
DateTimeZone.SwitchZone: This function converts a Datetimezone value from one time zone to another. The
function takes two arguments, the Datetimezone value and the new time zone offset, and returns a new
Datetimezone value.
vbnetCopy code
DateTimeZone.SwitchZone(dateTimeZone as datetimezone, timeZone as text)
Example:
lessCopy code
DateTimeZone.SwitchZone(DateTimeZone.From(#datetime(2023,3,13,12,0,0), "Eastern Standard Time"),
"Pacific Standard Time")
Output:
Copy code
3/13/2023 9:00:00 AM -08:00
DateTimeZone.ToLocal: This function converts a Datetimezone value to the local time zone. The function
takes one argument, the Datetimezone value, and returns a new Datetimezone value.
Syntax: DateTimeZone.ToLocal(dateTimeZoneValue)
Example: Suppose the input datetimezone value is #2023-03-13T08:30:00Z, which represents a datetime in
UTC
DateTimeZone.ToText: This function converts a Datetimezone value to a text value. The function takes two
arguments, the Datetimezone value and the text format, and returns a text value.
Syntax: DateTimeZone.ToText(dateTimeZoneValue, [optionalFormatString])
Example: Suppose the input datetimezone value is #2023-03-13T08:30:00Z, which represents a datetime in
UTC.
SQL Server offers a range of features, including high availability, security, scalability, and performance. It
supports a variety of data types, including integers, decimals, strings, and dates. SQL Server also supports
advanced features such as spatial data, full-text search, and XML data.
SQL Server consists of several components, including the Database Engine, Integration Services, Analysis
Services, Reporting Services, and Master Data Services. The Database Engine is the core component of SQL
Server, which stores, manages, and processes data. Integration Services is used for ETL (Extract, Transform,
Load) operations, Analysis Services is used for data analysis and OLAP (Online Analytical Processing),
Reporting Services is used for creating reports, and Master Data Services is used for data management.
1. Open Power BI and click on "Get Data" from the Home tab.
2. Select the database type you want to connect to (e.g., SQL Server, Oracle, MySQL, etc.).
3. Enter the server name and database name.
1. Choose the authentication method (Windows or database credentials) and enter the necessary
credentials.
2. Select the tables or views you want to import into Power BI and click on "Load" to import the data.
Query Folding in Power Editor Power BI:
Query folding is a technique used by Power BI to optimize queries by pushing as much of the query processing
as possible back to the data source. This helps to reduce the amount of data that needs to be transferred to Power
BI, improving performance and reducing resource consumption.
To use query folding in Power Editor in Power BI, follow these steps:
1. Right click on the Query and select ‘Edit Query’ option in Power BI , this opens power query editor .
2. Select the query you want to optimize and click on "Advanced Editor."
3. Verify that the query has been folded by checking the "Applied Steps" pane for a step labeled "Source"
or "Native Query."
1. If the query has not been folded, try to simplify or rewrite the query to use native database functions or
operators that can be folded.
2. After making any changes, click on "Close & Apply" to apply the changes and reload the data.
By using query folding in Power Editor, you can improve the performance of your Power BI reports and reduce
the load on your data source.
Import: This setting caches imported tables. If a query requests data from an Import table, it can only
be fulfilled from cached data.
DirectQuery: Tables with this setting are not cached. Queries that return data from DirectQuery tables
can only be fulfilled by executing on-demand queries to the data source, using the query language
specific to that data source (e.g. SQL).
Dual: Tables with this setting can function as either cached or not cached, depending on the context of
the query submitted to the Power BI dataset. Depending on the query, data can be fulfilled from either
cached data or on-demand queries to the data source.
It is important to note that changing the Storage mode from Import to another mode is a permanent action and
cannot be reversed.
1. INNER JOIN: This type of join returns only the rows that have matching values in both tables.
2. LEFT JOIN: This type of join returns all the rows from the left table and the matching rows from the
right table. If there is no matching row in the right table, then the result will have NULL values.
3. RIGHT JOIN: This type of join returns all the rows from the right table and the matching rows from
the left table. If there is no matching row in the left table, then the result will have NULL values.
4. FULL OUTER JOIN: This type of join returns all the rows from both tables, including the rows that
do not have matching values in the other table. If there is no matching row in one of the tables, then the
result will have NULL values.
To expand multiple tables in SQL Server using a JOIN statement, follow these steps:
1. Identify the columns that are related between the tables.
2. Determine the type of JOIN that you want to use.
3. Write the SQL statement using the appropriate JOIN statement and related columns.
4. Execute the SQL statement to retrieve the expanded data.
Here is an example SQL statement that uses an INNER JOIN to expand two tables:
sqlCopy code
SELECT *
FROM Table1
INNER JOIN Table2
ON Table1.RelatedColumn = Table2.RelatedColumn
This statement returns all the rows from Table1 and Table2 where the RelatedColumn values match. The *
symbol in the SELECT statement retrieves all columns from both tables.
You can modify this statement to use a different type of JOIN or to retrieve specific columns from the tables.
1. From the connector selection, choose the Azure SQL database option.
2. Provide the server and database name (optional) in the SQL Server database.
3. Choose either the Import or DirectQuery data connectivity mode.
4. Optionally, select and enter advanced options that can modify the connection query, such as a
command timeout or a native query (SQL statement). For more information, refer to the "Connect
using advanced options" section.
5. Select OK.
6. If this is your first time connecting to the database, select the authentication type, input your
credentials, and choose the level to apply the authentication settings to. Then, select Connect.
If you want to establish an encrypted connection to the Azure SQL database, follow the instructions in "Enable
encrypted connections to the Database Engine".
1. In the Navigator, select the desired database information, and then choose either Load to load the data
or Transform Data to continue transforming the data in Power Query Editor.
Using Azure SQL Database in Power BI
Azure SQL Database can be used as a data source in Power BI to create reports and visualizations. Here are the
steps to follow:
1. Connect to Azure SQL Database: In Power BI, go to Home > Get Data > Azure > Azure SQL
Database. Enter the server name and database name, and then select the appropriate authentication
method.
2. Select Tables: Once connected, select the tables you want to use as the data source. You can choose to
import the data into Power BI or create a direct connection to the Azure SQL Database.
3. Create visualizations: After selecting the tables, you can create various visualizations and reports
using the data. Power BI offers a wide range of visualization options such as tables, charts, maps, and
more.
4. Publish the report: Once you have created the report, you can publish it to the Power BI service. This
allows you to share the report with others and collaborate on it.
5. Refresh the data: If you have chosen to create a direct connection to the Azure SQL Database, you can
configure the data to be automatically refreshed at regular intervals. This ensures that the data is always
up-to-date in the report.
Overall, using Azure SQL Database as a data source in Power BI provides a powerful and flexible way to create
insightful reports and visualizations from your data.
Use the Microsoft Dataverse
Microsoft Dataverse (formerly known as Common Data Service) is a cloud-based data platform offered by
Microsoft as part of the Power Platform suite of products. It provides a secure and scalable database for storing
and managing business data in the cloud.
Dataverse allows businesses to create custom data models that can be used across various applications within
the Microsoft ecosystem, including Power Apps, Power Automate, Power Virtual Agents, and Power BI. The
data models can be created using a visual designer or code, and can be extended using custom code or pre-built
connectors.
Dataverse also provides advanced security features such as role-based access control, data encryption, and audit
trails. It can be integrated with other cloud-based services such as Azure Active Directory, Azure Data Factory,
and Azure Logic Apps.
1. Connect to Microsoft Dataverse: In Power BI, go to Home > Get Data > Power Platform >
Dataverse. Enter the URL of your Dataverse environment and then select the appropriate authentication
method.
2. Select Tables: Once connected, select the tables you want to use as the data source. You can choose to
import the data into Power BI or create a direct connection to the Dataverse.
3. Create visualizations: After selecting the tables, you can create various visualizations and reports
using the data. Power BI offers a wide range of visualization options such as tables, charts, maps, and
more.
Overall, using Microsoft Dataverse as a data source in Power BI provides a powerful and flexible way to create
insightful reports and visualizations from your data. It also provides a unified platform for data storage and
management across various applications in the Microsoft ecosystem.
Prerequisites
In order to utilize Power BI alongside Dataverse, the following components are necessary:
1. Firstly, you will need to download and install Power BI Desktop, which is a free application
designed to operate on your local computer. You can easily obtain Power BI desktop by visiting the
provided link.
2. Secondly, you must have access to a Power Platform environment with specific privileges, which
include read permissions to access data in a table, and maker privileges to modify a table in
make.powerapps.com. These privileges are granted through a security role such as system
customizer or environment maker.
3. Additionally, you will require an appropriate Power BI license that permits you to construct and
distribute Power BI reports.
Finally, to use the Dataverse connector, you need to enable the "Enable TDS endpoint" setting within your
environment. Further information on this can be found in the feature settings management section.
There are two types of page refresh available: fixed interval and change detection, when utilizing automatic
page refresh.
Fixed Interval :
For fixed interval refresh type, the visuals in the report page are updated based on a constant interval, such as
one second or five minutes. When the set interval is reached, all visuals on the page send an update query to the
data source and update accordingly.
Change Detection :
On the other hand, change detection refresh type refreshes visuals based on detecting changes in the data instead
of a fixed interval. This measure polls for changes in your DirectQuery source, and you must also select how
often Power BI Desktop will check for changes when defining the measure. It's worth noting that this refresh
type is only supported in workspaces that are part of a Premium capacity when published to the service, and
LiveConnect sources such as Analysis Services and Power BI datasets are not supported.
The process of creating reports with automatic page refresh in Power BI Desktop can be carried out, provided
that you are connected to a supported data source, such as DirectQuery sources and some LiveConnect
scenarios. The availability of automatic page refresh is subject to this restriction for both types of refresh.
To activate automatic page refresh, go to the report page you want to refresh automatically and access the
Visualizations pane. In this pane, you can locate the Page refresh section at the bottom, under the Formatting
button (a paint roller).
Once there, you can toggle the page refresh on or off, select the refresh type, and view the inputs and
information (which depend on the refresh type selected). Keep in mind that the Page refresh card is only
accessible when connected to a DirectQuery source and that the toggle must be in the "On" position to activate
automatic page refresh.
1. Connect to your big data source: Power BI supports many big data sources, including Hadoop,
Spark, and Azure Data Lake. You can connect to your big data source by clicking "Get Data" on the
Home tab in the Power BI Desktop, and then selecting the appropriate data source from the list.
2. Transform your data: Once you have connected to your big data source, you may need to transform
your data to make it suitable for analysis. Power BI provides a powerful set of tools for data
transformation, such as splitting columns, merging tables, and aggregating data.
3. Build your data model: Power BI uses a data modeling language called DAX to create relationships
between tables and define calculations. You can use DAX to build your data model and create measures
that summarize your data.
4. Create your visualizations: Power BI provides a wide range of visualizations that you can use to
create interactive reports and dashboards. You can drag and drop visualizations onto the canvas and
configure them to display the data in the way you want.
5. Once you have created your report, you can publish it to the Power BI service and share it with others.
The Power BI service provides a range of sharing options, such as sharing with specific individuals,
sharing with a group, or embedding the report in a web page.
In summary, Power BI provides a powerful set of tools for working with big data sources. By connecting to your
big data source, transforming your data, building your data model, creating your visualizations, and sharing your
reports, you can gain insights into your big data and make data-driven decisions.
Here are some steps to identify query performance issues and use Query Diagnostics in Power BI:
1. Identify the query: Start by identifying the query that is causing the performance issue. This can be
done by reviewing the queries that are being executed when the report is loaded or by identifying the
query that is taking the longest time to execute.
2. Check for query errors: Check for any errors in the query that could be causing performance issues.
Common errors include syntax errors or missing or incorrect parameters.
3. Check for query complexity: Review the complexity of the query, including the number of tables
being queried and the number of joins being performed. Simplifying the query may improve
performance.
4. Use Query Diagnostics: If you are still experiencing performance issues, you can use Query
Diagnostics to gather detailed information about the query execution. To enable Query Diagnostics, go
to File > Options and settings > Options > Diagnostics and select the "Enable tracing for query
analysis" option.
1. Analyze the Query Diagnostics report: Once you have enabled Query Diagnostics, you can run the
report and analyze the results. The report provides information about the query execution time, the
number of rows returned, and the performance of individual queries.
2. Take action: Based on the results of the Query Diagnostics report, you can take action to improve
query performance. This might include optimizing the query, adjusting the data model, or optimizing
the underlying data source.
In summary, identifying and diagnosing query performance issues is an important aspect of optimizing the
performance of your data models in Power BI. By following these steps, you can identify the query causing the
performance issue, diagnose the issue using Query Diagnostics, and take action to improve performance.
Activity 1:
Research on types of DAX functions available on Power BI:
There are several types of DAX functions available in Power BI that can be used to perform a wide range of
calculations and data analysis tasks. Here are some of the main categories of DAX functions:
1. Aggregation functions: These functions are used to perform calculations on numeric data, such as
SUM, AVERAGE, MIN, MAX, COUNT, and DISTINCTCOUNT.
2. Filter functions: These functions are used to filter data based on specific criteria, such as FILTER,
CALCULATETABLE, ALL, and VALUES.
3. Time intelligence functions: These functions are used to perform calculations on time-based data, such
as TOTALYTD, TOTALQTD, TOTALMTD, and DATESYTD.
4. Information functions: These functions are used to retrieve information about data, such as ISBLANK,
ISTEXT, ISNUMBER, and TYPE.
5. Text functions: These functions are used to manipulate text data, such as CONCATENATE, LEFT,
RIGHT, and SUBSTITUTE.
6. Logical functions: These functions are used to evaluate logical expressions, such as IF, AND, OR, and
NOT.
7. Table functions: These functions are used to create and manipulate tables, such as FILTER,
CALCULATETABLE, UNION, and INTERSECT.
8. Math and Trig functions: These functions are used to perform mathematical and trigonometric
calculations, such as ROUND, ABS, SQRT, and SIN.
9. Statistical functions: These functions are used to perform statistical calculations, such as AVERAGEIF,
COUNTIF, and STDEV.
Activity 2:
Show the Superstore data of only the State California :
To get the data of the state California , you need to filter the the State/Province column and select ‘equal to’ →
California .
Activity 3:
Use the Returns sheet from the Superstore Dataset to create a table showing
the Count of Returns .
To do this , select the Returned column and use the pivot column feature from
the transform menu and count as aggregate function , you will get the below
result .
Activity 4:
In the Orders file of the Superstore dataset extract the first three letter code
of the Product ID .
For doing this , select the Product ID column → Select extract from Transform
menu →enter the count three → click ok .
Activity 5:
Activity 6:
Create a column where it follows the following clauses :
1. If category equals Office supplies then deliver them to Offices .
2. If category equals Furniture then deliver to homes .
3. Else deliver to Company .
Select ‘Conditional Column’ option from Add column Menu , then enter as
below :
Customer & Web Analytics
Understanding customer behavior and preferences through website data helps businesses tailor their
offerings to meet customer needs effectively.
Key performance indicators (KPIs) for website analytics provide valuable metrics to assess the success
of online efforts and measure performance against business goals.
Analyzing website traffic and user engagement using tools like Google Analytics uncovers actionable
insights for enhancing user experience and optimizing content.
A/B testing and experimental design empower businesses to make data-driven decisions, leading to
continuous website improvement and better conversion rates.
Creating user personas and segments enables targeted marketing strategies that resonate with
specific customer groups, increasing engagement and conversion.
Conversion rate optimization techniques focus on refining the customer journey to enhance website
conversions and boost overall business revenue.
Identifying upselling and cross-selling opportunities through website data allows businesses to
leverage existing customer relationships for additional sales and revenue growth.
Click-Through Rates (CTRs): CTR measures the percentage of users who click on a specific link or call-to-
action (CTA). By analyzing CTRs, businesses can identify which links or CTAs are most effective in driving
customer engagement. This data helps optimize website design, placement of important elements, and
messaging to increase conversions.
Conversion Funnel Analysis: Conversion funnels represent the steps a user takes from entering a website to
completing a desired action, such as making a purchase. By analyzing the conversion funnel, businesses can
identify the stages where customers drop off or experience difficulties. This information helps pinpoint areas for
improvement, optimize the user experience, and increase conversion rates.
Search Queries and On-Site Search: Many websites have search functionalities that allow users to search for
specific products, information, or content. By analyzing search queries and on-site search behavior, businesses
can gain insights into what customers are looking for. This information can guide content creation, product
offerings, and website optimization efforts to better align with customer needs.
Exit Pages and Bounce Rates: Exit pages refer to the last pages customers view before leaving a website,
while bounce rates indicate the percentage of visitors who leave a website after viewing only one page.
Analyzing exit pages and bounce rates can reveal potential issues or areas of disinterest for customers.
Businesses can use this data to improve the content or user experience on those pages, reducing bounce rates
and encouraging users to explore further.
Heatmaps and Click Tracking: Heatmaps visually represent where users interact most on a webpage, showing
areas of high engagement and interest. Click tracking records and analyzes user clicks on specific elements or
links. By utilizing heatmaps and click tracking data, businesses can understand which elements attract the most
attention and which ones go unnoticed. This information helps optimize website layout, design, and user
interface elements for better usability and conversion rates.
Customer Segmentation and Personalization: By tracking user behavior and preferences on a website,
businesses can identify patterns and segment their customers based on their interests, purchase history, or
engagement level. This segmentation allows for personalized marketing strategies, targeted messaging, and
customized user experiences to cater to specific customer segments.
Overall, analyzing website data provides businesses with valuable insights into customer behavior and
preferences. By leveraging this information, businesses can optimize their website, content, and user experience
to better meet customer needs, increase engagement, and drive conversions.
KPIs are specific metrics that businesses use to track and evaluate the performance of their website. These
metrics provide valuable insights into different aspects of your website's effectiveness, such as visitor
engagement, conversion rates, and overall user experience. It's like having a dashboard that displays the vital
signs of your website's health.
Here are some commonly used KPIs for website analytics:
Traffic: This KPI measures the number of visitors coming to your website. It includes metrics such as total
visits, unique visitors, and traffic sources. It's like counting the footsteps of your virtual visitors as they explore
your online domain.
Bounce Rate: The bounce rate measures the percentage of visitors who leave your website after viewing only
one page. A high bounce rate may indicate that visitors didn't find what they were looking for or that your
website needs improvements. It's like measuring how quickly your visitors decide to stay or leave your digital
party.
Conversion Rate: This KPI measures the percentage of visitors who complete a desired action, such as making
a purchase, filling out a form, or subscribing to a newsletter. It's like tracking the number of people who respond
to your website's call-to-action and become valuable customers or leads.
Average Session Duration: This KPI measures the average amount of time visitors spend on your website. It
gives you an idea of how engaging and valuable your content is to your audience. It's like measuring how long
your visitors stay at your virtual party and enjoy the experience.
Click-Through Rate (CTR): The CTR measures the percentage of users who click on a specific link or call-to-
action. It's commonly used for email marketing campaigns, banner ads, or other promotional elements on your
website. It's like tracking the effectiveness of your digital signposts and guiding your visitors to take action.
By monitoring and analyzing these KPIs, businesses can gain insights into the effectiveness of their website,
identify areas for improvement, and make data-driven decisions to optimize their online presence.
Click-Through Rates (CTRs) is a website analytics metric that measures the percentage of users who
click on a specific link or call-to-action.
Conversion Funnel Analysis helps identify the stages where customers drop off or experience
difficulties in completing a desired action on a website.
Exit Pages and Bounce Rates is a term that refers to the last pages customers view before leaving a
website, and what indicates the percentage of visitors who leave after viewing only one page.
Heatmaps is a data visualization tool visually represents areas of high engagement and interest on a
webpage.
Analyzing website traffic and user engagement using tools like Google Analytics:
Google Analytics is a powerful tool that helps businesses track and measure various aspects of their website's
performance. It provides valuable insights into who visits your website, how they got there, and what they do
while they're there. It's like having a super-sleuth that keeps a watchful eye on every visitor that steps foot on
your digital turf.
By analyzing website traffic, businesses can see which pages are popular, where visitors are coming from, and
what keywords or campaigns are driving the most traffic. It's like a treasure map that reveals the hidden paths
that lead visitors to your virtual doorstep.
User engagement is another important aspect that Google Analytics helps uncover. It shows how visitors
interact with your website, such as the pages they view, the time they spend on each page, and the actions they
take. It's like having a spy that discreetly observes every move your visitors make, giving you valuable insights
into their preferences and behaviors
A/B testing and experimental design for website optimization
A/B testing and experimental design are like the mad scientists of website optimization! They help businesses
make data-driven decisions and improve the user experience by conducting experiments and testing different
versions of webpages or elements. Here's a breakdown of A/B testing and experimental design for website
optimization:
A/B Testing:
A/B testing, also known as split testing, involves comparing two versions of a webpage or element (A and B) to
determine which one performs better. The process includes dividing the audience into two groups, showing each
group a different version, and measuring their performance using relevant metrics. The objective is to identify
the version that yields the highest desired outcome.
a. Define the Objective: Clearly define the objective of the A/B test. For example, it could be to
increase click-through rates on a specific call-to-action button or to improve the conversion rate on a
landing page.
b. Identify Variables: Identify the specific elements or variables that you want to test. This could
include headlines, images, color schemes, button placement, or different page layouts.
c. Create Variations: Develop multiple versions (A and B) of the webpage or element, each with a
different variation of the identified variables. It's essential to change only one variable at a time to
accurately measure its impact on the outcome.
d. Split Test and Implement: Split the audience randomly into two groups, with each group exposed
to either version A or B. Ensure that the test is conducted simultaneously to mitigate external factors
that may skew the results.
e. Gather Data and Analyze: Collect relevant data on user interactions and conversions for both
versions. Analyze the data to determine which variation performs better statistically.
f. Implement the Winning Version: Implement the better-performing variation as the default version
on the website, based on statistically significant results.
g. Continuous Testing and Iteration: Keep testing and refining other variables to further optimize
the website's performance. It's an ongoing process!
Let's say you have an e-commerce website and want to increase the conversion rate on your product
page. You have identified two variables to test: the color of the "Add to Cart" button and the
placement of customer reviews.
In Version A (Control):
The "Add to Cart" button is green. Customer reviews are displayed below the product description.
In Version B (Variant):
The "Add to Cart" button is orange.
Customer reviews are displayed in a prominent section next to the product image. To conduct the A/B
test, you randomly divide your website visitors into two groups. Group A sees Version A, while Group
B sees Version B. Both versions are live simultaneously for a specified duration.
During the test, you track relevant metrics like click-through rates, add-to-cart rates, and overall
conversions for each version. You collect data on user interactions, such as the number of clicks on the
"Add to Cart" button and the percentage of visitors who proceed to checkout.
After collecting sufficient data, you analyze the results statistically. Let's say you find that Version B
with the orange "Add to Cart" button and prominent customer reviews performs significantly better in
terms of add-to-cart rates and overall conversions.
Based on these findings, you implement Version B as the default version on your product page. By
making this change, you expect to see an improvement in the conversion rate, as more users are
enticed to add products to their cart and proceed to checkout.
Remember, A/B testing is an iterative process, and you can continue to test and refine other variables
on your website to further optimize its performance and enhance the user experience.
So, by conducting A/B testing, you can make data-driven decisions and continuously improve your
website to achieve higher conversion rates and better meet your business goals!
Experimental Design:
Experimental design provides a systematic approach to conducting experiments, including A/B testing, to ensure
reliable and valid results. It involves careful planning, execution, and analysis of experiments. Here are some
key principles:
a. Randomization: Randomly assign participants or visitors to different versions to reduce bias and ensure a
fair comparison.
b. Control Group: Include a control group that represents the existing version for comparison.
c. Sample Size and Statistical Power: Determine an appropriate sample size to ensure statistical power and
reliable results.
d. Valid and Reliable Metrics: Choose metrics that accurately reflect the desired outcome and consistently
measure them across both versions.
e. Data Analysis: Apply statistical techniques to evaluate the data and determine the significance of observed
differences.
f. Replication and Iteration: Repeat the experiment or conduct similar experiments on different elements to
validate findings and improve results' robustness.
By applying A/B testing and experimental design principles, businesses can optimize their websites, enhance
user experience, and make informed decisions based on reliable data.
Google Analytics is a powerful tool that helps businesses track and measure various aspects of their
website's performance. It provides valuable insights into who visits your website, how they got there,
and what they do while they're there.
User engagement is another important aspect that Google Analytics helps uncover. It shows how
visitors interact with your website, such as the pages they view, the time they spend on each page,
and the actions they take.
A/B testing, also known as split testing, involves comparing two versions of a webpage or element (A
and B) to determine which one performs better. The process includes dividing the audience into two
groups, showing each group a different version, and measuring their performance using relevant
metrics.
To conduct the A/B test, you randomly divide your website visitors into two groups. Group A sees
Version A, while Group B sees Version B. Both versions are live simultaneously for a specified duration.
User Personas:
User personas are fictional representations of your ideal customers, each embodying a specific set of
demographics, interests, goals, and pain points. To create user personas:
1. Research: Conduct market research, surveys, and interviews to gather insights about your target
audience. Explore their demographics, motivations, challenges, and preferences.
2. Identify Patterns: Analyze the collected data to identify common patterns and characteristics among
your audience. Look for similarities in age, gender, location, interests, and purchasing behaviors.
3. Persona Creation: Create distinct personas based on the identified patterns. Give each persona a
name, photo, background story, and key traits that represent a specific segment of your audience.
4. Persona Details: Dive deeper into each persona's characteristics, including their goals, challenges,
preferred communication channels, and buying behaviors. Understand their pain points and how your
product or service can address their needs.
5. Persona Empathy: Develop empathy for each persona by understanding their motivations, aspirations,
and frustrations. This will help you tailor your marketing messages and strategies to resonate with each
persona.
Background:
Annie is a 32-year-old working professional who lives in a busy city. She is health-conscious and leads
an active lifestyle. Annie works long hours and finds it challenging to make time for fitness due to her
hectic schedule.
Demographics:
Age: 32
Gender: Female
Location: Urban area
Goals:
Stay fit and maintain a healthy lifestyle.
Find convenient ways to incorporate exercise into her busy routine.
Achieve her fitness goals and track progress effectively.
Challenges:
By creating the user persona of Active Annie, the online fitness platform can better understand her
needs and design targeted marketing strategies:
Messaging: Craft messages that highlight the convenience, flexibility, and time-saving aspects of the
platform to appeal to Annie's busy lifestyle.
Features and Benefits: Emphasize features like on-demand workouts, personalized schedules, and
progress tracking to address Annie's challenges and goals.
Content Creation: Develop content that showcases outdoor workout ideas, success stories of
individuals with busy schedules, and tips for staying motivated.
Social Media Engagement: Engage Annie through social media platforms by sharing inspiring
stories, fitness challenges, and providing a platform for her to connect with like-minded individuals.
Personalized Recommendations: Provide personalized workout recommendations and reminders
based on Annie's preferences, goals, and schedule.
By tailoring their marketing efforts to suit Active Annie's specific needs and preferences, the online
fitness platform can effectively engage and attract similar individuals, ultimately driving conversions
and long-term customer satisfaction.
User Segments:
User segments involve dividing your audience into groups based on shared characteristics or behaviors. Here's
how to create user segments:
1. Analyze Data: Analyze data from various sources, such as website analytics, customer surveys, or
purchase history, to identify commonalities among your audience.
2. Segmentation Criteria: Determine the criteria for segmenting your audience. This could include
demographics, behavior, interests, engagement level, or past purchase behavior.
3. Segment Creation: Create segments based on the chosen criteria. For example, you might have
segments like "young professionals," "frequent purchasers," or "budget-conscious shoppers."
4. Segment Attributes: Define the attributes and characteristics of each segment. Understand their needs,
preferences, pain points, and how they interact with your brand.
5. Targeted Strategies: Develop targeted marketing strategies for each segment, tailoring your
messaging, offers, and channels to match their specific preferences and motivations.
Conversion rate
optimization (CRO)
techniques
Conversion Rate Optimization (CRO) techniques are all about optimizing your website or online platform to
improve the percentage of visitors who take a desired action, such as making a purchase, filling out a form, or
subscribing to a newsletter.
Here's how website data can help uncover upselling and cross-selling opportunities:
For Upselling:
Opportunities for upselling arise when a customer is already considering a purchase or has made a purchase
decision. The goal is to encourage the customer to buy a higher-priced or upgraded version of the product or add
complementary items to their purchase.
Problem Statement: In this activity, we will explore A/B testing concepts and perform an analysis of
advertisement conversion rates using a provided dataset. The dataset contains information about user
interactions with ads and their conversion behavior.
Advanced Charts
Visualization best practices and chart selection for different data types and analysis goals
Line Charts: Visualize trends and changes over time. Watch your data come alive as it dances across
the graph, revealing hidden patterns and insights.
Bar Charts: Unleash the power of comparison! See your data rise and fall, as bars compete for your
attention. It's like a friendly race where insights win the trophy!
Pie Charts: Deliciously slice and dice your data! Create a feast for the eyes with colorful sections that
represent proportions. It's a visual treat that will leave you hungry for more insights!
Scatter Plots: Connect the dots and uncover correlations! Let your data points scatter across the plot
like confetti, revealing relationships that were once hidden. It's like solving a puzzle and finding the
missing piece!
Histograms: Step into the world of distributions! Witness data frequencies take shape in beautifully
crafted bars. It's like capturing the essence of your data and displaying it in an artistic masterpiece!
Heat Maps: Feel the heat of data intensity! Watch as colors blend and patterns emerge, providing a
sizzling display of information. It's like visualizing your data in a vibrant kaleidoscope!
Line charts are a powerful tool to visualize trends over time. This visualization method can reveal
hidden patterns and insights as data points move gracefully across the graph.
Bar charts provide a dynamic way to compare data, showcasing how values rise and fall. Just like a
friendly race, these bars compete for attention, with the ultimate goal of uncovering valuable insights.
Pie charts transform data into a visual feast, allowing viewers to slice and dice proportions. These
colorful sections resemble slices of a delicious dessert, enticing curiosity for more meaningful insights.
Scatter plots connect data points like confetti, enabling the discovery of correlations between
variables. Each data point scatters across the plot, contributing to the puzzle of relationships, much
like finding a missing piece.
Advanced chart types and techniques, such as heatmaps, sankey diagrams, and sparklines.
Heatmaps:
Heatmaps are powerful tools for visualizing data density and patterns! They use colors to represent
values and display the intensity of data across a grid or matrix. Each cell in the heatmap is assigned a
color based on its value, creating a visual representation that allows you to identify hotspots, trends,
and variations. Heatmaps are especially useful for analyzing large datasets and identifying correlations
between variables. They help you discover insights by visualizing data in a way that's easy to interpret
and digest. It's like unlocking the hidden treasures within your data, as the colors guide you towards
valuable insights!
Sankey Diagrams:
Sankey diagrams visualize the flow and transformation of data through interconnected paths! They
are ideal for illustrating complex processes, such as energy flows, customer journeys, or supply chain
dynamics. Sankey diagrams use varying widths of interconnected flows to represent the quantity or
value of data transitioning from one state to another. The thickness of the flows depicts the
magnitude of the data movement, allowing you to identify major contributors or bottlenecks in the
process. Sankey diagrams provide a holistic view of the data flow, enabling you to understand
relationships, spot inefficiencies, and optimize the system. It's like watching a river of information,
where the widths of the streams guide your understanding of the data flow and reveal valuable
insights!
Sparklines:
Sparklines are compact and powerful data visualizations that provide a snapshot of trends or patterns!
They are often small line or bar charts that can be seamlessly integrated within text or tables, allowing
you to present data alongside your narrative. Sparklines condense the essence of your data into a
concise visual representation, highlighting variations and providing a quick overview of the data's
trajectory. They are perfect for presenting changes over time, such as stock prices, sales figures, or
website traffic. Sparklines help you communicate key insights in a visually appealing and space-
efficient manner, making your data storytelling more engaging. It's like capturing the magic of your
data within a single spark, where even a tiny line can speak volumes!
Tools for creating advanced charts, such as D3.js, Plotly, and Tableau
D3.js:
D3.js, the wizard of data visualization! This JavaScript library empowers you to create stunning, custom
visualizations by harnessing the power of HTML, CSS, and SVG. With D3.js, you can conjure up
mesmerizing charts, animations, and interactive experiences. It's like having a paintbrush in your hand,
allowing you to craft visual masterpieces that bring your data to life!
Plotly:
Plotly, the sorcerer of interactive charts! This versatile data visualization library offers an array of tools
and APIs for creating dynamic and interactive charts. Whether you're working with Python, R, or
JavaScript, Plotly has your back. It's like wielding a wand that effortlessly conjures up rich, interactive
plots and graphs, captivating your audience with its spellbinding capabilities!
Tableau:
Tableau, the magical dashboard conjurer! This powerful data visualization software allows you to
create stunning visualizations, reports, and interactive dashboards with ease. Tableau's intuitive
interface and drag-and-drop functionality make it accessible to both data wizards and aspiring
sorcerers. It's like having a crystal ball that reveals hidden insights, enabling you to tell compelling
stories with your data!
Problem Statement 1:
You are given a dataset that contains information about the sales of different products in a store over
a period of time. Your task is to create an interactive line chart using Plotly to visualize the sales trend
of two selected products. The chart should have interactive features such as hovering over data points
to display additional information and the ability to toggle the visibility of each product's sales line.
# Import necessary libraries
import pandas as pd
import plotly.graph_objects as go
Problem Statement 2:
You are given a dataset containing information about the stock prices of a company over a certain
period of time. Your task is to create an interactive candlestick chart using Plotly to visualize the stock
price movement. The chart should display the opening, closing, high, and low prices for each trading
day, and should have interactive features such as zooming, panning, and hover tooltips to provide
detailed information about each candlestick.
# Import necessary libraries
import pandas as pd
import plotly.graph_objects as go
1. Conversion Rate: The conversion rate is a crucial metric that measures the percentage of website
visitors who take the desired action, such as making a purchase, signing up for a newsletter, or filling
out a form. It indicates how effective your website is at turning visitors into customers.
2. Average Order Value (AOV): The AOV represents the average amount customers spend per order.
Monitoring AOV helps businesses understand the average value of transactions and identify
opportunities to increase sales by encouraging customers to spend more.
3. Customer Lifetime Value (CLV): CLV is a prediction of the net profit attributed to the entire future
relationship with a customer. Understanding CLV helps businesses assess the long-term value of
customers and develop strategies to retain high-value customers.
1. Identify Key Business Goals: Start by identifying the primary objectives of your e-commerce business.
Examples may include increasing sales, expanding market reach, improving customer satisfaction, or
reducing cart abandonment rates.
2. Set Specific and Measurable Objectives: Once the goals are established, set specific and measurable
objectives to track progress. For instance, if the goal is to increase sales, the objective might be to
achieve a 15% increase in revenue over the next quarter.
3. Align Objectives with KPIs: Map each objective to relevant KPIs. For example, if the objective is to
reduce cart abandonment, the related KPI could be the cart abandonment rate. Keep Goals Realistic:
Ensure that the goals are realistic and attainable, considering the resources and capabilities of your e-
commerce business.
1. Monitoring KPIs is essential for e-commerce businesses as they provide valuable insights into their
performance.
2. To assess the long-term value of customers, e-commerce businesses analyze Customer Lifetime
Value (CLV).
3. One of the primary objectives for e-commerce businesses could be increasing sales.
4. When setting objectives for e-commerce dashboards, it is important to make them specific and
measurable.
Selecting and Designing Appropriate Charts and Visualizations for E-commerce KPIs
Selecting the right charts and visualizations is crucial to effectively communicate insights and trends related to
e-commerce KPIs. Consider the following tips:
1. Bar Charts and Line Graphs: Use bar charts to compare different KPIs over time or compare
performance across different categories. Line graphs are suitable for showing trends and changes in
KPIs over time.
2. Pie Charts: Pie charts are ideal for displaying the composition of a whole. Use them to show the
proportion of revenue generated from different product categories or customer segments.
3. Scatter Plots: Scatter plots can be helpful in identifying relationships between two KPIs, such as
correlating website traffic with conversion rates.
4. Heatmaps: Heatmaps can reveal patterns and trends in large datasets, making them useful for analyzing
user behavior on an e-commerce website, such as identifying popular browsing areas.
Identifying Trends and Patterns in E-commerce Data Using Exploratory Analysis Techniques
Exploratory data analysis (EDA) is a crucial step in understanding e-commerce data and uncovering meaningful
insights. Here's how you can approach it:
1. Data Cleaning: Start by cleaning the data to remove any missing values, outliers, or irrelevant
information that might skew the analysis.
2. Data Visualization: Utilize various charts and graphs to visualize the data and explore relationships
between different variables. This helps identify patterns and trends at a glance.
3. Segmentation: Segment your data based on relevant criteria like customer demographics, purchase
behavior, or product categories. Analyzing each segment separately can reveal valuable insights.
4. Time Series Analysis: For time-based data, perform a time series analysis to identify seasonal patterns,
trends, and any recurring events that might impact your e-commerce KPIs.
1. Analyze Performance Data: Thoroughly analyze your e-commerce KPIs and performance data to
identify areas that need improvement or have the most significant impact on your goals.
2. Identify Opportunities: Pinpoint specific opportunities for improvement based on the analysis. For
instance, if the conversion rate is low, focus on optimizing the checkout process.
3. A/B Testing: Implement A/B testing to compare different strategies or changes, such as website layout,
product descriptions, or pricing, and measure their impact on KPIs.
4. Continuous Monitoring: Continuously monitor the impact of the implemented changes to ensure they
are effective and adjust strategies as needed.
Measuring the Impact of Marketing Campaigns and Other Initiatives on Ecommerce Metrics
To measure the impact of marketing campaigns and initiatives, follow these steps:
1. Define Metrics for Each Campaign: Determine the specific e-commerce KPIs that align with the goals
of each marketing campaign, such as increased website traffic or higher conversion rates.
2. Implement Tracking Mechanisms: Use tools like Google Analytics to track the performance of your
campaigns. Set up custom tracking parameters to attribute website activity to specific campaigns.
3. Analyze Pre and Post-Campaign Data: Compare performance data before and after the campaign to
gauge its effectiveness. Look for significant changes in relevant KPIs to assess the impact.
4. Calculate Return on Investment (ROI): Calculate the ROI of each campaign by comparing the revenue
generated from the campaign with the cost of running it.
1. Identify Relevant Data Sources: Determine which data sources are critical for your e-commerce
business, such as website analytics, customer reviews, social media platforms, or sales data.
2. Data Integration Tools: Use data integration tools or platforms to bring data from different sources
together into a centralized data repository.
3. Data Cleaning and Transformation: Clean and transform the data as needed to ensure consistency and
accuracy across all sources.
4. Dashboard Design: Design your e-commerce dashboard to display the integrated data in a visually
appealing and easy-to-understand manner. Use charts, graphs, and tables to present the data effectively.
1. A/B testing is used to compare different strategies or changes and measure their impact on KPIs.
2. To measure the impact of marketing campaigns, define specific e-commerce KPIs that align with
campaign goals.
3. Data integration tools are used to bring data from different sources together into a centralized data
repository.
4. The purpose of continuously monitoring the impact of implemented changes is to ensure they are
effective.
the process of creating an interactive e-commerce dashboard using Microsoft Power BI. The dashboard will
display key performance indicators (KPIs) and visualizations to help us understand the performance of an e-
commerce business. We'll integrate data from multiple sources and design the dashboard to provide valuable
insights.
1. Locate the "Get data" button located on the lower left side of the Home screen and click on it.
2. Navigate to the "Files" section and proceed to click on the "Get" option.
3. Choose the "Local File" option and select the desired file (e.g., an Excel file) from your system.
4. After selecting the file, proceed by clicking the "Import" button.
5. Upon successful import, you will be presented with a screen displaying your file name.
6. After the process is complete, your imported data set will be visible under the "Recent" section of the
Home screen. Additionally, you can access the same data set from the "My workspace" screen.
You will be presented with two options: either paste or manually enter the data or pick a published dataset.
Select the option that best fits your needs and preferences.
1. Follow the same steps as above to access the "Manage relationships" dialog box.
2. Instead of selecting "Autodetect," click on "New," which will open the "Create relationship" dialog
box.
3. In the "Create relationship" dialog box, select the first table from the drop-down list and choose the
column you want to use for the relationship.
4. Next, select the second table you wish to establish a relationship with from the second drop-down list.
5. Select the corresponding column in the second table.
6. Once you've selected the appropriate columns, click "OK" to create the relationship manually.
By using either the autodetect feature or manually creating relationships, you can effectively establish
connections between tables in Power BI, allowing for seamless data analysis and visualization.
1. From the right sidebar, locate the desired visual from the list of available options.
2. Click and drag the chosen visual to the axis or canvas area where you want it to be displayed.
3. You can add multiple fields to each axis by dragging and dropping them onto the appropriate areas of
the visual.
By leveraging these visualizations, you can effectively present and analyze your data in a visually appealing and
insightful manner within your Power BI report.
Shortcomings
1. Limited Page Size: Power BI dashboards are limited to a single-page
canvas, which might restrict the ability to present a comprehensive
overview of complex ecommerce datasets.
2. Steeper Learning Curve: Mastering Power BI's features and functionalities
requires significant time and effort, making it challenging for beginners to
create advanced visualizations.
3. Data Size Limitations: Large datasets with millions of rows may lead to
performance issues, causing delays in data refresh and visualization
rendering.
4. Data Connectivity: Power BI might lack direct connectors to certain niche
ecommerce platforms or custom databases, necessitating custom
solutions for data integration.
As the Sales Manager of a company, you want to improve the sales analysis
process and gain valuable insights into sales performance, product analysis, and
shipping patterns using Power BI. Your goal is to create three interactive
dashboards - Sales Analysis, Product Analysis, and Shipping Analysis - that will
enable you to monitor and understand key metrics related to sales, profits,
product categories, shipping modes, and shipping costs. These dashboards should
provide a clear visualization of the data, allowing you to make data-driven
decisions to optimize sales strategies and operational efficiencies.
Dataset:
You will use the "Superstore Sales Dataset" available on Kaggle. You can
download the dataset from the following link:
https://www.kaggle.com/datasets/laibaanwer/superstore-sales-dataset
Steps:
Step 1: Data Import and Exploration
Download the "Superstore Sales Dataset" from the provided Kaggle link.
Launch Power BI and click on "Get Data."
Choose "Excel" as the data source and select the downloaded file.
Review the dataset structure and preview the data to understand the
available fields.
Step 2: Data Cleaning and Transformation
Calculate the total sales and total profit for each year and display them on
the dashboard.
Add filters for year and quarter to enable users to filter through sales data
dynamically.
Visualize the sales and profit data using line charts and column charts.
Include a summary section that displays the total sales and profit made
between 2011 and 2014.
Highlight the sales and profit figures for the year 2014 and compare them
with previous years.
Create a pie chart to show the percentage of profit contributed by each
product category.
Utilize slicers to enable users to filter data based on different months.
Step 4: Product Analysis Dashboard