Biva Question and Ans
Biva Question and Ans
How to handle missing data while reading raw data files in the preparation phase for visual
analytics? Justify with examples.
Handling missing data is an essential step in the Prepare phase of visual analytics. It ensures data
quality and improves the accuracy of visual interpretations and statistical results.
o Used when the number of missing records is small and won’t impact analysis
significantly.
Imputation:
o Replacing missing values using statistical methods such as mean, median, mode, or
predictive modeling.
o Example: Filling in missing “income” values in a customer dataset using the median
income of similar records.
o Example: If January sales are missing, use December's sales figure as a proxy.
Domain-specific logic:
o Example: If 'Order Status' is missing but 'Shipping Date' exists, status can be inferred
as “Shipped”.
In SAS Visual Analytics, during the Prepare phase, users can clean and calculate new data items
using the interface or with data transformation scripts.
Visual analytics is the science of analytical reasoning supported by interactive visual interfaces. It
enables users to explore complex datasets through visual means to derive actionable insights.
Interactive Dashboards & Reports: Users can drag and drop elements, filter data, and drill
down into insights.
Advanced Analytics Integration: Use statistical models, forecasting, and machine learning
with ease.
Multi-device Access: View reports from web browsers, mobile apps, or MS Office add-ins.
Collaborative Sharing: Share interactive reports and dashboards with others in the
organization.
3. List different job roles who use visual analytics and explain briefly.
Visual analytics is used by multiple roles within an organization, each with distinct responsibilities:
2. Data Administrator:
3. IT Administrator:
5. SAS Administrator:
These roles collaborate within platforms like SAS Visual Analytics, where each user type has role-
based capabilities tailored to their function.
Theory
UNIT 4
UNIT 5
UNIT 6
TABLUEA
🔹 Sorting in Tableau
Sorting helps organize and analyze data more efficiently. In Tableau, sorting can be applied to dimensions, and there are
two main types:
1. ✅ Computed Sorting
Definition: Automatic sorting applied through the Sort dialog box on an axis.
Options:
When to use: When sorting based on field values like numbers or dates.
🧪 Example:
2. ✋ Manual Sorting
Purpose: Useful for user-defined visual preferences, not based on field values.
When to use: When the sort order is subjective or doesn’t follow data values.
🧪 Example:
You manually move the "Home Office" segment below "Consumer", even if Home Office has lower sales — to highlight it
or match a presentation flow.
Filtering is the process of removing unwanted values or limiting data displayed in your visualizations. Tableau supports
simple filters (e.g., categories) and advanced filters (e.g., context filters, calculations).
1. Filter Dimensions
Used on: Dimension fields (categorical data like Region, Category, Product Name)
Action:
2. Filter Measures
These filters are applied on the measure fields. Filtering is based on the calculations applied to the measure fields.
Hence, while in dimension filters you use only values to filter, in measures filter you use calculations based on fields.
Purpose: Filter using calculations, e.g., "greater than X", "less than Y".
🧪 Example:
Action:
✅ Result: View now only includes subcategories with average profit over 20.
3. Filter Dates
Tableau treats the date field in three different ways while applying the date field. It can apply filter by taking a relative
date as compared to today, an absolute date, or range of dates. Each of this option is presented when a date field is
dragged out of the filter pane.
🧪 Example:
Action:
✅ Result: View shows profit trends for the selected date range.
Quick Filters are filters that can be added directly to the worksheet using a right-click on any dimension or measure. They
offer fast, interactive ways for users to filter data without opening the full filter dialog.
The filter then appears on the right-hand side of the worksheet for interactive selection.
Multiple Values (Custom List) Search and select one or more values manually
Single Value (Slider) Use a slider to choose a single value (mainly for date/numeric filters)
Wildcard Match Filter values that match a pattern (e.g., starts with “A”)
Steps:
✅ Result: Only sub-categories starting with “A” (like “Accessories” and “Art”) are displayed.
🧼 Clearing a Quick Filter
To remove a filter:
This will remove the applied filter and show all data again.
Context filter
The normal filters in Tableau are independent of each other. It means each of the filter reads all the rows from the
source data and creates its own result. However, there may be scenarios where you might want the second filter to
process only the records returned by the first filter. In such a case, the second filter is known as dependent filters
because they process only the data that passes through the context filter. Context Filters serve two main purposes.
Improves performance − If you set a lot of filters or have a large data source, the queries can be slow. You can
set one or more context filters to improve the performance.
Creates a dependent numerical or top N filter − You can set a context filter to include only the data of interest,
and then set a numerical or a top N filter.
✅ Goal: Show only the Top 10 Sub-Categories by Sales, but only for Furniture.
🔧 Steps:
Step 1:
Step 2:
Step 3:
🛑 At this point, Tableau still shows the Top 10 overall, then filters down to only Furniture sub-categories — which might
show fewer than 10 items.
Step 4:
One of the important filtering options in Tableau is to apply some conditions to already existing filters. These conditions
can be very simple like finding only those sales which are higher than a certain amount or it can be a complex one based
on a certain formula. The conditions can also be applied to create a range filter.
Step 1:
Step 2:
Choose:
o By field
o Sales
o Sum
✅ Result: The view now displays only those Sub-Categories whose total Sales ≥ 100,000, across all Segments.
Top filters
the Top option in Tableau filter is used to limit the result set from a filter. For example, from a large set of records on
sales you want only the top 10 values. You can apply this filter using the inbuilt options for limiting the records in many
ways or by creating a formula. In this chapter, you will explore the inbuilt options.
🔧 Steps:
Step 1:
Right-click on the Sub-Category field in the Filters shelf (or drag it there if it isn’t yet)
Choose:
o By field
o Enter 5
✅ Result: Tableau now shows only the top 5 Sub-Categories based on highest total sales.
Filtering is a core part of data analysis and visualization, allowing users to focus on relevant subsets of data. Tableau
provides powerful, flexible filter options for both dimensions (categorical data) and measures (numerical data).
✅ Creating Filters
➤ How to Create:
1. Drag a field (dimension or measure) to the Filters shelf (above the Marks card).
When you drag a measure like Sales to the Filters shelf, Tableau provides numeric filtering options:
🧪 Example:
Dimensions are fields like Category, Region, Product, etc. When used as filters, Tableau offers these options:
Filter Type Description
Condition Apply logic using aggregate measures (e.g., SUM(Sales) > 10,000)
🧪 Example:
➤ To Edit a Filter:
➤ To Clear a Filter:
TABLUE CHARTS
3. Choose the chart type from the Marks card or Show Me panel.
4. Drag fields to the Marks pane and drop them on Color, Size, Label, Tooltip, etc.
BAR CHARTS
A bar chart represents data in rectangular bars with the length of the bar proportional to the value of the variable.
Tableau automatically produces a bar chart when you drag a dimension to the Row shelf and measure to the Column
shelf. We can also use the bar chart option present in the Show Me button.
Drag the Profit field to the Color section in the Marks card
Drag another dimension (e.g., Segment) to the Color area in the Marks card
✅ This creates a stacked bar chart showing contribution by category (e.g., by segment)
In a line chart, a measure and a dimension are taken along the two axes of the chart area. The pair of values
for each observation becomes a point and the joining of all these points create a line showing the variation
or relationship between the dimensions and measures chosen.
This chart will show the variation of Sales across different Ship Modes.
Measures: Use two or more measures (e.g., Sales and Profit Ratio).
This chart will display multiple lines for each measure, allowing you to compare the variation of each measure
across the chosen dimension.
After creating a line chart, drop another measure (e.g., Profit Ratio) into the Labels pane in the Marks card.
This will label each point on the line chart with the average value of the new measure (Profit Ratio), allowing
for easy visualization of the values at each point.
Pie Chart
A pie chart represents data as slices of a circle with different sizes and colors. The slices are labeled and the numbers
corresponding to each slice is also represented in the chart. You can select the pie chart option from the Marks card to
create a pie chart.
Choose one dimension and one measure to create a simple pie chart. For example, take the dimension named region
with the measure named profit. Drop the Region dimension in the colors and label marks. Drop the Profit measure into
the size mark. Choose the chart type as Pie. The following chart appears which shows the 4 regions in different colors.
You can choose a dimension with hierarchy and as you go deeper into the hierarchy, the chart changes reflect the level of
the dimension chosen. In the following example, we take the dimension Sub-Category which has two more levels -
Manufacturer and Product Name. Take the measure profit and drop it to the Labels mark. The following pie chart
appears which shows the values for each slice.
Goal: Visualize the relationship between Sales and Profit, categorized by Sub-Category.
Steps:
Result: Each point represents a sub-category, showing how it performs in terms of sales and profit.
Step:
Result: Each point has a unique color representing its sub-category, improving clarity.
Step:
Result: The scatter plot updates to show more granular data, such as profit and sales per manufacturer under
each sub-category.
Tableau - Bubble Chart
Bubble charts display data as a cluster of circles. Each of the values in the dimension field represents a circle whereas the
values of measure represent the size of those circles. As the values are not going to be presented in any row or column,
you can drag the required fields to different shelves under the marks card.
Steps:
Result: Each bubble represents a ship mode, with size based on profit and color for category distinction.
Step:
Result: Bubbles now show the sales figures, offering clearer quantitative insight.
Step:
Result: Bubbles are shaded by sales volume—darker for higher sales, lighter for lower.
Tableau - Histogram
A histogram represents the frequencies of values of a variable bucketed into ranges. Histogram is similar to bar chart but
it groups the values into continuous ranges. Each bar in histogram represents the height of the number of values present
in that range.
Tableau creates a histogram by taking one measure. It creates an additional bin field for the measure used in creating a
histogram.
Steps:
Result: Tableau automatically bins Quantity values (e.g., 0–4811 range split into 12 bins), showing how many
entries fall into each range.
Step:
Result: Each bar is color-coded by Region, showing how different regions contribute to each Quantity bin—a
stacked histogram.
HEAT MAP
Heat Map: Heatmap is defined as a graphical representation of data using colors to visualize the value of the
matrix. In this to represent more common values or higher activities brighter colors basically reddish colors are
used and to less common or activity values darker colors are preferred. Heatmap is also defined by the name of
the shading matrix.
6. Choose “Heat Map” from the Show Me panel or the Marks card.
7. Drag a measure (like Sales or Profit) to Color on the Marks card to encode intensity.
Rows: Sub-Category
✅ Result: A more detailed heat map showing profit differences across sub-categories, markets, and shipping
modes.
Tableau needs one Dimension and one Measure to create a Waterfall chart.
Sort the bars in ascending order by clicking the sort icon on the axis.
Change the chart type to Gantt Bar from the Marks card.
🔍 This converts your bars into positionable Gantt blocks, forming the structure for a waterfall.
plaintext
CopyEdit
-[Sales]
Click OK.
🔍 This field helps in sizing the Gantt bar correctly from its starting point.
🔍 Now, your Gantt bars will extend by the sales value, creating the Waterfall Chart.
Optionally, set color ranges manually (e.g., green for increase, red for decrease).
The Box Plots take one or more measures with zero or more dimensions.
Using the Sample-superstore, plan to find the size of profits for the respective category for each Ship mode
values. To achieve this objective, following are the steps.
Step 1 − Drag and drop the dimension category to the Columns shelf and profit to the Rows shelf. Also drag the
dimension Ship mode to the right of Category in Columns shelf.
Step 2 − Choose Box-and-Whisker plot from Show Me. The following chart appears which shows the box plots.
Here, Tableau automatically reassigns the ship mode to the Marks card.
A Tree Map is created using one or more dimension with one or two measures.
Using the Sample-superstore, plan to find the size of profits for each Ship mode values. To achieve this
objective, following are the steps.
Step 1 − Drag and drop the measure profit two times to the Marks Card. Once to the Size shelf and again to the
Color shelf.
Step 2 − Drag and drop the dimension ship mode to the Label shelf. Choose the chart type Tree Map from Show
Me. The following chart appears.
UNIT 5
• Filters allow users to easily reduce or customize data displayed within a dashboard.
• Parameters often work like filters but can be customized by the dashboard designer to limit or direct user interaction.
• Actions are set up at the dashboard level to apply rules based on when and where a user clicks on an element in the
dashboard.
• Legends can be used to provide context and allow users to highlight specific data elements.
• Highlighters may be used to show linked data elements across multiple charts or tables.
✅ 7. Establishing Interactions
Interactivity is what makes a Tableau dashboard dynamic. You don’t want your dashboard to be a static report — you
want users to click, explore, and get answers.
1. Filter Actions
o Use Case: Clicking on a state in a map filters sales charts and KPIs to show only that state’s data.
2. Highlight Actions
o Use Case: Hovering over a bar in a chart highlights that category across other visuals.
o How: Dashboard → Actions → Add Action → Highlight
3. URL Actions
🔍 Example:
All other charts (KPIs, trends) update to show data only for the selected state.
Bottom Line: Interactions make the dashboard more explorable, flexible, and engaging for the user.
Tables are used in Tableau to display detailed, structured data, usually where summary charts aren’t enough.
Show exact values (e.g., Sales, Profit, Discount for each product).
2. Conditional Formatting: Highlight cells with color based on values (e.g., profit < 0 in red).
🔍 Example:
This is helpful for someone needing to see every record rather than just summaries or graphs.
Tip: Keep tables neat. Avoid large scroll-heavy views. Break into tabs or use filters.
This is closely connected to interactivity but focuses on navigation and external linking within the dashboard.
🔹 Types of Links/Actions:
o Use Case: Add a button that users click to go to another sheet or dashboard.
o Customizable: Can use images (e.g., logos or icons) instead of default arrow.
o Use Case: Embed a live website (e.g., company homepage, product site) directly in your dashboard.
3. URL Actions:
o Use Case: When a user clicks a product name, it opens that product’s page online.
o How: Add URL Action from Dashboard → Actions → Add Action → URL.
o Clicking on a chart (e.g., a state) can act like a link to bring users to another sheet that shows that
state’s details.
🔍 Example:
In the dashboard, a “Back to Overview” button could be placed on a detail view so users can easily navigate back.
Why it matters: These links improve flow, guide users through stories, and connect Tableau to the wider ecosystem.
✅ Summary Table:
7. Interactions Filters, highlights, dynamic responses Make charts respond to clicks & hovers
10. Links & Actions Navigation, URL integration Move between views or open websites
✅ 1. Designing Dashboards
A dashboard is a canvas in Tableau where you place multiple visual elements — like charts, maps, KPIs, filters — to
create a cohesive, interactive data story.
🔹 Why It Matters:
🔹 Best Practices:
Use tiled containers for structure and floating only when needed.
🔍 Example:
A dashboard showing:
Filter by Market/Region
Tip: Use vertical containers to organize top-down flows and horizontal ones for side-by-side charts.
A simple report in Tableau refers to a single visualization (or worksheet) that focuses on a specific analysis.
🔹 Steps to Create:
3. Choose a visualization type from the “Show Me” panel (e.g., bar chart).
🔍 Example:
Graphs are the visual core of any Tableau dashboard — they help users quickly identify patterns, trends, and outliers.
🔹 Best Practices:
🔍 Example:
Tip: Always choose chart types that best match your analysis goal.
Filters help you and your users focus on relevant data, while report sections help organize content visually.
🔹 Types of Filters:
🔹 Organizing Sections:
🔍 Example: