UNIT II
UNIT II
3003
Unit I : INTRODUCTION TO BUSINESS ANALYTICS 9
Analytics and Data Science – Analytics Life Cycle – Types of Analytics – Business Problem
Definition – Data Collection – Data Preparation – Hypothesis Generation – Modeling –
Validation and Evaluation – Interpretation – Deployment and Iteration
Unit II : BUSINESS INTELLIGENCE 9
Data Warehouses and Data Mart - Knowledge Management – Types of Decisions – Decision
Making Process - Decision Support Systems – Business Intelligence – OLAP –, Analytic
functions
Unit III : BUSINESS FORECASTING 9
Introduction to Business Forecasting and Predictive analytics - Logic and Data Driven
Models – Data Mining and Predictive Analysis Modeling – Machine Learning for Predictive
analytics
Unit IV : HR & SUPPLY CHAIN ANALYTICS 9
Human Resources – Planning and Recruitment – Training and Development - Supply chain
network - Planning Demand, Inventory and Supply – Logistics – Analytics applications in
HR & Supply Chain
Unit V :MARKETING & SALES ANALYTICS 9
Marketing Strategy, Marketing Mix, Customer Behavior – selling Process – Sales Planning –
Analytics applications in Marketing and Sales
Sl. No. Contents
2 Knowledge Management
3 Types of Decisions
6 Business Intelligence
7 OLAP
8 Analytic functions
UNIT - II BUSINESS INTELLIGENCE
1. DATA WAREHOUSES AND DATA MART
• Business intelligence (BI) is all about turning an organization’s data into insights that can be
used to inform business decisions. BI analysts will use BI tools, software or services to
access and analyze datasets and translate their findings into reports, summaries, dashboards,
graphs, charts or maps.
• In recent years, the advent of modern data visualization and reporting tools has transformed
the discipline, empowering businesses to use big data insights to identify, develop and create
new business opportunities.
Business Intelligence Definition
• “a set of methodologies, processes, architectures and technologies that transform raw data
into meaningful and useful information”. This can then be “used to enable more effective
strategic, tactical and operational insights and decision-making”.
• This definition acknowledges that data cannot be effectively analyzed or used to generate
meaningful insights if it is poor quality.
• BI should not be confused with ‘business analytics’. Business intelligence is descriptAive
and uses metrics to generate clear snapshots of business performance. Meanwhile, business
analytics is predictive, and describes what organizations should do in future to generate
better outcomes.
Data Warehouse
• A Data Warehouse (DW) is a relational database that is designed for query and analysis
rather than transaction processing. It includes historical data derived from transaction data
from single and multiple sources.
• A Data Warehouse provides integrated, enterprise-wide, historical data and focuses
on providing support for decision-makers for data modeling and analysis.
• A Data Warehouse is a group of data specific to the entire organization, not only to a
particular group of users.
• It is not used for daily operations and transaction processing but used for making
decisions.
• A Data Warehouse can be viewed as a data system with the following attributes:
• It is a database designed for investigative tasks, using data from various
applications.
• It supports a relatively small number of clients with relatively long interactions.
• It includes current and historical data to provide a historical perspective of
information.
• Its usage is read-intensive.
• It contains a few large tables.
• "Data Warehouse is a subject-oriented, integrated, and time-variant store of
information in support of management's decisions."
Subject-Oriented
A data warehouse target on the modeling and analysis of data for decision-makers.
Therefore, data warehouses typically provide a concise and straightforward view around a
particular subject, such as customer, product, or sales, instead of the global organization's
ongoing operations. This is done by excluding data that are not useful concerning the
subject and including all data needed by the users to understand the subject.
Integrated
A data warehouse integrates various heterogeneous data sources like RDBMS, flat files, and
online transaction records. It requires performing data cleaning and integration during data
warehousing to ensure consistency in naming conventions, attributes types, etc., among
different data sources.
Time-Variant
Historical information is kept in a data warehouse. For example, one can retrieve files from 3
months, 6 months, 12 months, or even previous data from a data warehouse. These variations
with a transactions system, where often only the most current file is kept.
Non-Volatile
The data warehouse is a physically separate data storage, which is transformed from the
source operational RDBMS. The operational updates of data do not occur in the data
warehouse, i.e., update, insert, and delete operations are not performed. It usually requires
only two procedures in data accessing: Initial loading of data and access to data.. Therefore,
the DW does not require transaction processing, recovery, and concurrency capabilities,
which allows for substantial speedup of data retrieval. Non-Volatile defines that once
entered into the warehouse, and data should not change.
Goals of Data Warehousing
• To help reporting as well as analysis
• Maintain the organization's historical information
• Be the foundation for decision making.
• Data Mart
A Data Mart is a subset of a directorial information store, generally oriented to a specific
purpose or primary data subject which may be distributed to provide business needs. Data
Marts are analytical record stores designed to focus on particular business functions for a
specific community within an organization. Data marts are derived from subsets of data in a
data warehouse, though in the bottom- up data warehouse design methodology, the data
warehouse is created from the union of organizational data marts.
The fundamental use of a data mart is Business Intelligence (BI)
applications. BI is used to gather, store, access, and analyze record. It can be used by smaller
businesses to utilize the data they have accumulated since it is less expensive than
implementing a data warehouse.
Reasons for creating a data mart
• Creates collective data by a group of users
• Easy access to frequently needed data
• Ease of creation
• Improves end-user response time
• Lower cost than implementing a complete data warehouses
• Potential clients are more clearly defined than in a comprehensive data
warehouse
• It contains only essential business data and is less cluttered.
Explicit knowledge can be easily collected, organized and transferred through digital means.
– A theory of the world, conceived of as a set of all of the conceptual entities describing
classes of objects, relationships, processes, and behavioral norms. Often referred to as
Globalization
Historically three factors, land, labor and capital were the key to economic success •
Knowledge has become a fourth factor. • Knowledge-based businesses can grow without
traditional land, labor, and capital requirements. • Key competitive factor will be how well
an organization acquires and applies knowledge
• Other factors • Rapid change: firms must be nimble and adaptive to compete
• Downsizing: sometimes the wrong people get fired when creating a leaner
organization
• Managing Info and Comm Overload: data must be categorized in some manner if
it is to be useful rather than overwhelming
• Knowledge Embedded in Products: the intangibles that add the most value to goods and
services are becoming increasingly knowledge-based
• Sustainable Competitive Advantage: KM is the way to do this. Shorter
innovation life cycles keep companies ahead of the competition.
• Categorizing knowledge
Knowledge Codification
Knowledge must be used or shared to be of value. • Codification puts the knowledge into a
form that makes it easy to find and use. • It is difficult to measure knowledge in discreet
units (since it changes over time). • Knowledge has a shelf life.
Knowledge must be used or shared to be of value. • Codification puts the knowledge into a
form that makes it easy to find and use. • It is difficult to measure knowledge in discreet
units (since it changes over time). • Knowledge has a shelf life
• Knowledge Transfer
Knowledge Transfer describe four different modes of knowledge conversion
(transfer):
Socialization: from tacit knowledge to tacit knowledge
Externalization: from tacit knowledge to explicit knowledge
Combination: from explicit knowledge to explicit knowledge
Internalization: from explicit knowledge to tacit knowledge
• The Knowledge Management Process
• Business intelligence goes as far back as the 19th century and the beginnings of
entrepreneurship. and it has been developing steadily over many years.
• BI enables organizations to integrate data across the enterprise, unlock the information and
empower knowledge worker to make better (and faster) decisions – it focuses on explicit
knowledge.
• However, KM deals with the creation of new knowledge and the dispersion of existing
knowledge throughout an organization – it encompasses both tacit and explicit knowledge,
thus, we can say that KM can influence the very nature of business intelligence.
3. TYPES OF DECISIONS
• Modern cloud-native organizations have constantly growing streams of raw data flowing
from every corner of the enterprise. Determining the impact this data has on business
performance can be an overwhelming task requiring teams of analysts. That’s where
employing business intelligence (BI) can help.
• By presenting current and historical data within a business context, the data insights
supplied by BI tools enable organizations to make smarter, more confident decisions that
provide strategic direction for years to come.
• Instead of relying on intuition and “gut feel,” companies can use BI to find new ways to
increase revenue, track performance, boost operational efficiency, identify market trends,
expose problems, and much, much more.
• Before we dig deeper into the primary types of decisions in business intelligence, let’s define
what we mean by decision-making in a business context and understand how business
intelligence factors into the process.
• Decision-making defined
• Simply put, decision-making is the process of deciding something, especially with a group of
people. From a business decision perspective, the aim is to achieve business objectives to
satisfy stakeholder requirements, needs, and expectations.
• For the decision to be effective, however, decision makers must forecast the outcome of each
option and determine which is best for a particular situation. That makes decision support
systems (DSS) like decision intelligence and business intelligence absolute essentials.
• Business intelligence refers to the technology tools and processes that enable businesses to
organize, analyze, and contextualize business data from around the company. Business
intelligence tools and decision-making transform raw data into meaningful and actionable
information.
• BI is the means through which organizations make smarter business decisions. While data
fuels the engine, integrating BI-related infrastructure like a data warehouse, dashboards,
reports, data discovery tools, and cloud data services make it possible to extract insights from
your data.
• The role of business intelligence
• Companies make big mistakes when they base business decisions on what they think will
happen instead of relying on facts.
Using BI and advanced analytics, organizations can extract crucial facts from the mountain
of data, transforming it into information companies can act on to make informed strategic
decisions. The result: improved business processes, operational efficiency, and business
productivity.
• Business intelligence decisions
• Business intelligence decisions typically fall into three categories: strategic, tactical, and
operational.
• An organization needs to gain a complete understanding of these types of decisions in
business intelligence to make better-informed decisions that lead to increased
customer retention, stakeholder satisfaction, operational efficiency, and revenue.
• The relationship between business intelligence and business analytics
• Business intelligence tells you what is currently happening and what happened in the past
to bring you to that state.
• On the other hand, business analytics is an umbrella term for predictive data analysis
techniques and prescriptive .
• Using business intelligence and analytics efficiently is the difference between
companies that succeed and those that fail in the modern environment.
• Three primary types of business intelligence decisions
• Business intelligence supports the three types of decision-making mentioned above:
strategic, tactical, and operational. Its frequency and organizational impact characterize each.
Strategic decisions
• Strategic decisions comprise the highest level of organizational business decisions and are
usually less frequent and made by the organization’s executives. Yet, their impact is
enormous and far-reaching.
• Some types of strategic decisions include selecting a particular market to penetrate, a
company to acquire, or whether to hire additional staff.
• Decisions made at this level usually involve significant expenditure. However, they are
generally non-repetitive in nature and are taken only after careful analysis and evaluation of
many alternatives.
Tactical decisions
• Tactical decisions (or semi structured decisions) occur with greater frequency (e.g., weekly
or monthly) and fall into the mid-management level. Often, they relate to the
implementation of strategic decisions.
Gathering information
• If you’re like most of us, you’re using spreadsheets and dashboards to visualize and present
data on the Web. But some businesses are turning to dashboards to make smart decisions that
improve operations, enhance employee satisfaction, process more data, and have greater
visibility into the bottom line.
Design and analyze
• Routinely encountering data requires a person to make decisions on how to process the data.
It can be viewed as a few steps involved in this process. First, it involves the analysis of data,
followed by the identification of the most appropriate decision- making model. The
implementation comes after the analysis of data. The data analysis process is then repeated,
and the model is altered based on the latest information or discoveries, if necessary.
Select and implement using ad hoc query, what-if, and forecasting.
• The importance of data in business is beyond dispute. But, the problem is that most companies
are not making the most of the information available to them. What they have is not necessarily
the right data and is not being put to its best use. Ad hoc queries, what-if scenarios, and
forecasting are particularly effective tools for uncovering hidden information. It can influence
risk and make decisions. Ad-hoc query, What-If, and Forecasting are three terms used in
business intelligence to help make decisions.
Do evaluations using the vital tools.
• Effective decision-making is critical to business success. The good news is that business
intelligence tools can help you make better decisions. Using dashboards and reports can help
you identify key performance indicators, make comparisons between data sets. Also, identify
opportunities to improve your business. If you have any experience managing a business, you
know that these vital tools are a necessity, providing you with the information you need to make
business decisions. These tools can be used for assessing your performance, motivating staff,
and improving the customer experience.
Separate the components that are not related.
• At one point in time, every decision we make appears to have little connection to anything else.
This is because we are able to separate decisions into distinctly different components, such as
the sales forecast, the profit, the hiring policy, and so on. The trick is to recognize that these
components are inseparable. Moreover, a decision made without the sales forecast is like an
engine without the air. You get nowhere.
• . A decision made without a profit is like an engine without fuel. You get nowhere. A
decision made without a hiring policy is like an engine without the air. You get nowhere
Increase the coherence.
• Business intelligence is a technology that extracts and analyzes historical data to help
organizations make better decisions. It is a key asset for businesses that use it to gain value from
all their data and to make better decisions on how to spend their resources.
• There is a trend among different organizations in today’s world to adopt the use of Business
Intelligence. The use of business intelligence in these organizations helps in making
decisions faster and more effectively. With the help of business intelligence, organizations
can keep track of their processes. Moreover, the performance of their business, and the
consumption of resources and get insights into the organization’s current situation. Business
intelligence solutions help in making more effective decisions.
2. Model-driven DSS
A model-driven DSS allows a user to analyze and manipulate specific models of data, such
as statistics, finances or scheduling.
These decision support systems are specific to the type of model the user wants to interact
with and typically offer less data than other DSS types.
They analyze scenarios and data to allow the user to manipulate a model, such as creating a
work schedule. They might use simple analysis tools or complex statistics, depending on the
model's purpose and the user's needs.
Managers, staff and third parties who interact with a business might use a model-
driven DSS.
Software examples of a model-driven DSS include:
Scheduling software
Financial modeling
Decision analysis modeling
Optimization software
3.Knowledge-driven DSS
With a knowledge-driven DSS, a knowledge-management system monitors
continually updated data about an organization to support decisions.
The DSS uses diagnosis, prediction, interpretation and classification to recommend actions
consistent with the business.
A knowledge-driven DSS can be helpful to managers because it performs tasks faster than a
human might. They can also help consumers decide which products and services to buy. This
kind of DSS often relies on a data-mining component.
Managers, staff and external users, such as customers, might use a knowledge- driven
DSS.
Software examples of a knowledge-driven DSS include:
Software that identifies new or current customers who might be interested in products
Product selection software
4. Document-driven DSS
A document-driven DSS retrieves unstructured information from a variety of electronic
sources. It searches web pages, documents in databases and other information based on a
user's search terms to gather relevant information.
A document-driven DSS might be specific to a business' private files or as broad as a
common internet search engine.
Anyone using a database's search function or an internet search engine is using a document-
driven DSS.
Software examples of a document-driven DSS include:
Search engines
Database search software
Article databases with search functions
5. Communication-driven DSS
A communication-driven DSS uses tools to support communication and
collaboration.
Email is an example of a communication-driven DSS. This type of DSS includes share tools
that allow multiple people to work on a project at once and software that allows for digital
communication between people.
It improves a shared project's efficiency and effectiveness and can help facilitate
meetings and conversations.
Internal team members, virtual business meeting hosts and online chat and video meeting
software users can benefit from a communication-driven DSS.
A communication-driven DSS might also be called a group DSS. A communication- driven
DSS focuses on communication and collaboration, while a group DSS helps groups
streamline the decision-making process.
A communication-driven DSS, for example, might help two people who work for the same
company on different shifts share documents. It might also allow employees on opposite
sides of the country to meet virtually to view a shared file.
Software examples of a communication-driven DSS include:
Chat and instant messaging services
Collaboration software, such as document sharing and editing software
Email
6. Intelligent DSS
Any DSS with artificial intelligence in its design is an example of intelligent DSS (IDSS).
Within an IDSS, AI does data mining and processing to filter through large datasets. An
IDSS is designed to offer similar services to a human consultant. They're programmed to
identify patterns and trends to guide decision-making. They can also resolve problems and
analyze solutions. AI components add advantages, such as fuzzy logic and machine learning,
to a DSS.
Managers, diagnosticians and other decision-makers might use an IDSS.
Software examples of an intelligent DSS include:
Smart manufacturing systems
Medical diagnostic systems
7. Manual DSS
A manual DSS relies on individuals instead of computers to support decision- making. A
group of experts analyzes the strengths, weaknesses, opportunities and threats of their
organization or project.
A manual DSS is much slower than a computer-based DSS, but certain types of
analysis still need a human eye at every step.
Economists, executives and managers might use a manual DSS.
Examples of manual DSS include:
Cost-benefit analyses
Decision matrixes
8. Hybrid DSS
A hybrid DSS combines parts of multiple DSS types to create a complex outcome. Large
issues in industries such as finance and health care might require the tools of multiple
decision support systems, such as a knowledge-driven DSS and a data-driven DSS.
A hybrid DSS might use additional software to help these components work together.
Sometimes a human analyzes and combines the results of each DSS.
A hybrid DSS might also describe a system in which a human works with a DSS to extract
and manipulate data. Medical professionals, financial decision-makers and researchers might
use a hybrid DSS.
Software examples of a hybrid DSS include:
Risk assessment
Clinical DSS
Web-based DSS
Decision Support System Examples.
GPS route planning.
Crop-planning.
Clinical DSS
Components of a decision support system
DSS database. The database draws on a variety of sources, including data internal to the
organization, data generated by applications, and external data purchased from third parties
or mined from the Internet. The size of the DSS database will vary based on need, from a
small, standalone system to a large data warehouse.
DSS software system. The software system is built on a model (including decision context
and user criteria).
Statistical models. These models are used to establish relationships between events
and factors related to that event. For example, they could be used to analyze sales in
relation to location or weather.
Sensitivity analysis models. These models are used for “what-if” analysis.
Optimization analysis models. These models are used to find the optimum value for a
target variable in relation to other variables.
Forecasting models. These include regression models, time series analysis,
and other models used to analyze business conditions and make plans.
Backward analysis sensitivity models. Sometimes called goal-seeking analysis, these
models set a target value for a particular variable and then determine the values other
variables need to hit to meet that target value.
DSS user interface. Dashboards and other user interfaces that allow users to interact with
and view results.
7. BUSINESS INTELLIGENCE
• Business intelligence combines business analytics, data mining, data visualization, data tools
and infrastructure, and best practices to help organizations make more data-driven decisions.
3. Reporting
One area where BI tools commonly help business users is by designing, scheduling and
generating reports, for example regular performance, sales or marketing reports.
Reports output by BI tools efficiently gather and present information to support the
management, planning and decision making process.
Once the report is designed it can be automatically run at set intervals and sent to a
predefined distribution list so key people can see regularly updated numbers.
Dice
The dice operation describes a sub cube by operating a selection on two or more
dimension.
Pivot
The pivot operation is also called a rotation. Pivot is a visualization operations which rotates the
data axes in view to provide an alternative presentation of the data. It may contain swapping the
rows and columns or moving one of the row-dimensions into the column dimensions.
8
Types of OLAP
numeric_expr represents the data to the trend. This is the Y-axis, usually a measure
column.
series is the X-axis is a list of numerics or time dimension attribute columns.
partitionBY is a list of dimension attribute columns that are in the view but not on the
X-axis.
model_type is one of the following (‘LINEAR’, ‘EXPONENTIAL’).
result_type is one of the following (‘VALUE’, ‘MODEL’). ‘VALUE‘ will return all the
regression Y values given X in the fit. ‘MODEL’ will return all the parameters in a
JSON format string.
TRENDLINE((Sales),(Order Date) BY (Product Sub Category), 'LINEAR', 'VALUE')
numeric_expr = Sales
series = Order Date (Time dimension)
partitionBy = Product Sub Category
model_type = LINEAR
result_type = VALUE
CLUSTER((dimension_expr1 , ... dimension_exprN), (expr1, ... exprN),
output_column_name, options, [runtime_binded_options])
dimension_expr represents the list of dimensions, e.g., (productID, companyID), to be
clustered.
expr represents the list of dimension attributes or measures to be used to
cluster the dimension_expr.
output_column_name is the output column. The valid values are ‘clusterId’, ‘clusterName’,
‘clusterDescription’, ‘clusterSize’, ‘distanceFromCenter’, ‘centers’.
options mean the string list of name=value pairs separated by ‘;’. The value can include
%1 … %N, specified using runtime_binded_options.
runtime_binded_options is an optional comma-separated list of run-time binded columns or
literal expressions.
OUTLIER((dimension_expr1 , ... dimension_exprN), (expr1, .. exprN),
output_column_name, options, [runtime_binded_options]))])
dimension_expr indicates a list of dimensions.
expr represents a list of dimension attributes or measures to find outlier.
output_column_name indicates the output column name. Valid values are 'isOutlier'
and 'distance'.
options indicates a string list of name/value pairs separated by a semi-colon
(;). The value can include %1 ... %N, which can be specified
using runtime_binded_options.
runtime_binded_options is an option comma separated list (,) of run-time binded
columns and or literal expressions.
REGR(y_axis_measure_expr, (x_axis_expr), (category_expr1, ..., category_exprN),
output_column_name, options, [runtime_binded_options])
1. What do you understand by Business Intelligence? (CO3, K1)
The term Business Intelligence refers to a collective meaning, including technologies, tools,
applications, practices for the data collection, and providing those data to the users, especially to
help in running the business or a part of it.
2. What are the primary objectives of Business Intelligence? (CO3,K3) The
primary objectives of Business Intelligence are:
• Business Intelligence is leverage to make the following enterprise-level decisions.
• Business Intelligence helps in identifying the wrong tracks and approaches of a business.
• Business Intelligence can cluster the data for analysis and then compile them to
monitor corrective actions.
• Business Intelligence is also useful for determining whether a company is executing
as per plan.
• Identification and extraction of trends and insights from business are possible using Business
Intelligence tools.
3. Define the term Data Warehousing? (CO3,K1)
• Data Warehousing is the repository system used to analyze and report data from various
heterogeneous sources and forms. These data will be available from the oracle database, SQL
Server, Postgres, or a simple excel sheet. The warehouse uses one central mechanism called the
repository, through which the business analyst can fetch all historical reports associated with that
data.
4. Mention some characteristics of the Data warehouse(CO3,K2)
• A data warehouse is a separate database responsible for storing historical
information records and is kept separate from an operational database.
• Processed and analyzed data from a data warehouse helps make top management strategic
and tactical decisions based on the analysis.
Analyzing data in the data warehouse helps the business analysts and users see the current
business trends.
• The data warehouse is also responsible for consolidating historical data analysis.
5. What are the popular Business Intelligence (BI) tools used by Business Analysts?
(CO3,k2)
• Microsoft BI
• Cognos
• MicroStrategy
• Tableau
• SAS
• Business Objects
• OBIEE
• Hyperion
Q. Questions CO K Level
No. Level
1 Discuss about Data Warehouses and Data Mart in CO3 K2
Detail
2 Explain in how Knowledge Management support in BI? CO3 K3