Screenshot 2024-10-17 at 2.05.17 PM
Screenshot 2024-10-17 at 2.05.17 PM
MBA232C6
Unit 1
RELEVANCE
▪ Increasing complexities associated with businesses in the form of scale of operations and
competition
▪ Demand deeper understanding of the market and customers to serve better and succeed in the
market. One of the main reasons for analytics is the scale of operations.
▪ Competitive Advantage – Nvidia, Google, Amazon
▪ Remove inefficiencies in the system
▪ Provides ability to make better decision
▪ In 2021, the overall amount of data generated in the world was estimated to be around 79
zettabytes. Approximately 402.74 million terabytes of data are created each day. Around 147
zettabytes of data will be generated this year. 181 zettabytes of data will be generated in 2025.
REASON FOR THE RISE OF ANALYTICS
▪Advanced software techniques are available e.g. advanced data structures, advanced
database systems, cloud computing, etc.
▪Clean data is available, now most organizations have robust software infrastructure
which helps in capturing clean customer, vendors, and sales data
▪Advanced hardwares are available which can store huge data in such a way that it can
be easily available for analysis without any time lag. Also, the cost is quite reasonable
e.g. GPU/TPU processors, distributed networks, etc.
▪Advanced business problem-solving techniques are providing new alternatives to
tackle business problems e.g. Agile and lean six sigma frameworks for business
excellence
NVIDIA MKT CAP
➢If Nvidia was a country – 7th largest
in the world (Market capitalization)
https://ojdigitalsolutions.com/amazon-sales-data/
SO, WHAT IS ANALYTICS AND WHAT IS ITS SCOPE?
Davenport and Harris (2007) and Hopkins et al. (2010) reported that there was a high
correlation between use of analytics and business performance.
BUSINESS ANALYTICS ACROSS DOMAINS (SCOPE)
Scope:
• Data collection and Management
• Data Analysis
• Predictive modelling
• Data visualization
• Decision-making support
• Customer-Making support
• Customer Behavior Analysis
• Market research
• Inventory management
• Financial forecasting
• Operations optimization
• Customer Behaviour analysis
• Sales and Marketing analysis
• Supply chain Optimization
• Financial analysis
• Process improvement
• HRM and analysis
QUIZ
Question 1: For which company does this problem arise : selling fake items in place of
genuine items?
A. E-commerce platforms
B. Logistics and delivery services
C. Fraudulent sellers
D. Data analytics and machine learning companies
Question 2: Which type of company manages a large inventory with a wide variety of
products, each represented by a unique SKU?
A. E-commerce platforms
B. Logistics and delivery services
C. Fraudulent sellers
D. Data analytics and machine learning companies
Question 3: Which type of company specializes in forecasting demand for products?
A. E-commerce platforms
B. Logistics and delivery services
C. Fraudulent sellers
D. Data analytics and machine learning companies
Question 4: Which type of company experiences a significant number of canceled
orders placed by customers before delivery?
A. E-commerce platforms
B. Logistics and delivery services
C. Fraudulent sellers
D. Data analytics and machine learning companies
Question 5: Which type of company predicts what a customer is likely to buy in the
future?
A. E-commerce platforms
B. Logistics and delivery services
C. Fraudulent sellers
D. Data analytics and machine learning companies
QUIZ – RECAP OF SESSION 1
A. Healthcare
B. Stock Market
C. Gambling
D. Banking
SESSION-2
BUSINESS ANALYTICS DEFINITION
Business
context
Data
Technology
Science
Business Context:
- BA applications/ projects starts with business context.
- Ability of the organizations to ask the right questions – helps in selecting relevant
data and right tools – results in a good analytics storyboard.
Technology:
- used for data capture, data storage, data preparation, data analysis, and data share.
- Most data are unstructured – need software for analysis – automation of actionable
items.
Data Science:
- Consists of statistical and operations research techniques, machine learnings and deep
learning algorithms.
- Given the objective or problem, data science component of analytics is to identify the
most appropriate statistical model / machine learning algorithms that can be used.
DATA DRIVEN DECISION MAKING
A typical data-driven decision-making process uses the following steps:
1. Identify the problem or opportunity for value creation. (Amazon Prime Air)
2. Identify sources of data - primary as well secondary data sources. (Customer feedback, GPS
and telematics data, traffic patterns, Online reviews, etc.)
3. Pre-process the data for issues such as missing and incorrect data. Generate derived variables
and transform the data if necessary (Normalization, standardization, handling skewed data,
less sensitive to outliers, storage efficiency, meeting assumptions of statistical tools). Prepare
the data for analytics model building.
4. Divide the data sets into subsets training and validation data sets.
5. Build analytical models and identify the best model(s) using model performance in validation
data.
6. Implement Solution/Decision/Develop Product
(Refer notes for Applications of typical data-driven decisions in various industries)
ANALYTICS CAN BE USED TO SOLVE VARIOUS KINDS OF PROBLEMS
Toyota Inventory
Management system
Healthcare
BUSINESS ANALYTICS CAN BE GROUPED
INTO THREE TYPES:
1. Descriptive analytics,
2. Predictive analytics, and
3. Prescriptive analytics
DESCRIPTIVE ANALYTICS
It involves the exploration and analysis of data to answer questions such as "What
has happened?" and "What is the current state?"
Descriptive analytics often includes tasks such as:
1. Data Exploration (identify patterns, distribution, outliers, Data cleaning, Data
Profiling, and data quality assessment); 2. Data Visualization (Charts, Graphs,
Dashboards); 3. Summary Statistics (mean, median, mode, range, std. deviation); 4.
Data Segmentation (grouping or categorizing data to understand the variation within
data); and 5. Data Reporting (highlighting key observations, presentation)
- It focuses on historical data and identifies the hidden trends, patterns and
relationships using graphs, measures of frequency, measures of central tendency,
measures of Dispersion or variation, and measures of position.
-Simplest form of analytics that mainly use simple descriptive statistics, data
visualization techniques, and business-related queries to understand past data.
- The primary objective is data summarization and understanding the trend in the past
data which can be useful for generating insights.
PREDICTIVE ANALYTICS
Predictive analytics is a branch of business analytics that uses historical data, statistical
algorithms, and machine learning techniques to make predictions and forecasts about
future events or outcomes. It answers to the question ‘What will happen in the future?’
Here are some examples of predictive analytics applications:
1. Customer Churn Prediction: Customers who are likely to churn or cancel their subscription.
2. Demand Forecasting: Retailers & manufacturers forecast future demand of their product.
3. Fraud Detection: Financial institutions detect fraudulent transactions by analyzing patterns and
anomalies in data.
4. Predictive Maintenance: To optimize equipment maintenance schedules.
5. Credit Risk Assessment: Assess the default risk and determine appropriate credit limits or
interest rates.
6. Healthcare Outcome Prediction: Likelihood of readmission, probability of disease.
DESCRIPTIVE ANALYTICS VS. PREDICTIVE ANALYTICS
Focus It focuses on answering questions such as It focuses on answering questions such as "What is
"What happened?" and "What is the current likely to happen?" and "What is the probability of a
state?" specific event occurring?"
Insights Descriptive analytics provides insights into Predictive analytics uses statistical modeling and
the past by analyzing and summarizing data machine learning algorithms to uncover
through measures like mean, median, mode, relationships and patterns in historical data and
standard deviation, and data visualization applies them to predict future behavior or
techniques. outcomes.
Examples Data profiling, data visualization, summary Customer churn prediction, demand forecasting,
statistics, data segmentation, and reporting are fraud detection, predictive maintenance, credit risk
common techniques used in descriptive assessment, and healthcare outcome prediction are
analytics. some common applications of predictive analytics.
PRESCRIPTIVE ANALYTICS
Prescriptive analytics is the highest level of business analytics that goes beyond
descriptive and predictive analytics. It focuses on providing recommendations or
prescriptions for actions to optimize decisions and outcomes. It acts as a solution
builder for a problem.
Here are some examples of prescriptive analytics applications:
1. Supply Chain Optimization (Flipkart’s Supply Chain Management: uses BA for
route optimization, Inventory optimization, Warehouse management etc.)
2. Pricing Optimization (Ola Cabs: Dynamic pricing algorithms based on rider
demand, driver availability, traffic conditions, driver incentives)
3. Resource Allocation in Healthcare (Apollo Hospitals – uses BA for optimal
allocation of medical staff (doctors, nurses, technicians), medical equipment (MRI
machines, surgical instruments), and hospital beds based on predicted demand and
resource availability)
4. Portfolio Optimization (Edelweiss Asset Management Limited - uses real-time
monitoring analytics tools, and optimization models to construct portfolios that
maximize returns given a specific level of risk tolerance or minimize risk for a given
level of expected return).
• Prescriptive Analytics: Amazon uses prescriptive analytics to optimize its logistics and
delivery operations. They analyze various factors such as transportation routes, weather
conditions, and customer preferences to determine the most efficient delivery options.
APPLICATION: COMPANIES USING ALL THE
THREE TYPES OF ANALYTICS
Netflix:
• Descriptive Analytics:
• Predictive Analytics:
• Prescriptive Analytics:
APPLICATION: COMPANIES USING ALL
THE THREE TYPES OF ANALYTICS
Uber:
• Descriptive Analytics:
• Predictive Analytics:
• Prescriptive Analytics:
APPLICATION: COMPANIES USING ALL
THE THREE TYPES OF ANALYTICS
• Descriptive Analytics:
• Predictive Analytics:
• Prescriptive Analytics:
APPLICATION: COMPANIES USING ALL THE
THREE TYPES OF ANALYTICS
Walmart:
• Descriptive Analytics:
• Predictive Analytics:
• Prescriptive Analytics:
TECHNIQUES: DESCRIPTIVE, PREDICTIVE AND
PRESCRIPTIVE;
Descriptive Analytics Predictive Analytics: Prescriptive Analytics:
Decision Analysis:
Multi-Criteria Decision
Analysis (MCDA):
BIG DATA ANALYTICS
Big data refers to extremely large and complex data sets that exceed the capabilities of
traditional data processing tools and techniques to capture, store, manage, and analyze.
Big data is characterized by the 5 V's: volume (large amount of data), velocity (High speed
data generation and processing), variety (diverse data type and different sources) , veracity
(data quality and accuracy - biased, incomplete, noise and abnormality in data) and value
(offers benefits).
Key characteristics of big data analytics include:
1. Advanced Analytics:
2. Scalability:
3. Real-time or Near Real-time Analysis:
4. Data Variety:
5. Data Integration:
6. Business Value:
WEB AND SOCIAL MEDIA ANALYTICS
Web and social media analytics refers to the process of collecting, analyzing, and
interpreting data from websites and social media platforms to gain insights and make
data-driven decisions.
General steps followed in conducting web and social media analytics is as follows:
1. Data Collection: done through various methods such as web scraping, APIs
(Application Programming Interfaces), or 3rd party analytical tools.
2. Data Cleaning and Preparation: (Removing duplicates, handling missing data etc.)
3. Data Analysis: (includes statistical, network, sentimental analysis, text mining etc.)
4. Visualization and Reporting: (Charts, graphs, dashboards, reports etc.)
5. Interpretation and Action: (helps in making business decisions, optimizing
strategies, solving problems, improve customer experiences).
FROM A BUSINESS PERSPECTIVE, WEB AND SOCIAL
MEDIA ANALYTICS IS FOUND TO HIGHLY RELEVANT:
1. Customer Insights: To create more targeted marketing campaigns, tailor products or
services to customer needs, and improve overall customer experiences
2. Marketing Optimization: To identify effective marketing channels, track the success
of campaigns
3. Brand Monitoring and Reputation Management: To proactively manage the brand
image, address any negative feedback
4. Competitor Analysis: To understand competitor strengths and weaknesses, and
adapting strategies accordingly.
5. Market Research and Trend Analysis: To help in staying ahead of industry trends,
identifying new market opportunities
6. Customer Service and Support: To provide timely and personalized responses, and
enhancing overall customer support experience.
INTRODUCTION TO MACHINE LEARNING
❑ Google: "Machine learning algorithms are a set of computational methods and models
that allow computers to automatically learn from data and make predictions or decisions
without explicit programming.“
❑ It requires dataset to train the machine learning model, consisting of input features
(attributes) and corresponding output labels or targets.
❑ ML models are evaluated based on their performance using metrics such as accuracy,
precision, recall, and F1-score (F1-score is a measure of a model's accuracy that combines
both precision and recall metrics into a single value. It is particularly useful when dealing
with imbalanced datasets where one class is more frequent than the other).
❑ Challenges in machine learning where a model may perform too well on training data
but poorly on new, unseen data (overfitting) or fail to capture the underlying patterns in
the data (underfitting).
TYPES OF MACHINE LEARNING
1. Supervised Learning: A type of machine learning where the model is trained on labeled
data, allowing it to learn relationships between input features and output labels.
Examples:
❑ Linear regression: Predicting house prices based on features like area, number of
bedrooms, and location.
❑ Classification: Identifying whether an email is spam or not based on its content and
attributes.
❑Support Vector Machines (SVM): Classifying images into different categories, such as
recognizing handwritten digits.
2. Unsupervised Learning:
It is a type of machine learning that learns from data without human supervision.
Unsupervised learning involves training a model on unlabeled data, where the input data
does not have corresponding target labels. The model learns to identify patterns,
structures, or relationships in the data without specific guidance.
Examples: