SlideShare a Scribd company logo
Large Scale Data Analytics
Shankar Radhakrishnan
shankar.r3@cognizant.com
linkedin.com/in/connect2shankar
Scenario
• Insurer uses meteorological data for pricing model
• At present data from 2000 weather stations are
collected for analysis
• Plan is to use 10,000 weather station data 

( or more )
• Stochastic simulation needs to run to ID pattern in
weather data, to determine pricing
• Volumetric : peta-bytes of information 

( for 1 region )
2
Trends
3
Data Analytics Is Mostly About $$, Customers, Markets
4
How Widespread Is Data Analytics?
5
Expectations On Payback Period ( Aggressive )
6
Large Scale Data Analytics
7
“Involves using different algorithms, 

distributed platforms, tools and techniques
to analyze big data and provide actionable
insights”
Big Data
“ Data sets that are very large in volume and complex “
8
New platforms, tools and techniques

have emerged to manage Big Data
We broke away from traditional

ways to process and analyze them
Data Structures
 
Vector, Matrix,
Or Complex
Structure
Free Text
Image or
Binary Data
Data “bags”
Iterative
Logic Or
Complex
Branching
Advanced
Analytic
Routines
Rapidly
Repeated
Measurements
Extreme
Low
Latency
Access to
all data
required
Search Ranking X X X X X X
Ad Tracking X X X X X X X X  
Location or Proximity Tracking X   X X     X X  
Social CRM X X X X X X      X
Document Similarity Testing X X X X X X   X X
Genomic Analysis X X X X X
Customer Cohort groups X X   X X X     X
Fraud Detection X X X X X X X X X
Smart Utility Metering X X X X X X
Churn Analysis X X X X X X   X  
Satellite Image Analysis X X X X
Game Gesture Analysis X X X X X X X X
Data Bag Exploration X X X X X X
9
Business Interests : Well Informed Customer Executive
10
Speech to Text
Conversion
Voice Data
Unstructured data Analytical System
Customer Persona
• Customer Persona -
Demographics,

Top interactions, 

Channel Preferences,
Dissatisfies
• Customer Lifetime Value
• Recent Contact History
• Customer Sentiment &
Trend during the call
Customer’s state of mind
Sentimental
Analysis
Social media
Depositions
Complaints
Other Channel
information
(ATM, Branch)
Big Data Warehouse
Traditional Warehouse
Decision Engine • Customer Executive Dashboard
presents all intelligence
required to make a decision
• The decision engine also
presents important decisions
to be taken for the particular
customer issue
Well Informed Customer Executive…
Customer calls
BankingCallCenter
Executive
understands the
customerproblemExecutive authenticates
customer and pulls up
CustomerPersona
Executive reviews
risk of attrition
against Customer
LifetimeValue
Executive reviews
Last 5 call center
and banking
transactions
Executiveviews
customer’s state of
mind (riskof
attrition )through a
barometer chart
Analytical Solution -
Converts Speech to
textAnalytical engine
listens to
customer voice
Suggested top 5
Actions required
DecisionEngine
Executive performs below actions based on his analysis and
recommendations from Decision engine
1. Reversal of overdraft fee
2. One time fee waiver on Cheque book (predicting customer
need based on historic usage cycles )
3. Cash back Reward card for a minimum spend of $X through
debitcard
4. Offer interest revision for investment products or mortgage
5. Promote new mutual funds or credit cards based on
customer willingness
Analytical engine
monitors
sentiment
Executive analyzes Customer
Persona (demographic /
Preferences / Satisfiers /
dissatisfiesetc )
11
Business Interests : Fraud Prevention
12
Envisaged Benefits
▪ New fraud patterns can be identified by building ‘analytical models’ to run against historical data
▪ ‘Web crawling’, ‘Contextual text analysis’, ‘Natural Language Processing’ allows fraud behavior
identification from social media. It may increase Fraud detection success rate
▪ ‘Real time’ models to capture behavioral patters and do pattern analysis against History data to
evaluate Fraud case validity. The model learns by self and updates ‘Fraud pattern master sets.
▪ Brings ‘artificial intelligent’ fraud pattern detection and analysis
▪ ‘Real time’ (in the order of .5-1 minute refresh rate) alerts to Fraud analysts about ‘self learned’ fraud
patterns based on new customer behavior patterns
Big Data Usage
▪ Formation of key value groups to the order of XcY (where X no. of attributes that are relevant to Fraud
and Y is no. of attributes that should be combined to identify patterns)
▪ High speed history data loading from source systems
▪ Efficient Real time fraud detection by identifying patterns through customer behavioral events and
processing them over X yrs. of history data – e.g. using HBase
Scenario
Formation of Fraud pattern reference tables using
▪ Real time data coming from different departments like IVR, WEB, Customer profile, Transactions etc
▪ Real time Mining and analysis of history data to form prior patterns (no. of years in range to 50-100 TB)
Fraud Pattern Detection…
13
Legacy Fraud
Data
Customer
Profile Data
IVR Audio
Data
Web / Online
Card
Transaction
Data
Fraud
Pattern
Master Table
Fraud Analyst
History Data
Processing to
determine
Fraud
Patterns over
X years
Real-time
Customer
Behavior
Analysis for
Fraud
Detection
Customer
Behavior Change
Events
Customer
Behavior Change
Events
Customer
Behavior Change
Events
Real time Analysis of
behavior patterns over
historical data
Real time update to
Master Table on New
Fraud Patterns
Real time alert to 

Fraud Analyst
RDBMS RDBMS
(JSON
Files) RDBMS
Customer
Behavior Change
Events
Fraud Prevention…
14
Benefits
15
BenefitsIndustry
Financial services
▪ Customer Insights – Integrating Transactional data (CRM/Payments) and unstructured Social feeds
▪ Regulatory Compliance – Risk exposures across asset classes, LOBs and firms
▪ Fraud Detection in Credit Cards & Financial Crimes (AML) in Banks
Travel, Hospitality & Retail
▪ Customer centricity – Customer behavior analysis from Omni channel retailing & Social feeds
▪ Markdown Optimization – Improve markdown based on actual customer buying patters
▪ Market basket analysis – Narrow down market basket analysis by demographics
Life Science
▪ Improve targeting & predictions – Automatic Detection of Adverse Drug Effects (ADEs)
▪ Patient data analysis – Longitudinal Patient Data (LPD) analysis
▪ Predictive Sciences – Analyze Preclinical Side Effect Profiles of Marketed Drugs
Healthcare (Payers & Providers)
▪ Cost of Care – Drug effectiveness & Cost of Care Analysis based on electronic Health Records (EMR)
▪ Self Service Healthcare – Increase in mHealth & eHealth to allow consumer access to health information
▪ Claims Analytics – Analyze insurance claims data for fraud detection & preferred treatment plans
Communication,
Media & Entertainment
▪ Discover churn patterns based on Call data records (CDRs) and activity in subscribers’ networks
▪ Digital Asset Management (DAM) – Analyze & capitalize digital data assets
Manufacturing
▪ Proactive Maintenance & Recommendation – Sensor Monitoring for automobile, buildings & machinery
▪ Energy Efficiency – Leveraging Smart meters for utility energy consumption
▪ Location or Proximity Tracking – Location based analytics using GPS Data
Hi-Tech
▪ Extend and complement conventional information supply chain with big data path
▪ Predictive analysis and real time decision support
Hadoop
16
Hadoop - HDFS
17
Hadoop - MapReduce
18
Hadoop - MapReduce
19
Apache Spark
20
Spark
Iterative
Processing
Batch
Processing
Machine
Learning
SQL
Stream
Processing
Graph
Processing
Hadoop
21
NoSQL Databases
22
NoSQL Databases
23
Modern Data Architecture
24
Lambda Architecture
25
Lambda Architecture
26
Data Analytics Lifecycle
27
Analytics - Trends
• Big Data Analytics In The Cloud
• AWS, AWS-Redshift
• Hadoop
• Enterprise Data Operating
System
• Data Analytics Platform
• SQL on Hadoop
• NoSQL
• IoT ( Internet of Things )
28
• Multi-polar Analytics
• Predictive Analytics ( Spark )
• In-memory Analytics
• Data Lake
• Deep Learning
• Machine Learning
• Neural Networks
• Data Monetization
Q & A
Thank You !
“Any Sufficiently Advanced Technology Is
Indistinguishable From Magic “
- Arthur C. Clarke
Ad

More Related Content

What's hot (20)

Rulex big data and analytics
Rulex big data and analyticsRulex big data and analytics
Rulex big data and analytics
Ordine Ingegneri Savona
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
Microsoft Canada
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
Pankaj Baid
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
KM Chicago
 
Banking Big Data Analytics
Banking Big Data AnalyticsBanking Big Data Analytics
Banking Big Data Analytics
Bria Strategy Group
 
Data mining
Data miningData mining
Data mining
udhaysweety
 
Data mining on Financial Data
Data mining on Financial DataData mining on Financial Data
Data mining on Financial Data
AmarnathVenkataraman
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
Michael Haddad
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Swiss Big Data User Group
 
Big Data Retail Banking
Big Data Retail Banking Big Data Retail Banking
Big Data Retail Banking
Sandeep Bhagat
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
nkabra
 
Data science in finance industry
Data science in finance industryData science in finance industry
Data science in finance industry
Institute of Contemporary Sciences
 
Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)
Data Science Thailand
 
How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?
NexSoftsys
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
Utkarsh Sharma
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
Bob Samuels
 
Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)
Huntsman Security
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYC
Pankaj Baid
 
Analytics in banking preview deck - june 2013
Analytics in banking   preview deck - june 2013Analytics in banking   preview deck - june 2013
Analytics in banking preview deck - june 2013
Everest Group
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
arpit bhadoriya
 
Data Analytics in Azure Cloud
Data Analytics in Azure CloudData Analytics in Azure Cloud
Data Analytics in Azure Cloud
Microsoft Canada
 
AI powered decision making in banks
AI powered decision making in banksAI powered decision making in banks
AI powered decision making in banks
Pankaj Baid
 
Predictive analytics km chicago
Predictive analytics km chicagoPredictive analytics km chicago
Predictive analytics km chicago
KM Chicago
 
How advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sectorHow advanced analytics is impacting the banking sector
How advanced analytics is impacting the banking sector
Michael Haddad
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Swiss Big Data User Group
 
Big Data Retail Banking
Big Data Retail Banking Big Data Retail Banking
Big Data Retail Banking
Sandeep Bhagat
 
Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013Big data in marketing at harvard business club nick1 june 15 2013
Big data in marketing at harvard business club nick1 june 15 2013
nkabra
 
Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)Big Data in Banking (Data Science Thailand Meetup #2)
Big Data in Banking (Data Science Thailand Meetup #2)
Data Science Thailand
 
How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?How is Big Data extending the life of the banking sector?
How is Big Data extending the life of the banking sector?
NexSoftsys
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
Utkarsh Sharma
 
Finance and Accounting BPM
Finance and Accounting BPMFinance and Accounting BPM
Finance and Accounting BPM
Bob Samuels
 
Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)Hidden security and privacy consequences around mobility (Infosec 2013)
Hidden security and privacy consequences around mobility (Infosec 2013)
Huntsman Security
 
Data driven approach to KYC
Data driven approach to KYCData driven approach to KYC
Data driven approach to KYC
Pankaj Baid
 
Analytics in banking preview deck - june 2013
Analytics in banking   preview deck - june 2013Analytics in banking   preview deck - june 2013
Analytics in banking preview deck - june 2013
Everest Group
 
USE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTORUSE OF DATA MINING IN BANKING SECTOR
USE OF DATA MINING IN BANKING SECTOR
arpit bhadoriya
 

Similar to Large Scale Data Analytics (20)

Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014
shankar_radhakrishnan
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?
Fabricio Quintanilla
 
BA4206 UNIT 4.pptx business analytics ppt
BA4206 UNIT 4.pptx business analytics pptBA4206 UNIT 4.pptx business analytics ppt
BA4206 UNIT 4.pptx business analytics ppt
LogeshThondamar
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
Arvind Sathi
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
Hortonworks
 
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERYEVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
Big Data Week
 
Cognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServeCognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServe
Iurii Milovanov
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
Mohamed Zuber Khatib
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
Mohamed Zuber Khatib
 
AP-Summary-Aug-09-2022_capabilities .pdf
AP-Summary-Aug-09-2022_capabilities .pdfAP-Summary-Aug-09-2022_capabilities .pdf
AP-Summary-Aug-09-2022_capabilities .pdf
kcdelllaptop
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
MachinePulse
 
Predictive Analytics Overview
Predictive Analytics OverviewPredictive Analytics Overview
Predictive Analytics Overview
Vijaykumar Adamapure
 
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
MassTLC
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services Experience
Cloudera, Inc.
 
Big Data Done Right by Successful Organizations
Big Data Done Right by Successful OrganizationsBig Data Done Right by Successful Organizations
Big Data Done Right by Successful Organizations
Euro IT Group
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments
Ashish Anand
 
Aanlytics on Telecom
Aanlytics on TelecomAanlytics on Telecom
Aanlytics on Telecom
Aaum Research and Analytics Private Limited
 
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity
 
Big Data solution for multi-national Bank
Big Data solution for multi-national BankBig Data solution for multi-national Bank
Big Data solution for multi-national Bank
Ritu Sarkar
 
Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel
Data Science Society
 
Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014Big Data Analytics Summit - April, 2014
Big Data Analytics Summit - April, 2014
shankar_radhakrishnan
 
¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?¿Como los modelos predictivos cambian los negocios?
¿Como los modelos predictivos cambian los negocios?
Fabricio Quintanilla
 
BA4206 UNIT 4.pptx business analytics ppt
BA4206 UNIT 4.pptx business analytics pptBA4206 UNIT 4.pptx business analytics ppt
BA4206 UNIT 4.pptx business analytics ppt
LogeshThondamar
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
Arvind Sathi
 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and HortonworksHow to Become an Analytics Ready Insurer - with Informatica and Hortonworks
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
Hortonworks
 
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERYEVOLVING PATTERNS IN BIG DATA - NEIL AVERY
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
Big Data Week
 
Cognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServeCognitive Computing and Data Science expertise at SoftServe
Cognitive Computing and Data Science expertise at SoftServe
Iurii Milovanov
 
AP-Summary-Aug-09-2022_capabilities .pdf
AP-Summary-Aug-09-2022_capabilities .pdfAP-Summary-Aug-09-2022_capabilities .pdf
AP-Summary-Aug-09-2022_capabilities .pdf
kcdelllaptop
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
MachinePulse
 
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
McKinsey MassTLC Big Data Seminar Keynote - February 28, 2014
MassTLC
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services Experience
Cloudera, Inc.
 
Big Data Done Right by Successful Organizations
Big Data Done Right by Successful OrganizationsBig Data Done Right by Successful Organizations
Big Data Done Right by Successful Organizations
Euro IT Group
 
Big data analytics in payments
Big data analytics in payments Big data analytics in payments
Big data analytics in payments
Ashish Anand
 
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity at Corinium Chief Data Officer Forum Keynote - Brian Cox
Captricity
 
Big Data solution for multi-national Bank
Big Data solution for multi-national BankBig Data solution for multi-national Bank
Big Data solution for multi-national Bank
Ritu Sarkar
 
Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel Problems of Application of Machine Learning in the CRM - panel
Problems of Application of Machine Learning in the CRM - panel
Data Science Society
 
Ad

Recently uploaded (20)

Web and Graphics Designing Training in Rajpura
Web and Graphics Designing Training in RajpuraWeb and Graphics Designing Training in Rajpura
Web and Graphics Designing Training in Rajpura
Erginous Technology
 
Top 10 IT Help Desk Outsourcing Services
Top 10 IT Help Desk Outsourcing ServicesTop 10 IT Help Desk Outsourcing Services
Top 10 IT Help Desk Outsourcing Services
Infrassist Technologies Pvt. Ltd.
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
Build 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHSBuild 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHS
TECH EHS Solution
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Social Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTechSocial Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTech
Steve Jonas
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Mastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdfMastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdf
Spiral Mantra
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Web and Graphics Designing Training in Rajpura
Web and Graphics Designing Training in RajpuraWeb and Graphics Designing Training in Rajpura
Web and Graphics Designing Training in Rajpura
Erginous Technology
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
Build 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHSBuild 3D Animated Safety Induction - Tech EHS
Build 3D Animated Safety Induction - Tech EHS
TECH EHS Solution
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Social Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTechSocial Media App Development Company-EmizenTech
Social Media App Development Company-EmizenTech
Steve Jonas
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Mastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdfMastering Advance Window Functions in SQL.pdf
Mastering Advance Window Functions in SQL.pdf
Spiral Mantra
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Ad

Large Scale Data Analytics

  • 1. Large Scale Data Analytics Shankar Radhakrishnan [email protected] linkedin.com/in/connect2shankar
  • 2. Scenario • Insurer uses meteorological data for pricing model • At present data from 2000 weather stations are collected for analysis • Plan is to use 10,000 weather station data 
 ( or more ) • Stochastic simulation needs to run to ID pattern in weather data, to determine pricing • Volumetric : peta-bytes of information 
 ( for 1 region ) 2
  • 4. Data Analytics Is Mostly About $$, Customers, Markets 4
  • 5. How Widespread Is Data Analytics? 5
  • 6. Expectations On Payback Period ( Aggressive ) 6
  • 7. Large Scale Data Analytics 7 “Involves using different algorithms, 
 distributed platforms, tools and techniques to analyze big data and provide actionable insights”
  • 8. Big Data “ Data sets that are very large in volume and complex “ 8 New platforms, tools and techniques
 have emerged to manage Big Data We broke away from traditional
 ways to process and analyze them
  • 9. Data Structures   Vector, Matrix, Or Complex Structure Free Text Image or Binary Data Data “bags” Iterative Logic Or Complex Branching Advanced Analytic Routines Rapidly Repeated Measurements Extreme Low Latency Access to all data required Search Ranking X X X X X X Ad Tracking X X X X X X X X   Location or Proximity Tracking X   X X     X X   Social CRM X X X X X X      X Document Similarity Testing X X X X X X   X X Genomic Analysis X X X X X Customer Cohort groups X X   X X X     X Fraud Detection X X X X X X X X X Smart Utility Metering X X X X X X Churn Analysis X X X X X X   X   Satellite Image Analysis X X X X Game Gesture Analysis X X X X X X X X Data Bag Exploration X X X X X X 9
  • 10. Business Interests : Well Informed Customer Executive 10 Speech to Text Conversion Voice Data Unstructured data Analytical System Customer Persona • Customer Persona - Demographics,
 Top interactions, 
 Channel Preferences, Dissatisfies • Customer Lifetime Value • Recent Contact History • Customer Sentiment & Trend during the call Customer’s state of mind Sentimental Analysis Social media Depositions Complaints Other Channel information (ATM, Branch) Big Data Warehouse Traditional Warehouse Decision Engine • Customer Executive Dashboard presents all intelligence required to make a decision • The decision engine also presents important decisions to be taken for the particular customer issue
  • 11. Well Informed Customer Executive… Customer calls BankingCallCenter Executive understands the customerproblemExecutive authenticates customer and pulls up CustomerPersona Executive reviews risk of attrition against Customer LifetimeValue Executive reviews Last 5 call center and banking transactions Executiveviews customer’s state of mind (riskof attrition )through a barometer chart Analytical Solution - Converts Speech to textAnalytical engine listens to customer voice Suggested top 5 Actions required DecisionEngine Executive performs below actions based on his analysis and recommendations from Decision engine 1. Reversal of overdraft fee 2. One time fee waiver on Cheque book (predicting customer need based on historic usage cycles ) 3. Cash back Reward card for a minimum spend of $X through debitcard 4. Offer interest revision for investment products or mortgage 5. Promote new mutual funds or credit cards based on customer willingness Analytical engine monitors sentiment Executive analyzes Customer Persona (demographic / Preferences / Satisfiers / dissatisfiesetc ) 11
  • 12. Business Interests : Fraud Prevention 12 Envisaged Benefits ▪ New fraud patterns can be identified by building ‘analytical models’ to run against historical data ▪ ‘Web crawling’, ‘Contextual text analysis’, ‘Natural Language Processing’ allows fraud behavior identification from social media. It may increase Fraud detection success rate ▪ ‘Real time’ models to capture behavioral patters and do pattern analysis against History data to evaluate Fraud case validity. The model learns by self and updates ‘Fraud pattern master sets. ▪ Brings ‘artificial intelligent’ fraud pattern detection and analysis ▪ ‘Real time’ (in the order of .5-1 minute refresh rate) alerts to Fraud analysts about ‘self learned’ fraud patterns based on new customer behavior patterns Big Data Usage ▪ Formation of key value groups to the order of XcY (where X no. of attributes that are relevant to Fraud and Y is no. of attributes that should be combined to identify patterns) ▪ High speed history data loading from source systems ▪ Efficient Real time fraud detection by identifying patterns through customer behavioral events and processing them over X yrs. of history data – e.g. using HBase Scenario Formation of Fraud pattern reference tables using ▪ Real time data coming from different departments like IVR, WEB, Customer profile, Transactions etc ▪ Real time Mining and analysis of history data to form prior patterns (no. of years in range to 50-100 TB)
  • 13. Fraud Pattern Detection… 13 Legacy Fraud Data Customer Profile Data IVR Audio Data Web / Online Card Transaction Data Fraud Pattern Master Table Fraud Analyst History Data Processing to determine Fraud Patterns over X years Real-time Customer Behavior Analysis for Fraud Detection Customer Behavior Change Events Customer Behavior Change Events Customer Behavior Change Events Real time Analysis of behavior patterns over historical data Real time update to Master Table on New Fraud Patterns Real time alert to 
 Fraud Analyst RDBMS RDBMS (JSON Files) RDBMS Customer Behavior Change Events
  • 15. Benefits 15 BenefitsIndustry Financial services ▪ Customer Insights – Integrating Transactional data (CRM/Payments) and unstructured Social feeds ▪ Regulatory Compliance – Risk exposures across asset classes, LOBs and firms ▪ Fraud Detection in Credit Cards & Financial Crimes (AML) in Banks Travel, Hospitality & Retail ▪ Customer centricity – Customer behavior analysis from Omni channel retailing & Social feeds ▪ Markdown Optimization – Improve markdown based on actual customer buying patters ▪ Market basket analysis – Narrow down market basket analysis by demographics Life Science ▪ Improve targeting & predictions – Automatic Detection of Adverse Drug Effects (ADEs) ▪ Patient data analysis – Longitudinal Patient Data (LPD) analysis ▪ Predictive Sciences – Analyze Preclinical Side Effect Profiles of Marketed Drugs Healthcare (Payers & Providers) ▪ Cost of Care – Drug effectiveness & Cost of Care Analysis based on electronic Health Records (EMR) ▪ Self Service Healthcare – Increase in mHealth & eHealth to allow consumer access to health information ▪ Claims Analytics – Analyze insurance claims data for fraud detection & preferred treatment plans Communication, Media & Entertainment ▪ Discover churn patterns based on Call data records (CDRs) and activity in subscribers’ networks ▪ Digital Asset Management (DAM) – Analyze & capitalize digital data assets Manufacturing ▪ Proactive Maintenance & Recommendation – Sensor Monitoring for automobile, buildings & machinery ▪ Energy Efficiency – Leveraging Smart meters for utility energy consumption ▪ Location or Proximity Tracking – Location based analytics using GPS Data Hi-Tech ▪ Extend and complement conventional information supply chain with big data path ▪ Predictive analysis and real time decision support
  • 28. Analytics - Trends • Big Data Analytics In The Cloud • AWS, AWS-Redshift • Hadoop • Enterprise Data Operating System • Data Analytics Platform • SQL on Hadoop • NoSQL • IoT ( Internet of Things ) 28 • Multi-polar Analytics • Predictive Analytics ( Spark ) • In-memory Analytics • Data Lake • Deep Learning • Machine Learning • Neural Networks • Data Monetization
  • 29. Q & A
  • 30. Thank You ! “Any Sufficiently Advanced Technology Is Indistinguishable From Magic “ - Arthur C. Clarke