SlideShare a Scribd company logo
© 2014 IBM Corporation
Client Approaches to
Successfully Navigate through
the Big Data Storm
June 2014
© 2014 IBM Corporation2
Does Your Big Data Project Look Like This?
IBM Presentation Template Full Version
You need cost predictability,
together with a solution that
can quickly take you places!
 Hadoop is a fascinating, exciting engine. However, it is:
 Ungoverned
 All custom, all the time
 Requires expensive, constantly changing skills
 Includes no concept of quality, governance or lineage
And, MapReduce was originally designed for finely grained fault
tolerance, which makes it slow for big data integration processing
Hadoop is just not a solution for big data integration
© 2014 IBM Corporation3
If so, that’s because 80% of the development work for a big data
project is to address Big Data Integration challenges
IBM Presentation Template Full Version
“By most accounts, 80 percent of the development effort in a big data project goes
into data integration and only 20 percent goes towards data analysis.”
Intel Corporation: Extract, Transform, and Load Big Data With
Apache Hadoop (White Paper)
Most Hadoop initiatives end up achieving garbage in,
garbage out faster, against larger data volumes and:
 MapReduce was not designed to accommodate the
processing all the logic necessary for big data
integration
 Teams forget that Hadoop initiatives require:
collecting, moving, transforming, cleansing,
integrating, exploring & analyzing volumes of
disparate data (of various types, from various
sources) --- AKA Data Integration
To succeed, you need Data Integration
capabilities that create consumable data by:
 Collecting, moving, transforming, cleansing,
governing, integrating, exploring & analyzing
volumes of disparate data
 Providing simplicity, speed, scalability and
reduced risk
© 2014 IBM Corporation4
A large US Bank needed to reduce total cost of ownership …
IBM Presentation Template Full Version
Business Problem Challenges
 Primary: Reduce Teradata total
cost of ownership
 Secondary: Allow for
new analytic exploration
& asset optimization
 Create a Data Distribution Hub / Big
Data platform to cut costs
 Move front-end processing from
Teradata to the Data Distrubion Hub
 Needed to offload ELT workload in a
cost-effective, efficient way
© 2014 IBM Corporation5
… and successfully offloaded ELT workloads to reduce costs
IBM Presentation Template Full Version
Approach Outcome
 Reduce costs by offloading ELT
workloads from Teradata to a Big
Data platform
 Leverage existing InfoSphere
Information Server data
integration skills and assets (jobs)
 Hand coding: Client would not
consider hand coding for data
integration capabilities
 Client decides to deploy IBM
PureData for Hadoop
 Client uses InfoSphere Information
Server as their single scalable &
flexible Big Data Integration solution
 Client successfully migrated their
Teradata ELT and now uses
InfoSphere Information Server to
exploit the lower cost of running
data integration on Hadoop
© 2014 IBM Corporation6
A government entity anticipated the need to support 10x increase in
incoming data volumes over 3-5 years …
IBM Presentation Template Full Version
Business Problem Project Challenges
 This Master Data Management
(MDM) client compares
frequently updated records to
identify potential national
security threats. They needed to:
– Support a 10X increase in
incoming data volumes (in
the next 3-5 years)
– Reduce high software and
hardware costs
 Create a solution that could support
scalable probabilistic matching for up
to 10X data growth
 Modernize ETL practices and remove
bottlenecks
© 2014 IBM Corporation7
… and replaced an expensive and failing hand-coding approach with
a massively scalable Big Data Integration solution
IBM Presentation Template Full Version
Approach Outcome
 Eliminate hand coding for data
integration to significantly reduce
software costs
 Deploy a data integration solution
that can scale fast enough to feed
the MDM system
 Reduce high costs of ELT running
in their database
 Removed hand coding & replaced it
with InfoSphere InfoSphere
Information Server for massively
scalable data integration processing
 Stopped running ELT in the
database, leveraging Hadoop instead
 Client purchased an end-to-end Big
Data solution from IBM – across
MDM, Hadoop, and Information
Integration areas
© 2014 IBM Corporation8
A large European telco wants to leverage big data to increase
revenue and customer satisfaction …
IBM Presentation Template Full Version
Business Problem Project Challenges
 Increase revenue & customer
satisfaction by analyzing usage
patterns of mobile devices to
match user demand
 Needed a comprehensive Big
Data platform that could keep up
with analytics requirements
 Reduce costs by reducing
inventory
 Client used Informatica for ETL,
generally, and planned to extend use
to the Big Data effort. They asked
Informatica to improve (existing)
Netezza loading performance in
support of their goals and:
– The ETL process broke with a
small sample of jobs
– They switched to an ELT
approach and encountered
technical problems
© 2014 IBM Corporation9
… and learned that ELT only was not sufficient to support Big Data
Integration
IBM Presentation Template Full Version
Approach Outcome
 Leverage a worldwide predictive
solution to anticipate customer
requirements
 Add a Hadoop layer to enrich
predictive models with
unstructured social media data
 Expand existing IBM Netezza
footprint to keep pace with new
data volumes
 Client requested a full-workload
data integration POC with IBM
 Client realized ELT only was not
sufficient for Big Data Integration
(all data integration logic cannot be
pushed into IBM Neteeza or Hadoop)
 Client found InfoSphere Information
Server can often run data integration
faster than either Neteeza or Hadoop
 Client selected InfoSphere
Information Server over Informatica
for Big Data Integration and
InfoSphere BigInsights over Cloudera
© 2014 IBM Corporation10
Plan for Success!
Successfully navigate the big data maze
IBM Presentation Template Full Version
Hadoop is not a Data
Integration platform,
80% of the work is
around Big Data
Integration, and
MapReduce is slow
To move into production
successfully, you need to
plan ahead and make
sure you have accounted
for your Big Data
Integration needs: Hand
coding does not meet
Big Data Integration
scalability, flexibility,
or performance
requirements
Get more information
about Big Data Integration requirements and key
success factors
ELT only is NOT
sufficient to meet
most Big Data
Integration
requirements,
because you cannot
push ALL the data
integration logic into
the data warehouse or
into Hadoop

More Related Content

What's hot (20)

Postgres Vision 2018: The Pragmatic Cloud
Postgres Vision 2018:  The Pragmatic CloudPostgres Vision 2018:  The Pragmatic Cloud
Postgres Vision 2018: The Pragmatic Cloud
EDB
 
Postgres Vision 2018: The Changing Role of the DBA in the Cloud
Postgres Vision 2018: The Changing Role of the DBA in the CloudPostgres Vision 2018: The Changing Role of the DBA in the Cloud
Postgres Vision 2018: The Changing Role of the DBA in the Cloud
EDB
 
PgConf 2018 - Postgres in a World of DevOps
PgConf 2018 - Postgres in a World of DevOpsPgConf 2018 - Postgres in a World of DevOps
PgConf 2018 - Postgres in a World of DevOps
EDB
 
Postgres Vision 2018: Making Modern an Old Legacy System
Postgres Vision 2018: Making Modern an Old Legacy SystemPostgres Vision 2018: Making Modern an Old Legacy System
Postgres Vision 2018: Making Modern an Old Legacy System
EDB
 
Hadoop dev 01
Hadoop dev 01Hadoop dev 01
Hadoop dev 01
Vivian S. Zhang
 
Building the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump InBuilding the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump In
SnapLogic
 
Postgres Vision 2018: AI Needs IA
Postgres Vision 2018: AI Needs IAPostgres Vision 2018: AI Needs IA
Postgres Vision 2018: AI Needs IA
EDB
 
Capgemini Insights and Data
Capgemini Insights and Data Capgemini Insights and Data
Capgemini Insights and Data
DataWorks Summit/Hadoop Summit
 
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
SnapLogic
 
On Demand BI
On Demand BIOn Demand BI
On Demand BI
Darren Cunningham
 
Hybrid Cloud Essential for Success
Hybrid Cloud Essential for SuccessHybrid Cloud Essential for Success
Hybrid Cloud Essential for Success
NetApp
 
Using AI-powered Automation for High Performance Data Pipelines in the Cloud
Using AI-powered Automation for High Performance Data Pipelines in the CloudUsing AI-powered Automation for High Performance Data Pipelines in the Cloud
Using AI-powered Automation for High Performance Data Pipelines in the Cloud
DevOps.com
 
Webinar: Hybrid Cloud Integration - Why It's Different and Why It Matters
Webinar: Hybrid Cloud Integration - Why It's Different and Why It MattersWebinar: Hybrid Cloud Integration - Why It's Different and Why It Matters
Webinar: Hybrid Cloud Integration - Why It's Different and Why It Matters
SnapLogic
 
Postgres Vision 2018: Data as the New Oil
Postgres Vision 2018: Data as the New OilPostgres Vision 2018: Data as the New Oil
Postgres Vision 2018: Data as the New Oil
EDB
 
NetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show Guide
NetAppUK
 
Webinar: The Death of Traditional Data Integration
Webinar: The Death of Traditional Data IntegrationWebinar: The Death of Traditional Data Integration
Webinar: The Death of Traditional Data Integration
SnapLogic
 
O'Reilly ebook: Financial Governance for Data Processing in the Cloud | Qubole
O'Reilly ebook: Financial Governance for Data Processing in the Cloud | QuboleO'Reilly ebook: Financial Governance for Data Processing in the Cloud | Qubole
O'Reilly ebook: Financial Governance for Data Processing in the Cloud | Qubole
Vasu S
 
IBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big Data
IBM Analytics
 
Exploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis KapsalisExploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis Kapsalis
NetAppUK
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 
Postgres Vision 2018: The Pragmatic Cloud
Postgres Vision 2018:  The Pragmatic CloudPostgres Vision 2018:  The Pragmatic Cloud
Postgres Vision 2018: The Pragmatic Cloud
EDB
 
Postgres Vision 2018: The Changing Role of the DBA in the Cloud
Postgres Vision 2018: The Changing Role of the DBA in the CloudPostgres Vision 2018: The Changing Role of the DBA in the Cloud
Postgres Vision 2018: The Changing Role of the DBA in the Cloud
EDB
 
PgConf 2018 - Postgres in a World of DevOps
PgConf 2018 - Postgres in a World of DevOpsPgConf 2018 - Postgres in a World of DevOps
PgConf 2018 - Postgres in a World of DevOps
EDB
 
Postgres Vision 2018: Making Modern an Old Legacy System
Postgres Vision 2018: Making Modern an Old Legacy SystemPostgres Vision 2018: Making Modern an Old Legacy System
Postgres Vision 2018: Making Modern an Old Legacy System
EDB
 
Building the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump InBuilding the Enterprise Data Lake - Important Considerations Before You Jump In
Building the Enterprise Data Lake - Important Considerations Before You Jump In
SnapLogic
 
Postgres Vision 2018: AI Needs IA
Postgres Vision 2018: AI Needs IAPostgres Vision 2018: AI Needs IA
Postgres Vision 2018: AI Needs IA
EDB
 
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
Webinar: It's the 21st Century - Why Isn't Your Data Integration Loosely Coup...
SnapLogic
 
Hybrid Cloud Essential for Success
Hybrid Cloud Essential for SuccessHybrid Cloud Essential for Success
Hybrid Cloud Essential for Success
NetApp
 
Using AI-powered Automation for High Performance Data Pipelines in the Cloud
Using AI-powered Automation for High Performance Data Pipelines in the CloudUsing AI-powered Automation for High Performance Data Pipelines in the Cloud
Using AI-powered Automation for High Performance Data Pipelines in the Cloud
DevOps.com
 
Webinar: Hybrid Cloud Integration - Why It's Different and Why It Matters
Webinar: Hybrid Cloud Integration - Why It's Different and Why It MattersWebinar: Hybrid Cloud Integration - Why It's Different and Why It Matters
Webinar: Hybrid Cloud Integration - Why It's Different and Why It Matters
SnapLogic
 
Postgres Vision 2018: Data as the New Oil
Postgres Vision 2018: Data as the New OilPostgres Vision 2018: Data as the New Oil
Postgres Vision 2018: Data as the New Oil
EDB
 
NetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show GuideNetApp at Gartner Symposium Show Guide
NetApp at Gartner Symposium Show Guide
NetAppUK
 
Webinar: The Death of Traditional Data Integration
Webinar: The Death of Traditional Data IntegrationWebinar: The Death of Traditional Data Integration
Webinar: The Death of Traditional Data Integration
SnapLogic
 
O'Reilly ebook: Financial Governance for Data Processing in the Cloud | Qubole
O'Reilly ebook: Financial Governance for Data Processing in the Cloud | QuboleO'Reilly ebook: Financial Governance for Data Processing in the Cloud | Qubole
O'Reilly ebook: Financial Governance for Data Processing in the Cloud | Qubole
Vasu S
 
IBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big DataIBM InfoSphere Data Replication for Big Data
IBM InfoSphere Data Replication for Big Data
IBM Analytics
 
Exploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis KapsalisExploring the Wider World of Big Data- Vasalis Kapsalis
Exploring the Wider World of Big Data- Vasalis Kapsalis
NetAppUK
 
Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies Data Warehousing in the Cloud: Practical Migration Strategies
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 

Similar to Client approaches to successfully navigate through the big data storm (20)

Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...
Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...
Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...
DataWorks Summit
 
Entry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data AnalyticsEntry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data Analytics
Inside Analysis
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
Jane Roberts
 
Big Data Driven Transformations
Big Data Driven TransformationsBig Data Driven Transformations
Big Data Driven Transformations
Piyush Malik
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
Adrian Turcu
 
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
IBM Switzerland
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
Jeffrey T. Pollock
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
Vikas Manoria
 
Think like your customer
Think like your customerThink like your customer
Think like your customer
Trisha Dutta
 
Think Like Your Customer
Think Like Your CustomerThink Like Your Customer
Think Like Your Customer
IBM Analytics
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data:  InterConnect 2016 Session on Getting Started with Big Data AnalyticsBig Data:  InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
 
DMA 2014: 6 Steps to Integrate Your Big Data
DMA 2014: 6 Steps to Integrate Your Big DataDMA 2014: 6 Steps to Integrate Your Big Data
DMA 2014: 6 Steps to Integrate Your Big Data
Sameer Khan
 
New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013
IBM Sverige
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
Ramy Mahrous
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
Skillwise Group
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
Skillwise Group
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Precisely
 
Summer Shorts: Big Data Integration
Summer Shorts: Big Data IntegrationSummer Shorts: Big Data Integration
Summer Shorts: Big Data Integration
ibi
 
Integrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientIntegrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and Perficient
Perficient, Inc.
 
Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...
Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...
Faster, Cheaper, Easier... and Successful Best Practices for Big Data Integra...
DataWorks Summit
 
Entry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data AnalyticsEntry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data Analytics
Inside Analysis
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
Jane Roberts
 
Big Data Driven Transformations
Big Data Driven TransformationsBig Data Driven Transformations
Big Data Driven Transformations
Piyush Malik
 
IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
Adrian Turcu
 
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
IBM Switzerland
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
Jeffrey T. Pollock
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
Vikas Manoria
 
Think like your customer
Think like your customerThink like your customer
Think like your customer
Trisha Dutta
 
Think Like Your Customer
Think Like Your CustomerThink Like Your Customer
Think Like Your Customer
IBM Analytics
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data:  InterConnect 2016 Session on Getting Started with Big Data AnalyticsBig Data:  InterConnect 2016 Session on Getting Started with Big Data Analytics
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
 
DMA 2014: 6 Steps to Integrate Your Big Data
DMA 2014: 6 Steps to Integrate Your Big DataDMA 2014: 6 Steps to Integrate Your Big Data
DMA 2014: 6 Steps to Integrate Your Big Data
Sameer Khan
 
New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013New Innovations in Information Management for Big Data - Smarter Business 2013
New Innovations in Information Management for Big Data - Smarter Business 2013
IBM Sverige
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
Ramy Mahrous
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
Skillwise Group
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Precisely
 
Summer Shorts: Big Data Integration
Summer Shorts: Big Data IntegrationSummer Shorts: Big Data Integration
Summer Shorts: Big Data Integration
ibi
 
Integrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and PerficientIntegrate Big Data into Your Organization with Informatica and Perficient
Integrate Big Data into Your Organization with Informatica and Perficient
Perficient, Inc.
 

More from IBM Analytics (20)

Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introduction
IBM Analytics
 
10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to
IBM Analytics
 
Advantages of an integrated governance, risk and compliance environment
Advantages of an integrated governance, risk and compliance environmentAdvantages of an integrated governance, risk and compliance environment
Advantages of an integrated governance, risk and compliance environment
IBM Analytics
 
Cognitive banking with expert insights
Cognitive banking with expert insightsCognitive banking with expert insights
Cognitive banking with expert insights
IBM Analytics
 
Sales performance management and C-level goals
Sales performance management and C-level goalsSales performance management and C-level goals
Sales performance management and C-level goals
IBM Analytics
 
The science of client insight: Increase revenue through improved engagement
The science of client insight: Increase revenue through improved engagementThe science of client insight: Increase revenue through improved engagement
The science of client insight: Increase revenue through improved engagement
IBM Analytics
 
Expert opinion on managing data breaches
Expert opinion on managing data breachesExpert opinion on managing data breaches
Expert opinion on managing data breaches
IBM Analytics
 
Top industry use cases for streaming analytics
Top industry use cases for streaming analyticsTop industry use cases for streaming analytics
Top industry use cases for streaming analytics
IBM Analytics
 
Make data simple in the cognitive era
Make data simple in the cognitive eraMake data simple in the cognitive era
Make data simple in the cognitive era
IBM Analytics
 
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive eraIBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
IBM Analytics
 
4 common headaches with sales compensation management
4 common headaches with sales compensation management4 common headaches with sales compensation management
4 common headaches with sales compensation management
IBM Analytics
 
IBM Virtual Finance Forum 2016: Top 10 reasons to attend
IBM Virtual Finance Forum 2016: Top 10 reasons to attendIBM Virtual Finance Forum 2016: Top 10 reasons to attend
IBM Virtual Finance Forum 2016: Top 10 reasons to attend
IBM Analytics
 
Data science tips for data engineers
Data science tips for data engineersData science tips for data engineers
Data science tips for data engineers
IBM Analytics
 
How secure is your enterprise from threats?
How secure is your enterprise from threats? How secure is your enterprise from threats?
How secure is your enterprise from threats?
IBM Analytics
 
10 benefits to thinking inside Box
10 benefits to thinking inside Box10 benefits to thinking inside Box
10 benefits to thinking inside Box
IBM Analytics
 
The digital transformation of the French Open
The digital transformation of the French OpenThe digital transformation of the French Open
The digital transformation of the French Open
IBM Analytics
 
Bridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureBridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architecture
IBM Analytics
 
What does data tell you about the customer journey?
What does data tell you about the customer journey?What does data tell you about the customer journey?
What does data tell you about the customer journey?
IBM Analytics
 
What CEOs want from CDOs and how to deliver on it
What CEOs want from CDOs and how to deliver on itWhat CEOs want from CDOs and how to deliver on it
What CEOs want from CDOs and how to deliver on it
IBM Analytics
 
Banking in the age of the empowered consumer
Banking in the age of the empowered consumerBanking in the age of the empowered consumer
Banking in the age of the empowered consumer
IBM Analytics
 
Data Lake: A simple introduction
Data Lake: A simple introductionData Lake: A simple introduction
Data Lake: A simple introduction
IBM Analytics
 
10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to10 WealthTech podcasts every wealth advisor should listen to
10 WealthTech podcasts every wealth advisor should listen to
IBM Analytics
 
Advantages of an integrated governance, risk and compliance environment
Advantages of an integrated governance, risk and compliance environmentAdvantages of an integrated governance, risk and compliance environment
Advantages of an integrated governance, risk and compliance environment
IBM Analytics
 
Cognitive banking with expert insights
Cognitive banking with expert insightsCognitive banking with expert insights
Cognitive banking with expert insights
IBM Analytics
 
Sales performance management and C-level goals
Sales performance management and C-level goalsSales performance management and C-level goals
Sales performance management and C-level goals
IBM Analytics
 
The science of client insight: Increase revenue through improved engagement
The science of client insight: Increase revenue through improved engagementThe science of client insight: Increase revenue through improved engagement
The science of client insight: Increase revenue through improved engagement
IBM Analytics
 
Expert opinion on managing data breaches
Expert opinion on managing data breachesExpert opinion on managing data breaches
Expert opinion on managing data breaches
IBM Analytics
 
Top industry use cases for streaming analytics
Top industry use cases for streaming analyticsTop industry use cases for streaming analytics
Top industry use cases for streaming analytics
IBM Analytics
 
Make data simple in the cognitive era
Make data simple in the cognitive eraMake data simple in the cognitive era
Make data simple in the cognitive era
IBM Analytics
 
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive eraIBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
IBM Analytics
 
4 common headaches with sales compensation management
4 common headaches with sales compensation management4 common headaches with sales compensation management
4 common headaches with sales compensation management
IBM Analytics
 
IBM Virtual Finance Forum 2016: Top 10 reasons to attend
IBM Virtual Finance Forum 2016: Top 10 reasons to attendIBM Virtual Finance Forum 2016: Top 10 reasons to attend
IBM Virtual Finance Forum 2016: Top 10 reasons to attend
IBM Analytics
 
Data science tips for data engineers
Data science tips for data engineersData science tips for data engineers
Data science tips for data engineers
IBM Analytics
 
How secure is your enterprise from threats?
How secure is your enterprise from threats? How secure is your enterprise from threats?
How secure is your enterprise from threats?
IBM Analytics
 
10 benefits to thinking inside Box
10 benefits to thinking inside Box10 benefits to thinking inside Box
10 benefits to thinking inside Box
IBM Analytics
 
The digital transformation of the French Open
The digital transformation of the French OpenThe digital transformation of the French Open
The digital transformation of the French Open
IBM Analytics
 
Bridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureBridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architecture
IBM Analytics
 
What does data tell you about the customer journey?
What does data tell you about the customer journey?What does data tell you about the customer journey?
What does data tell you about the customer journey?
IBM Analytics
 
What CEOs want from CDOs and how to deliver on it
What CEOs want from CDOs and how to deliver on itWhat CEOs want from CDOs and how to deliver on it
What CEOs want from CDOs and how to deliver on it
IBM Analytics
 
Banking in the age of the empowered consumer
Banking in the age of the empowered consumerBanking in the age of the empowered consumer
Banking in the age of the empowered consumer
IBM Analytics
 

Recently uploaded (20)

From Legacy to Cloud-Native: A Guide to AWS Modernization.pptx
From Legacy to Cloud-Native: A Guide to AWS Modernization.pptxFrom Legacy to Cloud-Native: A Guide to AWS Modernization.pptx
From Legacy to Cloud-Native: A Guide to AWS Modernization.pptx
Mohammad Jomaa
 
Building Agents with LangGraph & Gemini
Building Agents with LangGraph &  GeminiBuilding Agents with LangGraph &  Gemini
Building Agents with LangGraph & Gemini
HusseinMalikMammadli
 
cloudgenesis cloud workshop , gdg on campus mita
cloudgenesis cloud workshop , gdg on campus mitacloudgenesis cloud workshop , gdg on campus mita
cloudgenesis cloud workshop , gdg on campus mita
siyaldhande02
 
With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...
With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...
With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...
SOFTTECHHUB
 
The 2025 Digital Adoption Blueprint.pptx
The 2025 Digital Adoption Blueprint.pptxThe 2025 Digital Adoption Blueprint.pptx
The 2025 Digital Adoption Blueprint.pptx
aptyai
 
Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)
Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)
Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)
Eugene Fidelin
 
Measuring Microsoft 365 Copilot and Gen AI Success
Measuring Microsoft 365 Copilot and Gen AI SuccessMeasuring Microsoft 365 Copilot and Gen AI Success
Measuring Microsoft 365 Copilot and Gen AI Success
Nikki Chapple
 
What’s New in Web3 Development Trends to Watch in 2025.pptx
What’s New in Web3 Development Trends to Watch in 2025.pptxWhat’s New in Web3 Development Trends to Watch in 2025.pptx
What’s New in Web3 Development Trends to Watch in 2025.pptx
Lisa ward
 
Cognitive Chasms - A Typology of GenAI Failure Failure Modes
Cognitive Chasms - A Typology of GenAI Failure Failure ModesCognitive Chasms - A Typology of GenAI Failure Failure Modes
Cognitive Chasms - A Typology of GenAI Failure Failure Modes
Dr. Tathagat Varma
 
AI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AI
AI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AIAI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AI
AI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AI
Buhake Sindi
 
Fully Open-Source Private Clouds: Freedom, Security, and Control
Fully Open-Source Private Clouds: Freedom, Security, and ControlFully Open-Source Private Clouds: Freedom, Security, and Control
Fully Open-Source Private Clouds: Freedom, Security, and Control
ShapeBlue
 
A Comprehensive Guide on Integrating Monoova Payment Gateway
A Comprehensive Guide on Integrating Monoova Payment GatewayA Comprehensive Guide on Integrating Monoova Payment Gateway
A Comprehensive Guide on Integrating Monoova Payment Gateway
danielle hunter
 
John Carmack’s Slides From His Upper Bound 2025 Talk
John Carmack’s Slides From His Upper Bound 2025 TalkJohn Carmack’s Slides From His Upper Bound 2025 Talk
John Carmack’s Slides From His Upper Bound 2025 Talk
Razin Mustafiz
 
Talk: On an adventure into the depths of Maven - Kaya Weers
Talk: On an adventure into the depths of Maven - Kaya WeersTalk: On an adventure into the depths of Maven - Kaya Weers
Talk: On an adventure into the depths of Maven - Kaya Weers
Kaya Weers
 
"AI in the browser: predicting user actions in real time with TensorflowJS", ...
"AI in the browser: predicting user actions in real time with TensorflowJS", ..."AI in the browser: predicting user actions in real time with TensorflowJS", ...
"AI in the browser: predicting user actions in real time with TensorflowJS", ...
Fwdays
 
MCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCP
MCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCPMCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCP
MCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCP
Sambhav Kothari
 
Gihbli AI and Geo sitution |use/misuse of Ai Technology
Gihbli AI and Geo sitution |use/misuse of Ai TechnologyGihbli AI and Geo sitution |use/misuse of Ai Technology
Gihbli AI and Geo sitution |use/misuse of Ai Technology
zainkhurram1111
 
Maxx nft market place new generation nft marketing place
Maxx nft market place new generation nft marketing placeMaxx nft market place new generation nft marketing place
Maxx nft market place new generation nft marketing place
usersalmanrazdelhi
 
Introducing the OSA 3200 SP and OSA 3250 ePRC
Introducing the OSA 3200 SP and OSA 3250 ePRCIntroducing the OSA 3200 SP and OSA 3250 ePRC
Introducing the OSA 3200 SP and OSA 3250 ePRC
Adtran
 
Droidal: AI Agents Revolutionizing Healthcare
Droidal: AI Agents Revolutionizing HealthcareDroidal: AI Agents Revolutionizing Healthcare
Droidal: AI Agents Revolutionizing Healthcare
Droidal LLC
 
From Legacy to Cloud-Native: A Guide to AWS Modernization.pptx
From Legacy to Cloud-Native: A Guide to AWS Modernization.pptxFrom Legacy to Cloud-Native: A Guide to AWS Modernization.pptx
From Legacy to Cloud-Native: A Guide to AWS Modernization.pptx
Mohammad Jomaa
 
Building Agents with LangGraph & Gemini
Building Agents with LangGraph &  GeminiBuilding Agents with LangGraph &  Gemini
Building Agents with LangGraph & Gemini
HusseinMalikMammadli
 
cloudgenesis cloud workshop , gdg on campus mita
cloudgenesis cloud workshop , gdg on campus mitacloudgenesis cloud workshop , gdg on campus mita
cloudgenesis cloud workshop , gdg on campus mita
siyaldhande02
 
With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...
With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...
With Claude 4, Anthropic redefines AI capabilities, effectively unleashing a ...
SOFTTECHHUB
 
The 2025 Digital Adoption Blueprint.pptx
The 2025 Digital Adoption Blueprint.pptxThe 2025 Digital Adoption Blueprint.pptx
The 2025 Digital Adoption Blueprint.pptx
aptyai
 
Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)
Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)
Marko.js - Unsung Hero of Scalable Web Frameworks (DevDays 2025)
Eugene Fidelin
 
Measuring Microsoft 365 Copilot and Gen AI Success
Measuring Microsoft 365 Copilot and Gen AI SuccessMeasuring Microsoft 365 Copilot and Gen AI Success
Measuring Microsoft 365 Copilot and Gen AI Success
Nikki Chapple
 
What’s New in Web3 Development Trends to Watch in 2025.pptx
What’s New in Web3 Development Trends to Watch in 2025.pptxWhat’s New in Web3 Development Trends to Watch in 2025.pptx
What’s New in Web3 Development Trends to Watch in 2025.pptx
Lisa ward
 
Cognitive Chasms - A Typology of GenAI Failure Failure Modes
Cognitive Chasms - A Typology of GenAI Failure Failure ModesCognitive Chasms - A Typology of GenAI Failure Failure Modes
Cognitive Chasms - A Typology of GenAI Failure Failure Modes
Dr. Tathagat Varma
 
AI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AI
AI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AIAI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AI
AI in Java - MCP in Action, Langchain4J-CDI, SmallRye-LLM, Spring AI
Buhake Sindi
 
Fully Open-Source Private Clouds: Freedom, Security, and Control
Fully Open-Source Private Clouds: Freedom, Security, and ControlFully Open-Source Private Clouds: Freedom, Security, and Control
Fully Open-Source Private Clouds: Freedom, Security, and Control
ShapeBlue
 
A Comprehensive Guide on Integrating Monoova Payment Gateway
A Comprehensive Guide on Integrating Monoova Payment GatewayA Comprehensive Guide on Integrating Monoova Payment Gateway
A Comprehensive Guide on Integrating Monoova Payment Gateway
danielle hunter
 
John Carmack’s Slides From His Upper Bound 2025 Talk
John Carmack’s Slides From His Upper Bound 2025 TalkJohn Carmack’s Slides From His Upper Bound 2025 Talk
John Carmack’s Slides From His Upper Bound 2025 Talk
Razin Mustafiz
 
Talk: On an adventure into the depths of Maven - Kaya Weers
Talk: On an adventure into the depths of Maven - Kaya WeersTalk: On an adventure into the depths of Maven - Kaya Weers
Talk: On an adventure into the depths of Maven - Kaya Weers
Kaya Weers
 
"AI in the browser: predicting user actions in real time with TensorflowJS", ...
"AI in the browser: predicting user actions in real time with TensorflowJS", ..."AI in the browser: predicting user actions in real time with TensorflowJS", ...
"AI in the browser: predicting user actions in real time with TensorflowJS", ...
Fwdays
 
MCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCP
MCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCPMCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCP
MCP Dev Summit - Pragmatic Scaling of Enterprise GenAI with MCP
Sambhav Kothari
 
Gihbli AI and Geo sitution |use/misuse of Ai Technology
Gihbli AI and Geo sitution |use/misuse of Ai TechnologyGihbli AI and Geo sitution |use/misuse of Ai Technology
Gihbli AI and Geo sitution |use/misuse of Ai Technology
zainkhurram1111
 
Maxx nft market place new generation nft marketing place
Maxx nft market place new generation nft marketing placeMaxx nft market place new generation nft marketing place
Maxx nft market place new generation nft marketing place
usersalmanrazdelhi
 
Introducing the OSA 3200 SP and OSA 3250 ePRC
Introducing the OSA 3200 SP and OSA 3250 ePRCIntroducing the OSA 3200 SP and OSA 3250 ePRC
Introducing the OSA 3200 SP and OSA 3250 ePRC
Adtran
 
Droidal: AI Agents Revolutionizing Healthcare
Droidal: AI Agents Revolutionizing HealthcareDroidal: AI Agents Revolutionizing Healthcare
Droidal: AI Agents Revolutionizing Healthcare
Droidal LLC
 

Client approaches to successfully navigate through the big data storm

  • 1. © 2014 IBM Corporation Client Approaches to Successfully Navigate through the Big Data Storm June 2014
  • 2. © 2014 IBM Corporation2 Does Your Big Data Project Look Like This? IBM Presentation Template Full Version You need cost predictability, together with a solution that can quickly take you places!  Hadoop is a fascinating, exciting engine. However, it is:  Ungoverned  All custom, all the time  Requires expensive, constantly changing skills  Includes no concept of quality, governance or lineage And, MapReduce was originally designed for finely grained fault tolerance, which makes it slow for big data integration processing Hadoop is just not a solution for big data integration
  • 3. © 2014 IBM Corporation3 If so, that’s because 80% of the development work for a big data project is to address Big Data Integration challenges IBM Presentation Template Full Version “By most accounts, 80 percent of the development effort in a big data project goes into data integration and only 20 percent goes towards data analysis.” Intel Corporation: Extract, Transform, and Load Big Data With Apache Hadoop (White Paper) Most Hadoop initiatives end up achieving garbage in, garbage out faster, against larger data volumes and:  MapReduce was not designed to accommodate the processing all the logic necessary for big data integration  Teams forget that Hadoop initiatives require: collecting, moving, transforming, cleansing, integrating, exploring & analyzing volumes of disparate data (of various types, from various sources) --- AKA Data Integration To succeed, you need Data Integration capabilities that create consumable data by:  Collecting, moving, transforming, cleansing, governing, integrating, exploring & analyzing volumes of disparate data  Providing simplicity, speed, scalability and reduced risk
  • 4. © 2014 IBM Corporation4 A large US Bank needed to reduce total cost of ownership … IBM Presentation Template Full Version Business Problem Challenges  Primary: Reduce Teradata total cost of ownership  Secondary: Allow for new analytic exploration & asset optimization  Create a Data Distribution Hub / Big Data platform to cut costs  Move front-end processing from Teradata to the Data Distrubion Hub  Needed to offload ELT workload in a cost-effective, efficient way
  • 5. © 2014 IBM Corporation5 … and successfully offloaded ELT workloads to reduce costs IBM Presentation Template Full Version Approach Outcome  Reduce costs by offloading ELT workloads from Teradata to a Big Data platform  Leverage existing InfoSphere Information Server data integration skills and assets (jobs)  Hand coding: Client would not consider hand coding for data integration capabilities  Client decides to deploy IBM PureData for Hadoop  Client uses InfoSphere Information Server as their single scalable & flexible Big Data Integration solution  Client successfully migrated their Teradata ELT and now uses InfoSphere Information Server to exploit the lower cost of running data integration on Hadoop
  • 6. © 2014 IBM Corporation6 A government entity anticipated the need to support 10x increase in incoming data volumes over 3-5 years … IBM Presentation Template Full Version Business Problem Project Challenges  This Master Data Management (MDM) client compares frequently updated records to identify potential national security threats. They needed to: – Support a 10X increase in incoming data volumes (in the next 3-5 years) – Reduce high software and hardware costs  Create a solution that could support scalable probabilistic matching for up to 10X data growth  Modernize ETL practices and remove bottlenecks
  • 7. © 2014 IBM Corporation7 … and replaced an expensive and failing hand-coding approach with a massively scalable Big Data Integration solution IBM Presentation Template Full Version Approach Outcome  Eliminate hand coding for data integration to significantly reduce software costs  Deploy a data integration solution that can scale fast enough to feed the MDM system  Reduce high costs of ELT running in their database  Removed hand coding & replaced it with InfoSphere InfoSphere Information Server for massively scalable data integration processing  Stopped running ELT in the database, leveraging Hadoop instead  Client purchased an end-to-end Big Data solution from IBM – across MDM, Hadoop, and Information Integration areas
  • 8. © 2014 IBM Corporation8 A large European telco wants to leverage big data to increase revenue and customer satisfaction … IBM Presentation Template Full Version Business Problem Project Challenges  Increase revenue & customer satisfaction by analyzing usage patterns of mobile devices to match user demand  Needed a comprehensive Big Data platform that could keep up with analytics requirements  Reduce costs by reducing inventory  Client used Informatica for ETL, generally, and planned to extend use to the Big Data effort. They asked Informatica to improve (existing) Netezza loading performance in support of their goals and: – The ETL process broke with a small sample of jobs – They switched to an ELT approach and encountered technical problems
  • 9. © 2014 IBM Corporation9 … and learned that ELT only was not sufficient to support Big Data Integration IBM Presentation Template Full Version Approach Outcome  Leverage a worldwide predictive solution to anticipate customer requirements  Add a Hadoop layer to enrich predictive models with unstructured social media data  Expand existing IBM Netezza footprint to keep pace with new data volumes  Client requested a full-workload data integration POC with IBM  Client realized ELT only was not sufficient for Big Data Integration (all data integration logic cannot be pushed into IBM Neteeza or Hadoop)  Client found InfoSphere Information Server can often run data integration faster than either Neteeza or Hadoop  Client selected InfoSphere Information Server over Informatica for Big Data Integration and InfoSphere BigInsights over Cloudera
  • 10. © 2014 IBM Corporation10 Plan for Success! Successfully navigate the big data maze IBM Presentation Template Full Version Hadoop is not a Data Integration platform, 80% of the work is around Big Data Integration, and MapReduce is slow To move into production successfully, you need to plan ahead and make sure you have accounted for your Big Data Integration needs: Hand coding does not meet Big Data Integration scalability, flexibility, or performance requirements Get more information about Big Data Integration requirements and key success factors ELT only is NOT sufficient to meet most Big Data Integration requirements, because you cannot push ALL the data integration logic into the data warehouse or into Hadoop