SlideShare a Scribd company logo
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Q&A box is available for your questions
Webinar will be recorded for future viewing
Thank you for joining!
We’ll get started soon…
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Customer Analytics and Risk Management
in Financial Services
We do Hadoop.
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Your speakers…
Mark Lochbihler, Partner Solutions Engineer
Hortonworks
@MarkLochbihler
Bob Welshmer, Technical Director,
Strategic Accounts
Platfora
@BDubya22
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Our Mission:
Power your Modern Data Architecture
with HDP and Enterprise Apache Hadoop
Customer Momentum
•  300+ customers in seven quarters, growing at 75+/quarter
•  Two thirds of customers come from F1000
Hortonworks and Hadoop at Scale
•  HDP in production on largest clusters on planet
•  Multiple +1000 node clusters, including 35,000 nodes at
Yahoo!, 800 nodes at Spotify
•  Founded in 2011
•  Original 24 architects, developers,
operators of Hadoop from Yahoo!
•  We are leaders in Hadoop community
•  500+ employees
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
The Forrester Wave™
Big Data Hadoop Solutions
Q1 2014
“Hortonworks loves and lives
open source innovation”
World Class Support and Services.
Hortonworks' Customer Support received a
maximum score and was significantly higher
than both Cloudera and MapR
A Leader in Hadoop
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Hortonworks Approach
Innovate the Core1
Architect and build
innovation at the core of
Hadoop
•  YARN: Data Operating System
•  HDFS as the storage layer
•  Key processing engines
Extend Hadoop as an
Enterprise Data Platform
2 Enable the Ecosystem3
Extend Hadoop with enterprise
capabilities for governance,
security & operations
Apply enterprise software rigor
to the open source development
process
Enable the leaders in the data
center to easily adopt & extend
their platforms
•  Establish Hadoop as standard
component of a modern data
architecture
•  Joint engineering
YARN	
  :	
  Data	
  Opera.ng	
  System	
  
Script	
  
	
  
Pig	
  
	
  
	
  
SQL	
  
	
  
Hive/Tez,	
  
HCatalog	
  
	
  
	
  
NoSQL	
  
	
  
HBase	
  
Accumulo	
  
	
  
	
  
Stream	
  
	
  	
  
Storm	
  
	
  
	
  
	
  
Batch	
  
	
  
Map	
  
Reduce	
  
	
  
	
  
HDFS	
  	
  
(Hadoop	
  Distributed	
  File	
  System)	
  
HDP 2.2
Governance
&Integration
Security
Operations
Data Access
Data Management
YARN
Memory	
  
	
  
Spark	
  
	
  
	
  
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Enabling a Modern Data Architecture
with HDP and Apache Hadoop
Hortonworks. We do Hadoop.
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
APPLICATIONSDATASYSTEM
Business
Analytics
Custom
Applications
Packaged
Applications
Traditional systems under pressure
•  Silos of Data
•  Costly to Scale
•  Constrained Schemas
Clickstream
Geolocation
Sentiment, Web Data
Sensor. Machine Data
Unstructured docs, emails
Server logs
SOURCES
Existing Sources
(CRM, ERP,…)
RDBMS EDW MPP
New Data Types
…and difficult to
manage new data
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
HDP2 and YARN enable the Modern Data Architecture
Hortonworks architected and 

led development of YARN
Common data set, multiple applications
•  Optionally land all data in a single cluster
•  Batch, interactive & real-time use cases
•  Support multi-tenant access, processing
& segmentation of data
YARN: Architectural center of Hadoop
•  Consistent security, governance & operations
•  Ecosystem applications certified 

by Hortonworks to run natively in Hadoop
SOURCES
EXISTING	
  
Systems	
  
Clickstream	
   Web	
  	
  
&Social	
  
Geoloca.on	
   Sensor	
  	
  
&	
  Machine	
  
Server	
  	
  
Logs	
  
Unstructured	
  
APPLICATIONSDATASYSTEM
Business
Analytics
Custom
Applications
Packaged
Applications
RDBMS EDW MPP YARN: Data Operating System
1 ° ° ° ° ° ° ° ° °
° ° ° ° ° ° ° ° ° N
HDFS
(Hadoop Distributed File System)
Interactive Real-TimeBatch
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
1. Unlock New Applications from New Types of Data
INDUSTRY USE CASE
Sentiment
& Web
Clickstream
& Behavior
Machine
& Sensor
Geographic Server Logs
Structured &
Unstructured
Financial Services
New Account Risk Screens ✔ ✔
Trading Risk ✔
Insurance Underwriting ✔ ✔ ✔
Telecom
Call Detail Records (CDR) ✔ ✔
Infrastructure Investment ✔ ✔
Real-time Bandwidth Allocation ✔ ✔ ✔
Retail
360° View of the Customer ✔ ✔ ✔
Localized, Personalized Promotions ✔
Website Optimization ✔
Manufacturing
Supply Chain and Logistics ✔
Assembly Line Quality Assurance ✔
Crowd-sourced Quality Assurance ✔
Healthcare
Use Genomic Data in Medial Trials ✔ ✔ ✔
Monitor Patient Vitals in Real-Time ✔ ✔
Pharmaceuticals
Recruit and Retain Patients for Drug Trials ✔ ✔
Improve Prescription Adherence ✔ ✔ ✔ ✔
Oil & Gas
Unify Exploration & Production Data ✔ ✔ ✔ ✔
Monitor Rig Safety in Real-Time ✔ ✔ ✔
Government
ETL Offload/Federal Budgetary Pressures ✔ ✔
Sentiment Analysis for Government Programs ✔
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
..to shift from reactive to proactive interactions
HDP and Hadoop allow
organizations to shift
interactions from…
Reactive
Post Transaction
Proactive
Pre Decision
…to Real-time PersonalizationFrom static branding
…to repair before breakFrom break then fix
…to Designer MedicineFrom mass treatment
…to Automated AlgorithmsFrom Educated Investing
…to 1x1 TargetingFrom mass branding
A shift in Advertising
A shift in Financial Services
A shift in Healthcare
A shift in Retail
A shift in Telco
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
2. Or to realize a dramatic cost savings…
✚
EDW Optimization
OPERATIONS
50%
ANALYTICS
20%
ETL PROCESS
30%
OPERATIONS
50% ANALYTICS
50%
Current Reality
EDW at capacity: some usage
from low value workloads
Older data archived, unavailable
for ongoing exploration
Source data often discarded
Augment w/ Hadoop
Free up EDW resources from
low value tasks
Keep 100% of source data and historical
data for ongoing exploration
Mine data for value after loading it
because of schema-on-read
Hadoop
Parse, Cleanse
Apply Structure, Transform
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
2. Or to realize a dramatic cost savings…
✚
EDW Optimization
OPERATIONS
50%
ANALYTICS
20%
ETL PROCESS
30%
OPERATIONS
50% ANALYTICS
50%
Current Reality
EDW at capacity: some usage
from low value workloads
Older data archived, unavailable
for ongoing exploration
Source data often discarded
Augment w/ Hadoop
Free up EDW resources from
low value tasks
Keep 100% of source data and historical
data for ongoing exploration
Mine data for value after loading it
because of schema-on-read
MPP
SAN
Engineered System
NAS
HADOOP
Cloud Storage
$0 $20,000 $40,000 $60,000 $80,000 $180,000
Fully-loaded Cost Per Raw TB of Data (Min–Max Cost)
Commodity Compute & Storage
Hadoop Enables Scalable Compute &
Storage at a Compelling Cost Structure
Hadoop
Parse, Cleanse
Apply Structure, Transform
Storage Costs/Compute Costs
from $19/GB to $0.23/GB
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
3. Data Lake: An architectural shift
SCALE
SCOPE
Unlocking the Data Lake
	
  
RDBMS
MPP
EDW
Data Lake
Enabled by YARN
•  Single data repository,
shared infrastructure
•  Multiple biz apps
accessing all the data
•  Enable a shift from
reactive to proactive
interactions
•  Gain new insight across
the entire enterprise
New Analytic Apps
or IT Optimization
HDP 2.2
Governance
&Integration
Security
Operations
Data Access
Data Management
YARN
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
© Hortonworks Inc. 2014 - Confidential
Banking Data Lake for 100s of Use Cases
Page 15
Problem
Architecture unsuited to capitalize on server log data
•  Huge investments company generates valuable data assets
•  Current EDW solutions are appropriate for some data workloads but too expensive
for others
•  Financial log data is difficult to aggregate & analyze at scale
•  Short retention hampers price history & performance analysis
•  Limited visibility into cost of acquiring customers
Solution
Multi-tenant Hadoop cluster to merge data across groups
•  Server log data merged with structured data to uncover trends across traders
•  ETL offload saves money for Hadoop-appropriate workloads
•  Longer data retention enables price history analysis
•  Joining data sets for insight into customer acquisition costs
•  Accumulo enforces read permissions on individual data cells
Investment
Services
Global investments company
> $1.5 trillion assets under
management
> $14B billion in revenue
~ 50K employees
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
HDP delivers a comprehensive data management platform
Hortonworks Data Platform 2.2
YARN: Data Operating System
(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Script
Pig
SQL
Hive
Tez
Tez
Java
Scala
Cascading
Tez
° °
° °
° ° ° ° °
° ° ° ° °
Others
ISV
Engines
HDFS
(Hadoop Distributed File System)
Stream
Storm
Search
Solr
NoSQL
HBase
Accumulo
Slider
 Slider
SECURITYGOVERNANCE OPERATIONSBATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
Provision,
Manage &
Monitor
Ambari
Zookeeper
Scheduling
Oozie
Data Workflow,
Lifecycle &
Governance
Falcon
Sqoop
Flume
Kafka
NFS
WebHDFS
Authentication
Authorization
Accounting
Data Protection
Storage: HDFS
Resources: YARN
Access: Hive, …
Pipeline: Falcon
Cluster: Knox
Cluster: Ranger
Deployment ChoiceLinux Windows On-Premises Cloud
YARN
is the architectural
center of HDP
Enables batch, interactive
and real-time workloads
Provides comprehensive
enterprise capabilities
The widest range of
deployment options
Delivered Completely in the OPEN
Introducing Platfora
LEAD THE INDUSTRY TRANSITION FROM
BUSINESS INTELLIGENCE TO BIG DATA
ANALYTICS.
#1 Big Data Analytics
platform native on
Hadoop
MISSION
End-to-end platform built for
Multi-Structured Data
Self-service, iterative,
interactive, and fast
WORLD CLASS CUSTOMERS
Proven Leader in Big Data Analytics for Hadoop
PROVEN COMPANY TO WATCH
Ones to Watch in Big
Data
April 2014 10 Hot
Hadoop
Startups to Watch
CRN 10 Coolest Big
Data Products of
2013
RAISED $65M BY LEADING INVESTORS
•  Launched 9 product versions with
feature innovations
•  Grew customers by 4x and employees
by 2x
MOMENTUM OVER THE
PAST 12 MONTHS
The Way We Access and Interact with Data has
Changed
Data Warehousing: 1980s Technology that Still Exists Today
1. 1980s: DATA WAREHOUSING 2. EARLY 2000s: HADOOP 3. PLATFORA’S DISRUPTION
3-6 Months
ETL
Personnel Needed
ETL Programmer
RAW DATA
Customer Interactions
Machine Data
Transactions
90% of copy trashed
Data
Warehouse
Expensive $$$
Personnel Needed
Data Warehouse
Architect & Admin
BI Tool
Personnel Needed
BI Architect
& Admin
Business
Analyst
Outcome
Business Analyst gets
the data after 3-6
months with little control.
Rinse and repeat.
The Way We Access and Interact with Data has
Changed
Hadoop: Early 2000s New Data Storage Technology Arrives
RAW
DATA
1. 1980s: DATA WAREHOUSING 2. EARLY 2000s: HADOOP 3. PLATFORA’S DISRUPTION
3-6 Months + +
H A D O O P
MapReduce
Hive
Pig
Personnel Needed
Hadoop Expert
Data Scientist
ETL
SQL
on
Hadoop
Or Data Warehouse
Personnel Needed
Data Warehouse
Architect & Admin
BI Tool
Personnel Needed
BI Architect
& Admin
Business
Analyst
Outcome
Business Analyst gets
the data after 3-6
months with little control.
Rinse and repeat.
The Fastest Way to go from Raw Data to Analytics
Platfora is Leading the Transition from Business Intelligence to Big Data
Analytics
RAW
DATA
1. 1980s: DATA WAREHOUSING 2. EARLY 2000s: HADOOP 3. PLATFORA’S DISRUPTION
H A D O O P
Minutes
Business Analyst iterates & repeats
No Additional
Personnel Needed
Easily accessible by
Data Admins & Data
Scientists
Business
Analyst
Outcome
Business Analyst gets
the data in minutes,
collaborates with team,
and ask new questions
quickly.
The Only Platform that Offers an End-to-End
Architecture
Business
Analyst
Interactive Big Data
Analytics
Data Preparation
HDFS & Other Data
Sources
RAW DATA &
DATA
CONNECTORS
Transactions Customer
Interactions
Machine
Security
DataCatalog
API/Extensions
A Truly Scalable Platform
MapReduce/ Spark
Lens
Raw data at PB+ scale Accessible data at TB+ scale
HADOOP PLATFORA
Platfora natively leverages the deep processing, scalability, and limitless data
storage of Hadoop and combines it with a scale-out in-memory data
processing engine to make access extremely fast across infinite nodes.
Unlocking Big Data Analytics Solutions
•  Data Exfiltration Monitoring
•  Advanced Threat Analysis
•  Patch and Version Coverage
Security Analytics
•  Omni-channel Conversion Analysis
•  Audience Segmentation
•  Behavior Analysis
Customer Analytics
•  Consumer Devices and Monitoring
•  Utility Usage Monitoring
•  Product Telemetry
Internet of Things
•  No limit to new use
cases / verticals
•  Highly leveraged
platform services
•  Partners can
add custom IP
Open Solution
Platform
Customer
Analytics
Internet
of
Things
Security
Analytics
Detecting Advanced Persistent Cybersecurity
Attacks
“Platfora was built from the ground up for Hadoop.
Other players have been around longer and they are
all trying to shoehorn themselves into the Hadoop
infrastructure. And being architected for Hadoop is
very different from creating a Hadoop connector.
That’s a big differentiator for Platfora.”
Chief Information Security Officer
Multiple
Large
Financial
Organization
s
The Business Challenge
•  MicroStrategy and other traditional BI tools
could not handle the volume of data that this
financial org was ingesting
•  Needed to be able to respond dynamically
and instantly to any risk of malicious in-
network activity
The Solution
•  Identified malicious in-network activity that had
stayed under the radar of traditional security
solutions
•  Combined internal, netflow, firewall, IDS,
clickstream, and behavioral datasets for a wider
perspective
•  As a result, security analysts could see patterns
of exfiltration and infiltration, and iterate to
details without IT’s help
Platfora in Financial Services
•  Retail
•  360 degree view of customer
•  Marketing -Identify relevant customers for marketing campaigns - cross/up-sells
•  Digital Marketing – consolidation of channel analysis
•  Investment Banking
•  Identify common customer investment/product behaviors and build strategies to leverage insights
•  Identify/track trends in stock performance
•  Risk & Fraud
•  Comprehensive view of enterprise risk profile - Financial Risk (Credit, Market, Liquidity) Operational Risk
(Internal Audit, Vendor, Systems, Human Capital)
•  Potential Fraud identification
•  IT Management & Cyber Security
•  Track IT events over time by user directly from logs
•  Analyze threat detection data for anomalies or outliers
•  Malicious email identification and threat resolution
27	
  
DEMO
© Hortonworks Inc. 2011 – 2014. All Rights Reserved
Next Steps…
Download the Hortonworks Sandbox
Learn Hadoop
Build Your Analytic App
Try Hadoop 2
More about Platfora & Hortonworks
http://hortonworks.com/partner/platfora/
Contact us: events@hortonworks.com

More Related Content

PDF
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
PPTX
YARN Ready: Integrating to YARN with Tez
PDF
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
PDF
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
PDF
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
PDF
Discover.hdp2.2.storm and kafka.final
PDF
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
PDF
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
YARN Ready: Integrating to YARN with Tez
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Discover.hdp2.2.storm and kafka.final
Accelerate Big Data Application Development with Cascading and HDP, Hortonwor...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...

What's hot (20)

PDF
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
PDF
Cloudian 451-hortonworks - webinar
PDF
Discover HDP 2.2: Apache Falcon for Hadoop Data Governance
PDF
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
PDF
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
PDF
Eliminating the Challenges of Big Data Management Inside Hadoop
PDF
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
PDF
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
PDF
Webinar turbo charging_data_science_hawq_on_hdp_final
PDF
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
PDF
Hortonworks sqrrl webinar v5.pptx
PDF
Apache Hadoop on the Open Cloud
PDF
Hortonworks - What's Possible with a Modern Data Architecture?
PDF
Splunk-hortonworks-risk-management-oct-2014
PDF
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
PDF
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
PDF
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
PDF
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
PPTX
Introduction to the Hortonworks YARN Ready Program
PDF
Enterprise Hadoop with Hortonworks and Nimble Storage
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Cloudian 451-hortonworks - webinar
Discover HDP 2.2: Apache Falcon for Hadoop Data Governance
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Eliminating the Challenges of Big Data Management Inside Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Webinar turbo charging_data_science_hawq_on_hdp_final
Optimizing your Modern Data Architecture - with Attunity, RCG Global Services...
Hortonworks sqrrl webinar v5.pptx
Apache Hadoop on the Open Cloud
Hortonworks - What's Possible with a Modern Data Architecture?
Splunk-hortonworks-risk-management-oct-2014
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Enrich a 360-degree Customer View with Splunk and Apache Hadoop
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Introduction to the Hortonworks YARN Ready Program
Enterprise Hadoop with Hortonworks and Nimble Storage
Ad

Similar to Hortonworks and Platfora in Financial Services - Webinar (20)

PDF
Introduction to Hadoop
PPTX
Supporting Financial Services with a More Flexible Approach to Big Data
PDF
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
PDF
Discover.hdp2.2.h base.final[2]
PDF
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
PDF
IoT Crash Course Hadoop Summit SJ
PDF
Solving Big Data Problems using Hortonworks
PDF
Supporting Financial Services with a More Flexible Approach to Big Data
PPTX
Create a Smarter Data Lake with HP Haven and Apache Hadoop
PDF
Eliminating the Challenges of Big Data Management Inside Hadoop
PDF
Building a Modern Data Architecture with Enterprise Hadoop
PDF
The Big Data Gusher: Big Data Analytics, the Internet of Things and the Oil B...
PDF
Hortonworks and Red Hat Webinar - Part 2
PDF
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
PPTX
Cloud Austin Meetup - Hadoop like a champion
PDF
Discover hdp 2.2 hdfs - final
PPTX
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
PDF
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
PDF
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
PPTX
Big Data Expo 2015 - Hortonworks Common Hadoop Use Cases
Introduction to Hadoop
Supporting Financial Services with a More Flexible Approach to Big Data
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Discover.hdp2.2.h base.final[2]
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
IoT Crash Course Hadoop Summit SJ
Solving Big Data Problems using Hortonworks
Supporting Financial Services with a More Flexible Approach to Big Data
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
The Big Data Gusher: Big Data Analytics, the Internet of Things and the Oil B...
Hortonworks and Red Hat Webinar - Part 2
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Cloud Austin Meetup - Hadoop like a champion
Discover hdp 2.2 hdfs - final
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
Big Data Expo 2015 - Hortonworks Common Hadoop Use Cases
Ad

More from Hortonworks (20)

PDF
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
PDF
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
PDF
Getting the Most Out of Your Data in the Cloud with Cloudbreak
PDF
Johns Hopkins - Using Hadoop to Secure Access Log Events
PDF
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
PDF
HDF 3.2 - What's New
PPTX
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
PDF
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
PDF
IBM+Hortonworks = Transformation of the Big Data Landscape
PDF
Premier Inside-Out: Apache Druid
PDF
Accelerating Data Science and Real Time Analytics at Scale
PDF
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
PDF
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
PDF
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
PDF
Making Enterprise Big Data Small with Ease
PDF
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
PDF
Driving Digital Transformation Through Global Data Management
PPTX
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
PDF
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
PDF
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Johns Hopkins - Using Hadoop to Secure Access Log Events
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
HDF 3.2 - What's New
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
IBM+Hortonworks = Transformation of the Big Data Landscape
Premier Inside-Out: Apache Druid
Accelerating Data Science and Real Time Analytics at Scale
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Making Enterprise Big Data Small with Ease
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Driving Digital Transformation Through Global Data Management
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Unlock Value from Big Data with Apache NiFi and Streaming CDC

Recently uploaded (20)

PPTX
Custom Software Development Services.pptx.pptx
PDF
Topaz Photo AI Crack New Download (Latest 2025)
PDF
Cost to Outsource Software Development in 2025
PPTX
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
STL Containers in C++ : Sequence Container : Vector
PPTX
assetexplorer- product-overview - presentation
PPTX
Patient Appointment Booking in Odoo with online payment
PPTX
Introduction to Windows Operating System
PPTX
"Secure File Sharing Solutions on AWS".pptx
PDF
Types of Token_ From Utility to Security.pdf
PDF
DNT Brochure 2025 – ISV Solutions @ D365
PPTX
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
DOCX
How to Use SharePoint as an ISO-Compliant Document Management System
PPTX
Why Generative AI is the Future of Content, Code & Creativity?
PDF
How Tridens DevSecOps Ensures Compliance, Security, and Agility
PDF
iTop VPN Crack Latest Version Full Key 2025
PDF
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
PDF
Ableton Live Suite for MacOS Crack Full Download (Latest 2025)
PPTX
Cybersecurity: Protecting the Digital World
Custom Software Development Services.pptx.pptx
Topaz Photo AI Crack New Download (Latest 2025)
Cost to Outsource Software Development in 2025
Embracing Complexity in Serverless! GOTO Serverless Bengaluru
Digital Systems & Binary Numbers (comprehensive )
STL Containers in C++ : Sequence Container : Vector
assetexplorer- product-overview - presentation
Patient Appointment Booking in Odoo with online payment
Introduction to Windows Operating System
"Secure File Sharing Solutions on AWS".pptx
Types of Token_ From Utility to Security.pdf
DNT Brochure 2025 – ISV Solutions @ D365
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
How to Use SharePoint as an ISO-Compliant Document Management System
Why Generative AI is the Future of Content, Code & Creativity?
How Tridens DevSecOps Ensures Compliance, Security, and Agility
iTop VPN Crack Latest Version Full Key 2025
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
Ableton Live Suite for MacOS Crack Full Download (Latest 2025)
Cybersecurity: Protecting the Digital World

Hortonworks and Platfora in Financial Services - Webinar

  • 1. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Q&A box is available for your questions Webinar will be recorded for future viewing Thank you for joining! We’ll get started soon…
  • 2. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Customer Analytics and Risk Management in Financial Services We do Hadoop.
  • 3. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Your speakers… Mark Lochbihler, Partner Solutions Engineer Hortonworks @MarkLochbihler Bob Welshmer, Technical Director, Strategic Accounts Platfora @BDubya22
  • 4. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Our Mission: Power your Modern Data Architecture with HDP and Enterprise Apache Hadoop Customer Momentum •  300+ customers in seven quarters, growing at 75+/quarter •  Two thirds of customers come from F1000 Hortonworks and Hadoop at Scale •  HDP in production on largest clusters on planet •  Multiple +1000 node clusters, including 35,000 nodes at Yahoo!, 800 nodes at Spotify •  Founded in 2011 •  Original 24 architects, developers, operators of Hadoop from Yahoo! •  We are leaders in Hadoop community •  500+ employees
  • 5. © Hortonworks Inc. 2011 – 2014. All Rights Reserved The Forrester Wave™ Big Data Hadoop Solutions Q1 2014 “Hortonworks loves and lives open source innovation” World Class Support and Services. Hortonworks' Customer Support received a maximum score and was significantly higher than both Cloudera and MapR A Leader in Hadoop
  • 6. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Hortonworks Approach Innovate the Core1 Architect and build innovation at the core of Hadoop •  YARN: Data Operating System •  HDFS as the storage layer •  Key processing engines Extend Hadoop as an Enterprise Data Platform 2 Enable the Ecosystem3 Extend Hadoop with enterprise capabilities for governance, security & operations Apply enterprise software rigor to the open source development process Enable the leaders in the data center to easily adopt & extend their platforms •  Establish Hadoop as standard component of a modern data architecture •  Joint engineering YARN  :  Data  Opera.ng  System   Script     Pig       SQL     Hive/Tez,   HCatalog       NoSQL     HBase   Accumulo       Stream       Storm         Batch     Map   Reduce       HDFS     (Hadoop  Distributed  File  System)   HDP 2.2 Governance &Integration Security Operations Data Access Data Management YARN Memory     Spark      
  • 7. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Enabling a Modern Data Architecture with HDP and Apache Hadoop Hortonworks. We do Hadoop.
  • 8. © Hortonworks Inc. 2011 – 2014. All Rights Reserved APPLICATIONSDATASYSTEM Business Analytics Custom Applications Packaged Applications Traditional systems under pressure •  Silos of Data •  Costly to Scale •  Constrained Schemas Clickstream Geolocation Sentiment, Web Data Sensor. Machine Data Unstructured docs, emails Server logs SOURCES Existing Sources (CRM, ERP,…) RDBMS EDW MPP New Data Types …and difficult to manage new data
  • 9. © Hortonworks Inc. 2011 – 2014. All Rights Reserved HDP2 and YARN enable the Modern Data Architecture Hortonworks architected and 
 led development of YARN Common data set, multiple applications •  Optionally land all data in a single cluster •  Batch, interactive & real-time use cases •  Support multi-tenant access, processing & segmentation of data YARN: Architectural center of Hadoop •  Consistent security, governance & operations •  Ecosystem applications certified 
 by Hortonworks to run natively in Hadoop SOURCES EXISTING   Systems   Clickstream   Web     &Social   Geoloca.on   Sensor     &  Machine   Server     Logs   Unstructured   APPLICATIONSDATASYSTEM Business Analytics Custom Applications Packaged Applications RDBMS EDW MPP YARN: Data Operating System 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° N HDFS (Hadoop Distributed File System) Interactive Real-TimeBatch
  • 10. © Hortonworks Inc. 2011 – 2014. All Rights Reserved 1. Unlock New Applications from New Types of Data INDUSTRY USE CASE Sentiment & Web Clickstream & Behavior Machine & Sensor Geographic Server Logs Structured & Unstructured Financial Services New Account Risk Screens ✔ ✔ Trading Risk ✔ Insurance Underwriting ✔ ✔ ✔ Telecom Call Detail Records (CDR) ✔ ✔ Infrastructure Investment ✔ ✔ Real-time Bandwidth Allocation ✔ ✔ ✔ Retail 360° View of the Customer ✔ ✔ ✔ Localized, Personalized Promotions ✔ Website Optimization ✔ Manufacturing Supply Chain and Logistics ✔ Assembly Line Quality Assurance ✔ Crowd-sourced Quality Assurance ✔ Healthcare Use Genomic Data in Medial Trials ✔ ✔ ✔ Monitor Patient Vitals in Real-Time ✔ ✔ Pharmaceuticals Recruit and Retain Patients for Drug Trials ✔ ✔ Improve Prescription Adherence ✔ ✔ ✔ ✔ Oil & Gas Unify Exploration & Production Data ✔ ✔ ✔ ✔ Monitor Rig Safety in Real-Time ✔ ✔ ✔ Government ETL Offload/Federal Budgetary Pressures ✔ ✔ Sentiment Analysis for Government Programs ✔
  • 11. © Hortonworks Inc. 2011 – 2014. All Rights Reserved ..to shift from reactive to proactive interactions HDP and Hadoop allow organizations to shift interactions from… Reactive Post Transaction Proactive Pre Decision …to Real-time PersonalizationFrom static branding …to repair before breakFrom break then fix …to Designer MedicineFrom mass treatment …to Automated AlgorithmsFrom Educated Investing …to 1x1 TargetingFrom mass branding A shift in Advertising A shift in Financial Services A shift in Healthcare A shift in Retail A shift in Telco
  • 12. © Hortonworks Inc. 2011 – 2014. All Rights Reserved 2. Or to realize a dramatic cost savings… ✚ EDW Optimization OPERATIONS 50% ANALYTICS 20% ETL PROCESS 30% OPERATIONS 50% ANALYTICS 50% Current Reality EDW at capacity: some usage from low value workloads Older data archived, unavailable for ongoing exploration Source data often discarded Augment w/ Hadoop Free up EDW resources from low value tasks Keep 100% of source data and historical data for ongoing exploration Mine data for value after loading it because of schema-on-read Hadoop Parse, Cleanse Apply Structure, Transform
  • 13. © Hortonworks Inc. 2011 – 2014. All Rights Reserved 2. Or to realize a dramatic cost savings… ✚ EDW Optimization OPERATIONS 50% ANALYTICS 20% ETL PROCESS 30% OPERATIONS 50% ANALYTICS 50% Current Reality EDW at capacity: some usage from low value workloads Older data archived, unavailable for ongoing exploration Source data often discarded Augment w/ Hadoop Free up EDW resources from low value tasks Keep 100% of source data and historical data for ongoing exploration Mine data for value after loading it because of schema-on-read MPP SAN Engineered System NAS HADOOP Cloud Storage $0 $20,000 $40,000 $60,000 $80,000 $180,000 Fully-loaded Cost Per Raw TB of Data (Min–Max Cost) Commodity Compute & Storage Hadoop Enables Scalable Compute & Storage at a Compelling Cost Structure Hadoop Parse, Cleanse Apply Structure, Transform Storage Costs/Compute Costs from $19/GB to $0.23/GB
  • 14. © Hortonworks Inc. 2011 – 2014. All Rights Reserved 3. Data Lake: An architectural shift SCALE SCOPE Unlocking the Data Lake   RDBMS MPP EDW Data Lake Enabled by YARN •  Single data repository, shared infrastructure •  Multiple biz apps accessing all the data •  Enable a shift from reactive to proactive interactions •  Gain new insight across the entire enterprise New Analytic Apps or IT Optimization HDP 2.2 Governance &Integration Security Operations Data Access Data Management YARN
  • 15. © Hortonworks Inc. 2011 – 2014. All Rights Reserved © Hortonworks Inc. 2014 - Confidential Banking Data Lake for 100s of Use Cases Page 15 Problem Architecture unsuited to capitalize on server log data •  Huge investments company generates valuable data assets •  Current EDW solutions are appropriate for some data workloads but too expensive for others •  Financial log data is difficult to aggregate & analyze at scale •  Short retention hampers price history & performance analysis •  Limited visibility into cost of acquiring customers Solution Multi-tenant Hadoop cluster to merge data across groups •  Server log data merged with structured data to uncover trends across traders •  ETL offload saves money for Hadoop-appropriate workloads •  Longer data retention enables price history analysis •  Joining data sets for insight into customer acquisition costs •  Accumulo enforces read permissions on individual data cells Investment Services Global investments company > $1.5 trillion assets under management > $14B billion in revenue ~ 50K employees
  • 16. © Hortonworks Inc. 2011 – 2014. All Rights Reserved HDP delivers a comprehensive data management platform Hortonworks Data Platform 2.2 YARN: Data Operating System (Cluster Resource Management) 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Script Pig SQL Hive Tez Tez Java Scala Cascading Tez ° ° ° ° ° ° ° ° ° ° ° ° ° ° Others ISV Engines HDFS (Hadoop Distributed File System) Stream Storm Search Solr NoSQL HBase Accumulo Slider Slider SECURITYGOVERNANCE OPERATIONSBATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark Provision, Manage & Monitor Ambari Zookeeper Scheduling Oozie Data Workflow, Lifecycle & Governance Falcon Sqoop Flume Kafka NFS WebHDFS Authentication Authorization Accounting Data Protection Storage: HDFS Resources: YARN Access: Hive, … Pipeline: Falcon Cluster: Knox Cluster: Ranger Deployment ChoiceLinux Windows On-Premises Cloud YARN is the architectural center of HDP Enables batch, interactive and real-time workloads Provides comprehensive enterprise capabilities The widest range of deployment options Delivered Completely in the OPEN
  • 17. Introducing Platfora LEAD THE INDUSTRY TRANSITION FROM BUSINESS INTELLIGENCE TO BIG DATA ANALYTICS. #1 Big Data Analytics platform native on Hadoop MISSION End-to-end platform built for Multi-Structured Data Self-service, iterative, interactive, and fast
  • 18. WORLD CLASS CUSTOMERS Proven Leader in Big Data Analytics for Hadoop PROVEN COMPANY TO WATCH Ones to Watch in Big Data April 2014 10 Hot Hadoop Startups to Watch CRN 10 Coolest Big Data Products of 2013 RAISED $65M BY LEADING INVESTORS •  Launched 9 product versions with feature innovations •  Grew customers by 4x and employees by 2x MOMENTUM OVER THE PAST 12 MONTHS
  • 19. The Way We Access and Interact with Data has Changed Data Warehousing: 1980s Technology that Still Exists Today 1. 1980s: DATA WAREHOUSING 2. EARLY 2000s: HADOOP 3. PLATFORA’S DISRUPTION 3-6 Months ETL Personnel Needed ETL Programmer RAW DATA Customer Interactions Machine Data Transactions 90% of copy trashed Data Warehouse Expensive $$$ Personnel Needed Data Warehouse Architect & Admin BI Tool Personnel Needed BI Architect & Admin Business Analyst Outcome Business Analyst gets the data after 3-6 months with little control. Rinse and repeat.
  • 20. The Way We Access and Interact with Data has Changed Hadoop: Early 2000s New Data Storage Technology Arrives RAW DATA 1. 1980s: DATA WAREHOUSING 2. EARLY 2000s: HADOOP 3. PLATFORA’S DISRUPTION 3-6 Months + + H A D O O P MapReduce Hive Pig Personnel Needed Hadoop Expert Data Scientist ETL SQL on Hadoop Or Data Warehouse Personnel Needed Data Warehouse Architect & Admin BI Tool Personnel Needed BI Architect & Admin Business Analyst Outcome Business Analyst gets the data after 3-6 months with little control. Rinse and repeat.
  • 21. The Fastest Way to go from Raw Data to Analytics Platfora is Leading the Transition from Business Intelligence to Big Data Analytics RAW DATA 1. 1980s: DATA WAREHOUSING 2. EARLY 2000s: HADOOP 3. PLATFORA’S DISRUPTION H A D O O P Minutes Business Analyst iterates & repeats No Additional Personnel Needed Easily accessible by Data Admins & Data Scientists Business Analyst Outcome Business Analyst gets the data in minutes, collaborates with team, and ask new questions quickly.
  • 22. The Only Platform that Offers an End-to-End Architecture Business Analyst Interactive Big Data Analytics Data Preparation HDFS & Other Data Sources RAW DATA & DATA CONNECTORS Transactions Customer Interactions Machine Security DataCatalog API/Extensions
  • 23. A Truly Scalable Platform MapReduce/ Spark Lens Raw data at PB+ scale Accessible data at TB+ scale HADOOP PLATFORA Platfora natively leverages the deep processing, scalability, and limitless data storage of Hadoop and combines it with a scale-out in-memory data processing engine to make access extremely fast across infinite nodes.
  • 24. Unlocking Big Data Analytics Solutions •  Data Exfiltration Monitoring •  Advanced Threat Analysis •  Patch and Version Coverage Security Analytics •  Omni-channel Conversion Analysis •  Audience Segmentation •  Behavior Analysis Customer Analytics •  Consumer Devices and Monitoring •  Utility Usage Monitoring •  Product Telemetry Internet of Things •  No limit to new use cases / verticals •  Highly leveraged platform services •  Partners can add custom IP Open Solution Platform Customer Analytics Internet of Things Security Analytics
  • 25. Detecting Advanced Persistent Cybersecurity Attacks “Platfora was built from the ground up for Hadoop. Other players have been around longer and they are all trying to shoehorn themselves into the Hadoop infrastructure. And being architected for Hadoop is very different from creating a Hadoop connector. That’s a big differentiator for Platfora.” Chief Information Security Officer Multiple Large Financial Organization s The Business Challenge •  MicroStrategy and other traditional BI tools could not handle the volume of data that this financial org was ingesting •  Needed to be able to respond dynamically and instantly to any risk of malicious in- network activity The Solution •  Identified malicious in-network activity that had stayed under the radar of traditional security solutions •  Combined internal, netflow, firewall, IDS, clickstream, and behavioral datasets for a wider perspective •  As a result, security analysts could see patterns of exfiltration and infiltration, and iterate to details without IT’s help
  • 26. Platfora in Financial Services •  Retail •  360 degree view of customer •  Marketing -Identify relevant customers for marketing campaigns - cross/up-sells •  Digital Marketing – consolidation of channel analysis •  Investment Banking •  Identify common customer investment/product behaviors and build strategies to leverage insights •  Identify/track trends in stock performance •  Risk & Fraud •  Comprehensive view of enterprise risk profile - Financial Risk (Credit, Market, Liquidity) Operational Risk (Internal Audit, Vendor, Systems, Human Capital) •  Potential Fraud identification •  IT Management & Cyber Security •  Track IT events over time by user directly from logs •  Analyze threat detection data for anomalies or outliers •  Malicious email identification and threat resolution
  • 28. © Hortonworks Inc. 2011 – 2014. All Rights Reserved Next Steps… Download the Hortonworks Sandbox Learn Hadoop Build Your Analytic App Try Hadoop 2 More about Platfora & Hortonworks http://hortonworks.com/partner/platfora/ Contact us: [email protected]