SlideShare a Scribd company logo
Page 1 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
A Comprehensive Approach to Building
Your Big Data Solution
We do Hadoop.
Page 2 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Speakers
	
 ย โ€ฏ Hortonworks
โ—ฆโ€ฏ Ali Bajwa, Senior Partner Solution Engineer
	
 ย โ€ฏ Red Hat
โ—ฆโ€ฏ Irshad Raihan, Senior Principal, Product Marketing
	
 ย โ€ฏ Cisco
โ—ฆโ€ฏ Ron Graham, Big Data Analytics Engineer
Page 3 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Partnership
100%	
 ย open	
 ย source	
 ย Hadoop	
 ย Distribu5on,	
 ย 	
 ย 
Support	
 ย and	
 ย Training	
 ย 
	
 ย 
Middleware,	
 ย Storage,	
 ย PaaS,	
 ย IaaS	
 ย 
UCS	
 ย Integrated	
 ย Infrastructure	
 ย 
For	
 ย Big	
 ย Data	
 ย 
CISCO,	
 ย HORTONWORKS	
 ย AND	
 ย RED	
 ย HAT	
 ย ARE	
 ย PARTNERING	
 ย TO	
 ย HELP	
 ย YOU	
 ย 
BUILD	
 ย YOUR	
 ย BIG	
 ย DATA	
 ย SOLUTION	
 ย AND	
 ย REACH	
 ย MASSIVE	
 ย SCALABILITY,	
 ย 
SUPERIOR	
 ย EFFICIENCY	
 ย AND	
 ย DRAMATICALLY	
 ย LOWER	
 ย TOTAL	
 ย COST	
 ย OF	
 ย 
OWNERSHIP	
 ย THANKS	
 ย TO	
 ย A	
 ย VALIDATED	
 ย JOINT	
 ย ARCHITECTURE.
Page 4 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Traditional systems under pressure
Challenges
โ€ขโ€ฏ Constrains data to app
โ€ขโ€ฏ Canโ€™t manage new data
โ€ขโ€ฏ Costly to Scale
Business Value
Clickstream
Geolocation
Web Data
Internet of Things
Docs, emails
Server logs
2012
2.8 Zettabytes
2020
40 Zettabytes
LAGGARDS
INDUSTRY
LEADERS
1
2 New Data
ERP CRM SCM
New
Traditional
Page 5 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Modern Data Architecture emerges to unify data & processing
Modern Data Architecture
โ€ขโ€ฏ Enable applications to have access to
all your enterprise data through an
efficient centralized platform
โ€ขโ€ฏ Supported with a centralized
approach governance, security and
operations
โ€ขโ€ฏ Versatile to handle any applications
and datasets no matter the size or
type
Clickstream	
 ย  Web	
 ย 	
 ย 
&	
 ย Social	
 ย 
Geoloca3on	
 ย  Sensor	
 ย 	
 ย 
&	
 ย Machine	
 ย 
Server	
 ย 	
 ย 
Logs	
 ย 
Unstructured	
 ย 
SOURCES
Existing Systems
ERP	
 ย  CRM	
 ย  SCM	
 ย 
ANALYTICS
Data
Marts
Business
Analytics
Visualization
& Dashboards
ANALYTICS
Applications
Business
Analytics
Visualization
& Dashboards
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
HDFS
(Hadoop Distributed File System)
YARN: Data Operating System
Interactive Real-TimeBatch Partner ISVBatch BatchMP
P	
 ย 
EDW	
 ย 
Page 6 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Hadoop Driver: Cost optimization
Archive Data off EDW
Move rarely used data to Hadoop as active
archive, store more data longer
Offload costly ETL process
Free your EDW to perform high-value functions
like analytics & operations, not ETL
Enrich the value of your EDW
Use Hadoop to refine new data sources, such as
web and machine data for new analytical context
ANALYTICS
Data
Marts
Business
Analytics
Visualization
& Dashboards
HDP helps you reduce costs and optimize the value associated with your EDW
ANALYTICSDATASYSTEMS
Data
Marts
Business
Analytics
Visualization
& Dashboards
HDP 2.2
ELT
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
ยฐ
N
Cold Data,
Deeper Archive
& New Sources
Enterprise
Data
Warehouse
Hot
MPP
In-Memory
Clickstream	
 ย  Web	
 ย 	
 ย 
&	
 ย Social	
 ย 
Geoloca3on	
 ย  Sensor	
 ย 	
 ย 
&	
 ย Machine	
 ย 
Server	
 ย 	
 ย 
Logs	
 ย 
Unstructured	
 ย 
Existing Systems
ERP	
 ย  CRM	
 ย  SCM	
 ย 
SOURCES
Page 7 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Hadoop Driver: Enabling the data lakeSCALE
SCOPE
Data Lake Definition
โ€ขโ€ฏ Centralized Architecture
Multiple applications on a shared data set
with consistent levels of service
โ€ขโ€ฏ Any App, Any Data
Multiple applications accessing all data
affording new insights and opportunities.
โ€ขโ€ฏ Unlocks โ€˜Systems of Insightโ€™
Advanced algorithms and applications
used to derive new value and optimize
existing value.
Drivers:
1.โ€ฏ Cost Optimization
2.โ€ฏ Advanced Analytic Apps
Goal:
โ€ขโ€ฏ Centralized Architecture
โ€ขโ€ฏ Data-driven Business
DATA
LAKE
Journey to the Data Lake with Hadoop
Systems of Insight
Page 8 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Only HDP delivers a Centralized Architecture
HDP is uniquely built around YARN serving as a data operating system that provides multi-tenant Resource
Management, consistent Governance & Security and efficient Operations services across Hadoop applications.
Hortonworks Data Platform
YARN
Data Operating System
โ€ขโ€ฏ A centralized architecture of
consistent enterprise
services for resource
management, security,
operations, and
governance.
โ€ขโ€ฏ The versatility to support
multiple applications and
diverse workloads from
batch to interactive to real-
time, open source and
commercial.
Key Benefits
โ€ขโ€ฏ Multiple applications on a
shared data set with consistent
levels of service: a multitenant
data platform.
โ€ขโ€ฏ Provides a shared platform to
enable new analytic
applications.
โ€ขโ€ฏ Delivers maximum cost
efficiency for cluster resource
management. Fewer servers
fewer nodes.
Storage
YARN: Data Operating System
Governance Security
Operations
Resource Management
Existing
Applications
New
Analytics
Partner
Applications
Data Access: Batch, Interactive & Real-time
Page 9 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
HDP delivers a completely open data platform
Hortonworks Data Platform 2.2
Hortonworks Data Platform provides Hadoop for the Enterprise: a centralized architecture
of core enterprise services, for any application and any data.
Completely Open
โ€ขโ€ฏ HDP incorporates every element
required of an enterprise data
platform: data storage, data access,
governance, security, operations
โ€ขโ€ฏ All components are developed in
open source and then rigorously
tested, certified, and delivered as
an integrated open source platform
thatโ€™s easy to consume and use by
the enterprise and ecosystem.
YARN: Data Operating System
(Cluster Resource Management)
1 ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ
ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ
ApachePig
ยฐ ยฐ
ยฐ ยฐ
ยฐ ยฐ ยฐ
ยฐ ยฐ ยฐ
HDFS
(Hadoop Distributed File System)
GOVERNANCE BATCH, INTERACTIVE & REAL-TIME DATA ACCESS
Apache Falcon
ApacheHive
Cascading
ApacheHBase
ApacheAccumulo
ApacheSolr
ApacheSpark
ApacheStorm
Apache Sqoop
Apache Flume
Apache Kafka
SECURITY
Apache Ranger
Apache Knox
Apache Falcon
OPERATIONS
Apache Ambari
Apache
Zookeeper
Apache Oozie
Page 10 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
HDP: Any Data, Any Application, Anywhere
Any Application
โ€ขโ€ฏ Deep integration with ecosystem
partners to extend existing
investments and skills
โ€ขโ€ฏ Broadest set of applications through
the stable of YARN-Ready applications
Any Data
Deploy applications fueled by clickstream, sensor,
social, mobile, geo-location, server log, and other
new paradigm datasets with existing legacy
datasets.
Anywhere
Implement HDP naturally across the
complete range of deployment options
Clickstream	
 ย  Web	
 ย 	
 ย 
&	
 ย Social	
 ย 
Geoloca3on	
 ย  Internet	
 ย of	
 ย 
Things	
 ย 
Server	
 ย 	
 ย 
Logs	
 ย 
Files,	
 ย emails	
 ย ERP	
 ย  CRM	
 ย  SCM	
 ย 
hybrid
commodity appliance cloud
Over 70 Hortonworks Certified YARN Apps
Page 11 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved
Open Source IS the standard for platform technology
Modern platform standards are defined by open communities
For Hadoop, the ASF provides guidelines and
a governance framework and the open
community defines the standards for Hadoop.
Roadmap matches user
requirements not vendor
monetization requirements
Hortonworks Open Source Development Model yields unmatched
efficiency
โ€ขโ€ฏ Infinite number of developers under governance of ASF applied to problem
โ€ขโ€ฏ End users motivated to contribute to Apache Hadoop as they are consumers
โ€ขโ€ฏ IT vendors motivated to align with Apache Hadoop to capture adjacent opportunities
Hortonworks Open Source Business Model de-risks investments
โ€ขโ€ฏ Buying behavior changed: enterprise wants support subscription license
โ€ขโ€ฏ Vendor needs to earn your business, every year is an election year
โ€ขโ€ฏ Equitable balance of power between vendor and consumer
โ€ขโ€ฏ IT vendors want platform technologies to be open source to avoid lock-in
TITLE SLIDE: HEADLINE
Presenter name
Title, Red Hat
Date
Red	
 ย Hat	
 ย Big	
 ย Data	
 ย 
Open	
 ย the	
 ย possibili5es	
 ย of	
 ย your	
 ย data	
 ย 
13
Big	
 ย Data	
 ย innova3on	
 ย cannot	
 ย happen	
 ย in	
 ย a	
 ย bubble	
 ย 
Strong	
 ย partnerships	
 ย with	
 ย industry	
 ย leaders	
 ย and	
 ย open	
 ย source	
 ย communi5es	
 ย 
14
Business	
 ย User	
 ย Architect	
 ย Data	
 ย Center	
 ย Operator	
 ย  App	
 ย Developer	
 ย 
Mul5ple	
 ย Silos.	
 ย Mul5ple	
 ย Views.	
 ย Mul5ple	
 ย Goals.	
 ย 
The	
 ย Old	
 ย Data	
 ย Lifecycle	
 ย 
Manage	
 ย 	
 ย  Build	
 ย 	
 ย  Code	
 ย  Query	
 ย 
15
Business	
 ย User	
 ย 
Architect	
 ย 
Data	
 ย Center	
 ย Operator	
 ย 
App	
 ย Developer	
 ย 
One	
 ย Language.	
 ย One	
 ย View.	
 ย One	
 ย Goal.	
 ย 
The	
 ย New	
 ย Data	
 ย Lifecycle	
 ย 
Ingest	
 ย  Integrate	
 ย 
Act	
 ย  Discover	
 ย 
16
Lack	
 ย of	
 ย agile,	
 ย open,	
 ย and	
 ย cost	
 ย e๏ฌ€ec5ve	
 ย enterprise-ยญโ€grade	
 ย solu5ons	
 ย 
Barriers	
 ย to	
 ย Big	
 ย Data	
 ย Success	
 ย 
I	
 ย want	
 ย more	
 ย than	
 ย 
canned	
 ย BI	
 ย queries	
 ย 
I	
 ย am	
 ย locked	
 ย into	
 ย a	
 ย 
vendor	
 ย stack	
 ย 
I	
 ย want	
 ย to	
 ย use	
 ย my	
 ย favorite	
 ย 
dev	
 ย framework	
 ย 
I	
 ย need	
 ย to	
 ย integrate	
 ย 
data	
 ย across	
 ย silos	
 ย 
Business	
 ย User	
 ย 
Architect	
 ย 
Data	
 ย Center	
 ย Operator	
 ย 
App	
 ย Developer	
 ย 
17
Business	
 ย User	
 ย 
Architect	
 ย 
Data	
 ย Center	
 ย Operator	
 ย 
App	
 ย Developer	
 ย 
Ingest	
 ย 
Integrate	
 ย 
Act	
 ย 
Discover	
 ย 
Big	
 ย Data	
 ย Solu3ons	
 ย from	
 ย Red	
 ย Hat	
 ย 
Integrated	
 ย Big	
 ย Data	
 ย PlaOorm	
 ย 
	
 ย 
Cisco UCS Integrated Infrastructure for Big Data
Hadoop
Compatible
File System
Red Hat
Storage
Hadoop Data Processing
Map/Reduce YARN
Analytics
Operating System
Red Hat Enterprise Linux
Cloud
Red Hat Enterprise Linux
OpenStack Platform
Operating EnvironmentData Integration & Application Development
Application Platform-
as-a-Service
OpenShift by Red Hat
Data Integration and Data
Services
Red Hat JBoss Data
Virtualization
Data Caching
Red Hat JBoss
Data Grid
Business Rules Mgmt
Red Hat JBoss BRMS
Development
Red Hat JBoss
Developer
Studio Hadoop
Distributed
File
System
Management
HortonworksCisco Red Hat
Data Integration
and Data
Services
Composite
Cloud
Cisco OpenStack
Pig Spark Storm
HBase Tez Hive
Cisco Security Suite
CiscoUCSDirectoryExpress
CiscoUnifiedManagement
Ambari
Virtualization
Red Hat Enterprise
Virtualization
Software and Solutions Innovation
Empowering Whatโ€™s Next
Ron Graham
Big Data Analytics Engineer
Hardware Architecture
Cisco UCS with Big Data
20ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
Why Cisco UCS for Big Data?
โ€ขโ€ฏ Manageability
โ€ขโ€ฏ Save time with UCS Manager
โ€ขโ€ฏ Enables consistent and rapid
deployments using UCS Service profiles
โ€ขโ€ฏ Offers operational simplification
โ€ขโ€ฏ Delivers a modular solution
โ€ขโ€ฏ Scalability
โ€ขโ€ฏ Performance
SIM Card
Identity for a phone
Service Profile
Identity for a server
21ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
โ€ขโ€ฏ End to end provisioning, installation, and
monitoring tool for Hadoop Clusters
โ€ขโ€ฏ Better business outcomes with faster time to
value from Big Data
โ€ขโ€ฏ Provides appliance like experience with out
inflexibilities
โ€ขโ€ฏ Centralized visibility across Hadoop and
physical infrastructure
โ€ขโ€ฏ Powerful interface for further integration into
third party tools and services
UCS Director Express for Big Data
End to end solution for Hadoop
22ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
Powering Big Data and Analytics
UCS	
 ย B200	
 ย 
Scale-ยญโ€out	
 ย Analy5cs	
 ย 
Big	
 ย Data	
 ย 
with	
 ย 
EMC	
 ย 
Isilon	
 ย 
and	
 ย VCE	
 ย 
Invicta	
 ย 
(Fast	
 ย 
Data)	
 ย 
UCS	
 ย C240	
 ย 
(Hadoop,	
 ย NoSQL	
 ย 
MPP)	
 ย 
UCS	
 ย Manager,	
 ย Director,	
 ย Express,	
 ย Central,	
 ย Redhat	
 ย 	
 ย 
ACl	
 ย 
C/B460	
 ย (In-ยญโ€
memory	
 ย 
Analy5cs)	
 ย 
UCS	
 ย C3160,	
 ย 
C3260	
 ย 
(Hadoop)	
 ย 
UCS	
 ย C220	
 ย 
(real-ยญโ€5me,	
 ย streaming)	
 ย 
FlexPod	
 ย 
Select	
 ย 
with	
 ย 
NetApp	
 ย 
E-ยญโ€Series	
 ย UCS	
 ย Mini	
 ย (All-ยญโ€in-ยญโ€one	
 ย 
at	
 ย Edge)	
 ย 
UCS	
 ย M-ยญโ€Series	
 ย (Massive	
 ย 
scale-ยญโ€out)	
 ย 
Ac5an,	
 ย DataStax,	
 ย Hortonworks,	
 ย MongoDB,	
 ย Pivotal,SAP,	
 ย SAS,	
 ย Splunk	
 ย 	
 ย 
Cisco,	
 ย Elas5c	
 ย Search,	
 ย IBM,	
 ย Informa5ca,	
 ย MicrosoZ,	
 ย MicroStrategy	
 ย ,	
 ย Oracle,	
 ย SAP,	
 ย 
SAS	
 ย 	
 ย and	
 ย others	
 ย 
Complete	
 ย 
and	
 ย Industry	
 ย 
leading	
 ย 
Por[olio	
 ย 
Ecosystem	
 ย 
Partners	
 ย 
ISV	
 ย Partners	
 ย 
Infrastructure	
 ย 
Management	
 ย 
Data	
 ย Management	
 ย 
Applica5ons	
 ย 
23ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
DESIGNS
Big Data
Cisco Validated Designs
for leading big data
platforms can be found
at:
www.cisco.com/go/bigdata
Cisco Validated Designs
Accelerate Deployment
24ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
Server 8x UCS C220 M4
CPU 2 x Intel Xeon
E5-2620 v3 (15M
Cache, 2.40 GHz)
Memory 256GB
Storage 8 1.2-TB 10K SAS
SFF HDD
Starter High Performance
Server 8x UCS C220 M4
CPU 2 x Intel Xeon
E5-2680 v3 (30M
Cache, 2.50 GHz)
Memory 384GB
Storage 2 1.2-TB 10K SAS
SFF HDD, 6 400-
GB SAS SSD
Performance Optimized Capacity Optimized Extreme Capacity
Server 16x UCS C240 M4
CPU 2 x Intel Xeon E5-2680
v3 (30M Cache, 2.50
GHz)
Memory 256GB
Storage 2 120-GB SATA SSD,
24 1.2-TB 10K SAS
SFF HDD
Server 16x UCS C240 M4
CPU 2 x Intel Xeon
E5-2620 v3 (15M
Cache, 2.40 GHz)
Memory 128GB
Storage 2 120-GB SATA
SSD. 12 4-TB 7.2K
SAS SFF HDD
Server 2x UCS C3160
CPU 2 x Intel Xeon
E5-2695 v2 (30M
Cache, 2.40 GHz)
Memory 256GB
Storage 2 120-GB SATA
SSD, 60 4-TB 7.2K
SAS SFF HDD
Cisco UCS CPA for Big Data v3
Reference Architecture and Bundles
25ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
2x UCS 6296 Series
Fabric Interconnect
UCS Manager
โ€ขโ€ฏ UCS Domain (68 Servers)
โ€ขโ€ฏ Manage by UCS Manager
โ€ขโ€ฏ 2.8 PB of storage
โ€ขโ€ฏ HDP 2.2
โ€ขโ€ฏ Tiered Storage
โ€ขโ€ฏ Tez
โ€ขโ€ฏ RHEL 6.5
โ€ขโ€ฏ Dual 10G Network
โ€ขโ€ฏ 17 Servers Per Rack
UCS C240 M4
2x E5-2680 v3
256GB Memory
Cisco 12Gb/s SAS Raid Controller
2x 120GB STAT SSD
24x 1.2TB 10k SAS
2x Cisco UCS VIC 1227
UCS C3160
2x E5-2695 v2
256GB Memory
Cisco 12Gb/s SAS Raid Controller
2x 120GB SATA SSD
60x 4TB 7.2k SAS SFF
2x Cisco UCS VIC 1227
/ 17 10Gb Ethernet
/ 17 10Gb Ethernet
64 Node Cluster Configuration
26ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
UCSD Express
UCS 6200 Series
Fabric Interconnect
UCS Manager
UCS C240 M4 Series
Rack Server
UCS C3160 Rack
Server
Apache Ambari
Unified Management
Programmability, Scalability and Automation
27ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
UCS 6200 Series
Fabric Interconnect
UCS C240 M4 Series
Rack Server
UCS C3160 Rack
Server
Data
Data
Data
Cold
n replicas on
Archive
Warm
1 replicas on Disk,
n-1 on Archive
Hot
All (n) replicas on
Disk
Cold
Hot
Policy
Hot - for both storage and compute. The data that
is popular and still being used for processing will
stay in this policy. When a block is hot, all replicas
are stored in DISK.
Warm - partially hot and partially cold. When a
block is warm, some of its replicas are stored in
DISK and the remaining replicas are stored in
ARCHIVE.
Cold - only for storage with limited compute. The
data that is no longer being used, or data that
needs to be archived is moved from hot storage to
cold storage. When a block is cold, all replicas are
stored in ARCHIVE.
Multi-tiered Storage Architecture
Multi-temperature Policy
28ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Software and Solutions Innovation
Empowering Whatโ€™s Next
UCS 6200 Series
Fabric Interconnect
UCS C240 M4 Series
Rack Server
UCS C3160 Rack
Server
Data
Data
Data
Cold
n replicas on
Archive
Warm
1 replicas on Disk,
n-1 on Archive
Hot
All (n) replicas on
Disk
Cold
Hot
Mover โ€“ A new data migration tool
It periodically scans the files in HDFS to
check if the block placement satisfies the
storage policy. For the blocks violating the
storage policy, it moves the replicas to a
different storage type in order to fulfill the
storage policy requirement.
A
C
D
A
C
D
E
A
C
D
E
N
N
N
N
E
Multi-tiered Storage Architecture
Multi-temperature Policy
Page 29 ยฉ Hortonworks Inc. 2011 โ€“ 2015. All Rights Reserved
Next Stepsโ€ฆ
Download the Hortonworks Sandbox
Learn Hadoop
Build Your Analytic App
Try Hadoop
Learn more with our partnerships
http://hortonworks.com/partner/cisco/
http://hortonworks.com/partner/redhat/
Joint CVD bit.ly/Cisco-CVD
30ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
โ€ขโ€ฏ Cisco Live! in San Diego โ€“ June 7 - 11
โ€ขโ€ฏ Hadoop Summit in San Jose โ€“ June 9 โ€“ 11
โ€ขโ€ฏ Red Hat Summit in Boston - June 23-26
More information about Red Hatโ€™s Big Data solutions please visit:
โ€ขโ€ฏ redhat.com/bigdata
โ€ขโ€ฏ redhatstorage.redhat.com/category/big-data
โ€ขโ€ฏ redhat.com/en/insights/big-data
More information about Ciscoโ€™s Big Data and Analytics Offers please visit:
โ€ขโ€ฏ www.cisco.com/go/bigdata and www.cisco.com/go/bigdata_design
โ€ขโ€ฏ http://blogs.cisco.com/author/raghunathnambiar
โ€ขโ€ฏ bit.ly/Cisco-CVD
30
Meet us in person!

More Related Content

What's hot (20)

PDF
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks
ย 
PDF
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Hortonworks
ย 
PPTX
Falcon Meetup
Hortonworks
ย 
PDF
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
ย 
PPTX
Log Analytics Optimization
Hortonworks
ย 
PDF
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
Hortonworks
ย 
PDF
Predictive Analytics and Machine Learning โ€ฆwith SAS and Apache Hadoop
Hortonworks
ย 
PDF
HDP Advanced Security: Comprehensive Security for Enterprise Hadoop
Hortonworks
ย 
PPTX
Hortonworks Data In Motion Series Part 4
Hortonworks
ย 
PDF
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Hortonworks
ย 
PDF
Discover.hdp2.2.storm and kafka.final
Hortonworks
ย 
PPTX
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks
ย 
PPTX
Enabling the Real Time Analytical Enterprise
Hortonworks
ย 
PDF
Discover.hdp2.2.h base.final[2]
Hortonworks
ย 
PDF
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
ย 
PDF
Implementing a Data Lake with Enterprise Grade Data Governance
Hortonworks
ย 
PPTX
YARN Ready: Integrating to YARN with Tez
Hortonworks
ย 
PPTX
Don't Let Security Be The 'Elephant in the Room'
Hortonworks
ย 
PPTX
Spark Summit EMEA - Arun Murthy's Keynote
Hortonworks
ย 
PDF
Discover HDP 2.1: Interactive SQL Query in Hadoop with Apache Hive
Hortonworks
ย 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks
ย 
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Hortonworks
ย 
Falcon Meetup
Hortonworks
ย 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
ย 
Log Analytics Optimization
Hortonworks
ย 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
Hortonworks
ย 
Predictive Analytics and Machine Learning โ€ฆwith SAS and Apache Hadoop
Hortonworks
ย 
HDP Advanced Security: Comprehensive Security for Enterprise Hadoop
Hortonworks
ย 
Hortonworks Data In Motion Series Part 4
Hortonworks
ย 
Discover HDP 2.1: Apache Hadoop 2.4.0, YARN & HDFS
Hortonworks
ย 
Discover.hdp2.2.storm and kafka.final
Hortonworks
ย 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks
ย 
Enabling the Real Time Analytical Enterprise
Hortonworks
ย 
Discover.hdp2.2.h base.final[2]
Hortonworks
ย 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Hortonworks
ย 
Implementing a Data Lake with Enterprise Grade Data Governance
Hortonworks
ย 
YARN Ready: Integrating to YARN with Tez
Hortonworks
ย 
Don't Let Security Be The 'Elephant in the Room'
Hortonworks
ย 
Spark Summit EMEA - Arun Murthy's Keynote
Hortonworks
ย 
Discover HDP 2.1: Interactive SQL Query in Hadoop with Apache Hive
Hortonworks
ย 

Viewers also liked (20)

PDF
Leverage Big Data to Enhance Customer Experience in Telecommunications โ€“ with...
Hortonworks
ย 
PDF
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
ย 
PPTX
Digital Transformation and Data Protection in Automotive Industry
ร‡ukur & Yฤฑlmaz Law Firm
ย 
PDF
Smarter commerce partner presentation final
Ben Andre Heyerdahl
ย 
PDF
Smarter commerce overview
Harikrishnan M
ย 
PDF
Bringing Big Data Analytics to Network Monitoring
Savvius, Inc
ย 
PDF
Hadoop Summit 2013 : Continuous Integration on top of hadoop
Wisely chen
ย 
PDF
Case study - Automotive DMS Connection to Salesforce.com
Rodney Birch
ย 
PDF
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Hortonworks
ย 
PDF
Qrious about Insights -- Big Data in the Real World
Guy K. Kloss
ย 
PPTX
Leveraging SAP, Hadoop, and Big Data to Redefine Business
DataWorks Summit
ย 
PDF
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks
ย 
PDF
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks
ย 
PDF
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Hortonworks
ย 
PDF
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Hortonworks
ย 
PDF
Hortonworks and Voltage Security webinar
Hortonworks
ย 
PDF
Hortonworks, Novetta and Noble Energy Webinar
Hortonworks
ย 
PDF
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
ย 
PDF
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
Hortonworks
ย 
PDF
Adoption de Hadoop : des Possibilitรฉs Illimitรฉes - Hortonworks and Talend
Hortonworks
ย 
Leverage Big Data to Enhance Customer Experience in Telecommunications โ€“ with...
Hortonworks
ย 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
ย 
Digital Transformation and Data Protection in Automotive Industry
ร‡ukur & Yฤฑlmaz Law Firm
ย 
Smarter commerce partner presentation final
Ben Andre Heyerdahl
ย 
Smarter commerce overview
Harikrishnan M
ย 
Bringing Big Data Analytics to Network Monitoring
Savvius, Inc
ย 
Hadoop Summit 2013 : Continuous Integration on top of hadoop
Wisely chen
ย 
Case study - Automotive DMS Connection to Salesforce.com
Rodney Birch
ย 
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Hortonworks
ย 
Qrious about Insights -- Big Data in the Real World
Guy K. Kloss
ย 
Leveraging SAP, Hadoop, and Big Data to Redefine Business
DataWorks Summit
ย 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks
ย 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks
ย 
Getting to What Matters: Accelerating Your Path Through the Big Data Lifecycl...
Hortonworks
ย 
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Hortonworks
ย 
Hortonworks and Voltage Security webinar
Hortonworks
ย 
Hortonworks, Novetta and Noble Energy Webinar
Hortonworks
ย 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
ย 
How to Become an Analytics Ready Insurer - with Informatica and Hortonworks
Hortonworks
ย 
Adoption de Hadoop : des Possibilitรฉs Illimitรฉes - Hortonworks and Talend
Hortonworks
ย 
Ad

Similar to A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks and Red Hat (20)

PPTX
Supporting Financial Services with a More Flexible Approach to Big Data
WANdisco Plc
ย 
PDF
Hortonworks & Bilot Data Driven Transformations with Hadoop
Mats Johansson
ย 
PDF
Meetup oslo hortonworks HDP
Alexander Bakos Leirvรฅg
ย 
PDF
Hortonworks Hadoop @ Oslo Hadoop User Group
Mats Johansson
ย 
PDF
Introduction to Hadoop
POSSCON
ย 
PDF
Storm Demo Talk - Colorado Springs May 2015
Mac Moore
ย 
PDF
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
ย 
PDF
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
ย 
PDF
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Hortonworks
ย 
PDF
Discover hdp 2.2 hdfs - final
Hortonworks
ย 
PPTX
Yahoo! Hack Europe
Hortonworks
ย 
PDF
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
ย 
PDF
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
ย 
PDF
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Hortonworks
ย 
PDF
IoT Crash Course Hadoop Summit SJ
Daniel Madrigal
ย 
PDF
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
ย 
PPTX
Mrinal devadas, Hortonworks Making Sense Of Big Data
PatrickCrompton
ย 
PDF
Azure Cafe Marketplace with Hortonworks March 31 2016
Joan Novino
ย 
PDF
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
ย 
PPTX
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
skumpf
ย 
Supporting Financial Services with a More Flexible Approach to Big Data
WANdisco Plc
ย 
Hortonworks & Bilot Data Driven Transformations with Hadoop
Mats Johansson
ย 
Meetup oslo hortonworks HDP
Alexander Bakos Leirvรฅg
ย 
Hortonworks Hadoop @ Oslo Hadoop User Group
Mats Johansson
ย 
Introduction to Hadoop
POSSCON
ย 
Storm Demo Talk - Colorado Springs May 2015
Mac Moore
ย 
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
ย 
Eliminating the Challenges of Big Data Management Inside Hadoop
Hortonworks
ย 
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Hortonworks
ย 
Discover hdp 2.2 hdfs - final
Hortonworks
ย 
Yahoo! Hack Europe
Hortonworks
ย 
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
ย 
Webinar turbo charging_data_science_hawq_on_hdp_final
Hortonworks
ย 
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Hortonworks
ย 
IoT Crash Course Hadoop Summit SJ
Daniel Madrigal
ย 
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
ย 
Mrinal devadas, Hortonworks Making Sense Of Big Data
PatrickCrompton
ย 
Azure Cafe Marketplace with Hortonworks March 31 2016
Joan Novino
ย 
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
ย 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
skumpf
ย 
Ad

More from Hortonworks (20)

PDF
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks
ย 
PDF
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Hortonworks
ย 
PDF
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Hortonworks
ย 
PDF
Johns Hopkins - Using Hadoop to Secure Access Log Events
Hortonworks
ย 
PDF
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Hortonworks
ย 
PDF
HDF 3.2 - What's New
Hortonworks
ย 
PPTX
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Hortonworks
ย 
PDF
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Hortonworks
ย 
PDF
IBM+Hortonworks = Transformation of the Big Data Landscape
Hortonworks
ย 
PDF
Premier Inside-Out: Apache Druid
Hortonworks
ย 
PDF
Accelerating Data Science and Real Time Analytics at Scale
Hortonworks
ย 
PDF
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
Hortonworks
ย 
PDF
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Hortonworks
ย 
PDF
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Hortonworks
ย 
PDF
Making Enterprise Big Data Small with Ease
Hortonworks
ย 
PDF
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Hortonworks
ย 
PDF
Driving Digital Transformation Through Global Data Management
Hortonworks
ย 
PPTX
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
Hortonworks
ย 
PDF
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks
ย 
PDF
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Hortonworks
ย 
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks
ย 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Hortonworks
ย 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Hortonworks
ย 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Hortonworks
ย 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Hortonworks
ย 
HDF 3.2 - What's New
Hortonworks
ย 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Hortonworks
ย 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Hortonworks
ย 
IBM+Hortonworks = Transformation of the Big Data Landscape
Hortonworks
ย 
Premier Inside-Out: Apache Druid
Hortonworks
ย 
Accelerating Data Science and Real Time Analytics at Scale
Hortonworks
ย 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
Hortonworks
ย 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Hortonworks
ย 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Hortonworks
ย 
Making Enterprise Big Data Small with Ease
Hortonworks
ย 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Hortonworks
ย 
Driving Digital Transformation Through Global Data Management
Hortonworks
ย 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
Hortonworks
ย 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks
ย 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Hortonworks
ย 

Recently uploaded (20)

PDF
Understanding the EU Cyber Resilience Act
ICS
ย 
PDF
Instantiations Company Update (ESUG 2025)
ESUG
ย 
PPTX
MiniTool Partition Wizard Crack 12.8 + Serial Key Download Latest [2025]
filmoracrack9001
ย 
PPTX
SAP Public Cloud PPT , SAP PPT, Public Cloud PPT
sonawanekundan2024
ย 
PPTX
Transforming Lending with IntelliGrow โ€“ Advanced Loan Software Solutions
Intelli grow
ย 
PDF
Ready Layer One: Intro to the Model Context Protocol
mmckenna1
ย 
PDF
ESUG 2025: Pharo 13 and Beyond (Stephane Ducasse)
ESUG
ย 
PPTX
Cutting Optimization Pro 5.18.2 Crack With Free Download
cracked shares
ย 
PPT
Brief History of Python by Learning Python in three hours
adanechb21
ย 
PPTX
BB FlashBack Pro 5.61.0.4843 With Crack Free Download
cracked shares
ย 
PDF
How Attendance Management Software is Revolutionizing Education.pdf
Pikmykid
ย 
PDF
Message Level Status (MLS): The Instant Feedback Mechanism for UAE e-Invoicin...
Prachi Desai
ย 
PPTX
TexSender Pro 8.9.1 Crack Full Version Download
cracked shares
ย 
PDF
Windows 10 Professional Preactivated.pdf
asghxhsagxjah
ย 
PPTX
UI5con_2025_Accessibility_Ever_Evolving_
gerganakremenska1
ย 
PDF
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Safe Software
ย 
PPTX
prodad heroglyph crack 2.0.214.2 Full Free Download
cracked shares
ย 
PDF
How to Download and Install ADT (ABAP Development Tools) for Eclipse IDE | SA...
SAP Vista, an A L T Z E N Company
ย 
PDF
Introduction to Apache Icebergโ„ข & Tableflow
Alluxio, Inc.
ย 
PPTX
API DOCUMENTATION | API INTEGRATION PLATFORM
philipnathen82
ย 
Understanding the EU Cyber Resilience Act
ICS
ย 
Instantiations Company Update (ESUG 2025)
ESUG
ย 
MiniTool Partition Wizard Crack 12.8 + Serial Key Download Latest [2025]
filmoracrack9001
ย 
SAP Public Cloud PPT , SAP PPT, Public Cloud PPT
sonawanekundan2024
ย 
Transforming Lending with IntelliGrow โ€“ Advanced Loan Software Solutions
Intelli grow
ย 
Ready Layer One: Intro to the Model Context Protocol
mmckenna1
ย 
ESUG 2025: Pharo 13 and Beyond (Stephane Ducasse)
ESUG
ย 
Cutting Optimization Pro 5.18.2 Crack With Free Download
cracked shares
ย 
Brief History of Python by Learning Python in three hours
adanechb21
ย 
BB FlashBack Pro 5.61.0.4843 With Crack Free Download
cracked shares
ย 
How Attendance Management Software is Revolutionizing Education.pdf
Pikmykid
ย 
Message Level Status (MLS): The Instant Feedback Mechanism for UAE e-Invoicin...
Prachi Desai
ย 
TexSender Pro 8.9.1 Crack Full Version Download
cracked shares
ย 
Windows 10 Professional Preactivated.pdf
asghxhsagxjah
ย 
UI5con_2025_Accessibility_Ever_Evolving_
gerganakremenska1
ย 
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Safe Software
ย 
prodad heroglyph crack 2.0.214.2 Full Free Download
cracked shares
ย 
How to Download and Install ADT (ABAP Development Tools) for Eclipse IDE | SA...
SAP Vista, an A L T Z E N Company
ย 
Introduction to Apache Icebergโ„ข & Tableflow
Alluxio, Inc.
ย 
API DOCUMENTATION | API INTEGRATION PLATFORM
philipnathen82
ย 

A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks and Red Hat

  • 1. Page 1 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved A Comprehensive Approach to Building Your Big Data Solution We do Hadoop.
  • 2. Page 2 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Speakers ย โ€ฏ Hortonworks โ—ฆโ€ฏ Ali Bajwa, Senior Partner Solution Engineer ย โ€ฏ Red Hat โ—ฆโ€ฏ Irshad Raihan, Senior Principal, Product Marketing ย โ€ฏ Cisco โ—ฆโ€ฏ Ron Graham, Big Data Analytics Engineer
  • 3. Page 3 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Partnership 100% ย open ย source ย Hadoop ย Distribu5on, ย  ย  Support ย and ย Training ย  ย  Middleware, ย Storage, ย PaaS, ย IaaS ย  UCS ย Integrated ย Infrastructure ย  For ย Big ย Data ย  CISCO, ย HORTONWORKS ย AND ย RED ย HAT ย ARE ย PARTNERING ย TO ย HELP ย YOU ย  BUILD ย YOUR ย BIG ย DATA ย SOLUTION ย AND ย REACH ย MASSIVE ย SCALABILITY, ย  SUPERIOR ย EFFICIENCY ย AND ย DRAMATICALLY ย LOWER ย TOTAL ย COST ย OF ย  OWNERSHIP ย THANKS ย TO ย A ย VALIDATED ย JOINT ย ARCHITECTURE.
  • 4. Page 4 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Traditional systems under pressure Challenges โ€ขโ€ฏ Constrains data to app โ€ขโ€ฏ Canโ€™t manage new data โ€ขโ€ฏ Costly to Scale Business Value Clickstream Geolocation Web Data Internet of Things Docs, emails Server logs 2012 2.8 Zettabytes 2020 40 Zettabytes LAGGARDS INDUSTRY LEADERS 1 2 New Data ERP CRM SCM New Traditional
  • 5. Page 5 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Modern Data Architecture emerges to unify data & processing Modern Data Architecture โ€ขโ€ฏ Enable applications to have access to all your enterprise data through an efficient centralized platform โ€ขโ€ฏ Supported with a centralized approach governance, security and operations โ€ขโ€ฏ Versatile to handle any applications and datasets no matter the size or type Clickstream ย  Web ย  ย  & ย Social ย  Geoloca3on ย  Sensor ย  ย  & ย Machine ย  Server ย  ย  Logs ย  Unstructured ย  SOURCES Existing Systems ERP ย  CRM ย  SCM ย  ANALYTICS Data Marts Business Analytics Visualization & Dashboards ANALYTICS Applications Business Analytics Visualization & Dashboards ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ HDFS (Hadoop Distributed File System) YARN: Data Operating System Interactive Real-TimeBatch Partner ISVBatch BatchMP P ย  EDW ย 
  • 6. Page 6 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Hadoop Driver: Cost optimization Archive Data off EDW Move rarely used data to Hadoop as active archive, store more data longer Offload costly ETL process Free your EDW to perform high-value functions like analytics & operations, not ETL Enrich the value of your EDW Use Hadoop to refine new data sources, such as web and machine data for new analytical context ANALYTICS Data Marts Business Analytics Visualization & Dashboards HDP helps you reduce costs and optimize the value associated with your EDW ANALYTICSDATASYSTEMS Data Marts Business Analytics Visualization & Dashboards HDP 2.2 ELT ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ N Cold Data, Deeper Archive & New Sources Enterprise Data Warehouse Hot MPP In-Memory Clickstream ย  Web ย  ย  & ย Social ย  Geoloca3on ย  Sensor ย  ย  & ย Machine ย  Server ย  ย  Logs ย  Unstructured ย  Existing Systems ERP ย  CRM ย  SCM ย  SOURCES
  • 7. Page 7 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Hadoop Driver: Enabling the data lakeSCALE SCOPE Data Lake Definition โ€ขโ€ฏ Centralized Architecture Multiple applications on a shared data set with consistent levels of service โ€ขโ€ฏ Any App, Any Data Multiple applications accessing all data affording new insights and opportunities. โ€ขโ€ฏ Unlocks โ€˜Systems of Insightโ€™ Advanced algorithms and applications used to derive new value and optimize existing value. Drivers: 1.โ€ฏ Cost Optimization 2.โ€ฏ Advanced Analytic Apps Goal: โ€ขโ€ฏ Centralized Architecture โ€ขโ€ฏ Data-driven Business DATA LAKE Journey to the Data Lake with Hadoop Systems of Insight
  • 8. Page 8 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Only HDP delivers a Centralized Architecture HDP is uniquely built around YARN serving as a data operating system that provides multi-tenant Resource Management, consistent Governance & Security and efficient Operations services across Hadoop applications. Hortonworks Data Platform YARN Data Operating System โ€ขโ€ฏ A centralized architecture of consistent enterprise services for resource management, security, operations, and governance. โ€ขโ€ฏ The versatility to support multiple applications and diverse workloads from batch to interactive to real- time, open source and commercial. Key Benefits โ€ขโ€ฏ Multiple applications on a shared data set with consistent levels of service: a multitenant data platform. โ€ขโ€ฏ Provides a shared platform to enable new analytic applications. โ€ขโ€ฏ Delivers maximum cost efficiency for cluster resource management. Fewer servers fewer nodes. Storage YARN: Data Operating System Governance Security Operations Resource Management Existing Applications New Analytics Partner Applications Data Access: Batch, Interactive & Real-time
  • 9. Page 9 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved HDP delivers a completely open data platform Hortonworks Data Platform 2.2 Hortonworks Data Platform provides Hadoop for the Enterprise: a centralized architecture of core enterprise services, for any application and any data. Completely Open โ€ขโ€ฏ HDP incorporates every element required of an enterprise data platform: data storage, data access, governance, security, operations โ€ขโ€ฏ All components are developed in open source and then rigorously tested, certified, and delivered as an integrated open source platform thatโ€™s easy to consume and use by the enterprise and ecosystem. YARN: Data Operating System (Cluster Resource Management) 1 ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ApachePig ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ ยฐ HDFS (Hadoop Distributed File System) GOVERNANCE BATCH, INTERACTIVE & REAL-TIME DATA ACCESS Apache Falcon ApacheHive Cascading ApacheHBase ApacheAccumulo ApacheSolr ApacheSpark ApacheStorm Apache Sqoop Apache Flume Apache Kafka SECURITY Apache Ranger Apache Knox Apache Falcon OPERATIONS Apache Ambari Apache Zookeeper Apache Oozie
  • 10. Page 10 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved HDP: Any Data, Any Application, Anywhere Any Application โ€ขโ€ฏ Deep integration with ecosystem partners to extend existing investments and skills โ€ขโ€ฏ Broadest set of applications through the stable of YARN-Ready applications Any Data Deploy applications fueled by clickstream, sensor, social, mobile, geo-location, server log, and other new paradigm datasets with existing legacy datasets. Anywhere Implement HDP naturally across the complete range of deployment options Clickstream ย  Web ย  ย  & ย Social ย  Geoloca3on ย  Internet ย of ย  Things ย  Server ย  ย  Logs ย  Files, ย emails ย ERP ย  CRM ย  SCM ย  hybrid commodity appliance cloud Over 70 Hortonworks Certified YARN Apps
  • 11. Page 11 ยฉ Hortonworks Inc. 2011 โ€“ 2014. All Rights Reserved Open Source IS the standard for platform technology Modern platform standards are defined by open communities For Hadoop, the ASF provides guidelines and a governance framework and the open community defines the standards for Hadoop. Roadmap matches user requirements not vendor monetization requirements Hortonworks Open Source Development Model yields unmatched efficiency โ€ขโ€ฏ Infinite number of developers under governance of ASF applied to problem โ€ขโ€ฏ End users motivated to contribute to Apache Hadoop as they are consumers โ€ขโ€ฏ IT vendors motivated to align with Apache Hadoop to capture adjacent opportunities Hortonworks Open Source Business Model de-risks investments โ€ขโ€ฏ Buying behavior changed: enterprise wants support subscription license โ€ขโ€ฏ Vendor needs to earn your business, every year is an election year โ€ขโ€ฏ Equitable balance of power between vendor and consumer โ€ขโ€ฏ IT vendors want platform technologies to be open source to avoid lock-in
  • 12. TITLE SLIDE: HEADLINE Presenter name Title, Red Hat Date Red ย Hat ย Big ย Data ย  Open ย the ย possibili5es ย of ย your ย data ย 
  • 13. 13 Big ย Data ย innova3on ย cannot ย happen ย in ย a ย bubble ย  Strong ย partnerships ย with ย industry ย leaders ย and ย open ย source ย communi5es ย 
  • 14. 14 Business ย User ย Architect ย Data ย Center ย Operator ย  App ย Developer ย  Mul5ple ย Silos. ย Mul5ple ย Views. ย Mul5ple ย Goals. ย  The ย Old ย Data ย Lifecycle ย  Manage ย  ย  Build ย  ย  Code ย  Query ย 
  • 15. 15 Business ย User ย  Architect ย  Data ย Center ย Operator ย  App ย Developer ย  One ย Language. ย One ย View. ย One ย Goal. ย  The ย New ย Data ย Lifecycle ย  Ingest ย  Integrate ย  Act ย  Discover ย 
  • 16. 16 Lack ย of ย agile, ย open, ย and ย cost ย e๏ฌ€ec5ve ย enterprise-ยญโ€grade ย solu5ons ย  Barriers ย to ย Big ย Data ย Success ย  I ย want ย more ย than ย  canned ย BI ย queries ย  I ย am ย locked ย into ย a ย  vendor ย stack ย  I ย want ย to ย use ย my ย favorite ย  dev ย framework ย  I ย need ย to ย integrate ย  data ย across ย silos ย  Business ย User ย  Architect ย  Data ย Center ย Operator ย  App ย Developer ย 
  • 17. 17 Business ย User ย  Architect ย  Data ย Center ย Operator ย  App ย Developer ย  Ingest ย  Integrate ย  Act ย  Discover ย  Big ย Data ย Solu3ons ย from ย Red ย Hat ย 
  • 18. Integrated ย Big ย Data ย PlaOorm ย  ย  Cisco UCS Integrated Infrastructure for Big Data Hadoop Compatible File System Red Hat Storage Hadoop Data Processing Map/Reduce YARN Analytics Operating System Red Hat Enterprise Linux Cloud Red Hat Enterprise Linux OpenStack Platform Operating EnvironmentData Integration & Application Development Application Platform- as-a-Service OpenShift by Red Hat Data Integration and Data Services Red Hat JBoss Data Virtualization Data Caching Red Hat JBoss Data Grid Business Rules Mgmt Red Hat JBoss BRMS Development Red Hat JBoss Developer Studio Hadoop Distributed File System Management HortonworksCisco Red Hat Data Integration and Data Services Composite Cloud Cisco OpenStack Pig Spark Storm HBase Tez Hive Cisco Security Suite CiscoUCSDirectoryExpress CiscoUnifiedManagement Ambari Virtualization Red Hat Enterprise Virtualization
  • 19. Software and Solutions Innovation Empowering Whatโ€™s Next Ron Graham Big Data Analytics Engineer Hardware Architecture Cisco UCS with Big Data
  • 20. 20ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next Why Cisco UCS for Big Data? โ€ขโ€ฏ Manageability โ€ขโ€ฏ Save time with UCS Manager โ€ขโ€ฏ Enables consistent and rapid deployments using UCS Service profiles โ€ขโ€ฏ Offers operational simplification โ€ขโ€ฏ Delivers a modular solution โ€ขโ€ฏ Scalability โ€ขโ€ฏ Performance SIM Card Identity for a phone Service Profile Identity for a server
  • 21. 21ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next โ€ขโ€ฏ End to end provisioning, installation, and monitoring tool for Hadoop Clusters โ€ขโ€ฏ Better business outcomes with faster time to value from Big Data โ€ขโ€ฏ Provides appliance like experience with out inflexibilities โ€ขโ€ฏ Centralized visibility across Hadoop and physical infrastructure โ€ขโ€ฏ Powerful interface for further integration into third party tools and services UCS Director Express for Big Data End to end solution for Hadoop
  • 22. 22ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next Powering Big Data and Analytics UCS ย B200 ย  Scale-ยญโ€out ย Analy5cs ย  Big ย Data ย  with ย  EMC ย  Isilon ย  and ย VCE ย  Invicta ย  (Fast ย  Data) ย  UCS ย C240 ย  (Hadoop, ย NoSQL ย  MPP) ย  UCS ย Manager, ย Director, ย Express, ย Central, ย Redhat ย  ย  ACl ย  C/B460 ย (In-ยญโ€ memory ย  Analy5cs) ย  UCS ย C3160, ย  C3260 ย  (Hadoop) ย  UCS ย C220 ย  (real-ยญโ€5me, ย streaming) ย  FlexPod ย  Select ย  with ย  NetApp ย  E-ยญโ€Series ย UCS ย Mini ย (All-ยญโ€in-ยญโ€one ย  at ย Edge) ย  UCS ย M-ยญโ€Series ย (Massive ย  scale-ยญโ€out) ย  Ac5an, ย DataStax, ย Hortonworks, ย MongoDB, ย Pivotal,SAP, ย SAS, ย Splunk ย  ย  Cisco, ย Elas5c ย Search, ย IBM, ย Informa5ca, ย MicrosoZ, ย MicroStrategy ย , ย Oracle, ย SAP, ย  SAS ย  ย and ย others ย  Complete ย  and ย Industry ย  leading ย  Por[olio ย  Ecosystem ย  Partners ย  ISV ย Partners ย  Infrastructure ย  Management ย  Data ย Management ย  Applica5ons ย 
  • 23. 23ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next DESIGNS Big Data Cisco Validated Designs for leading big data platforms can be found at: www.cisco.com/go/bigdata Cisco Validated Designs Accelerate Deployment
  • 24. 24ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next Server 8x UCS C220 M4 CPU 2 x Intel Xeon E5-2620 v3 (15M Cache, 2.40 GHz) Memory 256GB Storage 8 1.2-TB 10K SAS SFF HDD Starter High Performance Server 8x UCS C220 M4 CPU 2 x Intel Xeon E5-2680 v3 (30M Cache, 2.50 GHz) Memory 384GB Storage 2 1.2-TB 10K SAS SFF HDD, 6 400- GB SAS SSD Performance Optimized Capacity Optimized Extreme Capacity Server 16x UCS C240 M4 CPU 2 x Intel Xeon E5-2680 v3 (30M Cache, 2.50 GHz) Memory 256GB Storage 2 120-GB SATA SSD, 24 1.2-TB 10K SAS SFF HDD Server 16x UCS C240 M4 CPU 2 x Intel Xeon E5-2620 v3 (15M Cache, 2.40 GHz) Memory 128GB Storage 2 120-GB SATA SSD. 12 4-TB 7.2K SAS SFF HDD Server 2x UCS C3160 CPU 2 x Intel Xeon E5-2695 v2 (30M Cache, 2.40 GHz) Memory 256GB Storage 2 120-GB SATA SSD, 60 4-TB 7.2K SAS SFF HDD Cisco UCS CPA for Big Data v3 Reference Architecture and Bundles
  • 25. 25ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next 2x UCS 6296 Series Fabric Interconnect UCS Manager โ€ขโ€ฏ UCS Domain (68 Servers) โ€ขโ€ฏ Manage by UCS Manager โ€ขโ€ฏ 2.8 PB of storage โ€ขโ€ฏ HDP 2.2 โ€ขโ€ฏ Tiered Storage โ€ขโ€ฏ Tez โ€ขโ€ฏ RHEL 6.5 โ€ขโ€ฏ Dual 10G Network โ€ขโ€ฏ 17 Servers Per Rack UCS C240 M4 2x E5-2680 v3 256GB Memory Cisco 12Gb/s SAS Raid Controller 2x 120GB STAT SSD 24x 1.2TB 10k SAS 2x Cisco UCS VIC 1227 UCS C3160 2x E5-2695 v2 256GB Memory Cisco 12Gb/s SAS Raid Controller 2x 120GB SATA SSD 60x 4TB 7.2k SAS SFF 2x Cisco UCS VIC 1227 / 17 10Gb Ethernet / 17 10Gb Ethernet 64 Node Cluster Configuration
  • 26. 26ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next UCSD Express UCS 6200 Series Fabric Interconnect UCS Manager UCS C240 M4 Series Rack Server UCS C3160 Rack Server Apache Ambari Unified Management Programmability, Scalability and Automation
  • 27. 27ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next UCS 6200 Series Fabric Interconnect UCS C240 M4 Series Rack Server UCS C3160 Rack Server Data Data Data Cold n replicas on Archive Warm 1 replicas on Disk, n-1 on Archive Hot All (n) replicas on Disk Cold Hot Policy Hot - for both storage and compute. The data that is popular and still being used for processing will stay in this policy. When a block is hot, all replicas are stored in DISK. Warm - partially hot and partially cold. When a block is warm, some of its replicas are stored in DISK and the remaining replicas are stored in ARCHIVE. Cold - only for storage with limited compute. The data that is no longer being used, or data that needs to be archived is moved from hot storage to cold storage. When a block is cold, all replicas are stored in ARCHIVE. Multi-tiered Storage Architecture Multi-temperature Policy
  • 28. 28ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Software and Solutions Innovation Empowering Whatโ€™s Next UCS 6200 Series Fabric Interconnect UCS C240 M4 Series Rack Server UCS C3160 Rack Server Data Data Data Cold n replicas on Archive Warm 1 replicas on Disk, n-1 on Archive Hot All (n) replicas on Disk Cold Hot Mover โ€“ A new data migration tool It periodically scans the files in HDFS to check if the block placement satisfies the storage policy. For the blocks violating the storage policy, it moves the replicas to a different storage type in order to fulfill the storage policy requirement. A C D A C D E A C D E N N N N E Multi-tiered Storage Architecture Multi-temperature Policy
  • 29. Page 29 ยฉ Hortonworks Inc. 2011 โ€“ 2015. All Rights Reserved Next Stepsโ€ฆ Download the Hortonworks Sandbox Learn Hadoop Build Your Analytic App Try Hadoop Learn more with our partnerships http://hortonworks.com/partner/cisco/ http://hortonworks.com/partner/redhat/ Joint CVD bit.ly/Cisco-CVD
  • 30. 30ยฉ 2014 Cisco and/or its affiliates. All rights reserved. Cisco Confidential โ€ขโ€ฏ Cisco Live! in San Diego โ€“ June 7 - 11 โ€ขโ€ฏ Hadoop Summit in San Jose โ€“ June 9 โ€“ 11 โ€ขโ€ฏ Red Hat Summit in Boston - June 23-26 More information about Red Hatโ€™s Big Data solutions please visit: โ€ขโ€ฏ redhat.com/bigdata โ€ขโ€ฏ redhatstorage.redhat.com/category/big-data โ€ขโ€ฏ redhat.com/en/insights/big-data More information about Ciscoโ€™s Big Data and Analytics Offers please visit: โ€ขโ€ฏ www.cisco.com/go/bigdata and www.cisco.com/go/bigdata_design โ€ขโ€ฏ http://blogs.cisco.com/author/raghunathnambiar โ€ขโ€ฏ bit.ly/Cisco-CVD 30 Meet us in person!