SlideShare a Scribd company logo
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.1
Big Data
Jean-Pierre Dijcks
Team Lead – Big Data Product Management
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.2
Agenda
 Big Data Implementation Patterns
 Big Data Products
 Q&A
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.3
Big Data Implementations
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.4
Big Data Usage Pattern
ETL and Batch Processing Workloads on Hadoop
Integrate
SQL
SQL
NoSQL
• Scalable
• Flexible
• Cost
Effective
DW & BI
Analytics
Web
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.5
Ad-hoc
Big Data Usage Pattern
Scale-out Information Discovery
• Online
• Scalable
• Flexible
• Cost
Effective
Data Factory
Continuous On-Demand
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.6
Big Data Usage Pattern
Expand Data Warehouse with Granular Data Store
MartsData Warehouse
Σ Σ
Business
Intelligence
Archiving
• Online
• Scalable
• Flexible
• Cost
Effective
Data Factory
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.7
Big Data Usage Pattern
Instant Responses to Streaming Data based on Historical Analysis
Data Warehouse
Business
Intelligence
• Online
• Scalable
• Flexible
• Cost
Effective
Data Factory
Event Decisions
NoSQL
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.8
Oracle Big Data Solution
Stream Acquire – Organize – Analyze
In-Database
Analytics
Data
Warehouse
Oracle
Advanced
Analytics
Oracle
Database
Oracle BI
Enterprise Edition
Oracle Real-Time
Decisions
Endeca Information
Discovery
Decide
Oracle Event
Processing
Apache
Flume
Applications
Oracle
NoSQL
Database
Cloudera
Hadoop
Oracle R
Distribution
Oracle Big Data
Connectors
Oracle Data
Integrator
• Complete
• Integrated
• Scalable
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.9
Big Data Products
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.10
Big Data Appliance X3-2
Sun Oracle X3-2L Servers with per server:
• 2 * 8 Core Intel Xeon E5 Processors
• 64 GB Memory
• 36TB Disk space
Totals per Full Rack:
• 288 Processor Cores
• 1152 GB of Memory
• 648TB Available Disk space
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.11
Big Data Appliance Software Stack
Integrated Software:
 Oracle Linux 5.8 with UEK
 Cloudera CDH 4.2 & Cloudera Manager 4.5
 Big Data Appliance Enterprise Manager Plug-In
 Oracle R Distribution
All integrated software is supported as part of Premier Support for
Systems and Premier Support for Operating Systems
Optional Software:
 Oracle NoSQL Database 2.x
 Oracle Big Data Connectors 2.x
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.12
BDA in Infrastructure as a Service
 Procurement option for H/W
 Low monthly fee spread out
over 3 to 5 years
 Ownership of the system
stays with Oracle
 Applies to all Engineered
Systems
 BDA Full Racks only
Month
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.13
Big Data Appliance Product Family
 Starter Rack is a fully cabled and
configured for growth with 6 servers
 In-Rack Expansion delivers 6 server
modular expansion block
 Full Rack delivers optimal blend of
capacity and expansion options
 Grow by adding rack – up to 18 racks
without additional switches
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.14
Big Data Appliance X3-2 Starter Rack
 6 Nodes fully cabled in Starter Rack
• 96 Intel® Xeon® E5 Processors
• 384 GB total memory
• 216TB total raw storage capacity
 6 Nodes In-Rack Expansion added in-rack
• 96 Intel® Xeon® E5 Processors
• 384 GB total memory
• 216TB total raw storage capacity
Start and grow in increments of six servers
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.15
Why Oracle Big Data Appliance?
 Beats DIY Clusters on:
– Initial Cost and Time to Value
– Performance and Scalability
 Pre-configured with leading Hadoop Distribution
– Proven at large scale
– Contributors across all components for better support
 Better Integration with your Oracle ecosystem with:
– High-performance connectivity to Exadata
– Unified analytics API (SQL, R, MapReduce etc.)
– Single Enterprise Manager Framework
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.16
Divide Full Rack BDA in
multiple clusters
Provide more flexible
configurations for
customers
Automatic reconfiguration
when expanding the
cluster
Flexible Configurations
6 Node Cluster
12 Node Cluster
Example Configuration
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.17
Engineered for Quicker Time to Value at Lower Cost
http://www.oracle.com/us/corporate/analystreports/industries/esg-big-data-wp-1914112.pdf
ESG believes that a "buy" versus "do-it-yourself"
approach will yield roughly one-third faster time-
to-market benefit improvement...
0
5
10
15
20
25
30
Oracle Big Data Appliance Build it yourself
Time to Market (Weeks)
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
Oracle Big Data Appliance Build it yourself
Cost: Initial Infrastructure/Tasks
[…] nearly 40% cost savings versus IT
architecting, designing, procuring, configuring, an
d implementing its own big data infrastructure.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.18
Engineered for Performance
Compared with a DIY Cluster
0
5
10
Big Data
Appliance
DIY Hadoop
Cluster
Time(hours)
 Configured for exceptional
performance on delivery
 6x faster than custom 20-node
Hadoop cluster for large batch
transformation jobs
 Engineering done by Oracle and
Cloudera:
– OS and File System Tuning
– Java Virtual Machine Tuning
– Hadoop Configuration and Setup
6x
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.19
Engineered by Oracle and Cloudera
Why Cloudera and Cloudera CDH?
 Proven Track Record with the largest Hadoop Installed Base
 Proven in large scale enterprise implementations
 Demonstrated Leadership in Hadoop Community
– Breath and Depth across the Hadoop ecosystem and products
– Fast evolution in critical features
 Managed Distribution
– Components certified to work together and on Oracle Big Data Appliance in
regular updates
– Industry Leading Management Framework for Hadoop integrated with
Oracle Enterprise Manager
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.20
Engineered by Oracle and Cloudera
 Cloudera’s Hadoop Knowledge Engineered into the system:
– Master service lay-out designed for large clusters based on
experience with many large systems
– Optimized data block size for MapReduce workloads
– Optimized number of Map and Reduce slots fitting the system
capacity
– Optimized settings for a large number of Hadoop parameters
 Tested at Oracle and Cloudera on the same hardware/software
stack as our customers
Market Leading Hadoop Distribution Pre-configured
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.21
Engineered by Oracle and Cloudera
 Multi-Homing for Hadoop
– To leverage BDA’s InfiniBand and 10GiGE network, Hadoop needed to be able to
support multiple networks and IP addresses
– Committed to Apache Hadoop by Cloudera
 Highly Available NameNode Solution
– Remove dependency on a HA Filer to enable HA without required additional
hardware
– Build a journaling based HA solution for NameNode with automatic fail-over
 System Administration
– Tight integration between Oracle Enterprise Manager (Hardware and High-Level
Software Monitoring) and Cloudera Manager (Hadoop Details)
Driving Enterprise Class Requirements for Hadoop
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.22
Integrated Management Framework
Management Infrastructure combines EM and CM
Quick view of Hardware and Software status
in Oracle Enterprise Manager
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.23
Big Data Connectors
Optimized integration of Hadoop with Oracle Database
and Oracle Exadata
• Oracle Loader for Hadoop
• Oracle SQL Connector for Hadoop Distributed File System
(HDFS)
• Oracle Data Integrator Application Adapter for Hadoop
• Oracle R Connector for Hadoop
• Does not require Big Data Appliance – can be licensed for Hadoop
running on non-Oracle hardware
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.24
Analyze Data across your Oracle Systems
SQL Analytics on ALL data
SQL
Hadoop Oracle Database
IB
 Expand the data pool for
analytics leveraging Hadoop
 Stream Hadoop resident data
through Big Data Connectors
for SQL processing
 Use the full power of Oracle
SQL on all data
 Or use Oracle Loader for
Hadoop to integrate data in
Oracle Database
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.25
Analyze Data across your Oracle Systems
R Analytics on ALL data
R
Hadoop Oracle Database
IB
 Expand the data pool for
analytics leveraging Hadoop
 Improve scalability and
performance for R without
changes to your programs
 Dynamically leverage Hadoop
through Big Data Connectors
to execute R analytics
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.26
Oracle Data Integrator
Simplify Map Reduce
OLH
&
OSCH
Oracle
Data
Integrator
 Automatically generates
MapReduce code
 High performance loads into
Data Warehouse leveraging
both OLH and OSCH
 Manages the process across
platforms
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.27
Oracle NoSQL Database
Scalable, Highly Available, Key-Value Database
Application
Storage Nodes
Datacenter B
Storage Nodes
Datacenter A
Application
NoSQL DB Driver
Application
NoSQL DB Driver
Application
 Simple Key-Value Data Model
 Horizontally Scalable
 Highly Available
 Simple administration
 ACID Transactions at scale
 Transparent load balancing
 Elastic Configuration
 Commercial grade software and
support
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.28
Oracle NoSQL Database Use Cases
NoSQL DB Driver
Application
Oracle Event
Processor
Event
Stream
Web Scale Transaction Processing
• High velocity, High volume, High variety, Low information density data capture
• Uses Hadoop and/or Data Warehouse for analytics
• Applications: Web browsing, Web Retail, CDR processing, Sensor data capture
Last Mile Content Delivery
• Platform for real-time content delivery
• Content & market segmentation Acquired and Analyzed in Hadoop & RDBMS
• NoSQL provides low latency content lookup and delivery to end-customers
• OEP rules perform low latency lookups to Oracle NoSQL DB for additional data
Real Time Event Processing
• Real time events trigger rule execution in Oracle Event Processing
• OEP rules perform low latency lookups to Oracle NoSQL DB for additional data
• OEP actions are triggered
• Applications: Medical Monitoring, Factory Automation, Oil & Gas, Geo-location
Rule Action
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.29
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.30

More Related Content

What's hot (20)

Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
Daniel Martin
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the experts
DataWorks Summit
 
Oracle Big data at work
Oracle Big data at workOracle Big data at work
Oracle Big data at work
solarisyougood
 
Oracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and ArchitectureOracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and Architecture
Riccardo Romani
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
Cloudera, Inc.
 
Replacing Oracle CDC with Oracle GoldenGate
Replacing Oracle CDC with Oracle GoldenGateReplacing Oracle CDC with Oracle GoldenGate
Replacing Oracle CDC with Oracle GoldenGate
Stewart Bryson
 
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
VMware Tanzu
 
Oracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewOracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service Overview
Jinyu Wang
 
HAWQ: a massively parallel processing SQL engine in hadoop
HAWQ: a massively parallel processing SQL engine in hadoopHAWQ: a massively parallel processing SQL engine in hadoop
HAWQ: a massively parallel processing SQL engine in hadoop
BigData Research
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
DataWorks Summit
 
Beyond TCO
Beyond TCOBeyond TCO
Beyond TCO
DataWorks Summit/Hadoop Summit
 
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
DataWorks Summit
 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Eric Sun
 
Discover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchDiscover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop Search
Hortonworks
 
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalPresentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Diego Alberto Tamayo
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
avanttic Consultoría Tecnológica
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Hortonworks
 
clusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheetclusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheet
Andrei Khurshudov
 
Powering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the CloudPowering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the Cloud
Hortonworks
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
Hortonworks
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
Daniel Martin
 
Hadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the expertsHadoop in the cloud – The what, why and how from the experts
Hadoop in the cloud – The what, why and how from the experts
DataWorks Summit
 
Oracle Big data at work
Oracle Big data at workOracle Big data at work
Oracle Big data at work
solarisyougood
 
Oracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and ArchitectureOracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and Architecture
Riccardo Romani
 
Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?Hadoop on Cloud: Why and How?
Hadoop on Cloud: Why and How?
Cloudera, Inc.
 
Replacing Oracle CDC with Oracle GoldenGate
Replacing Oracle CDC with Oracle GoldenGateReplacing Oracle CDC with Oracle GoldenGate
Replacing Oracle CDC with Oracle GoldenGate
Stewart Bryson
 
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
VMware Tanzu
 
Oracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service OverviewOracle GoldenGate Cloud Service Overview
Oracle GoldenGate Cloud Service Overview
Jinyu Wang
 
HAWQ: a massively parallel processing SQL engine in hadoop
HAWQ: a massively parallel processing SQL engine in hadoopHAWQ: a massively parallel processing SQL engine in hadoop
HAWQ: a massively parallel processing SQL engine in hadoop
BigData Research
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
DataWorks Summit
 
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
Can you Re-Platform your Teradata, Oracle, Netezza and SQL Server Analytic Wo...
DataWorks Summit
 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Eric Sun
 
Discover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchDiscover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop Search
Hortonworks
 
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalPresentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Diego Alberto Tamayo
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
avanttic Consultoría Tecnológica
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Hortonworks
 
clusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheetclusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheet
Andrei Khurshudov
 
Powering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the CloudPowering Big Data Success On-Prem and in the Cloud
Powering Big Data Success On-Prem and in the Cloud
Hortonworks
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
Hortonworks
 

Viewers also liked (6)

Ephesians 1 3 14
Ephesians 1 3 14Ephesians 1 3 14
Ephesians 1 3 14
mfewkes1
 
2012 10 bigdata_overview
2012 10 bigdata_overview2012 10 bigdata_overview
2012 10 bigdata_overview
jdijcks
 
Ephesians 6 1 24
Ephesians 6 1 24Ephesians 6 1 24
Ephesians 6 1 24
mfewkes1
 
Swap’s Guide to the Holidays
Swap’s Guide to the HolidaysSwap’s Guide to the Holidays
Swap’s Guide to the Holidays
laurindatracey
 
Ephesians introduction
Ephesians   introductionEphesians   introduction
Ephesians introduction
mfewkes1
 
Hd카메라 빔프로젝트
Hd카메라  빔프로젝트Hd카메라  빔프로젝트
Hd카메라 빔프로젝트
leekyusoon
 
Ephesians 1 3 14
Ephesians 1 3 14Ephesians 1 3 14
Ephesians 1 3 14
mfewkes1
 
2012 10 bigdata_overview
2012 10 bigdata_overview2012 10 bigdata_overview
2012 10 bigdata_overview
jdijcks
 
Ephesians 6 1 24
Ephesians 6 1 24Ephesians 6 1 24
Ephesians 6 1 24
mfewkes1
 
Swap’s Guide to the Holidays
Swap’s Guide to the HolidaysSwap’s Guide to the Holidays
Swap’s Guide to the Holidays
laurindatracey
 
Ephesians introduction
Ephesians   introductionEphesians   introduction
Ephesians introduction
mfewkes1
 
Hd카메라 빔프로젝트
Hd카메라  빔프로젝트Hd카메라  빔프로젝트
Hd카메라 빔프로젝트
leekyusoon
 

Similar to 2013 05 Oracle big_dataapplianceoverview (20)

Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by Example
Harald Erb
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Pentaho
 
Oracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewOracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overview
Paulo Fagundes
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Appfluent Technology
 
Meetup Oracle Database BCN: 2.1 Data Management Trends
Meetup Oracle Database BCN: 2.1 Data Management TrendsMeetup Oracle Database BCN: 2.1 Data Management Trends
Meetup Oracle Database BCN: 2.1 Data Management Trends
avanttic Consultoría Tecnológica
 
Unlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQLUnlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQL
Matt Lord
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
 
Solution Use Case Demo: The Power of Relationships in Your Big Data
Solution Use Case Demo: The Power of Relationships in Your Big DataSolution Use Case Demo: The Power of Relationships in Your Big Data
Solution Use Case Demo: The Power of Relationships in Your Big Data
InfiniteGraph
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Stefan Lipp
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - Overview
Jeffrey T. Pollock
 
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFSMySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
Mats Kindahl
 
Replicate data between environments
Replicate data between environmentsReplicate data between environments
Replicate data between environments
DLT Solutions
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
Toronto-Oracle-Users-Group
 
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
EMC
 
Enterprise-class security with PostgreSQL - 2
Enterprise-class security with PostgreSQL - 2Enterprise-class security with PostgreSQL - 2
Enterprise-class security with PostgreSQL - 2
Ashnikbiz
 
Sesion covergentes 2016
Sesion covergentes 2016Sesion covergentes 2016
Sesion covergentes 2016
Fran Navarro
 
Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
Fran Navarro
 
Hadoop and Hive in Enterprises
Hadoop and Hive in EnterprisesHadoop and Hive in Enterprises
Hadoop and Hive in Enterprises
markgrover
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
Seeling Cheung
 
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and SummaryzData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc.
 
Oracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by ExampleOracle Unified Information Architeture + Analytics by Example
Oracle Unified Information Architeture + Analytics by Example
Harald Erb
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Pentaho
 
Oracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overviewOracle NoSQL Database release 3.0 overview
Oracle NoSQL Database release 3.0 overview
Paulo Fagundes
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Appfluent Technology
 
Unlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQLUnlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQL
Matt Lord
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
Dr. Wilfred Lin (Ph.D.)
 
Solution Use Case Demo: The Power of Relationships in Your Big Data
Solution Use Case Demo: The Power of Relationships in Your Big DataSolution Use Case Demo: The Power of Relationships in Your Big Data
Solution Use Case Demo: The Power of Relationships in Your Big Data
InfiniteGraph
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Stefan Lipp
 
Oracle Data Integration - Overview
Oracle Data Integration - OverviewOracle Data Integration - Overview
Oracle Data Integration - Overview
Jeffrey T. Pollock
 
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFSMySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
MySQL Applier for Apache Hadoop: Real-Time Event Streaming to HDFS
Mats Kindahl
 
Replicate data between environments
Replicate data between environmentsReplicate data between environments
Replicate data between environments
DLT Solutions
 
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
Pivotal: Hadoop for Powerful Processing of Unstructured Data for Valuable Ins...
EMC
 
Enterprise-class security with PostgreSQL - 2
Enterprise-class security with PostgreSQL - 2Enterprise-class security with PostgreSQL - 2
Enterprise-class security with PostgreSQL - 2
Ashnikbiz
 
Sesion covergentes 2016
Sesion covergentes 2016Sesion covergentes 2016
Sesion covergentes 2016
Fran Navarro
 
Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
Fran Navarro
 
Hadoop and Hive in Enterprises
Hadoop and Hive in EnterprisesHadoop and Hive in Enterprises
Hadoop and Hive in Enterprises
markgrover
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
Seeling Cheung
 
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and SummaryzData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc.
 

Recently uploaded (20)

HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 

2013 05 Oracle big_dataapplianceoverview

  • 1. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.1 Big Data Jean-Pierre Dijcks Team Lead – Big Data Product Management
  • 2. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.2 Agenda  Big Data Implementation Patterns  Big Data Products  Q&A
  • 3. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.3 Big Data Implementations
  • 4. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.4 Big Data Usage Pattern ETL and Batch Processing Workloads on Hadoop Integrate SQL SQL NoSQL • Scalable • Flexible • Cost Effective DW & BI Analytics Web
  • 5. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.5 Ad-hoc Big Data Usage Pattern Scale-out Information Discovery • Online • Scalable • Flexible • Cost Effective Data Factory Continuous On-Demand
  • 6. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.6 Big Data Usage Pattern Expand Data Warehouse with Granular Data Store MartsData Warehouse Σ Σ Business Intelligence Archiving • Online • Scalable • Flexible • Cost Effective Data Factory
  • 7. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.7 Big Data Usage Pattern Instant Responses to Streaming Data based on Historical Analysis Data Warehouse Business Intelligence • Online • Scalable • Flexible • Cost Effective Data Factory Event Decisions NoSQL
  • 8. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.8 Oracle Big Data Solution Stream Acquire – Organize – Analyze In-Database Analytics Data Warehouse Oracle Advanced Analytics Oracle Database Oracle BI Enterprise Edition Oracle Real-Time Decisions Endeca Information Discovery Decide Oracle Event Processing Apache Flume Applications Oracle NoSQL Database Cloudera Hadoop Oracle R Distribution Oracle Big Data Connectors Oracle Data Integrator • Complete • Integrated • Scalable
  • 9. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.9 Big Data Products
  • 10. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.10 Big Data Appliance X3-2 Sun Oracle X3-2L Servers with per server: • 2 * 8 Core Intel Xeon E5 Processors • 64 GB Memory • 36TB Disk space Totals per Full Rack: • 288 Processor Cores • 1152 GB of Memory • 648TB Available Disk space
  • 11. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.11 Big Data Appliance Software Stack Integrated Software:  Oracle Linux 5.8 with UEK  Cloudera CDH 4.2 & Cloudera Manager 4.5  Big Data Appliance Enterprise Manager Plug-In  Oracle R Distribution All integrated software is supported as part of Premier Support for Systems and Premier Support for Operating Systems Optional Software:  Oracle NoSQL Database 2.x  Oracle Big Data Connectors 2.x
  • 12. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.12 BDA in Infrastructure as a Service  Procurement option for H/W  Low monthly fee spread out over 3 to 5 years  Ownership of the system stays with Oracle  Applies to all Engineered Systems  BDA Full Racks only Month
  • 13. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.13 Big Data Appliance Product Family  Starter Rack is a fully cabled and configured for growth with 6 servers  In-Rack Expansion delivers 6 server modular expansion block  Full Rack delivers optimal blend of capacity and expansion options  Grow by adding rack – up to 18 racks without additional switches
  • 14. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.14 Big Data Appliance X3-2 Starter Rack  6 Nodes fully cabled in Starter Rack • 96 Intel® Xeon® E5 Processors • 384 GB total memory • 216TB total raw storage capacity  6 Nodes In-Rack Expansion added in-rack • 96 Intel® Xeon® E5 Processors • 384 GB total memory • 216TB total raw storage capacity Start and grow in increments of six servers
  • 15. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.15 Why Oracle Big Data Appliance?  Beats DIY Clusters on: – Initial Cost and Time to Value – Performance and Scalability  Pre-configured with leading Hadoop Distribution – Proven at large scale – Contributors across all components for better support  Better Integration with your Oracle ecosystem with: – High-performance connectivity to Exadata – Unified analytics API (SQL, R, MapReduce etc.) – Single Enterprise Manager Framework
  • 16. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.16 Divide Full Rack BDA in multiple clusters Provide more flexible configurations for customers Automatic reconfiguration when expanding the cluster Flexible Configurations 6 Node Cluster 12 Node Cluster Example Configuration
  • 17. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.17 Engineered for Quicker Time to Value at Lower Cost http://www.oracle.com/us/corporate/analystreports/industries/esg-big-data-wp-1914112.pdf ESG believes that a "buy" versus "do-it-yourself" approach will yield roughly one-third faster time- to-market benefit improvement... 0 5 10 15 20 25 30 Oracle Big Data Appliance Build it yourself Time to Market (Weeks) 0 100,000 200,000 300,000 400,000 500,000 600,000 700,000 800,000 Oracle Big Data Appliance Build it yourself Cost: Initial Infrastructure/Tasks […] nearly 40% cost savings versus IT architecting, designing, procuring, configuring, an d implementing its own big data infrastructure.
  • 18. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.18 Engineered for Performance Compared with a DIY Cluster 0 5 10 Big Data Appliance DIY Hadoop Cluster Time(hours)  Configured for exceptional performance on delivery  6x faster than custom 20-node Hadoop cluster for large batch transformation jobs  Engineering done by Oracle and Cloudera: – OS and File System Tuning – Java Virtual Machine Tuning – Hadoop Configuration and Setup 6x
  • 19. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.19 Engineered by Oracle and Cloudera Why Cloudera and Cloudera CDH?  Proven Track Record with the largest Hadoop Installed Base  Proven in large scale enterprise implementations  Demonstrated Leadership in Hadoop Community – Breath and Depth across the Hadoop ecosystem and products – Fast evolution in critical features  Managed Distribution – Components certified to work together and on Oracle Big Data Appliance in regular updates – Industry Leading Management Framework for Hadoop integrated with Oracle Enterprise Manager
  • 20. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.20 Engineered by Oracle and Cloudera  Cloudera’s Hadoop Knowledge Engineered into the system: – Master service lay-out designed for large clusters based on experience with many large systems – Optimized data block size for MapReduce workloads – Optimized number of Map and Reduce slots fitting the system capacity – Optimized settings for a large number of Hadoop parameters  Tested at Oracle and Cloudera on the same hardware/software stack as our customers Market Leading Hadoop Distribution Pre-configured
  • 21. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.21 Engineered by Oracle and Cloudera  Multi-Homing for Hadoop – To leverage BDA’s InfiniBand and 10GiGE network, Hadoop needed to be able to support multiple networks and IP addresses – Committed to Apache Hadoop by Cloudera  Highly Available NameNode Solution – Remove dependency on a HA Filer to enable HA without required additional hardware – Build a journaling based HA solution for NameNode with automatic fail-over  System Administration – Tight integration between Oracle Enterprise Manager (Hardware and High-Level Software Monitoring) and Cloudera Manager (Hadoop Details) Driving Enterprise Class Requirements for Hadoop
  • 22. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.22 Integrated Management Framework Management Infrastructure combines EM and CM Quick view of Hardware and Software status in Oracle Enterprise Manager
  • 23. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.23 Big Data Connectors Optimized integration of Hadoop with Oracle Database and Oracle Exadata • Oracle Loader for Hadoop • Oracle SQL Connector for Hadoop Distributed File System (HDFS) • Oracle Data Integrator Application Adapter for Hadoop • Oracle R Connector for Hadoop • Does not require Big Data Appliance – can be licensed for Hadoop running on non-Oracle hardware
  • 24. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.24 Analyze Data across your Oracle Systems SQL Analytics on ALL data SQL Hadoop Oracle Database IB  Expand the data pool for analytics leveraging Hadoop  Stream Hadoop resident data through Big Data Connectors for SQL processing  Use the full power of Oracle SQL on all data  Or use Oracle Loader for Hadoop to integrate data in Oracle Database
  • 25. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.25 Analyze Data across your Oracle Systems R Analytics on ALL data R Hadoop Oracle Database IB  Expand the data pool for analytics leveraging Hadoop  Improve scalability and performance for R without changes to your programs  Dynamically leverage Hadoop through Big Data Connectors to execute R analytics
  • 26. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.26 Oracle Data Integrator Simplify Map Reduce OLH & OSCH Oracle Data Integrator  Automatically generates MapReduce code  High performance loads into Data Warehouse leveraging both OLH and OSCH  Manages the process across platforms
  • 27. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.27 Oracle NoSQL Database Scalable, Highly Available, Key-Value Database Application Storage Nodes Datacenter B Storage Nodes Datacenter A Application NoSQL DB Driver Application NoSQL DB Driver Application  Simple Key-Value Data Model  Horizontally Scalable  Highly Available  Simple administration  ACID Transactions at scale  Transparent load balancing  Elastic Configuration  Commercial grade software and support
  • 28. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.28 Oracle NoSQL Database Use Cases NoSQL DB Driver Application Oracle Event Processor Event Stream Web Scale Transaction Processing • High velocity, High volume, High variety, Low information density data capture • Uses Hadoop and/or Data Warehouse for analytics • Applications: Web browsing, Web Retail, CDR processing, Sensor data capture Last Mile Content Delivery • Platform for real-time content delivery • Content & market segmentation Acquired and Analyzed in Hadoop & RDBMS • NoSQL provides low latency content lookup and delivery to end-customers • OEP rules perform low latency lookups to Oracle NoSQL DB for additional data Real Time Event Processing • Real time events trigger rule execution in Oracle Event Processing • OEP rules perform low latency lookups to Oracle NoSQL DB for additional data • OEP actions are triggered • Applications: Medical Monitoring, Factory Automation, Oil & Gas, Geo-location Rule Action
  • 29. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.29
  • 30. Copyright © 2013, Oracle and/or its affiliates. All rights reserved.30