SlideShare a Scribd company logo
BlueData EPIC™
on AWS
Big Data Analytics on AWS
The BlueData EPIC (Elastic Private Instant Clusters) software platform makes deployment of Big
Data infrastructure and applications easier, faster, and more cost-effective—whether on-premises
or on the public cloud.
With BlueData EPIC on AWS, you can quickly and easily deploy your preferred Big Data
applications, distributions, and tools; leverage enterprise-class security and cost controls for multi-
tenant deployments on the Amazon cloud; and tap into both Amazon S3 and on-premises storage
for your Big Data analytics.
Key Features
The BlueData EPIC software platform enables simplified, consistent, and repeatable deployment
of multiple Big Data applications and tools in a multi-tenant architecture.
BlueData EPIC on AWS provides the ability to spin up your own unique Big Data environments
for a range of different data science and analytics use cases within minutes—without requiring
DevOps or AWS-specific skills. BlueData EPIC delivers a single unified Big-Data-as-a-Service
platform to build, run, and manage a variety of different Big Data frameworks including Hadoop,
Spark, Kafka, Cassandra, and more on AWS—instead of managing each deployment with different
processes and tools.
This common platform minimizes the refactoring work required for Big Data developers and data
science teams to get up and running with their preferred Big Data applications and tools on AWS—
along with the flexibility to quickly and easily upgrade to the latest releases and new versions.
DevOps Agility for Big Data
BlueData EPIC software is deployed into your AWS account with a streamlined, wizard-based process
that takes only 20-25 minutes. The EPIC software is automatically installed in a single AWS controller
instance within a private Amazon VPC subnet or (optionally) a new VPC subnet.
Once installed, BlueData EPIC provides a simple and easy-to-use interface that abstracts both
administrators and end users from the inner-workings of AWS. As an administrator, you can quickly
and easily onboard one or more user groups (i.e. tenants) and assign them different AWS usage quotas
in a multi-tenant environment.
Your end users can then log into BlueData EPIC’s self-service interface to spin up instant clusters on
Amazon EC2 and scale up or scale down in just a few mouse click—with built-in elasticity for all cluster
types. Amazon S3 connectivity is pre-integrated into all clusters, using tenant-level access credentials.
Ultimate flexibility and choice for Big-Data-as-a-Service on Amazon Web Services
www.bluedata.com
Spec Sheet
BENEFITS
	Simplified user experience
for both administrators
and data science teams,
abstracting the AWS-specific
infrastructure so they can
focus on their Big Data needs.
	Faster AWS onboarding for
multiple teams and Big Data
workloads, eliminating the
need for DevOps expertise
and reducing the cost and
time involved.
	Greater agility and flexibility,
with self-service clusters
pre-configured on Amazon
EC2 for Spark, Hadoop, Kafka,
Cassandra, and other Big
Data applications.
	Reduced AWS costs through
the use of fine-grained
resource quotas, start/stop
controls, and cost reporting
in a multi-tenant environment.
	Faster time to insights
with pre-built cluster
integrations to Amazon S3
and in-place analytics against
on-premises data.
	Improved data governance
with integrations to Amazon
VPC (including site-to-site
VPN), Active Directory, and
Kerberos for authentication.
on
Minimum Requirements
An AWS account is required for BlueData EPIC on AWS.
The one time setup of the BlueData EPIC controller requires an AWS
IAM user account with the following AWS permissions:
•	 AmazonEC2FullAccess
•	 AmazonS3FullAccess
•	 AmazonVPCFullAccess
•	 AWS CloudFormation (ability to “Create Stack”, “Describe”,
“List”, “Get”, “Delete”, “Validate”)
An individual “root” (i.e. personal AWS user account) will need
to have each of the above privileges. Larger enterprises will typically
have multiple AWS IAM user accounts for different business groups.
The BlueData EPIC controller must be activated via a license key.
The unique controller ID (available in the UI) must be provided to
BlueData Support in order to obtain the license key.
If deploying into an existing Amazon VPC:
•	 The subnet for site-to-site VPN (if used) must be provided;
•	 The security group for the subnet must permit the BlueData EPIC
controller instance to access the internet temporarily, in order
to finish deployment of the stack.
•	 Amazon S3 endpoints must be configured for the subnet so S3
buckets may be accessed by BlueData-provisioned clusters.
To ensure optimal performance when tapping into on-premises
storage, Amazon Direct Connect and/or a site-to-site VPN between
the Amazon VPC and your site is recommended.
BlueData EPIC is currently available in all North America AWS regions;
other regions may be available upon request. The BlueData EPIC
controller cannot manage instances across different AWS regions.
Pricing
BlueData EPIC on AWS starts at $499 per month for 15 managed
Amazon EC2 instances, including standard support and training.
Premium support is available at 20% of the monthly license.
Application Flexibility and Choice
BlueData EPIC for AWS offers a pre-populated App Store with
Docker-based application images for multiple Big Data frameworks
and tools. You can install these images within minutes and make
them available to your end users to spin up their own clusters—
eliminating the complexity and time associated with configuring and
tuning each of these applications to run on AWS. Each cluster runs
in an embedded and fully-managed Docker container, providing
abstraction from the underlying AWS infrastructure.
The standard App Store includes the following ready-to-run images:
•	 Cloudera CDH 5.7 with Cloudera Manager (CDH 5.8 and 5.9 are
available upon request)
•	 Hortonworks HDP 2.4 with Ambari 2.2
•	 MapR 5.1 with MapR Control System
•	 Apache Spark 2.0.1 with Zeppelin Notebook
•	 Apache Spark 1.6.1 with Job Server and Zeppelin Notebook
•	 DataStax Distribution of Apache Cassandra 3.9
•	 DataStax Distribution of Apache Cassandra 2.1.10
•	 Apache Kafka 0.9.0.1
•	 Ubuntu 16.04 (Xenial Xerus)
•	 CentOS 7 / CentOS 6
•	 RHEL 7 / RHEL 6
These same images can be used on AWS or for on-premises
deployments of the BlueData EPIC platform, providing portability
and a common user experience in a hybrid architecture. In addition
to the above, pre-configured images for other Big Data applications
(e.g. Splunk) and tools (e.g. Jupyter Notebook) may be available
upon request.
BlueData EPIC also provides an App Workbench that allows your
administers to easily modify and update these images (e.g. to
upgrade to the latest version) within your own App Store—or create
new Docker-based application images for other Big Data frameworks
and tools preferred by your data scientists, developers, and analysts.
www.bluedata.com
Sign up for a free two-week trial at bluedata.com/aws
Cost Savings, Visibility, and Control
Administrators can segregate different teams or user groups as
separate tenants, with visibility into their usage and costs. Each
tenant can be assigned resource quotas to limit their CPU, memory,
and storage usage; you can also set restrictions on which Amazon
EC2 instances are available for these users.
To control AWS costs, your users can stop a cluster when it’s no
longer in use; they can easily restart the cluster later with the same
metadata, data, and networking configurations.
You can also drill down into the usage and billing by tenant, using
standard AWS Cost Explorer reports; all Amazon EC2 instances
launched by BlueData EPIC are tagged with the tenant name and
tenant ID.
Authentication, Security, and User Management
You can deploy BlueData EPIC into your existing Amazon VPC,
ensuring logical isolation for your EC2 instances from other virtual
networks in the AWS cloud. And you can easily integrate with your
existing enterprise security model through Active Directory/LDAP
integration available within BlueData EPIC.
You can also automatically associate an AWS user account for each
tenant. This eliminates the time and potential errors of manually
configuring this for every cluster in the tenant, and controls their
access to data in Amazon S3.
And you automate the Kerberos configuration setup for each cluster,
ensuring security and data governance.
Access to Amazon S3 and On-Premises Storage
With BlueData EPIC, you can tap into data stored on your
on-premises HDFS and network file systems—using our
proprietary DataTap™ functionality. This means users can analyze
data either in Amazon S3 or on-premises, avoiding the time
and cost of moving data to AWS. BlueData is the only platform
for Big-Data-as-a-Service that supports access to both S3 and
cloud storage for Big Data analytics.
© 2016 bluedata
Ad

More Related Content

What's hot (18)

Spark Infrastructure Made Easy
Spark Infrastructure Made EasySpark Infrastructure Made Easy
Spark Infrastructure Made Easy
BlueData, Inc.
 
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services LayerLogical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
DataWorks Summit
 
Implementing SharePoint on Azure, Lessons Learnt from a Real World Project
Implementing SharePoint on Azure, Lessons Learnt from a Real World ProjectImplementing SharePoint on Azure, Lessons Learnt from a Real World Project
Implementing SharePoint on Azure, Lessons Learnt from a Real World Project
K.Mohamed Faizal
 
Azure Hd insigth news
Azure Hd insigth newsAzure Hd insigth news
Azure Hd insigth news
nnakasone
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on Docker
Rakesh Saha
 
Hadoop Virtualization - Intel White Paper
Hadoop Virtualization - Intel White PaperHadoop Virtualization - Intel White Paper
Hadoop Virtualization - Intel White Paper
BlueData, Inc.
 
Google Cloud Platform Certification Cloud Architect Exam Prep Review Virtual ...
Google Cloud Platform Certification Cloud Architect Exam Prep Review Virtual ...Google Cloud Platform Certification Cloud Architect Exam Prep Review Virtual ...
Google Cloud Platform Certification Cloud Architect Exam Prep Review Virtual ...
Joseph Holbrook, Chief Learning Officer (CLO)
 
Enabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache KuduEnabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache Kudu
Grant Henke
 
Power of OpenStack & Hadoop
Power of OpenStack & HadoopPower of OpenStack & Hadoop
Power of OpenStack & Hadoop
Tuan Yang
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
DataWorks Summit
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
Andrei Savu
 
BlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData EPIC 2.0 Overview
BlueData EPIC 2.0 Overview
BlueData, Inc.
 
Spark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with SparkSpark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with Spark
Matt Ingenthron
 
Apache Accumulo Overview
Apache Accumulo OverviewApache Accumulo Overview
Apache Accumulo Overview
Bill Havanki
 
Introduction to AWS Services
Introduction to AWS ServicesIntroduction to AWS Services
Introduction to AWS Services
Klearchos Klearchou
 
Camel Riders in the Cloud
Camel Riders in the CloudCamel Riders in the Cloud
Camel Riders in the Cloud
Red Hat Developers
 
Running High Availability Websites with Acquia and AWS
Running High Availability Websites with Acquia and AWSRunning High Availability Websites with Acquia and AWS
Running High Availability Websites with Acquia and AWS
Acquia
 
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Cloudera, Inc.
 
Spark Infrastructure Made Easy
Spark Infrastructure Made EasySpark Infrastructure Made Easy
Spark Infrastructure Made Easy
BlueData, Inc.
 
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services LayerLogical Data Warehouse: How to Build a Virtualized Data Services Layer
Logical Data Warehouse: How to Build a Virtualized Data Services Layer
DataWorks Summit
 
Implementing SharePoint on Azure, Lessons Learnt from a Real World Project
Implementing SharePoint on Azure, Lessons Learnt from a Real World ProjectImplementing SharePoint on Azure, Lessons Learnt from a Real World Project
Implementing SharePoint on Azure, Lessons Learnt from a Real World Project
K.Mohamed Faizal
 
Azure Hd insigth news
Azure Hd insigth newsAzure Hd insigth news
Azure Hd insigth news
nnakasone
 
Hadoop on Docker
Hadoop on DockerHadoop on Docker
Hadoop on Docker
Rakesh Saha
 
Hadoop Virtualization - Intel White Paper
Hadoop Virtualization - Intel White PaperHadoop Virtualization - Intel White Paper
Hadoop Virtualization - Intel White Paper
BlueData, Inc.
 
Enabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache KuduEnabling the Active Data Warehouse with Apache Kudu
Enabling the Active Data Warehouse with Apache Kudu
Grant Henke
 
Power of OpenStack & Hadoop
Power of OpenStack & HadoopPower of OpenStack & Hadoop
Power of OpenStack & Hadoop
Tuan Yang
 
Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
DataWorks Summit
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data MeetupOne Hadoop, Multiple Clouds - NYC Big Data Meetup
One Hadoop, Multiple Clouds - NYC Big Data Meetup
Andrei Savu
 
BlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData EPIC 2.0 Overview
BlueData EPIC 2.0 Overview
BlueData, Inc.
 
Spark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with SparkSpark and Couchbase– Augmenting the Operational Database with Spark
Spark and Couchbase– Augmenting the Operational Database with Spark
Matt Ingenthron
 
Apache Accumulo Overview
Apache Accumulo OverviewApache Accumulo Overview
Apache Accumulo Overview
Bill Havanki
 
Running High Availability Websites with Acquia and AWS
Running High Availability Websites with Acquia and AWSRunning High Availability Websites with Acquia and AWS
Running High Availability Websites with Acquia and AWS
Acquia
 
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Hive, Impala, and Spark, Oh My: SQL-on-Hadoop in Cloudera 5.5
Cloudera, Inc.
 

Viewers also liked (6)

BlueData DataSheet
BlueData DataSheetBlueData DataSheet
BlueData DataSheet
Greg Kirchoff
 
BlueData Hunk Integration: Splunk Analytics for Hadoop
BlueData Hunk Integration: Splunk Analytics for HadoopBlueData Hunk Integration: Splunk Analytics for Hadoop
BlueData Hunk Integration: Splunk Analytics for Hadoop
BlueData, Inc.
 
How to deploy Apache Spark in a multi-tenant, on-premises environment
How to deploy Apache Spark in a multi-tenant, on-premises environmentHow to deploy Apache Spark in a multi-tenant, on-premises environment
How to deploy Apache Spark in a multi-tenant, on-premises environment
BlueData, Inc.
 
End-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service DeploymentEnd-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service Deployment
DataWorks Summit/Hadoop Summit
 
Big Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 TelcoBig Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 Telco
BlueData, Inc.
 
Bare-metal performance for Big Data workloads on Docker containers
Bare-metal performance for Big Data workloads on Docker containersBare-metal performance for Big Data workloads on Docker containers
Bare-metal performance for Big Data workloads on Docker containers
BlueData, Inc.
 
BlueData Hunk Integration: Splunk Analytics for Hadoop
BlueData Hunk Integration: Splunk Analytics for HadoopBlueData Hunk Integration: Splunk Analytics for Hadoop
BlueData Hunk Integration: Splunk Analytics for Hadoop
BlueData, Inc.
 
How to deploy Apache Spark in a multi-tenant, on-premises environment
How to deploy Apache Spark in a multi-tenant, on-premises environmentHow to deploy Apache Spark in a multi-tenant, on-premises environment
How to deploy Apache Spark in a multi-tenant, on-premises environment
BlueData, Inc.
 
End-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service DeploymentEnd-to-End Security and Auditing in a Big Data as a Service Deployment
End-to-End Security and Auditing in a Big Data as a Service Deployment
DataWorks Summit/Hadoop Summit
 
Big Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 TelcoBig Data Case Study: Fortune 100 Telco
Big Data Case Study: Fortune 100 Telco
BlueData, Inc.
 
Bare-metal performance for Big Data workloads on Docker containers
Bare-metal performance for Big Data workloads on Docker containersBare-metal performance for Big Data workloads on Docker containers
Bare-metal performance for Big Data workloads on Docker containers
BlueData, Inc.
 
Ad

Similar to BlueData EPIC on AWS - Spec Sheet (20)

Azure bootcamp (1)
Azure bootcamp (1)Azure bootcamp (1)
Azure bootcamp (1)
AmnaHussain26
 
AWS Summit 2018 Summary
AWS Summit 2018 SummaryAWS Summit 2018 Summary
AWS Summit 2018 Summary
Ashish Mrig
 
Azure diario de abordo
Azure diario de abordoAzure diario de abordo
Azure diario de abordo
José Ángel Bolaño Rucabado
 
Cloudformation & VPC, EC2, RDS
Cloudformation & VPC, EC2, RDSCloudformation & VPC, EC2, RDS
Cloudformation & VPC, EC2, RDS
Can Abacıgil
 
Aws coi7
Aws coi7Aws coi7
Aws coi7
Jeevan Dongre
 
Citrix cloud platform 4.2 data sheet
Citrix cloud platform 4.2 data sheetCitrix cloud platform 4.2 data sheet
Citrix cloud platform 4.2 data sheet
Nuno Alves
 
Amazon Web Service.pdf
Amazon Web Service.pdfAmazon Web Service.pdf
Amazon Web Service.pdf
Pyingkodi Maran
 
Case study on Cloud Platforms
Case study on Cloud PlatformsCase study on Cloud Platforms
Case study on Cloud Platforms
nik_053
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
Girish Kalamati
 
Wi t containerizemicroservices
Wi t containerizemicroservicesWi t containerizemicroservices
Wi t containerizemicroservices
Dipali Kulshrestha
 
Aws certified solutions architect
Aws certified solutions architectAws certified solutions architect
Aws certified solutions architect
Syed Measum Haider Bokhari
 
AWS re:Invent re:Cap 2015
AWS re:Invent re:Cap 2015AWS re:Invent re:Cap 2015
AWS re:Invent re:Cap 2015
Mark Bate
 
Aws overview (Amazon Web Services)
Aws overview (Amazon Web Services)Aws overview (Amazon Web Services)
Aws overview (Amazon Web Services)
Jatinder Randhawa
 
Hybridní cloud s F5 v prostředí kontejnerů
Hybridní cloud s F5 v prostředí kontejnerůHybridní cloud s F5 v prostředí kontejnerů
Hybridní cloud s F5 v prostředí kontejnerů
MarketingArrowECS_CZ
 
Introduction to Microsoft Azure 101
Introduction to Microsoft Azure 101Introduction to Microsoft Azure 101
Introduction to Microsoft Azure 101
R M Shahidul Islam Shahed
 
Scaling drupal horizontally and in cloud
Scaling drupal horizontally and in cloudScaling drupal horizontally and in cloud
Scaling drupal horizontally and in cloud
Vladimir Ilic
 
Aws101
Aws101Aws101
Aws101
Shaimaa Esmaeil
 
Aws class demo
Aws class demoAws class demo
Aws class demo
Animesh Roy
 
Azure Cloud complete administration document
Azure Cloud complete administration documentAzure Cloud complete administration document
Azure Cloud complete administration document
sanjeeva11
 
Aws interview questions and answers
Aws interview questions and answersAws interview questions and answers
Aws interview questions and answers
kavinilavuG
 
AWS Summit 2018 Summary
AWS Summit 2018 SummaryAWS Summit 2018 Summary
AWS Summit 2018 Summary
Ashish Mrig
 
Cloudformation & VPC, EC2, RDS
Cloudformation & VPC, EC2, RDSCloudformation & VPC, EC2, RDS
Cloudformation & VPC, EC2, RDS
Can Abacıgil
 
Citrix cloud platform 4.2 data sheet
Citrix cloud platform 4.2 data sheetCitrix cloud platform 4.2 data sheet
Citrix cloud platform 4.2 data sheet
Nuno Alves
 
Case study on Cloud Platforms
Case study on Cloud PlatformsCase study on Cloud Platforms
Case study on Cloud Platforms
nik_053
 
Azure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish KalamatiAzure from scratch part 3 By Girish Kalamati
Azure from scratch part 3 By Girish Kalamati
Girish Kalamati
 
Wi t containerizemicroservices
Wi t containerizemicroservicesWi t containerizemicroservices
Wi t containerizemicroservices
Dipali Kulshrestha
 
AWS re:Invent re:Cap 2015
AWS re:Invent re:Cap 2015AWS re:Invent re:Cap 2015
AWS re:Invent re:Cap 2015
Mark Bate
 
Aws overview (Amazon Web Services)
Aws overview (Amazon Web Services)Aws overview (Amazon Web Services)
Aws overview (Amazon Web Services)
Jatinder Randhawa
 
Hybridní cloud s F5 v prostředí kontejnerů
Hybridní cloud s F5 v prostředí kontejnerůHybridní cloud s F5 v prostředí kontejnerů
Hybridní cloud s F5 v prostředí kontejnerů
MarketingArrowECS_CZ
 
Scaling drupal horizontally and in cloud
Scaling drupal horizontally and in cloudScaling drupal horizontally and in cloud
Scaling drupal horizontally and in cloud
Vladimir Ilic
 
Azure Cloud complete administration document
Azure Cloud complete administration documentAzure Cloud complete administration document
Azure Cloud complete administration document
sanjeeva11
 
Aws interview questions and answers
Aws interview questions and answersAws interview questions and answers
Aws interview questions and answers
kavinilavuG
 
Ad

More from BlueData, Inc. (12)

Introduction to KubeDirector - SF Kubernetes Meetup
Introduction to KubeDirector - SF Kubernetes MeetupIntroduction to KubeDirector - SF Kubernetes Meetup
Introduction to KubeDirector - SF Kubernetes Meetup
BlueData, Inc.
 
Dell EMC Ready Solutions for Big Data
Dell EMC Ready Solutions for Big DataDell EMC Ready Solutions for Big Data
Dell EMC Ready Solutions for Big Data
BlueData, Inc.
 
BlueData and Hortonworks Data Platform (HDP)
BlueData and Hortonworks Data Platform (HDP)BlueData and Hortonworks Data Platform (HDP)
BlueData and Hortonworks Data Platform (HDP)
BlueData, Inc.
 
How to Protect Big Data in a Containerized Environment
How to Protect Big Data in a Containerized EnvironmentHow to Protect Big Data in a Containerized Environment
How to Protect Big Data in a Containerized Environment
BlueData, Inc.
 
BlueData EPIC datasheet (en Français)
BlueData EPIC datasheet (en Français)BlueData EPIC datasheet (en Français)
BlueData EPIC datasheet (en Français)
BlueData, Inc.
 
Best Practices for Running Kafka on Docker Containers
Best Practices for Running Kafka on Docker ContainersBest Practices for Running Kafka on Docker Containers
Best Practices for Running Kafka on Docker Containers
BlueData, Inc.
 
Lessons Learned from Dockerizing Spark Workloads
Lessons Learned from Dockerizing Spark WorkloadsLessons Learned from Dockerizing Spark Workloads
Lessons Learned from Dockerizing Spark Workloads
BlueData, Inc.
 
Lessons Learned Running Hadoop and Spark in Docker Containers
Lessons Learned Running Hadoop and Spark in Docker ContainersLessons Learned Running Hadoop and Spark in Docker Containers
Lessons Learned Running Hadoop and Spark in Docker Containers
BlueData, Inc.
 
The Time Has Come for Big-Data-as-a-Service
The Time Has Come for Big-Data-as-a-ServiceThe Time Has Come for Big-Data-as-a-Service
The Time Has Come for Big-Data-as-a-Service
BlueData, Inc.
 
Solution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline AcceleratorSolution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline Accelerator
BlueData, Inc.
 
Solution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab AcceleratorSolution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab Accelerator
BlueData, Inc.
 
BlueData Integration with Cloudera Manager
BlueData Integration with Cloudera ManagerBlueData Integration with Cloudera Manager
BlueData Integration with Cloudera Manager
BlueData, Inc.
 
Introduction to KubeDirector - SF Kubernetes Meetup
Introduction to KubeDirector - SF Kubernetes MeetupIntroduction to KubeDirector - SF Kubernetes Meetup
Introduction to KubeDirector - SF Kubernetes Meetup
BlueData, Inc.
 
Dell EMC Ready Solutions for Big Data
Dell EMC Ready Solutions for Big DataDell EMC Ready Solutions for Big Data
Dell EMC Ready Solutions for Big Data
BlueData, Inc.
 
BlueData and Hortonworks Data Platform (HDP)
BlueData and Hortonworks Data Platform (HDP)BlueData and Hortonworks Data Platform (HDP)
BlueData and Hortonworks Data Platform (HDP)
BlueData, Inc.
 
How to Protect Big Data in a Containerized Environment
How to Protect Big Data in a Containerized EnvironmentHow to Protect Big Data in a Containerized Environment
How to Protect Big Data in a Containerized Environment
BlueData, Inc.
 
BlueData EPIC datasheet (en Français)
BlueData EPIC datasheet (en Français)BlueData EPIC datasheet (en Français)
BlueData EPIC datasheet (en Français)
BlueData, Inc.
 
Best Practices for Running Kafka on Docker Containers
Best Practices for Running Kafka on Docker ContainersBest Practices for Running Kafka on Docker Containers
Best Practices for Running Kafka on Docker Containers
BlueData, Inc.
 
Lessons Learned from Dockerizing Spark Workloads
Lessons Learned from Dockerizing Spark WorkloadsLessons Learned from Dockerizing Spark Workloads
Lessons Learned from Dockerizing Spark Workloads
BlueData, Inc.
 
Lessons Learned Running Hadoop and Spark in Docker Containers
Lessons Learned Running Hadoop and Spark in Docker ContainersLessons Learned Running Hadoop and Spark in Docker Containers
Lessons Learned Running Hadoop and Spark in Docker Containers
BlueData, Inc.
 
The Time Has Come for Big-Data-as-a-Service
The Time Has Come for Big-Data-as-a-ServiceThe Time Has Come for Big-Data-as-a-Service
The Time Has Come for Big-Data-as-a-Service
BlueData, Inc.
 
Solution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline AcceleratorSolution Brief: Real-Time Pipeline Accelerator
Solution Brief: Real-Time Pipeline Accelerator
BlueData, Inc.
 
Solution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab AcceleratorSolution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab Accelerator
BlueData, Inc.
 
BlueData Integration with Cloudera Manager
BlueData Integration with Cloudera ManagerBlueData Integration with Cloudera Manager
BlueData Integration with Cloudera Manager
BlueData, Inc.
 

Recently uploaded (20)

chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Ch3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendencyCh3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendency
ayeleasefa2
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
Classification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptxClassification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptx
wencyjorda88
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
Ch3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendencyCh3MCT24.pptx measure of central tendency
Ch3MCT24.pptx measure of central tendency
ayeleasefa2
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
computer organization and assembly language.docx
computer organization and assembly language.docxcomputer organization and assembly language.docx
computer organization and assembly language.docx
alisoftwareengineer1
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbbEDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
EDU533 DEMO.pptxccccvbnjjkoo jhgggggbbbb
JessaMaeEvangelista2
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
Adobe Analytics NOAM Central User Group April 2025 Agent AI: Uncovering the S...
gmuir1066
 
Classification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptxClassification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptx
wencyjorda88
 
GenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.aiGenAI for Quant Analytics: survey-analytics.ai
GenAI for Quant Analytics: survey-analytics.ai
Inspirient
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 

BlueData EPIC on AWS - Spec Sheet

  • 1. BlueData EPIC™ on AWS Big Data Analytics on AWS The BlueData EPIC (Elastic Private Instant Clusters) software platform makes deployment of Big Data infrastructure and applications easier, faster, and more cost-effective—whether on-premises or on the public cloud. With BlueData EPIC on AWS, you can quickly and easily deploy your preferred Big Data applications, distributions, and tools; leverage enterprise-class security and cost controls for multi- tenant deployments on the Amazon cloud; and tap into both Amazon S3 and on-premises storage for your Big Data analytics. Key Features The BlueData EPIC software platform enables simplified, consistent, and repeatable deployment of multiple Big Data applications and tools in a multi-tenant architecture. BlueData EPIC on AWS provides the ability to spin up your own unique Big Data environments for a range of different data science and analytics use cases within minutes—without requiring DevOps or AWS-specific skills. BlueData EPIC delivers a single unified Big-Data-as-a-Service platform to build, run, and manage a variety of different Big Data frameworks including Hadoop, Spark, Kafka, Cassandra, and more on AWS—instead of managing each deployment with different processes and tools. This common platform minimizes the refactoring work required for Big Data developers and data science teams to get up and running with their preferred Big Data applications and tools on AWS— along with the flexibility to quickly and easily upgrade to the latest releases and new versions. DevOps Agility for Big Data BlueData EPIC software is deployed into your AWS account with a streamlined, wizard-based process that takes only 20-25 minutes. The EPIC software is automatically installed in a single AWS controller instance within a private Amazon VPC subnet or (optionally) a new VPC subnet. Once installed, BlueData EPIC provides a simple and easy-to-use interface that abstracts both administrators and end users from the inner-workings of AWS. As an administrator, you can quickly and easily onboard one or more user groups (i.e. tenants) and assign them different AWS usage quotas in a multi-tenant environment. Your end users can then log into BlueData EPIC’s self-service interface to spin up instant clusters on Amazon EC2 and scale up or scale down in just a few mouse click—with built-in elasticity for all cluster types. Amazon S3 connectivity is pre-integrated into all clusters, using tenant-level access credentials. Ultimate flexibility and choice for Big-Data-as-a-Service on Amazon Web Services www.bluedata.com Spec Sheet BENEFITS Simplified user experience for both administrators and data science teams, abstracting the AWS-specific infrastructure so they can focus on their Big Data needs. Faster AWS onboarding for multiple teams and Big Data workloads, eliminating the need for DevOps expertise and reducing the cost and time involved. Greater agility and flexibility, with self-service clusters pre-configured on Amazon EC2 for Spark, Hadoop, Kafka, Cassandra, and other Big Data applications. Reduced AWS costs through the use of fine-grained resource quotas, start/stop controls, and cost reporting in a multi-tenant environment. Faster time to insights with pre-built cluster integrations to Amazon S3 and in-place analytics against on-premises data. Improved data governance with integrations to Amazon VPC (including site-to-site VPN), Active Directory, and Kerberos for authentication. on
  • 2. Minimum Requirements An AWS account is required for BlueData EPIC on AWS. The one time setup of the BlueData EPIC controller requires an AWS IAM user account with the following AWS permissions: • AmazonEC2FullAccess • AmazonS3FullAccess • AmazonVPCFullAccess • AWS CloudFormation (ability to “Create Stack”, “Describe”, “List”, “Get”, “Delete”, “Validate”) An individual “root” (i.e. personal AWS user account) will need to have each of the above privileges. Larger enterprises will typically have multiple AWS IAM user accounts for different business groups. The BlueData EPIC controller must be activated via a license key. The unique controller ID (available in the UI) must be provided to BlueData Support in order to obtain the license key. If deploying into an existing Amazon VPC: • The subnet for site-to-site VPN (if used) must be provided; • The security group for the subnet must permit the BlueData EPIC controller instance to access the internet temporarily, in order to finish deployment of the stack. • Amazon S3 endpoints must be configured for the subnet so S3 buckets may be accessed by BlueData-provisioned clusters. To ensure optimal performance when tapping into on-premises storage, Amazon Direct Connect and/or a site-to-site VPN between the Amazon VPC and your site is recommended. BlueData EPIC is currently available in all North America AWS regions; other regions may be available upon request. The BlueData EPIC controller cannot manage instances across different AWS regions. Pricing BlueData EPIC on AWS starts at $499 per month for 15 managed Amazon EC2 instances, including standard support and training. Premium support is available at 20% of the monthly license. Application Flexibility and Choice BlueData EPIC for AWS offers a pre-populated App Store with Docker-based application images for multiple Big Data frameworks and tools. You can install these images within minutes and make them available to your end users to spin up their own clusters— eliminating the complexity and time associated with configuring and tuning each of these applications to run on AWS. Each cluster runs in an embedded and fully-managed Docker container, providing abstraction from the underlying AWS infrastructure. The standard App Store includes the following ready-to-run images: • Cloudera CDH 5.7 with Cloudera Manager (CDH 5.8 and 5.9 are available upon request) • Hortonworks HDP 2.4 with Ambari 2.2 • MapR 5.1 with MapR Control System • Apache Spark 2.0.1 with Zeppelin Notebook • Apache Spark 1.6.1 with Job Server and Zeppelin Notebook • DataStax Distribution of Apache Cassandra 3.9 • DataStax Distribution of Apache Cassandra 2.1.10 • Apache Kafka 0.9.0.1 • Ubuntu 16.04 (Xenial Xerus) • CentOS 7 / CentOS 6 • RHEL 7 / RHEL 6 These same images can be used on AWS or for on-premises deployments of the BlueData EPIC platform, providing portability and a common user experience in a hybrid architecture. In addition to the above, pre-configured images for other Big Data applications (e.g. Splunk) and tools (e.g. Jupyter Notebook) may be available upon request. BlueData EPIC also provides an App Workbench that allows your administers to easily modify and update these images (e.g. to upgrade to the latest version) within your own App Store—or create new Docker-based application images for other Big Data frameworks and tools preferred by your data scientists, developers, and analysts. www.bluedata.com Sign up for a free two-week trial at bluedata.com/aws Cost Savings, Visibility, and Control Administrators can segregate different teams or user groups as separate tenants, with visibility into their usage and costs. Each tenant can be assigned resource quotas to limit their CPU, memory, and storage usage; you can also set restrictions on which Amazon EC2 instances are available for these users. To control AWS costs, your users can stop a cluster when it’s no longer in use; they can easily restart the cluster later with the same metadata, data, and networking configurations. You can also drill down into the usage and billing by tenant, using standard AWS Cost Explorer reports; all Amazon EC2 instances launched by BlueData EPIC are tagged with the tenant name and tenant ID. Authentication, Security, and User Management You can deploy BlueData EPIC into your existing Amazon VPC, ensuring logical isolation for your EC2 instances from other virtual networks in the AWS cloud. And you can easily integrate with your existing enterprise security model through Active Directory/LDAP integration available within BlueData EPIC. You can also automatically associate an AWS user account for each tenant. This eliminates the time and potential errors of manually configuring this for every cluster in the tenant, and controls their access to data in Amazon S3. And you automate the Kerberos configuration setup for each cluster, ensuring security and data governance. Access to Amazon S3 and On-Premises Storage With BlueData EPIC, you can tap into data stored on your on-premises HDFS and network file systems—using our proprietary DataTap™ functionality. This means users can analyze data either in Amazon S3 or on-premises, avoiding the time and cost of moving data to AWS. BlueData is the only platform for Big-Data-as-a-Service that supports access to both S3 and cloud storage for Big Data analytics. © 2016 bluedata