SlideShare a Scribd company logo
Page1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s New in Ambari?
June 2015
Yusaku Sako @ Hortonworks (Ambari PMC Chair)
Sumit Mohanty @ Hortonworks (Ambari PMC)
Page2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s Apache Ambari?
100% open-source
platform for simplifying
Hadoop cluster
management and use.
Highly extensible.
Page3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Open Source Activity
Page4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Inception: AMBARI-1 (Sept, 2011)
Page5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Fast forward 4 years to today… (June, 2015)
• Latest JIRA: AMBARI-11854
• 100+ Contributors
• 50 Committers
• ~12k JIRAs filed
• ~11k JIRAs resolved
At 1.5 day per JIRA -> 66 person years! (probably more)
• Used by hundreds of companies
Page6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari – 4th Biggest Project* @ Apache
* Based on total JIRAs filed on a project basis out of 162 projects as of June 10, 2015
#2: Hadoop at ~28k as it is split across multiple JIRA Projects
#1
#3
#4
#5
Page7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Timeline: Past 1 Year
Ambari 1.6.*
May 2014
907 JIRAs
Ambari 1.5.*
Apr 2014
1218 JIRAs
Ambari 1.7.*
Dec 2014
1620 JIRAs
Ambari 2.0.*
April 2015
1784 JIRAs
Current GA Version (2.0.1)
Ambari 2.1
Coming Soon
1520+ JIRAs
Focus of today’s talk
Resolution of 7k+ JIRAs
Page8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s new?
• Rolling Upgrade
• Alerts
• Metrics
• Enhanced Dashboard
• Smart Configurations
• Views
• Kerberos Automation
• Blueprints
Page9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade
Page10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade of Stack
Side-by-Side Bits and Configs
Bits:
/usr/hdp/2.2.0.0-2041
/usr/hdp/2.2.4.2-2
/usr/hdp/2.3.0.0-3000
Configs:
/etc/hive/conf/ (initial)
/etc/hive/conf/v0 (HDP 2.2.4.2)
/etc/hive/conf/v1 (HDP 2.3)
2.2.0.0 2.2.4.2 2.3.0.0minor jump major jump
Page11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Manage Versions
Install bits in parallel on all agents
No down-time
Page12 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Orchestration
Not necessarily “one-click” but fully guided
Services are up the entire time
Upgrade one component at a time
Robust and fault-tolerant
Service-checks performed throughout
2.3.0.0-2283 2.3.0.0-2283
Page13 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Grouping and order
Page14 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Run custom scripts (python and Server-side)
Page15 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Mark steps are skippable, retryable
All service checks are skippable, all steps retryable
Page16 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Upgrade Catalog
Set, move, delete, transform configurations
Page17 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Rolling Upgrade – Downgrade
Page18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Alerts
Page19 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Alert – Types
Type Description Status
Thresholds
Configurable?
PORT
Watches a port based on a configuration property such as
the URI.
OK, WARN, CRIT Yes (seconds)
WEB
Watches an HTTP or HTTPS endpoint and determines
connectivity and HTTP status code.
OK, WARN, CRIT No
AGGREGATE Aggregate of status for another alert definition. OK, WARN, CRIT Yes (percentage)
METRIC
Watches a metric or series of metrics in JMX and compares
a mathematical result against a threshold.
OK, WARN, CRIT Yes (variable)
SCRIPT Uses a custom script to handle checking. OK or CRIT No
Page20 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
UI – Current Alerts
Configured by default; managed via the the web client
Page21 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
UI – Host Alerts
Automatically refreshes
Query alert history
Page22 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
UI– Customization & Instances
Status text, thresholds, and interval
Page23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Metrics
Page24 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari Metrics Service (AMS) - Goals
Ability to collect metrics from Hadoop and other Stack services
Ability to retain metrics at a high precision for a configurable time period
Ability to automatically purge metrics after retention period
At collection time, provide clear integration point for external system
At purge time, provide clear integration point for metrics retention by
external system
Should provide default options for external metrics retention
Provide tools / utilities for analyzing metrics in retention system
Page25 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Aggregators
Metrics Collector
HTTP REST endpoint
Metrics API
Query Layer
HBASE
Phoenix server
Phoenix client
Namenode
Datanode
Nodemanager
Regionserver
Nimbus
Flume Agent
Kafka worker
Metrics Sinks Metrics Monitor
AMBARI
DashboardsViews REST API
Ambari Metrics System - Architecture
Page26 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Sample Stats
Total number of raw uncompressed Hadoop metrics written per day on a 300 node cluster =
100 GB
Rows in Phoenix table ~ 100 million
Raw query time: 500 rows selected (1.923 seconds)
Aggregate query time: 204 rows selected (0.19 seconds)
SELECT METRIC_NAME, APP_ID, INSTANCE_ID, TIMESTAMP, METRIC_SUM, HOSTS_COUNT, METRIC_MAX,
METRIC_MIN FROM METRIC_AGGREGATE WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten',
'dfs.datanode.BytesRead') AND APP_ID = 'datanode' AND TIMESTAMP >= 1409770831000 and TIMESTAMP <
1409774431000;
SELECT METRIC_NAME, HOSTNAME, APP_ID, INSTANCE_ID, START_TIME, METRICS FROM METRIC_RECORD
WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten','dfs.datanode.BytesRead') AND APP_ID = 'datanode'
AND START_TIME >= 1409770831000 AND START_TIME < 1409774431000 ORDER BY METRIC_NAME,
START_TIME LIMIT 500;
Page27 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Key takeaways
Using Phoenix query hints to avoid full table scans
PHOENIX-914 – Use Native Hbase timestamp to skip HFiles
Client side buffering and aggregation built into Sinks and Monitor
Cluster and Host level aggregations across various time dimensions
Table schema optimized for reads and Hbase tuned to support heavy write
loads
Page28 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Enhanced Dashboard
Page29 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Customizable Service Dashboards
Service dashboards are now customizable in Ambari 2.1
• Create new widgets
• Graphs, Dial Gauge, Number, Template
• Customize layout
• Share widgets
Future:
• Make Layouts shareable
Page30 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Recorded Demo
Page31 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Easy to expose widgets for new services
Out-of-the-box widgets are defined in the stack as JSON files
No frontend code changes necessary
Page32 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Smart Configurations
Page33 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Hadoop Configuration Challenges
• Too many configurations
• which ones are important?
• Too easy to mess up
• What are valid/reasonable values?
• What are the units?
• Ok, what about dependencies?
• Gets harder with combinations of services, host assignments, enabled
features, CPU/RAM/disks, etc
• Any recommendations? What am I doing wrong?
• Smart Configs to the rescue! (Ambari 2.1)
Page34 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Smart Configs Demo
Page35 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari Smart Configs UI
Customizable layout
- Tabs
- Sections
- Sub-sections
- Simple grid layout
(Advanced Tab contains
remaining configurations)
New Widgets
- Sliders
- Recommended
- Minimum
- Maximum
- Increment Step
- Combos
- Enumerated values
- Toggles
- Binary options
- Spinners
- Splits value into multiple
controls. Time in
milliseconds split into days,
hours, minutes.
- Lists
- Enumerated values
- Single select
- Multi select
Implemented
- HDFS
- YARN
- MapReduce
- Hive
- HBase
Page36 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Stack Driven Layouts
{
"name": "default",
"description": "Default theme for HBASE service",
"configuration": {
"layouts": [
{
"name": "default",
"tabs": [
{
"name": "settings",
"display-name": "Settings",
"layout": {
"tab-columns": "3",
"tab-rows": "3",
"sections": [
...
]
}
}
]
}
],
"placement": {
"configuration-layout": "default",
"configs": [...]
},
"widgets": [
{
"config": "hbase-env/hbase_master_heapsize",
"widget": {
"type": "slider",
"units": [
{
"unit-name": "GB"
}
]
}
},
...
]
}
}
• Stacks has theme.json file
• Layout
– Tabs
– Sections
– Sub-sections
• Placement
– Configs placement in sub-sections
• Widgets
– Widget type
– Optional Units
- Bytes (B, KB, MB, GB, TB, PB)
- Time (Millis, Seconds, Minutes, Hours, Days, Months, Years)
Page37 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Config Metadata and Dependencies
{
"StackConfigurations": {
"final": "false",
"property_depends_on": [
{
"type": "yarn-site",
"name": "yarn.nodemanager.resource.memory-mb"
}
],
"property_description": “The minimum allocation for every",
"property_display_name": "Minimum Container Size (Memory)",
"property_name": "yarn.scheduler.minimum-allocation-mb",
"property_type": [],
"property_value": "512",
"property_value_attributes": {
"type": "int",
"maximum": "5120",
"minimum": "0",
"unit": "MB",
"increment_step": "256"
},
"type": "yarn-site.xml"
},
"dependencies": [
{
"StackConfigurationDependency": {
"dependency_name": "hive.tez.container.size",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
},
{
"StackConfigurationDependency": {
"dependency_name": "mapreduce.map.memory.mb",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
},
{
"StackConfigurationDependency": {
"dependency_name": "mapreduce.reduce.memory.mb",
"property_name": "yarn.scheduler.minimum-allocation-mb”
}
}…
]
}
• Extended Metadata
• Defined in property_value_attributes
• Hold non-UI metadata about value range, increment,
unit, etc
• Dependencies
• Models bi-directional relationship between configs
• Depends On (property_depends_on)
– Answers ‘which configs do I depend on?’
• Depended By (dependencies)
– Answers ‘which configs are dependent on me?’
• Ambari automatically updates dependencies
Page38 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Views
Page39 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What are Views?
View Framework
• Provide various applications accessible from Ambari Web UI – interact
with the cluster via a browser from a single place for all users (cluster
operators, data analysis, developers, etc)
Easy to develop
• No need to understand Ambari core code – view development is just
like creating any other web application
Easy to deploy
• Packaged as a single jar file
• Auto create / auto configure
Page40 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
CS Queue Manager for Cluster Operators
Capacity Scheduler
Queue Manager
Page41 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
HDFS File Browser for General Users
HDFS File Browser
Page42 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Job Analysis for Developers
Troubleshoot Tez JobsTroubleshoot / Improve Hive queries
Page43 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Query Editors for Data Analysts
Create, edit, execute, and analyze Hive queries Create, edit, and execute Pig scripts
Page44 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Ambari Server in Views-Only mode
• Use Views on existing clusters not under Ambari’s management
• Can use Views against multiple clusters
Ambari
Server
Cluster managed by Ambari
Ambari
Server “Views-only” mode
(aka “Stand-alone” mode)
Cluster not managed by Ambari
Management
Use Views
Use Views
Use Views
Page45 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Kerberos Automation
Page46 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Kerberos Automation
New in Ambari 2.0:
• Have Ambari manage Kerberos principals and keytabs
• Once Kerberized, seamlessly handle:
• Adding new hosts
• Adding new components to existing hosts
• Adding new services
• Moving components to different hosts
• Works with existing MIT KDC or Active Directory
Page47 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Blueprints
Page48 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Automated Cluster Deployment
Simple
• Making two REST calls is all it takes to provision a cluster
Who uses it?
• Cloudbreak
• Microsoft Azure Marketplace (portal.azure.com)
• Hortonworks QA
• and many many others
Page49 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Cluster Replication
Export blueprint of source cluster
Import blueprint to replicate clusters
Page50 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Example: Create a 100-node Cluster
{
"configurations" : [
{
”hdfs-site" : {
"dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3"
}
}
],
"host_groups" : [
{
"name" : ”master-host",
"components" : [
{ "name" : "NAMENODE” },
{ "name" : "RESOURCEMANAGER” },
…
],
"cardinality" : "1"
},
{
"name" : ”worker-host",
"components" : [
{ "name" : ”DATANODE” },
{ "name" : ”NODEMANAGER” },
…
],
"cardinality" : "1+"
},
],
"Blueprints" : {
"blueprint_name" : ”multi-node-hdfs-yarn",
"stack_name" : "HDP",
"stack_version" : "2.0"
}
}
{
"blueprint" : ”multi-node-hdfs-yarn",
"host_groups" :[
{
"name" : ”master-host",
"hosts" : [
{
"fqdn" : ”master001.ambari.apache.org”
}
]
},
{
"name" : ”worker-host",
"hosts" : [
{
"fqdn" : ”worker001.ambari.apache.org”
},
{
"fqdn" : ”worker002.ambari.apache.org”
},
…
{
"fqdn" : ”worker099.ambari.apache.org”
}
]
}
]
}
1. POST /blueprints/my-blueprint 2. POST /clusters/MyCluster
Page51 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
What’s New in Blueprint
New in Ambari 2.0:
• Supports various HA configurations:
• NameNode, ResourceManager, RegionServer, Oozie Server, Hive Metastore, HiveServer2, WebHCat
Server
• Adding hosts “blueprint style” (AMBARI-8458)
New in Ambari 2.1:
• Advanced Cluster Creation – more flexible, scalable, and robust
(AMBARI-10750)
Page52 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Blueprint Caveats
As of Ambari 2.1:
• Stack advisor (smart/dynamic config recommendation/validation) is not
used (yet)
• Can’t create Kerberized Cluster (yet)
Page53 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Thank You!
Try Ambari
• Follow the Ambari Quick Start Guide (search online for “Ambari Quick Start Guide”)
Learn more
• Visit the project website (search online for “Ambari”)
Get Involved
• User Mailing List: user-subscribe@ambari.apache.org
• Developer Mailing List: dev-subscribe@ambari.apache.org
• Use JIRA to file bugs and improvement requests (search online for “Ambari JIRA”)
Page54 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
Q & A
Yusaku Sako yusaku@hortonworks.com (Ambari PMC)
Sumit Mohanty smohanty@hortonworks.com (Ambari PMC)

More Related Content

PPTX
Hadoop crash course workshop at Hadoop Summit
PPTX
Big Data Simplified - Is all about Ab'strakSHeN
PPTX
Evolving HDFS to a Generalized Storage Subsystem
PPTX
Sharing metadata across the data lake and streams
PPTX
Analyzing the World's Largest Security Data Lake!
PPTX
End-to-End Security and Auditing in a Big Data as a Service Deployment
PDF
Ingesting Data at Blazing Speed Using Apache Orc
PDF
Hortonworks Technical Workshop - HDP Search
Hadoop crash course workshop at Hadoop Summit
Big Data Simplified - Is all about Ab'strakSHeN
Evolving HDFS to a Generalized Storage Subsystem
Sharing metadata across the data lake and streams
Analyzing the World's Largest Security Data Lake!
End-to-End Security and Auditing in a Big Data as a Service Deployment
Ingesting Data at Blazing Speed Using Apache Orc
Hortonworks Technical Workshop - HDP Search

What's hot (20)

PPTX
Format Wars: from VHS and Beta to Avro and Parquet
PPTX
YARN Ready: Apache Spark
PPTX
Internet of things Crash Course Workshop
PDF
Combine SAS High-Performance Capabilities with Hadoop YARN
PPTX
Hadoop & Cloud Storage: Object Store Integration in Production
PPTX
Apache Hive 2.0: SQL, Speed, Scale
PDF
Hortonworks tech workshop in-memory processing with spark
PPTX
Hadoop first ETL on Apache Falcon
PDF
Hortonworks Technical Workshop - Operational Best Practices Workshop
PDF
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
PDF
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
PDF
Visualizing Big Data in Realtime
PPTX
Introduction to the Hortonworks YARN Ready Program
PPTX
What's new in apache hive
PDF
Realizing the Promise of Portable Data Processing with Apache Beam
PPTX
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
PDF
Spark Uber Development Kit
PPTX
Hadoop in the Cloud - The what, why and how from the experts
PPTX
Data Regions: Modernizing your company's data ecosystem
PPTX
LLAP: Sub-Second Analytical Queries in Hive
Format Wars: from VHS and Beta to Avro and Parquet
YARN Ready: Apache Spark
Internet of things Crash Course Workshop
Combine SAS High-Performance Capabilities with Hadoop YARN
Hadoop & Cloud Storage: Object Store Integration in Production
Apache Hive 2.0: SQL, Speed, Scale
Hortonworks tech workshop in-memory processing with spark
Hadoop first ETL on Apache Falcon
Hortonworks Technical Workshop - Operational Best Practices Workshop
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Visualizing Big Data in Realtime
Introduction to the Hortonworks YARN Ready Program
What's new in apache hive
Realizing the Promise of Portable Data Processing with Apache Beam
Apache Falcon : 22 Sept 2014 for Hadoop User Group France (@Criteo)
Spark Uber Development Kit
Hadoop in the Cloud - The what, why and how from the experts
Data Regions: Modernizing your company's data ecosystem
LLAP: Sub-Second Analytical Queries in Hive
Ad

Viewers also liked (20)

PPTX
Internet of Things Crash Course Workshop at Hadoop Summit
PPTX
Spark crash course workshop at Hadoop Summit
PPTX
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
PPTX
Millions of Regions in HBase: Size Matters
PPTX
Authoring and Hosting Applications on YARN using Slider
PPTX
NextGen Apache Hadoop MapReduce
PPTX
Hadoop Operations - Best Practices from the Field
PPTX
Rocking the World of Big Data at Centrica
PPTX
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
PPTX
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
PPTX
Real-Time Clinical Analytics
PPTX
Ingest and Stream Processing - What will you choose?
PPTX
Apache Tez - A unifying Framework for Hadoop Data Processing
PDF
Scaling HDFS to Manage Billions of Files with Key-Value Stores
PPTX
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
PPTX
June 10 145pm hortonworks_tan & welch_v2
PPT
Running Spark in Production
PPTX
Securing Hadoop with Apache Ranger
PPTX
YARN and the Docker container runtime
Internet of Things Crash Course Workshop at Hadoop Summit
Spark crash course workshop at Hadoop Summit
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Millions of Regions in HBase: Size Matters
Authoring and Hosting Applications on YARN using Slider
NextGen Apache Hadoop MapReduce
Hadoop Operations - Best Practices from the Field
Rocking the World of Big Data at Centrica
Data Science: Driving Smarter Finance and Workforce Decsions for the Enterprise
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Real-Time Clinical Analytics
Ingest and Stream Processing - What will you choose?
Apache Tez - A unifying Framework for Hadoop Data Processing
Scaling HDFS to Manage Billions of Files with Key-Value Stores
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
June 10 145pm hortonworks_tan & welch_v2
Running Spark in Production
Securing Hadoop with Apache Ranger
YARN and the Docker container runtime
Ad

Similar to What's new in Ambari (20)

PPTX
Apache Ambari: Past, Present, Future
PPTX
Apache Ambari - What's New in 2.1
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
PDF
Hortonworks Technical Workshop: What's New in HDP 2.3
PPTX
Hadoop Operations - Past, Present, and Future
PPTX
SAM—streaming analytics made easy
PDF
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
PDF
Hortonworks technical workshop operations with ambari
PDF
Fast SQL on Hadoop, really?
PPTX
Apache Ambari - What's New in 2.4
PPTX
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
PPTX
Streaming analytics manager
PPTX
SAM - Streaming Analytics Made Easy
PPTX
Hive present-and-feature-shanghai
PPTX
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
PDF
Pivotal cf for_devops_mkim_20141209
PPTX
Hive Performance Dataworks Summit Melbourne February 2019
PDF
Fast SQL on Hadoop, Really?
PPTX
Manage Add-on Services in Apache Ambari
Apache Ambari: Past, Present, Future
Apache Ambari - What's New in 2.1
Managing Enterprise Hadoop Clusters with Apache Ambari
Managing Enterprise Hadoop Clusters with Apache Ambari
Hortonworks Technical Workshop: What's New in HDP 2.3
Hadoop Operations - Past, Present, and Future
SAM—streaming analytics made easy
Apache Hadoop YARN – Multi-Tenancy, Capacity Scheduler & Preemption - Stamped...
Hortonworks technical workshop operations with ambari
Fast SQL on Hadoop, really?
Apache Ambari - What's New in 2.4
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Streaming analytics manager
SAM - Streaming Analytics Made Easy
Hive present-and-feature-shanghai
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Pivotal cf for_devops_mkim_20141209
Hive Performance Dataworks Summit Melbourne February 2019
Fast SQL on Hadoop, Really?
Manage Add-on Services in Apache Ambari

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

PDF
NewMind AI Monthly Chronicles - July 2025
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Event Presentation Google Cloud Next Extended 2025
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Chapter 2 Digital Image Fundamentals.pdf
PPTX
Belt and Road Supply Chain Finance Blockchain Solution
PDF
Sensors and Actuators in IoT Systems using pdf
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
PPTX
CroxyProxy Instagram Access id login.pptx
PDF
How Onsite IT Support Drives Business Efficiency, Security, and Growth.pdf
PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
CIFDAQ's Token Spotlight: SKY - A Forgotten Giant's Comeback?
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
PDF
CIFDAQ's Teaching Thursday: Moving Averages Made Simple
NewMind AI Monthly Chronicles - July 2025
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Chapter 3 Spatial Domain Image Processing.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Event Presentation Google Cloud Next Extended 2025
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
NewMind AI Weekly Chronicles - August'25 Week I
Chapter 2 Digital Image Fundamentals.pdf
Belt and Road Supply Chain Finance Blockchain Solution
Sensors and Actuators in IoT Systems using pdf
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
CroxyProxy Instagram Access id login.pptx
How Onsite IT Support Drives Business Efficiency, Security, and Growth.pdf
Enable Enterprise-Ready Security on IBM i Systems.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
CIFDAQ's Token Spotlight: SKY - A Forgotten Giant's Comeback?
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
CIFDAQ's Teaching Thursday: Moving Averages Made Simple

What's new in Ambari

  • 1. Page1 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s New in Ambari? June 2015 Yusaku Sako @ Hortonworks (Ambari PMC Chair) Sumit Mohanty @ Hortonworks (Ambari PMC)
  • 2. Page2 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s Apache Ambari? 100% open-source platform for simplifying Hadoop cluster management and use. Highly extensible.
  • 3. Page3 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Open Source Activity
  • 4. Page4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Inception: AMBARI-1 (Sept, 2011)
  • 5. Page5 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Fast forward 4 years to today… (June, 2015) • Latest JIRA: AMBARI-11854 • 100+ Contributors • 50 Committers • ~12k JIRAs filed • ~11k JIRAs resolved At 1.5 day per JIRA -> 66 person years! (probably more) • Used by hundreds of companies
  • 6. Page6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari – 4th Biggest Project* @ Apache * Based on total JIRAs filed on a project basis out of 162 projects as of June 10, 2015 #2: Hadoop at ~28k as it is split across multiple JIRA Projects #1 #3 #4 #5
  • 7. Page7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Timeline: Past 1 Year Ambari 1.6.* May 2014 907 JIRAs Ambari 1.5.* Apr 2014 1218 JIRAs Ambari 1.7.* Dec 2014 1620 JIRAs Ambari 2.0.* April 2015 1784 JIRAs Current GA Version (2.0.1) Ambari 2.1 Coming Soon 1520+ JIRAs Focus of today’s talk Resolution of 7k+ JIRAs
  • 8. Page8 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s new? • Rolling Upgrade • Alerts • Metrics • Enhanced Dashboard • Smart Configurations • Views • Kerberos Automation • Blueprints
  • 9. Page9 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade
  • 10. Page10 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade of Stack Side-by-Side Bits and Configs Bits: /usr/hdp/2.2.0.0-2041 /usr/hdp/2.2.4.2-2 /usr/hdp/2.3.0.0-3000 Configs: /etc/hive/conf/ (initial) /etc/hive/conf/v0 (HDP 2.2.4.2) /etc/hive/conf/v1 (HDP 2.3) 2.2.0.0 2.2.4.2 2.3.0.0minor jump major jump
  • 11. Page11 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Manage Versions Install bits in parallel on all agents No down-time
  • 12. Page12 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Orchestration Not necessarily “one-click” but fully guided Services are up the entire time Upgrade one component at a time Robust and fault-tolerant Service-checks performed throughout 2.3.0.0-2283 2.3.0.0-2283
  • 13. Page13 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Grouping and order
  • 14. Page14 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Run custom scripts (python and Server-side)
  • 15. Page15 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Mark steps are skippable, retryable All service checks are skippable, all steps retryable
  • 16. Page16 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Upgrade Catalog Set, move, delete, transform configurations
  • 17. Page17 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Rolling Upgrade – Downgrade
  • 18. Page18 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Alerts
  • 19. Page19 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Alert – Types Type Description Status Thresholds Configurable? PORT Watches a port based on a configuration property such as the URI. OK, WARN, CRIT Yes (seconds) WEB Watches an HTTP or HTTPS endpoint and determines connectivity and HTTP status code. OK, WARN, CRIT No AGGREGATE Aggregate of status for another alert definition. OK, WARN, CRIT Yes (percentage) METRIC Watches a metric or series of metrics in JMX and compares a mathematical result against a threshold. OK, WARN, CRIT Yes (variable) SCRIPT Uses a custom script to handle checking. OK or CRIT No
  • 20. Page20 © Hortonworks Inc. 2011 – 2015. All Rights Reserved UI – Current Alerts Configured by default; managed via the the web client
  • 21. Page21 © Hortonworks Inc. 2011 – 2015. All Rights Reserved UI – Host Alerts Automatically refreshes Query alert history
  • 22. Page22 © Hortonworks Inc. 2011 – 2015. All Rights Reserved UI– Customization & Instances Status text, thresholds, and interval
  • 23. Page23 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Metrics
  • 24. Page24 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari Metrics Service (AMS) - Goals Ability to collect metrics from Hadoop and other Stack services Ability to retain metrics at a high precision for a configurable time period Ability to automatically purge metrics after retention period At collection time, provide clear integration point for external system At purge time, provide clear integration point for metrics retention by external system Should provide default options for external metrics retention Provide tools / utilities for analyzing metrics in retention system
  • 25. Page25 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Aggregators Metrics Collector HTTP REST endpoint Metrics API Query Layer HBASE Phoenix server Phoenix client Namenode Datanode Nodemanager Regionserver Nimbus Flume Agent Kafka worker Metrics Sinks Metrics Monitor AMBARI DashboardsViews REST API Ambari Metrics System - Architecture
  • 26. Page26 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Sample Stats Total number of raw uncompressed Hadoop metrics written per day on a 300 node cluster = 100 GB Rows in Phoenix table ~ 100 million Raw query time: 500 rows selected (1.923 seconds) Aggregate query time: 204 rows selected (0.19 seconds) SELECT METRIC_NAME, APP_ID, INSTANCE_ID, TIMESTAMP, METRIC_SUM, HOSTS_COUNT, METRIC_MAX, METRIC_MIN FROM METRIC_AGGREGATE WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten', 'dfs.datanode.BytesRead') AND APP_ID = 'datanode' AND TIMESTAMP >= 1409770831000 and TIMESTAMP < 1409774431000; SELECT METRIC_NAME, HOSTNAME, APP_ID, INSTANCE_ID, START_TIME, METRICS FROM METRIC_RECORD WHERE METRIC_NAME IN ('dfs.datanode.BytesWritten','dfs.datanode.BytesRead') AND APP_ID = 'datanode' AND START_TIME >= 1409770831000 AND START_TIME < 1409774431000 ORDER BY METRIC_NAME, START_TIME LIMIT 500;
  • 27. Page27 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Key takeaways Using Phoenix query hints to avoid full table scans PHOENIX-914 – Use Native Hbase timestamp to skip HFiles Client side buffering and aggregation built into Sinks and Monitor Cluster and Host level aggregations across various time dimensions Table schema optimized for reads and Hbase tuned to support heavy write loads
  • 28. Page28 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Enhanced Dashboard
  • 29. Page29 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Customizable Service Dashboards Service dashboards are now customizable in Ambari 2.1 • Create new widgets • Graphs, Dial Gauge, Number, Template • Customize layout • Share widgets Future: • Make Layouts shareable
  • 30. Page30 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Recorded Demo
  • 31. Page31 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Easy to expose widgets for new services Out-of-the-box widgets are defined in the stack as JSON files No frontend code changes necessary
  • 32. Page32 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Smart Configurations
  • 33. Page33 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Hadoop Configuration Challenges • Too many configurations • which ones are important? • Too easy to mess up • What are valid/reasonable values? • What are the units? • Ok, what about dependencies? • Gets harder with combinations of services, host assignments, enabled features, CPU/RAM/disks, etc • Any recommendations? What am I doing wrong? • Smart Configs to the rescue! (Ambari 2.1)
  • 34. Page34 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Smart Configs Demo
  • 35. Page35 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari Smart Configs UI Customizable layout - Tabs - Sections - Sub-sections - Simple grid layout (Advanced Tab contains remaining configurations) New Widgets - Sliders - Recommended - Minimum - Maximum - Increment Step - Combos - Enumerated values - Toggles - Binary options - Spinners - Splits value into multiple controls. Time in milliseconds split into days, hours, minutes. - Lists - Enumerated values - Single select - Multi select Implemented - HDFS - YARN - MapReduce - Hive - HBase
  • 36. Page36 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Stack Driven Layouts { "name": "default", "description": "Default theme for HBASE service", "configuration": { "layouts": [ { "name": "default", "tabs": [ { "name": "settings", "display-name": "Settings", "layout": { "tab-columns": "3", "tab-rows": "3", "sections": [ ... ] } } ] } ], "placement": { "configuration-layout": "default", "configs": [...] }, "widgets": [ { "config": "hbase-env/hbase_master_heapsize", "widget": { "type": "slider", "units": [ { "unit-name": "GB" } ] } }, ... ] } } • Stacks has theme.json file • Layout – Tabs – Sections – Sub-sections • Placement – Configs placement in sub-sections • Widgets – Widget type – Optional Units - Bytes (B, KB, MB, GB, TB, PB) - Time (Millis, Seconds, Minutes, Hours, Days, Months, Years)
  • 37. Page37 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Config Metadata and Dependencies { "StackConfigurations": { "final": "false", "property_depends_on": [ { "type": "yarn-site", "name": "yarn.nodemanager.resource.memory-mb" } ], "property_description": “The minimum allocation for every", "property_display_name": "Minimum Container Size (Memory)", "property_name": "yarn.scheduler.minimum-allocation-mb", "property_type": [], "property_value": "512", "property_value_attributes": { "type": "int", "maximum": "5120", "minimum": "0", "unit": "MB", "increment_step": "256" }, "type": "yarn-site.xml" }, "dependencies": [ { "StackConfigurationDependency": { "dependency_name": "hive.tez.container.size", "property_name": "yarn.scheduler.minimum-allocation-mb” } }, { "StackConfigurationDependency": { "dependency_name": "mapreduce.map.memory.mb", "property_name": "yarn.scheduler.minimum-allocation-mb” } }, { "StackConfigurationDependency": { "dependency_name": "mapreduce.reduce.memory.mb", "property_name": "yarn.scheduler.minimum-allocation-mb” } }… ] } • Extended Metadata • Defined in property_value_attributes • Hold non-UI metadata about value range, increment, unit, etc • Dependencies • Models bi-directional relationship between configs • Depends On (property_depends_on) – Answers ‘which configs do I depend on?’ • Depended By (dependencies) – Answers ‘which configs are dependent on me?’ • Ambari automatically updates dependencies
  • 38. Page38 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Views
  • 39. Page39 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What are Views? View Framework • Provide various applications accessible from Ambari Web UI – interact with the cluster via a browser from a single place for all users (cluster operators, data analysis, developers, etc) Easy to develop • No need to understand Ambari core code – view development is just like creating any other web application Easy to deploy • Packaged as a single jar file • Auto create / auto configure
  • 40. Page40 © Hortonworks Inc. 2011 – 2015. All Rights Reserved CS Queue Manager for Cluster Operators Capacity Scheduler Queue Manager
  • 41. Page41 © Hortonworks Inc. 2011 – 2015. All Rights Reserved HDFS File Browser for General Users HDFS File Browser
  • 42. Page42 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Job Analysis for Developers Troubleshoot Tez JobsTroubleshoot / Improve Hive queries
  • 43. Page43 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Query Editors for Data Analysts Create, edit, execute, and analyze Hive queries Create, edit, and execute Pig scripts
  • 44. Page44 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Ambari Server in Views-Only mode • Use Views on existing clusters not under Ambari’s management • Can use Views against multiple clusters Ambari Server Cluster managed by Ambari Ambari Server “Views-only” mode (aka “Stand-alone” mode) Cluster not managed by Ambari Management Use Views Use Views Use Views
  • 45. Page45 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Kerberos Automation
  • 46. Page46 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Kerberos Automation New in Ambari 2.0: • Have Ambari manage Kerberos principals and keytabs • Once Kerberized, seamlessly handle: • Adding new hosts • Adding new components to existing hosts • Adding new services • Moving components to different hosts • Works with existing MIT KDC or Active Directory
  • 47. Page47 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Blueprints
  • 48. Page48 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Automated Cluster Deployment Simple • Making two REST calls is all it takes to provision a cluster Who uses it? • Cloudbreak • Microsoft Azure Marketplace (portal.azure.com) • Hortonworks QA • and many many others
  • 49. Page49 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Cluster Replication Export blueprint of source cluster Import blueprint to replicate clusters
  • 50. Page50 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Example: Create a 100-node Cluster { "configurations" : [ { ”hdfs-site" : { "dfs.datanode.data.dir" : ”/hadoop/1,/hadoop/2,/hadoop/3" } } ], "host_groups" : [ { "name" : ”master-host", "components" : [ { "name" : "NAMENODE” }, { "name" : "RESOURCEMANAGER” }, … ], "cardinality" : "1" }, { "name" : ”worker-host", "components" : [ { "name" : ”DATANODE” }, { "name" : ”NODEMANAGER” }, … ], "cardinality" : "1+" }, ], "Blueprints" : { "blueprint_name" : ”multi-node-hdfs-yarn", "stack_name" : "HDP", "stack_version" : "2.0" } } { "blueprint" : ”multi-node-hdfs-yarn", "host_groups" :[ { "name" : ”master-host", "hosts" : [ { "fqdn" : ”master001.ambari.apache.org” } ] }, { "name" : ”worker-host", "hosts" : [ { "fqdn" : ”worker001.ambari.apache.org” }, { "fqdn" : ”worker002.ambari.apache.org” }, … { "fqdn" : ”worker099.ambari.apache.org” } ] } ] } 1. POST /blueprints/my-blueprint 2. POST /clusters/MyCluster
  • 51. Page51 © Hortonworks Inc. 2011 – 2015. All Rights Reserved What’s New in Blueprint New in Ambari 2.0: • Supports various HA configurations: • NameNode, ResourceManager, RegionServer, Oozie Server, Hive Metastore, HiveServer2, WebHCat Server • Adding hosts “blueprint style” (AMBARI-8458) New in Ambari 2.1: • Advanced Cluster Creation – more flexible, scalable, and robust (AMBARI-10750)
  • 52. Page52 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Blueprint Caveats As of Ambari 2.1: • Stack advisor (smart/dynamic config recommendation/validation) is not used (yet) • Can’t create Kerberized Cluster (yet)
  • 53. Page53 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Thank You! Try Ambari • Follow the Ambari Quick Start Guide (search online for “Ambari Quick Start Guide”) Learn more • Visit the project website (search online for “Ambari”) Get Involved • User Mailing List: [email protected] • Developer Mailing List: [email protected] • Use JIRA to file bugs and improvement requests (search online for “Ambari JIRA”)
  • 54. Page54 © Hortonworks Inc. 2011 – 2015. All Rights Reserved Q & A Yusaku Sako [email protected] (Ambari PMC) Sumit Mohanty [email protected] (Ambari PMC)

Editor's Notes

  • #7: Hadoop projects combined is: ~28k HDFS: 8247 MapReduce: 6117 YARN: 3664 Hadoop Common: 10072
  • #9: Too many configurations – which ones are important? 2 Configurations from 1 section and 2 from another section might be most important No easy way to group across sections Majority Text fields Configs almost always shown as text fields Can be shown in more intuitive controls No units help Configs might shown to user in one unit (days, GB), and be saved in a different unit (milliseconds, B) What are acceptable values? Open ended text fields don’t help when values have to been within a minimum/maximum values No support for a enum of values No configuration dependencies After install if you change one config, you have to remember to change others
  • #11: Notice that can upgrade in either same stack e.g., 2.2.*, or 2.2 -> 2.3
  • #25: OpenTSDB is popular solution on top of HBASE. Time Series DB
  • #26: -
  • #34: Too many configurations – which ones are important? 2 Configurations from 1 section and 2 from another section might be most important No easy way to group across sections Majority Text fields Configs almost always shown as text fields Can be shown in more intuitive controls No units help Configs might shown to user in one unit (days, GB), and be saved in a different unit (milliseconds, B) What are acceptable values? Open ended text fields don’t help when values have to been within a minimum/maximum values No support for a enum of values No configuration dependencies After install if you change one config, you have to remember to change others
  • #40: Introduced in Ambari 1.7
  • #52: Allow cluster creation or scaling to be started via the REST API prior to all/any hosts being available. As hosts register with Ambari server they will be matched to request host groups and provisioned according to the requested topology Allow host predicates to be specified along with host count to provide more flexibility in matching hosts to host groups. This will allow for host flavors where different host groups are matched to different host flavors Break up the current monolithic provisioning request into a request for each host operation. For example, install on host A, start on host A, install on hostB, etc. This will allow hosts to make progress even when another host encounters a failure. Allow a host count to be specified in the cluster creation template instead of host names. This is documented in https://issues.apache.org/jira/browse/AMBARI-6275