SlideShare a Scribd company logo
Ambari Metrics System
Apache Ambari Meetup @
DataWorks Summit 2017
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
Introduction
New Features
Horizontal Scalability
Future Work
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Introduction & Architecture
 Metrics Collector – API daemon
 Sinks – Service daemons configured to publish metrics
 Metric Monitors – Lightweight daemon for system metrics
 Managed HBase (Embedded / Distributed)
 Phoenix schema designed for fast reads
 Grafana Integration (Ambari 2.2.2)
High level component arch
Ambari
Collector API
Grafana
Phoenix
HDP
Services
System
MONITORSSINKS
Metrics Collector
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Feature Highlights
AMSSimple POST
API
Rich GET API
Aggregation and
Down sampling
Metadata API
Highly Tunable
Abstract Sink
Impl.
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Metric Sinks
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Sid Wagle (Hortonworks)
 Aravindan Vijayan (Hortonworks)
 Dmitry Sen (Hortonworks)
 Prajwal Rao (ITRenew)
 Myroslav Papyrkovskyy (Hortonworks)
 Yusaku Sako (Hortonworks)
 Qin Liu (IBM)
 Tim Thorpe (IBM)
 Jungtaek Lim (Hortonworks)
 Jameel Naina (Microsoft)
 Masahiro Tanaka (Ntt data)
Contributors (Karma)
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
New Features
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Features (2.2.2 -> 2.5.x)
• Ability to query for Top N series from a set of series
• Top N ‘Hosts’ vs Top N ‘Metrics’
• Multiple Top N functions supported (max, avg, sum)
Top N
(Request)
• Ability to run a custom downsampling function on a
set of metrics.
• Use Case : HDFS top users, operations
Top N
(Source)
• Ability to aggregate a set of series on the GET path.
• Helps with ad-hoc aggregation on the fly.
• Use Case : Aggregate across Storm topologies or Kafka
topics
Series
Aggregation
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Features (2.2.2 -> 2.5.x) continued
• Https Support for AMS and Grafana
• SSLV23 and TLSV1 supportHttps Support
• Service alerts based on metrics from AMS.
• Example : Deviation in Daily/Weekly Namenode RPC
queue latency
Metric Based
Alerts
• Multi Cluster support in AMS and Grafana.
• Blueprint defined.
Multi Cluster
Support
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Horizontal Scalability (Ambari 2.5.0)
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Operational Statistics
Disk Usage Write Load
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Horizontal Scalability (AMBARI-15901)
 Horizontal scalability proportional to cluster size
 High availability for metrics service
 Restart resiliency for metrics service discovery
 Distribute heavy-weight operations
 Automatic failover
 Usability – Easy addition of extra collectors
 Distributed mode – HBase writing to HDFS
 Cluster Zookeeper – HBase and Collector dependency
 First release still requires a restart to get new sink bits
Motivation
Operational Requirements
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Horizontal Scalability (AMBARI-15901)
 Distributed lock problem
– Leader election
– Ephemeral state for discovery
– Partition tolerance
 Service discovery
 Persistent distributed storage – aggregator checkpoints
Architectural Requirements
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Why Helix vs Curator
Limitations of using curator
 Leader election
 Shared Lock
 Service discovery
Curator Recipes applicable
 Every solution looks different, HMaster / Kafka etc..
 Need to build abstractions for FSM and resources
 Lack of primitives for cluster management
 Possibly needs a couple of application versions to achieve stability
Soft drawbacks
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Helix– High level
 Controller – Co-ordinate transitions and try to maintain Ideal State
 Participant – Process hosting distributed resources
 Spectator – Observer / Router
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Under the hood
 Controller models a STATE machine based on the various partitions in the
cluster.
 Uses ZK to maintain cluster state and as a notification system.
 States
– IdealState: All nodes are up and running.
– CurrentState: Actual current state of each node in the cluster
– ExternalView: The combined view of the CurrentState of all nodes
Helix architecture
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
AMS – Helix usage - Primitives
Instance Resource Partitions / Replicas
H
I II
Host
Cluster
Aggregators
O
M
State Model
Online
Offline
Host Cluster
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Distributed Writes
Bootstrapped
with a initial set
of collectors
DONE
Failed
counter >
threshold
Collector
supplier is
ok
Push metrics to
collector
Find list of live collectors
from configured collectors
If unable to find live
collector, ask Zookeeper
for list of live collectors
Choose a collector based
on hostname with expiry
Expire collector
supplier
Timer
Sinks
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Distributed Writes
Monitors
Hostname based sharding strategy similar to sinks
Initial configured collector list
When a collector is inferred to be down, it is blacklisted.
Needs a restart if no known collector is live
No ZK fallback.
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Reads
 Ambari Server handles multiple collectors seamlessly.
 Event based notification whenever a collector host down situation is sensed.
– Using AmbariEvent framework
Ambari Server
 Configured with 1 collector during startup.
 If that collector goes down,
– Manually update the datasource to another collector
– (or) Restart Grafana.
Grafana
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Cluster Zookeeper
METRICS MONITOR
YARN
KAFKA
FLUME
METRICS SINKS
HBASE
STORM
HIVE
NIFI
HDFS
METRICS COLLECTOR
HBASE
Master + RS
PHOENIX
Aggregators
Collector API
Helix
Participant
METRICS COLLECTOR
HBASE
Master + RS
PHOENIX
Aggregators
Collector API
Helix
Participant
AMS Multiple Collectors Architecture
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Future Work (Ambari 3.0.0)
 Major revamp in the storage and access layer in AMS.
 Moving to a UUID based Row key instead of the current long and redundant
key. (AMBARI-20773)
 Aggregation V2 to tackle scale issues in AMS
– Time aggregation (down sampling) handed off to the individual monitors (AMBARI-
20758)
– Cluster aggregation done online only for metrics which need it.
 Tee to external storage by providing pluggable sink interface
 Metric Based Anomaly Detection using Statistical and Machine learning
technique (AMBARI-21105)
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Thank You
Last Meetup Slides :
https://www.slideshare.net/prajrao/apache-ambari-meetup-ams-grafana

More Related Content

What's hot (20)

PDF
Turning the Heat up on DevOps: Providing a web-based editing experience aroun...
Michael Elder
 
PPTX
Manage democratization of the data - Data Replication in Hadoop
DataWorks Summit
 
PPTX
Apache Accumulo 1.8.0 Overview
Josh Elser
 
PPTX
Apache Ambari - What's New in 2.1
Hortonworks
 
PPTX
Apache Ambari - What's New in 1.7.0
Hortonworks
 
PDF
Spark Summit EU talk by Mikhail Semeniuk Hollin Wilkins
Spark Summit
 
POTX
Schema Registry & Stream Analytics Manager
Sriharsha Chintalapani
 
PPTX
SAM - Streaming Analytics Made Easy
DataWorks Summit
 
PDF
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Big Data Spain
 
PDF
Hortonworks Technical Workshop: Apache Ambari
Hortonworks
 
PDF
Integrating Apache NiFi and Apache Apex
Apache Apex
 
PDF
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks
 
PDF
The Role of Elastic Load Balancer - Apache Stratos
Imesh Gunaratne
 
PPTX
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
DataWorks Summit
 
PPTX
Hive & HBase For Transaction Processing
DataWorks Summit
 
PDF
Kafka Connect by Datio
Datio Big Data
 
PDF
Autoscaler architecture of apache stratos 4.0.0
Lahiru Sandaruwan
 
PDF
CEP Integration for Apache Stratos 4.0.0
Manula Thantriwatte
 
PDF
Cooperative Data Exploration with iPython Notebook
DataWorks Summit/Hadoop Summit
 
PPTX
Ambari Meetup: Architecture and Demo
Hortonworks
 
Turning the Heat up on DevOps: Providing a web-based editing experience aroun...
Michael Elder
 
Manage democratization of the data - Data Replication in Hadoop
DataWorks Summit
 
Apache Accumulo 1.8.0 Overview
Josh Elser
 
Apache Ambari - What's New in 2.1
Hortonworks
 
Apache Ambari - What's New in 1.7.0
Hortonworks
 
Spark Summit EU talk by Mikhail Semeniuk Hollin Wilkins
Spark Summit
 
Schema Registry & Stream Analytics Manager
Sriharsha Chintalapani
 
SAM - Streaming Analytics Made Easy
DataWorks Summit
 
Apache Hive 2.0 SQL, Speed, Scale by Alan Gates
Big Data Spain
 
Hortonworks Technical Workshop: Apache Ambari
Hortonworks
 
Integrating Apache NiFi and Apache Apex
Apache Apex
 
Hortonworks Technical Workshop: HDP everywhere - cloud considerations using...
Hortonworks
 
The Role of Elastic Load Balancer - Apache Stratos
Imesh Gunaratne
 
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
DataWorks Summit
 
Hive & HBase For Transaction Processing
DataWorks Summit
 
Kafka Connect by Datio
Datio Big Data
 
Autoscaler architecture of apache stratos 4.0.0
Lahiru Sandaruwan
 
CEP Integration for Apache Stratos 4.0.0
Manula Thantriwatte
 
Cooperative Data Exploration with iPython Notebook
DataWorks Summit/Hadoop Summit
 
Ambari Meetup: Architecture and Demo
Hortonworks
 

Similar to Ambari metrics system - Apache ambari meetup (DataWorks Summit 2017) (20)

PPTX
Sharing metadata across the data lake and streams
DataWorks Summit
 
PPTX
Standalone metastore-dws-sjc-june-2018
alanfgates
 
PPTX
Running Cloudbreak on Kubernetes
Future of Data Meetup
 
PPTX
Running Cloudbreak on Kubernetes
Krisztián Horváth
 
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
Jayush Luniya
 
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
Hortonworks
 
PPTX
Hive edw-dataworks summit-eu-april-2017
alanfgates
 
PPTX
An Apache Hive Based Data Warehouse
DataWorks Summit
 
PPTX
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
PPTX
Sharing metadata across the data lake and streams
DataWorks Summit
 
PPTX
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Data Con LA
 
PPTX
Hive acid and_2.x new_features
Alberto Romero
 
PPTX
Streamline - Stream Analytics for Everyone
DataWorks Summit/Hadoop Summit
 
PPTX
Apache Hadoop 3.0 What's new in YARN and MapReduce
DataWorks Summit/Hadoop Summit
 
PPTX
Why is My Hadoop Job Slow?
Bikas Saha
 
PPTX
Why is my Hadoop cluster slow?
DataWorks Summit/Hadoop Summit
 
PDF
Why is My Hadoop Job Slow?
Bikas Saha
 
PPTX
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Hortonworks
 
PDF
Paris FOD meetup - Streams Messaging Manager
Abdelkrim Hadjidj
 
PPTX
Micro services vs hadoop
Gergely Devenyi
 
Sharing metadata across the data lake and streams
DataWorks Summit
 
Standalone metastore-dws-sjc-june-2018
alanfgates
 
Running Cloudbreak on Kubernetes
Future of Data Meetup
 
Running Cloudbreak on Kubernetes
Krisztián Horváth
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Jayush Luniya
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Hortonworks
 
Hive edw-dataworks summit-eu-april-2017
alanfgates
 
An Apache Hive Based Data Warehouse
DataWorks Summit
 
Apache Hadoop YARN: Past, Present and Future
DataWorks Summit/Hadoop Summit
 
Sharing metadata across the data lake and streams
DataWorks Summit
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Why is my Hadoop cluster s...
Data Con LA
 
Hive acid and_2.x new_features
Alberto Romero
 
Streamline - Stream Analytics for Everyone
DataWorks Summit/Hadoop Summit
 
Apache Hadoop 3.0 What's new in YARN and MapReduce
DataWorks Summit/Hadoop Summit
 
Why is My Hadoop Job Slow?
Bikas Saha
 
Why is my Hadoop cluster slow?
DataWorks Summit/Hadoop Summit
 
Why is My Hadoop Job Slow?
Bikas Saha
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Hortonworks
 
Paris FOD meetup - Streams Messaging Manager
Abdelkrim Hadjidj
 
Micro services vs hadoop
Gergely Devenyi
 
Ad

Recently uploaded (20)

PDF
this idjfk sgfdhgdhgdbhgbgrbdrwhrgbbhtgdt
WaleedAziz7
 
PDF
mbse_An_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
PDF
Module - 5 Machine Learning-22ISE62.pdf
Dr. Shivashankar
 
PPTX
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
PDF
20ES1152 Programming for Problem Solving Lab Manual VRSEC.pdf
Ashutosh Satapathy
 
PDF
Artificial Neural Network-Types,Perceptron,Problems
Sharmila Chidaravalli
 
PDF
Bayesian Learning - Naive Bayes Algorithm
Sharmila Chidaravalli
 
PDF
aAn_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
PDF
Tesia Dobrydnia - An Avid Hiker And Backpacker
Tesia Dobrydnia
 
PPTX
2025 CGI Congres - Surviving agile v05.pptx
Derk-Jan de Grood
 
PDF
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
PDF
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
PDF
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
PPT
Footbinding.pptmnmkjkjkknmnnjkkkkkkkkkkkkkk
mamadoundiaye42742
 
PPTX
Basics of Electrical Engineering and electronics .pptx
PrabhuNarayan6
 
PPTX
Numerical-Solutions-of-Ordinary-Differential-Equations.pptx
SAMUKTHAARM
 
PPTX
OCS353 DATA SCIENCE FUNDAMENTALS- Unit 1 Introduction to Data Science
A R SIVANESH M.E., (Ph.D)
 
PPTX
Seminar Description: YOLO v1 (You Only Look Once).pptx
abhijithpramod20002
 
PPTX
Unit_I Functional Units, Instruction Sets.pptx
logaprakash9
 
PDF
3rd International Conference on Machine Learning and IoT (MLIoT 2025)
ClaraZara1
 
this idjfk sgfdhgdhgdbhgbgrbdrwhrgbbhtgdt
WaleedAziz7
 
mbse_An_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
Module - 5 Machine Learning-22ISE62.pdf
Dr. Shivashankar
 
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
20ES1152 Programming for Problem Solving Lab Manual VRSEC.pdf
Ashutosh Satapathy
 
Artificial Neural Network-Types,Perceptron,Problems
Sharmila Chidaravalli
 
Bayesian Learning - Naive Bayes Algorithm
Sharmila Chidaravalli
 
aAn_Introduction_to_Arcadia_20150115.pdf
henriqueltorres1
 
Tesia Dobrydnia - An Avid Hiker And Backpacker
Tesia Dobrydnia
 
2025 CGI Congres - Surviving agile v05.pptx
Derk-Jan de Grood
 
methodology-driven-mbse-murphy-july-hsv-huntsville6680038572db67488e78ff00003...
henriqueltorres1
 
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
Footbinding.pptmnmkjkjkknmnnjkkkkkkkkkkkkkk
mamadoundiaye42742
 
Basics of Electrical Engineering and electronics .pptx
PrabhuNarayan6
 
Numerical-Solutions-of-Ordinary-Differential-Equations.pptx
SAMUKTHAARM
 
OCS353 DATA SCIENCE FUNDAMENTALS- Unit 1 Introduction to Data Science
A R SIVANESH M.E., (Ph.D)
 
Seminar Description: YOLO v1 (You Only Look Once).pptx
abhijithpramod20002
 
Unit_I Functional Units, Instruction Sets.pptx
logaprakash9
 
3rd International Conference on Machine Learning and IoT (MLIoT 2025)
ClaraZara1
 
Ad

Ambari metrics system - Apache ambari meetup (DataWorks Summit 2017)

  • 1. Ambari Metrics System Apache Ambari Meetup @ DataWorks Summit 2017
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda Introduction New Features Horizontal Scalability Future Work
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Introduction & Architecture  Metrics Collector – API daemon  Sinks – Service daemons configured to publish metrics  Metric Monitors – Lightweight daemon for system metrics  Managed HBase (Embedded / Distributed)  Phoenix schema designed for fast reads  Grafana Integration (Ambari 2.2.2) High level component arch Ambari Collector API Grafana Phoenix HDP Services System MONITORSSINKS Metrics Collector
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Feature Highlights AMSSimple POST API Rich GET API Aggregation and Down sampling Metadata API Highly Tunable Abstract Sink Impl.
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Metric Sinks
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Sid Wagle (Hortonworks)  Aravindan Vijayan (Hortonworks)  Dmitry Sen (Hortonworks)  Prajwal Rao (ITRenew)  Myroslav Papyrkovskyy (Hortonworks)  Yusaku Sako (Hortonworks)  Qin Liu (IBM)  Tim Thorpe (IBM)  Jungtaek Lim (Hortonworks)  Jameel Naina (Microsoft)  Masahiro Tanaka (Ntt data) Contributors (Karma)
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved New Features
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Features (2.2.2 -> 2.5.x) • Ability to query for Top N series from a set of series • Top N ‘Hosts’ vs Top N ‘Metrics’ • Multiple Top N functions supported (max, avg, sum) Top N (Request) • Ability to run a custom downsampling function on a set of metrics. • Use Case : HDFS top users, operations Top N (Source) • Ability to aggregate a set of series on the GET path. • Helps with ad-hoc aggregation on the fly. • Use Case : Aggregate across Storm topologies or Kafka topics Series Aggregation
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Features (2.2.2 -> 2.5.x) continued • Https Support for AMS and Grafana • SSLV23 and TLSV1 supportHttps Support • Service alerts based on metrics from AMS. • Example : Deviation in Daily/Weekly Namenode RPC queue latency Metric Based Alerts • Multi Cluster support in AMS and Grafana. • Blueprint defined. Multi Cluster Support
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Horizontal Scalability (Ambari 2.5.0)
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Operational Statistics Disk Usage Write Load
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Horizontal Scalability (AMBARI-15901)  Horizontal scalability proportional to cluster size  High availability for metrics service  Restart resiliency for metrics service discovery  Distribute heavy-weight operations  Automatic failover  Usability – Easy addition of extra collectors  Distributed mode – HBase writing to HDFS  Cluster Zookeeper – HBase and Collector dependency  First release still requires a restart to get new sink bits Motivation Operational Requirements
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Horizontal Scalability (AMBARI-15901)  Distributed lock problem – Leader election – Ephemeral state for discovery – Partition tolerance  Service discovery  Persistent distributed storage – aggregator checkpoints Architectural Requirements
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Why Helix vs Curator Limitations of using curator  Leader election  Shared Lock  Service discovery Curator Recipes applicable  Every solution looks different, HMaster / Kafka etc..  Need to build abstractions for FSM and resources  Lack of primitives for cluster management  Possibly needs a couple of application versions to achieve stability Soft drawbacks
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Helix– High level  Controller – Co-ordinate transitions and try to maintain Ideal State  Participant – Process hosting distributed resources  Spectator – Observer / Router
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Under the hood  Controller models a STATE machine based on the various partitions in the cluster.  Uses ZK to maintain cluster state and as a notification system.  States – IdealState: All nodes are up and running. – CurrentState: Actual current state of each node in the cluster – ExternalView: The combined view of the CurrentState of all nodes Helix architecture
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved AMS – Helix usage - Primitives Instance Resource Partitions / Replicas H I II Host Cluster Aggregators O M State Model Online Offline Host Cluster
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Distributed Writes Bootstrapped with a initial set of collectors DONE Failed counter > threshold Collector supplier is ok Push metrics to collector Find list of live collectors from configured collectors If unable to find live collector, ask Zookeeper for list of live collectors Choose a collector based on hostname with expiry Expire collector supplier Timer Sinks
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Distributed Writes Monitors Hostname based sharding strategy similar to sinks Initial configured collector list When a collector is inferred to be down, it is blacklisted. Needs a restart if no known collector is live No ZK fallback.
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Reads  Ambari Server handles multiple collectors seamlessly.  Event based notification whenever a collector host down situation is sensed. – Using AmbariEvent framework Ambari Server  Configured with 1 collector during startup.  If that collector goes down, – Manually update the datasource to another collector – (or) Restart Grafana. Grafana
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Cluster Zookeeper METRICS MONITOR YARN KAFKA FLUME METRICS SINKS HBASE STORM HIVE NIFI HDFS METRICS COLLECTOR HBASE Master + RS PHOENIX Aggregators Collector API Helix Participant METRICS COLLECTOR HBASE Master + RS PHOENIX Aggregators Collector API Helix Participant AMS Multiple Collectors Architecture
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Future Work (Ambari 3.0.0)  Major revamp in the storage and access layer in AMS.  Moving to a UUID based Row key instead of the current long and redundant key. (AMBARI-20773)  Aggregation V2 to tackle scale issues in AMS – Time aggregation (down sampling) handed off to the individual monitors (AMBARI- 20758) – Cluster aggregation done online only for metrics which need it.  Tee to external storage by providing pluggable sink interface  Metric Based Anomaly Detection using Statistical and Machine learning technique (AMBARI-21105)
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Thank You Last Meetup Slides : https://www.slideshare.net/prajrao/apache-ambari-meetup-ams-grafana

Editor's Notes

  • #12: Resources used by AMS Write load 500 Sinks – 867M rows Disk Usage 500 sinks – 81GB
  • #22: Ambari-2.5.0 will support AMS HA Helix – task partitioning & service discovery Restarting daemons not needed. Auto discovered Client side