SlideShare a Scribd company logo
Season 3 Episode 2
Oct 14, 2015
Welcome!
Agenda
NetflixOSS Website Relaunch @aspyker
Fenzo @podila
Vector @spiermar
Linux Java perf support @brendangregg
FIDO, Sleepy Puppy, Lemur @chanjbs
Falcor @jhusain
Website Relaunch
http://netflix.github.io
Goals of the Relaunch
● Show how the pieces fit together
○ Projects now discussed with each other in context
● OSS categories mirror internal teams
○ No artificial categories, focal points for each area
● Focus on projects that are core to Netflix
○ Projects mentioned are core and strategic
● Adding project-branded websites
High Level Categories
Big Data
Tools and services for (big) data
Build and Delivery Tools
Taking code from desktop to the cloud
Common Runtimes Service & Libraries
Runtime containers, libraries & services that power
microservices
High Level Categories
Data Persistence
Storing and serving data in the cloud
Insight, Reliability and Performance
Providing actionable insight at massive scale
High Level Categories
Security
Security for dynamic and distributed environments
User Interface
Libraries to help you build rich client applications
Fenzo
A generic, plug-ins based scheduling library for
Apache Mesos frameworks
Fenzo scheduling library
Heterogeneous
resources
Autoscaling
of cluster
Visibility of
scheduler
actions
Plugins for
Constraints, Fitness
High speed
Heterogeneous
task requests
Fenzo: scheduling model
Fitness
Pending
Assigned
Urgency
Fenzo: scheduling optimizations
Speed Accuracy
First fit assignment Optimal assignment
Real world tradeoffs
~ O (1) ~ O (N * M)1
1
Assuming tasks are not reassigned
Fenzo: fitness, constraints plugins
● Fitness value (0.0 - 1.0)
○ Degree of fitness - first fit, best fit, worst fit
○ Composable evaluators
○ e.g., bin packing
● Constraints
○ Hard constraints filter appropriate resources
○ Soft constraints specify preferences
○ e.g., zone balancing, instance type preferences
Fenzo: bin packing experiment
Bin pack tasks using Fenzo’s built-in CPU bin packer
Fenzo: cluster autoscaling
ASG/Cluster:
mantisagent
MinIdle: 8
MaxIdle: 20
CooldownSecs:
360
ASG/Cluster:
mantisagent
MinIdle: 8
MaxIdle: 20
CooldownSecs:
360
ASG/cluster:
computeCluster
MinIdle: 8
MaxIdle: 20
CooldownSecs: 360
Fenzo
ScaleUp
action:
Cluster, N
ScaleDown
action:
Cluster,
HostList
Fenzo: what’s next
● Task management SLAs
● Support for newer Mesos features
● Collaboration
NetflixOSS Meetup season 3 episode 2
Why?
● Easier way for users to troubleshoot
performance issues
● Access to low-level and specialized metrics
● Easier way to visualize and understand
● High-resolution data to detect anomalies
● Real-time and on-demand
● No additional overhead when not in use
● Something easier than SSH
● And simpler than full-fledged monitoring
solution
What?
● Is a Performance Monitoring tool
● Host-Level, On-Demand, High-Resolution Metrics (1 second)
● Client-side Application, User-friendly web UI
● Configurable dashboards and widgets
● Leverages Performance Co-Pilot (PCP)
● Stateless and Lightweight Metric Collection
● No persistence
● System Metrics: CPU, Memory, Network, Disk, ...
● Application Metrics*: Java, Memcached, C*, ElasticSearch, Apache
● Extensible. Custom metric agents and widgets.
* Agents are available, but not included by default.
NetflixOSS Meetup season 3 episode 2
NetflixOSS Meetup season 3 episode 2
What’s Next?
● Interface for different backends
● Better support for containers;
○ With container-specific dashboard and widgets.
● Native flame graph integration;
○ With our d3.js flame graph plugin.
CPU Flame Graphs
Java Mixed-Mode Flame Graphs
● Needs JDK8u60+ with
-XX:+PreserveFramePointer
○ May have some cost
● Lets Linux perf (perf_events)
see Java method frames
● Use with perf-map-agent for
symbols
● http://techblog.netflix.
com/2015/07/java-in-flames.
html
Java
Kernel
JVMGC
See all the things...
D3.js Flame Graph Plugin
Netflix Security OSS
FIDO - Security Response Orchestration
● Centralize alerts
● Enrich with data
○ User, machine
○ Threat
● Prioritize response
● Automate first
actions
Netflix's FIDO is not a part of or service of the FIDO Alliance
Cross-Site Scripting
Sleepy Puppy - XSS Testing Framework
● Visibility for non-
targeted vulnerable
apps
● Assessment
management over
time
Sleepy Puppy - Assessments and Payloads
TLS Certificate Management
Lemur - x.509 Certificate Orchestration
● Pluggable CA
support
● Private key
management and
distribution
● Expiry monitoring
Lemur Certificate Request
NetflixOSS Meetup season 3 episode 2
Every user wants to believe the entire cloud is
sitting right on their device.
Falcor let’s you code that way.
Let's talk about REST.
The Web used to be a place to get things.
Today, the Web is a place to do things.
Web pages use a small
number of large resources.
Web apps use large numbers of
small resources.
What is ?
Falcor is not a replacement for your Database,
MVC Framework, or your Web Server.
Falcor fits into your existing stack,
allowing the layers to communicate
more efficiently.
model.json
Demo
NetflixOSS Meetup season 3 episode 2
Falcor
● Designed for needs of Web Apps
● Model domain with JSON Graph
● Optimizes Data Access using...
○ caching
○ batching
○ path optimization
Falcor Roadmap
● netflix.github.io/falcor
● Java version of Router coming
● iOS client coming
Wrapup
● Thanks for attending!
● Join us in the courtyard for food and drinks

More Related Content

PDF
Netflix Open Source Meetup Season 4 Episode 2
PDF
Netflix Open Source Meetup Season 3 Episode 2
PDF
Netflix Open Source Meetup Season 4 Episode 1
PDF
Netflix oss season 1 episode 3
PDF
Netflix oss season 2 episode 1 - meetup Lightning talks
PDF
NetflixOSS Meetup season 3 episode 1
PDF
20140708 - Jeremy Edberg: How Netflix Delivers Software
PDF
Netflix and Containers: Not A Stranger Thing
Netflix Open Source Meetup Season 4 Episode 2
Netflix Open Source Meetup Season 3 Episode 2
Netflix Open Source Meetup Season 4 Episode 1
Netflix oss season 1 episode 3
Netflix oss season 2 episode 1 - meetup Lightning talks
NetflixOSS Meetup season 3 episode 1
20140708 - Jeremy Edberg: How Netflix Delivers Software
Netflix and Containers: Not A Stranger Thing

What's hot (20)

PPTX
Netflix Story of Embracing the Cloud
PDF
Triangle Devops Meetup 10/2015
PDF
Dev309 from asgard to zuul - netflix oss-final
PDF
CS80A Foothill College Open Source Talk
PDF
NetflixOSS Meetup S6E1 - Titus & Containers
PDF
Netflix Cloud Platform and Open Source
PDF
Netflix Container Scheduling and Execution - QCon New York 2016
PDF
NetflixOSS and ZeroToDocker Talk
PDF
QConSF18 - Disenchantment: Netflix Titus, its Feisty Team, and Daemons
PDF
CMP376 - Another Week, Another Million Containers on Amazon EC2
PDF
An approach for migrating enterprise apps into open stack
PDF
NetflixOSS Meetup S6E2 - Spinnaker, Kayenta
PDF
Timed Text At Netflix
PDF
Netflix OSS Meetup Season 4 Episode 4
PDF
Season 7 Episode 1 - Tools for Data Scientists
PPTX
Distributed architecture in a cloud native microservices ecosystem
PDF
Neutron high availability open stack architecture openstack israel event 2015
PDF
Kafka on Kubernetes—From Evaluation to Production at Intuit
PDF
Netflix: From Zero to Production-Ready in Minutes (QCon 2017)
PDF
QCon NYC: Distributed systems in practice, in theory
Netflix Story of Embracing the Cloud
Triangle Devops Meetup 10/2015
Dev309 from asgard to zuul - netflix oss-final
CS80A Foothill College Open Source Talk
NetflixOSS Meetup S6E1 - Titus & Containers
Netflix Cloud Platform and Open Source
Netflix Container Scheduling and Execution - QCon New York 2016
NetflixOSS and ZeroToDocker Talk
QConSF18 - Disenchantment: Netflix Titus, its Feisty Team, and Daemons
CMP376 - Another Week, Another Million Containers on Amazon EC2
An approach for migrating enterprise apps into open stack
NetflixOSS Meetup S6E2 - Spinnaker, Kayenta
Timed Text At Netflix
Netflix OSS Meetup Season 4 Episode 4
Season 7 Episode 1 - Tools for Data Scientists
Distributed architecture in a cloud native microservices ecosystem
Neutron high availability open stack architecture openstack israel event 2015
Kafka on Kubernetes—From Evaluation to Production at Intuit
Netflix: From Zero to Production-Ready in Minutes (QCon 2017)
QCon NYC: Distributed systems in practice, in theory
Ad

Viewers also liked (14)

PPTX
2012 re:Invent Netflix: embracing the cloud final
PPTX
Surge 2013: Maximizing Scalability, Resiliency, and Engineering Velocity in t...
PDF
Engineering Velocity: Shifting the Curve at Netflix
PPTX
From Code to the Monkeys: Continuous Delivery at Netflix
PDF
QConSF 2014 talk on Netflix Mantis, a stream processing system
PPTX
Engineering Tools at Netflix: Enabling Continuous Delivery
PDF
OTT & The Future of Connected TV
PDF
Continuous Delivery at Netflix, and beyond
PPTX
Implementing DevOps
PPTX
Application Networks: Microservices and APIs at Netflix
PPTX
Splitting the Check on Compliance and Security
PDF
How Netflix thinks of DevOps. Spoiler: we don’t.
PPTX
Hadoop and HBase experiences in perf log project
PPTX
Culture
2012 re:Invent Netflix: embracing the cloud final
Surge 2013: Maximizing Scalability, Resiliency, and Engineering Velocity in t...
Engineering Velocity: Shifting the Curve at Netflix
From Code to the Monkeys: Continuous Delivery at Netflix
QConSF 2014 talk on Netflix Mantis, a stream processing system
Engineering Tools at Netflix: Enabling Continuous Delivery
OTT & The Future of Connected TV
Continuous Delivery at Netflix, and beyond
Implementing DevOps
Application Networks: Microservices and APIs at Netflix
Splitting the Check on Compliance and Security
How Netflix thinks of DevOps. Spoiler: we don’t.
Hadoop and HBase experiences in perf log project
Culture
Ad

Similar to NetflixOSS Meetup season 3 episode 2 (20)

PDF
Netflix container scheduling talk at stanford final
PDF
Resource Scheduling using Apache Mesos in Cloud Native Environments
PDF
Prezo at-mesos con2015-final
PDF
Podila mesos con-northamerica_sep2017
KEY
Real World Cloud Application Security
PDF
Podila mesos con europe keynote aug sep 2016
PDF
Aws uk ug #8 not everything that happens in vegas stay in vegas
PDF
Txlf2012
PDF
NetflixOSS meetup lightning talks and roadmap
PDF
Scaling capacity while saving cash
PDF
HPC on OpenStack
PDF
NetflixOSS Open House Lightning talks
PDF
Practice and challenges from building IaaS
PPT
Automating Your CloudStack Cloud with Puppet
PDF
QNIBTerminal: Understand your datacenter by overlaying multiple information l...
PDF
Automating CloudStack with Puppet - David Nalley
PPTX
Industrialization, be fast be furious!
PDF
Netflix Architecture and Open Source
PPTX
To Build My Own Cloud with Blackjack…
PDF
Infrastructure Around Hadoop
Netflix container scheduling talk at stanford final
Resource Scheduling using Apache Mesos in Cloud Native Environments
Prezo at-mesos con2015-final
Podila mesos con-northamerica_sep2017
Real World Cloud Application Security
Podila mesos con europe keynote aug sep 2016
Aws uk ug #8 not everything that happens in vegas stay in vegas
Txlf2012
NetflixOSS meetup lightning talks and roadmap
Scaling capacity while saving cash
HPC on OpenStack
NetflixOSS Open House Lightning talks
Practice and challenges from building IaaS
Automating Your CloudStack Cloud with Puppet
QNIBTerminal: Understand your datacenter by overlaying multiple information l...
Automating CloudStack with Puppet - David Nalley
Industrialization, be fast be furious!
Netflix Architecture and Open Source
To Build My Own Cloud with Blackjack…
Infrastructure Around Hadoop

More from Ruslan Meshenberg (7)

PDF
Netflix oss past-present-future
PDF
NetflixOSS season 2 episode 2 - Reactive / Async
PDF
OSS Think Tank - NetflixOSS - OSS as a Competitive Differentiator
PPTX
Arc305 how netflix leverages multiple regions to increase availability an i...
PPTX
Svc 202-netflix-open-source
PPTX
re:Invent 2012 Optimizing Cassandra
PDF
The Netflix Open Source Platform
Netflix oss past-present-future
NetflixOSS season 2 episode 2 - Reactive / Async
OSS Think Tank - NetflixOSS - OSS as a Competitive Differentiator
Arc305 how netflix leverages multiple regions to increase availability an i...
Svc 202-netflix-open-source
re:Invent 2012 Optimizing Cassandra
The Netflix Open Source Platform

Recently uploaded (20)

PDF
B.Tech (Electrical Engineering ) 2024 syllabus.pdf
PPTX
MET 305 MODULE 1 KTU 2019 SCHEME 25.pptx
PPTX
Internship_Presentation_Final engineering.pptx
PDF
BRKDCN-2613.pdf Cisco AI DC NVIDIA presentation
PPTX
Fluid Mechanics, Module 3: Basics of Fluid Mechanics
PDF
A Framework for Securing Personal Data Shared by Users on the Digital Platforms
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PPT
Drone Technology Electronics components_1
PDF
Chad Ayach - A Versatile Aerospace Professional
PPTX
Ship’s Structural Components.pptx 7.7 Mb
PDF
Introduction to Data Science: data science process
PDF
ETO & MEO Certificate of Competency Questions and Answers
PDF
International Journal of Information Technology Convergence and Services (IJI...
PDF
dse_final_merit_2025_26 gtgfffffcjjjuuyy
PPTX
436813905-LNG-Process-Overview-Short.pptx
PPTX
Road Safety tips for School Kids by a k maurya.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
ANIMAL INTERVENTION WARNING SYSTEM (4).pptx
PPTX
TE-AI-Unit VI notes using planning model
PPTX
Unit 5 BSP.pptxytrrftyyydfyujfttyczcgvcd
B.Tech (Electrical Engineering ) 2024 syllabus.pdf
MET 305 MODULE 1 KTU 2019 SCHEME 25.pptx
Internship_Presentation_Final engineering.pptx
BRKDCN-2613.pdf Cisco AI DC NVIDIA presentation
Fluid Mechanics, Module 3: Basics of Fluid Mechanics
A Framework for Securing Personal Data Shared by Users on the Digital Platforms
Lesson 3_Tessellation.pptx finite Mathematics
Drone Technology Electronics components_1
Chad Ayach - A Versatile Aerospace Professional
Ship’s Structural Components.pptx 7.7 Mb
Introduction to Data Science: data science process
ETO & MEO Certificate of Competency Questions and Answers
International Journal of Information Technology Convergence and Services (IJI...
dse_final_merit_2025_26 gtgfffffcjjjuuyy
436813905-LNG-Process-Overview-Short.pptx
Road Safety tips for School Kids by a k maurya.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
ANIMAL INTERVENTION WARNING SYSTEM (4).pptx
TE-AI-Unit VI notes using planning model
Unit 5 BSP.pptxytrrftyyydfyujfttyczcgvcd

NetflixOSS Meetup season 3 episode 2

  • 1. Season 3 Episode 2 Oct 14, 2015
  • 3. Agenda NetflixOSS Website Relaunch @aspyker Fenzo @podila Vector @spiermar Linux Java perf support @brendangregg FIDO, Sleepy Puppy, Lemur @chanjbs Falcor @jhusain
  • 5. Goals of the Relaunch ● Show how the pieces fit together ○ Projects now discussed with each other in context ● OSS categories mirror internal teams ○ No artificial categories, focal points for each area ● Focus on projects that are core to Netflix ○ Projects mentioned are core and strategic ● Adding project-branded websites
  • 6. High Level Categories Big Data Tools and services for (big) data Build and Delivery Tools Taking code from desktop to the cloud Common Runtimes Service & Libraries Runtime containers, libraries & services that power microservices
  • 7. High Level Categories Data Persistence Storing and serving data in the cloud Insight, Reliability and Performance Providing actionable insight at massive scale
  • 8. High Level Categories Security Security for dynamic and distributed environments User Interface Libraries to help you build rich client applications
  • 9. Fenzo A generic, plug-ins based scheduling library for Apache Mesos frameworks
  • 10. Fenzo scheduling library Heterogeneous resources Autoscaling of cluster Visibility of scheduler actions Plugins for Constraints, Fitness High speed Heterogeneous task requests
  • 12. Fenzo: scheduling optimizations Speed Accuracy First fit assignment Optimal assignment Real world tradeoffs ~ O (1) ~ O (N * M)1 1 Assuming tasks are not reassigned
  • 13. Fenzo: fitness, constraints plugins ● Fitness value (0.0 - 1.0) ○ Degree of fitness - first fit, best fit, worst fit ○ Composable evaluators ○ e.g., bin packing ● Constraints ○ Hard constraints filter appropriate resources ○ Soft constraints specify preferences ○ e.g., zone balancing, instance type preferences
  • 14. Fenzo: bin packing experiment Bin pack tasks using Fenzo’s built-in CPU bin packer
  • 15. Fenzo: cluster autoscaling ASG/Cluster: mantisagent MinIdle: 8 MaxIdle: 20 CooldownSecs: 360 ASG/Cluster: mantisagent MinIdle: 8 MaxIdle: 20 CooldownSecs: 360 ASG/cluster: computeCluster MinIdle: 8 MaxIdle: 20 CooldownSecs: 360 Fenzo ScaleUp action: Cluster, N ScaleDown action: Cluster, HostList
  • 16. Fenzo: what’s next ● Task management SLAs ● Support for newer Mesos features ● Collaboration
  • 18. Why? ● Easier way for users to troubleshoot performance issues ● Access to low-level and specialized metrics ● Easier way to visualize and understand ● High-resolution data to detect anomalies ● Real-time and on-demand ● No additional overhead when not in use ● Something easier than SSH ● And simpler than full-fledged monitoring solution
  • 19. What? ● Is a Performance Monitoring tool ● Host-Level, On-Demand, High-Resolution Metrics (1 second) ● Client-side Application, User-friendly web UI ● Configurable dashboards and widgets ● Leverages Performance Co-Pilot (PCP) ● Stateless and Lightweight Metric Collection ● No persistence ● System Metrics: CPU, Memory, Network, Disk, ... ● Application Metrics*: Java, Memcached, C*, ElasticSearch, Apache ● Extensible. Custom metric agents and widgets. * Agents are available, but not included by default.
  • 22. What’s Next? ● Interface for different backends ● Better support for containers; ○ With container-specific dashboard and widgets. ● Native flame graph integration; ○ With our d3.js flame graph plugin.
  • 24. Java Mixed-Mode Flame Graphs ● Needs JDK8u60+ with -XX:+PreserveFramePointer ○ May have some cost ● Lets Linux perf (perf_events) see Java method frames ● Use with perf-map-agent for symbols ● http://techblog.netflix. com/2015/07/java-in-flames. html Java Kernel JVMGC
  • 25. See all the things...
  • 28. FIDO - Security Response Orchestration ● Centralize alerts ● Enrich with data ○ User, machine ○ Threat ● Prioritize response ● Automate first actions Netflix's FIDO is not a part of or service of the FIDO Alliance
  • 30. Sleepy Puppy - XSS Testing Framework ● Visibility for non- targeted vulnerable apps ● Assessment management over time
  • 31. Sleepy Puppy - Assessments and Payloads
  • 33. Lemur - x.509 Certificate Orchestration ● Pluggable CA support ● Private key management and distribution ● Expiry monitoring
  • 36. Every user wants to believe the entire cloud is sitting right on their device.
  • 37. Falcor let’s you code that way.
  • 39. The Web used to be a place to get things.
  • 40. Today, the Web is a place to do things.
  • 41. Web pages use a small number of large resources.
  • 42. Web apps use large numbers of small resources.
  • 44. Falcor is not a replacement for your Database, MVC Framework, or your Web Server.
  • 45. Falcor fits into your existing stack, allowing the layers to communicate more efficiently.
  • 47. Demo
  • 49. Falcor ● Designed for needs of Web Apps ● Model domain with JSON Graph ● Optimizes Data Access using... ○ caching ○ batching ○ path optimization
  • 50. Falcor Roadmap ● netflix.github.io/falcor ● Java version of Router coming ● iOS client coming
  • 51. Wrapup ● Thanks for attending! ● Join us in the courtyard for food and drinks