SlideShare a Scribd company logo
A closer look to
 Locaweb Iaas
        Gleicon Moraes
 Engineering Manager PaaS/IaaS
  @gleicon - http://blog.7co.cc
Agenda
Agenda
•   Engineering Team
Agenda
•   Engineering Team
•   IaaS
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
•   Architecture
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
•   Architecture
•   OSS
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
•   Architecture
•   OSS
•   Provisioning
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
•   Architecture
•   OSS
•   Provisioning
•   CMDB/Closed Loop
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
•   Architecture
•   OSS
•   Provisioning
•   CMDB/Closed Loop
•   Resource usage gathering
Agenda
•   Engineering Team
•   IaaS
•   Virtual/Physical servers
•   Architecture
•   OSS
•   Provisioning
•   CMDB/Closed Loop
•   Resource usage gathering
•   Software defined networks
Engineering Team
• We aim to be efficient
• DC and IaaS Automation
• IaaS and PaaS products
• Email and Domain Registration products
• Coffee/psychological help/counseling
• 40 people team (devs/architects/1 master
  devops)
IaaS - NIST definition
           “ The capability provided to the consumer is to provision
           processing, storage, networks, and other fundamental
           computing resources where the consumer is able to deploy
           and run arbitrary software, which can include operating
           systems and applications.
           The consumer does not manage or control the underlying
           cloud infrastructure but has control over operating systems,
           storage, and deployed applications; and possibly limited control
           of select networking components (e.g., host firewalls).”



* http://csrc.nist.gov/publications/nistpubs/800-145/SP800-145.pdf
IaaS - Wikipedia
           “In this most basic cloud service model, cloud providers offer computers, as
           physical or more often as virtual machines, and other resources. The virtual
           machines are run as guests by a hypervisor, such as Xen or KVM. Management
           of pools of hypervisors by the cloud operational support system leads to the
           ability to scale to support a large number of virtual machines. Other resources
           in IaaS clouds include images in a virtual machine image library, raw (block) and
           file-based storage, firewalls, load balancers, IP addresses, virtual local area
           networks (VLANs), and software bundles.[46] IaaS cloud providers supply
           these resources on demand from their large pools installed in data centers. For
           wide area connectivity, the Internet can be used or—in carrier clouds --
           dedicated virtual private networks can be configured.”




* http://en.wikipedia.org/wiki/Infrastructure_as_a_service#Service_models
IaaS - tl;dr


“Automate infrastructure such as the customer will not
know the underlying details, will not manage them and can
provision services automagically.”
IaaS - building blocks

• Servers: virtual and physical
• Storage area
• Network devices: firewall, switches, load
  balancer
IaaS - High Level

• Automation
• Resource Management
• Install, Uninstall, Migrate
• High Availability, Scalability, Capacity
  Planning
IaaS at Locaweb
•   3 DCs, 6k Servers (physical), 1k storages 6PB area,
    12K network equipments/ports, > 100 Km of
    cables
•   10k VMs, 3.2M email accounts, 250k hosting
    customers, ~500k sites, ~600k DB
•   130 people at day to day 24/7 Operations team
    (from DC basics to managing apps and platforms),
    < 40 sysadmins
•   Currently ~ 18 people from Engineering team
    taking care of IaaS
Virtual and Physical

•   Single tenant per Physical Server

•   Single tenant per VM

•   Multiple tenants per VM

•   Multiple tenants per Physical Server

•   Multiple VMs per Physical Server
Virtual and Physical

• Single tenant per Physical Server
• Single tenant per VM
• Multiple tenants per VM
• Multiple tenants per Physical Server
• Multiple VMs per Physical Server = Cloud
Cloud
•   Check back NIST definition
•   Hypervisor + set of servers + set of storages
    + network = time sharing
•   Distinct capacity planning than physical servers
•   Flexible configuration options
•   Vertical Scaling
•   Horizontal scaling
Architecture - Cloud
                                          Internet


                                        Main Network




                      Network
Physical Servers                         Firewall
 hypervisor   ovs      Gear



Simplestack         SimpleNet/Quantum
Architecture - Physical
                                         Internet


                                      Main Network


   Why not ?

                   Network
Physical Servers                        Firewall
                    Gear



  Simplestack            SimpleNet/Quantum
OSS
OSS
•   Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL,
    Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io,
    bottle.py, Quantum, R, EjabberD, Resque, lots of
    gem, eggs
OSS
•   Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL,
    Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io,
    bottle.py, Quantum, R, EjabberD, Resque, lots of
    gem, eggs

•   Up-to-date technology
OSS
•   Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL,
    Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io,
    bottle.py, Quantum, R, EjabberD, Resque, lots of
    gem, eggs

•   Up-to-date technology

•   No lock-ins
OSS
•   Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL,
    Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io,
    bottle.py, Quantum, R, EjabberD, Resque, lots of
    gem, eggs

•   Up-to-date technology

•   No lock-ins

•   Vendor neutral
OSS
•   Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL,
    Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io,
    bottle.py, Quantum, R, EjabberD, Resque, lots of
    gem, eggs

• Up-to-date technology
• No lock-ins
• Vendor neutral
• We contribute back
Our projects


http://locaweb.github.com
Our projects
Our projects
•   Leela - Data collection monster
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
•   SimpleNet - OVS and FW controller
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
•   SimpleNet - OVS and FW controller
•   NET/L2 - Controller/Inventory for network
    equipment
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
•   SimpleNet - OVS and FW controller
•   NET/L2 - Controller/Inventory for network
    equipment
•   BrickLayer - packaging for normal people
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
•   SimpleNet - OVS and FW controller
•   NET/L2 - Controller/Inventory for network
    equipment
•   BrickLayer - packaging for normal people
•   Logix - Graylog2 message bus for log streams
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
•   SimpleNet - OVS and FW controller
•   NET/L2 - Controller/Inventory for network
    equipment
•   BrickLayer - packaging for normal people
•   Logix - Graylog2 message bus for log streams
•   xenapi-ruby - XEN API bindings for Ruby
Our projects
•   Leela - Data collection monster
•   SimpleStack - Provisioning made easy
•   SimpleNet - OVS and FW controller
•   NET/L2 - Controller/Inventory for network
    equipment
•   BrickLayer - packaging for normal people
•   Logix - Graylog2 message bus for log streams
•   xenapi-ruby - XEN API bindings for Ruby
•   otto, debundler, bpmachine and more each week
Our Contributions
Our Contributions

•   Contributed to Quantum, from Openstack
Our Contributions

•   Contributed to Quantum, from Openstack
•   Snorby/snort contributions
Our Contributions

•   Contributed to Quantum, from Openstack
•   Snorby/snort contributions
•   Mod_security for Nginx and helping on IIS
Our Contributions

•   Contributed to Quantum, from Openstack
•   Snorby/snort contributions
•   Mod_security for Nginx and helping on IIS
•   hired consulting from grsecurity and dovecot
    teams - we support OSS companies
Bricklayer
Bricklayer

•   First opensource project from Locaweb
Bricklayer

•   First opensource project from Locaweb
•   Package builder (deb + rpm) straight from git
Bricklayer

•   First opensource project from Locaweb
•   Package builder (deb + rpm) straight from git
•   150+ projects, 500+ builds/day
Bricklayer

•   First opensource project from Locaweb
•   Package builder (deb + rpm) straight from git
•   150+ projects, 500+ builds/day
•   tag your project, get the packages done and on
    repositories
Logix
Logix

•   We have lots of logs. Everything broke.
Logix

•   We have lots of logs. Everything broke.
•   26.753.205.474 lines of log/day
Logix

•   We have lots of logs. Everything broke.
•   26.753.205.474 lines of log/day
•   Highly distributed: local syslog daemon to
    RabbitMQ
Logix

•   We have lots of logs. Everything broke.
•   26.753.205.474 lines of log/day
•   Highly distributed: local syslog daemon to
    RabbitMQ
•   Elastic search + graylog2 to store, search and filter
Provisioning


• Ruby: Panel, Control panel, Scheduler
• Python: Provisioning, Server management,
  Metric collection
• REST APIs to Hypervisor, Network,
  Firewall, XMPP
Provisioning - Cloud


                                                                           Internet
Cloud
 Control Panel   API     Sales                                           Main Network




                                                       Network
                                 Physical Servers                         Firewall
                                  hypervisor   ovs      Gear
                  Provisioner

                                 Simplestack         SimpleNet/Quantum
Provisioning - Servers

   Dedicated Servers

     Control Panel

          API            Closed Loop
                                           Racked Servers
         Sales
                                                                  Internet


                                                                Main Network




                                              Network
                       Physical Servers                          Firewall
                        hypervisor   ovs       Gear



                       Simplestack          SimpleNet/Quantum
Provisioning - Managed Servers
Managed Servers

  Control Panel      Sales
                                         Dedicated Servers

                                           Control Panel
                       PaaS
                                                   API         Closed Loop
                    Provisioner                                                  Racked Servers
                                                  Sales
                                                                                                        Internet
                  Cloud
                   Control Panel   API     Sales                                                      Main Network




                                                                                    Network
                                                             Physical Servers                          Firewall
                                                              hypervisor   ovs       Gear
                                    Provisioner

                                                             Simplestack          SimpleNet/Quantum
Cloud provisioner
                                                      Jobs


                                                     DHCP

                          CMDB

                                                      FW

       API

                                                   quantum/
Control Panel              core           Resque
                                                   simplenet

      Sales

                                                   simplestack



                Notifications      Leela
                                                    console
Closed loop

The closed loop process



      CMDB                API   Conductor   Network



     Futurama                    Cobbler    Hardware
Closed loop
Closed loop

•   All servers get racked, wired, tested and
    configured
Closed loop

•   All servers get racked, wired, tested and
    configured
•   Power management discovery
Closed loop

•   All servers get racked, wired, tested and
    configured
•   Power management discovery
•   Network configuration
Closed loop

•   All servers get racked, wired, tested and
    configured
•   Power management discovery
•   Network configuration
•   OS install: Windows, Linux and OpenSolaris aware
Closed loop

•   All servers get racked, wired, tested and
    configured
•   Power management discovery
•   Network configuration
•   OS install: Windows, Linux and OpenSolaris aware
•   Server life cycle: once deactivated it gets back to
    the pool to be used again
CMDB
    Futurama

   Power audit
                        Ops
     NET/L2           Frontend
                                     Controllers         Database
                        API



     Product                           Resque
   provisioners

  IP provisioning


Server provisioning
                                IT chg
                                                   SAP
                              management
Futurama
                                            Management


                                                 CFEngine


Server side                                         Planet
                                                   Express

                                                   Leela-
                  CF-Agent                         Server


     Conductor-                             CMDB
                  Leela-agent   bkp-agent
       audit
                                                 Cegonha

                                                 Asdrubal

                                                   CFTools
Resource Metering and
  Monitoring - Leela
                                      Cassandra
                Leela-
                          Cassandra               Cassandra
               Lasergun

 Leela-agent
               Leela-
               Reader     Cassandra               Cassandra

                                      Cassandra

                 API
Resource Metering and
  Monitoring - Leela
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
•   ~600GB/mo
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
•   ~600GB/mo
•   1M keys (~5k servers)
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
•   ~600GB/mo
•   1M keys (~5k servers)
•   Write latency: 15 us
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
•   ~600GB/mo
•   1M keys (~5k servers)
•   Write latency: 15 us
•   Read latency: 1s to read 1mo worth of data
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
•   ~600GB/mo
•   1M keys (~5k servers)
•   Write latency: 15 us
•   Read latency: 1s to read 1mo worth of data
•   Down to minute resolution
Resource Metering and
  Monitoring - Leela
•   18k writes/sec
•   6 TB total per cluster
•   13 baseline metrics + 68 distinct metrics
•   ~600GB/mo
•   1M keys (~5k servers)
•   Write latency: 15 us
•   Read latency: 1s to read 1mo worth of data
•   Down to minute resolution
•   http://leela.readthedocs.org/en/latest/intro/archnut.html
Resource Metering and
  Monitoring - Leela
Resource Metering and
  Monitoring - Leela

•   Map/Reduce with SQL like interface:
Resource Metering and
  Monitoring - Leela

•   Map/Reduce with SQL like interface:
    -SELECT mov_avg_samples = 7 (function)
Resource Metering and
  Monitoring - Leela

•   Map/Reduce with SQL like interface:
    -SELECT mov_avg_samples = 7 (function)
    -FROM cpro9559.cpu.cpu8.idle (metric)
Resource Metering and
  Monitoring - Leela

•   Map/Reduce with SQL like interface:
    -SELECT mov_avg_samples = 7 (function)
    -FROM cpro9559.cpu.cpu8.idle (metric)
    -WHERE timestamp >= 1346279003 (timeframe)
Resource Metering and
  Monitoring - Leela
Resource Metering and
  Monitoring - Leela

•   Create charts
Resource Metering and
  Monitoring - Leela

•   Create charts
    - var widget = LEELA.widget(jQuery.(“#target”));
Resource Metering and
  Monitoring - Leela

•   Create charts
    - var widget = LEELA.widget(jQuery.(“#target”));
    - jQuery.ajax(“/v1/pastweek/
      cpro9559.cpu.cpu8.idle”, {dataType: “jsonp”,
      success: widget.render});
Software defined
    network
Software defined
           network

•   Traditional equipment: local config and controller
Software defined
           network

•   Traditional equipment: local config and controller
•   SDN: flows (commands), openflow 1.0, central
    controller, distributed data plane
Software defined
           network

•   Traditional equipment: local config and controller
•   SDN: flows (commands), openflow 1.0, central
    controller, distributed data plane
•   Abstraction over VLANs with ACLs, Tunnels or
    even VLAN QoQ
Software defined
      network
Switch Vendor A                            Switch Vendor B

       Data path (hardware)                       Data path (hardware)

     Control path      Openflow                  Control path        Openflow




         API                  Controller                       OpenVSwitch
Software defined
    network
      Cisco   Force 10    HP    OpenVSwitch   Firewalls




API           Net/L2             Quantum




                         CMDB
Ruby @ Locaweb
Not only for front-end
?
Thanks !

More Related Content

PDF
Por trás da infraestrutura do Cloud - Campus Party 2014
Gleicon Moraes
 
PDF
Locaweb cloud and sdn
Gleicon Moraes
 
PPTX
Azure reference architectures
Masashi Narumoto
 
PDF
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
WSO2
 
PDF
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
HostedbyConfluent
 
PPTX
NextGen IBM Cloud Monitoring and Logging
Nagesh Ramamoorthy
 
PDF
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
HostedbyConfluent
 
PDF
RedisConf17 - Explosion of Data at the Edge in Equinix
Redis Labs
 
Por trás da infraestrutura do Cloud - Campus Party 2014
Gleicon Moraes
 
Locaweb cloud and sdn
Gleicon Moraes
 
Azure reference architectures
Masashi Narumoto
 
WSO2Con USA 2017: Scalable Real-time Complex Event Processing at Uber
WSO2
 
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
HostedbyConfluent
 
NextGen IBM Cloud Monitoring and Logging
Nagesh Ramamoorthy
 
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
HostedbyConfluent
 
RedisConf17 - Explosion of Data at the Edge in Equinix
Redis Labs
 

What's hot (20)

PDF
Mainframe Integration, Offloading and Replacement with Apache Kafka | Kai Wae...
HostedbyConfluent
 
PPTX
How to Set Up ApsaraDB for RDS on Alibaba Cloud
Alibaba Cloud
 
PDF
Introducing the WSO2 Elastic Load Balancer
WSO2
 
PPTX
Building Lightweight Microservices With Redis & Hydra
Redis Labs
 
PPTX
Big Data Quickstart Series 3: Perform Data Integration
Alibaba Cloud
 
PPTX
Migration to Alibaba Cloud
Alibaba Cloud
 
PPTX
AliCloud Object Storage Service (OSS) Core Features
Alibaba Cloud
 
PDF
War Stories: DIY Kafka
confluent
 
PDF
Using Kafka as a Database For Real-Time Transaction Processing | Chad Preisle...
HostedbyConfluent
 
PPTX
Big Data on Cloud Native Platform
Sunil Govindan
 
PPTX
Leveraging ApsaraDB to Deploy Business Data on the Cloud
Oliver Theobald
 
PPTX
Responding to Digital Transformation With RDS Database Technology
Alibaba Cloud
 
PPTX
Banking and Docker Datacenter - How Containers Drive Agility
Sendachi
 
PDF
Building Applications with Carbon Studio on Premise and Cloud
WSO2
 
PPTX
Zabbix at scale with Elasticsearch
Leandro Totino Pereira
 
PPTX
Discovering Cloud Networking: VPC, VPN, Express Connect & Server Load Balancer
Alibaba Cloud
 
PPTX
Launch and Scale Your E-commerce Website with Magento
Alibaba Cloud
 
PPTX
RedisConf18 - Redis Enterprise on Cloud Native Platforms
Redis Labs
 
PPTX
Choosing the right Cloud Database
Janakiram MSV
 
PPTX
Designing microservices part2
Masashi Narumoto
 
Mainframe Integration, Offloading and Replacement with Apache Kafka | Kai Wae...
HostedbyConfluent
 
How to Set Up ApsaraDB for RDS on Alibaba Cloud
Alibaba Cloud
 
Introducing the WSO2 Elastic Load Balancer
WSO2
 
Building Lightweight Microservices With Redis & Hydra
Redis Labs
 
Big Data Quickstart Series 3: Perform Data Integration
Alibaba Cloud
 
Migration to Alibaba Cloud
Alibaba Cloud
 
AliCloud Object Storage Service (OSS) Core Features
Alibaba Cloud
 
War Stories: DIY Kafka
confluent
 
Using Kafka as a Database For Real-Time Transaction Processing | Chad Preisle...
HostedbyConfluent
 
Big Data on Cloud Native Platform
Sunil Govindan
 
Leveraging ApsaraDB to Deploy Business Data on the Cloud
Oliver Theobald
 
Responding to Digital Transformation With RDS Database Technology
Alibaba Cloud
 
Banking and Docker Datacenter - How Containers Drive Agility
Sendachi
 
Building Applications with Carbon Studio on Premise and Cloud
WSO2
 
Zabbix at scale with Elasticsearch
Leandro Totino Pereira
 
Discovering Cloud Networking: VPC, VPN, Express Connect & Server Load Balancer
Alibaba Cloud
 
Launch and Scale Your E-commerce Website with Magento
Alibaba Cloud
 
RedisConf18 - Redis Enterprise on Cloud Native Platforms
Redis Labs
 
Choosing the right Cloud Database
Janakiram MSV
 
Designing microservices part2
Masashi Narumoto
 
Ad

Viewers also liked (20)

PDF
Tender Service Romania 2015 Achizitii + Licitatii Publice, pe Email
licitatie-publica.ro Tender Service
 
DOCX
Segundo informe de prácticas.
Roberto Robles Gonzalez
 
PDF
Premio al mejor cuguete del año - categoría juguetes conectados
Oscar Valero Serna
 
PDF
Oportunidades de I+D+i para las empresas. Jose Luis Llerena Ruiz. CTAEX
CTAEX
 
PDF
Mobile Trends and Learning - Learning Without Frontiers - Jason DaPonte, THE ...
THE SWARM
 
PDF
Rda web régionale 2014
Trophée des Roses
 
PPT
Staff blogging
SlipSliding
 
PDF
Centrul Istoric al Municipiului Cluj-Napoca/ Brosura Turnuri
cluj. travel
 
PPTX
Comunicacionhumana 130825230112-phpapp02
Jonathan Jiim
 
DOC
jitendra_sonar_R___D[2].
Jitendra Sonar
 
PDF
BFA aprueba la cotización en bolsa de Bankia en julio
financia 2
 
ODP
El gato en casa.
elvkbf
 
PPTX
Forces on an Incline
Alexey Reztsov
 
PDF
Market research at the speed of light
Len Starnes
 
PDF
Barquisimeto archivo
edumass2014
 
PDF
Escolar y besse Epistemologia-fronteriza
CARLALURATI
 
PPTX
Santa fe
Fernando Cisf
 
PPSX
Los medios y la realidad
Profeblog SB
 
PDF
Titulo vapor
Wagner Branco
 
PPS
Musica para recordar
lacuentaslidedejulian
 
Tender Service Romania 2015 Achizitii + Licitatii Publice, pe Email
licitatie-publica.ro Tender Service
 
Segundo informe de prácticas.
Roberto Robles Gonzalez
 
Premio al mejor cuguete del año - categoría juguetes conectados
Oscar Valero Serna
 
Oportunidades de I+D+i para las empresas. Jose Luis Llerena Ruiz. CTAEX
CTAEX
 
Mobile Trends and Learning - Learning Without Frontiers - Jason DaPonte, THE ...
THE SWARM
 
Rda web régionale 2014
Trophée des Roses
 
Staff blogging
SlipSliding
 
Centrul Istoric al Municipiului Cluj-Napoca/ Brosura Turnuri
cluj. travel
 
Comunicacionhumana 130825230112-phpapp02
Jonathan Jiim
 
jitendra_sonar_R___D[2].
Jitendra Sonar
 
BFA aprueba la cotización en bolsa de Bankia en julio
financia 2
 
El gato en casa.
elvkbf
 
Forces on an Incline
Alexey Reztsov
 
Market research at the speed of light
Len Starnes
 
Barquisimeto archivo
edumass2014
 
Escolar y besse Epistemologia-fronteriza
CARLALURATI
 
Santa fe
Fernando Cisf
 
Los medios y la realidad
Profeblog SB
 
Titulo vapor
Wagner Branco
 
Musica para recordar
lacuentaslidedejulian
 
Ad

Similar to A closer look to locaweb IaaS (20)

PDF
Directions for CloudStack Networking
Chiradeep Vittal
 
PDF
The Future of SDN in CloudStack by Chiradeep Vittal
buildacloud
 
PPTX
State of the Container Ecosystem
Vinay Rao
 
PPTX
Project RedDwarf - Database Services in the Cloud.pptx
OpenStack Foundation
 
PPTX
What are clouds made from
John Garbutt
 
PDF
Webinar - DreamObjects/Ceph Case Study
Ceph Community
 
PPTX
Climb Technical Overview
Arif Ali
 
PPTX
OpenStack and Windows
Alessandro Pilotti
 
PPTX
Laying OpenStack Cinder Block Services
Kenneth Hui
 
PPTX
Integrating OpenStack To Existing Infrastructure
Hui Cheng
 
PDF
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
 
PDF
Cloud Native Camel Riding
Christian Posta
 
PPT
Scalable networking in Apache CloudStack
Chiradeep Vittal
 
PDF
Achieving Infrastructure Portability with Chef
Matt Ray
 
PPT
Openstack presentation
Sankalp Jain
 
PDF
OpenStack Block Storage 101
NetApp
 
PDF
Getting started with Riak in the Cloud
Ines Sombra
 
PPTX
20191201 kubernetes managed weblogic revival - part 1
makker_nl
 
PPTX
Meetup #3: Migrate a fast scale system to AWS
AWS Vietnam Community
 
PPTX
Intro to Docker October 2013
Docker, Inc.
 
Directions for CloudStack Networking
Chiradeep Vittal
 
The Future of SDN in CloudStack by Chiradeep Vittal
buildacloud
 
State of the Container Ecosystem
Vinay Rao
 
Project RedDwarf - Database Services in the Cloud.pptx
OpenStack Foundation
 
What are clouds made from
John Garbutt
 
Webinar - DreamObjects/Ceph Case Study
Ceph Community
 
Climb Technical Overview
Arif Ali
 
OpenStack and Windows
Alessandro Pilotti
 
Laying OpenStack Cinder Block Services
Kenneth Hui
 
Integrating OpenStack To Existing Infrastructure
Hui Cheng
 
Better, faster, cheaper infrastructure with apache cloud stack and riak cs redux
John Burwell
 
Cloud Native Camel Riding
Christian Posta
 
Scalable networking in Apache CloudStack
Chiradeep Vittal
 
Achieving Infrastructure Portability with Chef
Matt Ray
 
Openstack presentation
Sankalp Jain
 
OpenStack Block Storage 101
NetApp
 
Getting started with Riak in the Cloud
Ines Sombra
 
20191201 kubernetes managed weblogic revival - part 1
makker_nl
 
Meetup #3: Migrate a fast scale system to AWS
AWS Vietnam Community
 
Intro to Docker October 2013
Docker, Inc.
 

More from Gleicon Moraes (18)

PDF
Como arquiteturas de dados quebram
Gleicon Moraes
 
PDF
Arquitetura emergente - sobre cultura devops
Gleicon Moraes
 
PDF
API Gateway report
Gleicon Moraes
 
PDF
DNAD 2015 - Como a arquitetura emergente de sua aplicação pode jogar contra ...
Gleicon Moraes
 
PDF
QCon SP 2015 - Advogados do diabo: como a arquitetura emergente de sua aplica...
Gleicon Moraes
 
PDF
Semi Automatic Sentiment Analysis
Gleicon Moraes
 
PPT
L'esprit de l'escalier
Gleicon Moraes
 
PPT
OSCon - Performance vs Scalability
Gleicon Moraes
 
PDF
Architectural Anti Patterns - Notes on Data Distribution and Handling Failures
Gleicon Moraes
 
PDF
Architecture by Accident
Gleicon Moraes
 
PDF
Patterns of fail
Gleicon Moraes
 
PDF
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
Gleicon Moraes
 
PDF
Architectural anti-patterns for data handling
Gleicon Moraes
 
PDF
Architectural anti patterns_for_data_handling
Gleicon Moraes
 
PDF
RestMQ - HTTP/Redis based Message Queue
Gleicon Moraes
 
PDF
NoSQL and SQL Anti Patterns
Gleicon Moraes
 
PDF
Redis
Gleicon Moraes
 
PDF
NoSql Introduction
Gleicon Moraes
 
Como arquiteturas de dados quebram
Gleicon Moraes
 
Arquitetura emergente - sobre cultura devops
Gleicon Moraes
 
API Gateway report
Gleicon Moraes
 
DNAD 2015 - Como a arquitetura emergente de sua aplicação pode jogar contra ...
Gleicon Moraes
 
QCon SP 2015 - Advogados do diabo: como a arquitetura emergente de sua aplica...
Gleicon Moraes
 
Semi Automatic Sentiment Analysis
Gleicon Moraes
 
L'esprit de l'escalier
Gleicon Moraes
 
OSCon - Performance vs Scalability
Gleicon Moraes
 
Architectural Anti Patterns - Notes on Data Distribution and Handling Failures
Gleicon Moraes
 
Architecture by Accident
Gleicon Moraes
 
Patterns of fail
Gleicon Moraes
 
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)
Gleicon Moraes
 
Architectural anti-patterns for data handling
Gleicon Moraes
 
Architectural anti patterns_for_data_handling
Gleicon Moraes
 
RestMQ - HTTP/Redis based Message Queue
Gleicon Moraes
 
NoSQL and SQL Anti Patterns
Gleicon Moraes
 
NoSql Introduction
Gleicon Moraes
 

Recently uploaded (20)

PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptx
Rejig Digital
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Doc9.....................................
SofiaCollazos
 
PPTX
Coupa-Overview _Assumptions presentation
annapureddyn
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Software Development Methodologies in 2025
KodekX
 
PPT
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
IoT Sensor Integration 2025 Powering Smart Tech and Industrial Automation.pptx
Rejig Digital
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Doc9.....................................
SofiaCollazos
 
Coupa-Overview _Assumptions presentation
annapureddyn
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Software Development Methodologies in 2025
KodekX
 
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 

A closer look to locaweb IaaS

  • 1. A closer look to Locaweb Iaas Gleicon Moraes Engineering Manager PaaS/IaaS @gleicon - http://blog.7co.cc
  • 3. Agenda • Engineering Team
  • 4. Agenda • Engineering Team • IaaS
  • 5. Agenda • Engineering Team • IaaS • Virtual/Physical servers
  • 6. Agenda • Engineering Team • IaaS • Virtual/Physical servers • Architecture
  • 7. Agenda • Engineering Team • IaaS • Virtual/Physical servers • Architecture • OSS
  • 8. Agenda • Engineering Team • IaaS • Virtual/Physical servers • Architecture • OSS • Provisioning
  • 9. Agenda • Engineering Team • IaaS • Virtual/Physical servers • Architecture • OSS • Provisioning • CMDB/Closed Loop
  • 10. Agenda • Engineering Team • IaaS • Virtual/Physical servers • Architecture • OSS • Provisioning • CMDB/Closed Loop • Resource usage gathering
  • 11. Agenda • Engineering Team • IaaS • Virtual/Physical servers • Architecture • OSS • Provisioning • CMDB/Closed Loop • Resource usage gathering • Software defined networks
  • 12. Engineering Team • We aim to be efficient • DC and IaaS Automation • IaaS and PaaS products • Email and Domain Registration products • Coffee/psychological help/counseling • 40 people team (devs/architects/1 master devops)
  • 13. IaaS - NIST definition “ The capability provided to the consumer is to provision processing, storage, networks, and other fundamental computing resources where the consumer is able to deploy and run arbitrary software, which can include operating systems and applications. The consumer does not manage or control the underlying cloud infrastructure but has control over operating systems, storage, and deployed applications; and possibly limited control of select networking components (e.g., host firewalls).” * http://csrc.nist.gov/publications/nistpubs/800-145/SP800-145.pdf
  • 14. IaaS - Wikipedia “In this most basic cloud service model, cloud providers offer computers, as physical or more often as virtual machines, and other resources. The virtual machines are run as guests by a hypervisor, such as Xen or KVM. Management of pools of hypervisors by the cloud operational support system leads to the ability to scale to support a large number of virtual machines. Other resources in IaaS clouds include images in a virtual machine image library, raw (block) and file-based storage, firewalls, load balancers, IP addresses, virtual local area networks (VLANs), and software bundles.[46] IaaS cloud providers supply these resources on demand from their large pools installed in data centers. For wide area connectivity, the Internet can be used or—in carrier clouds -- dedicated virtual private networks can be configured.” * http://en.wikipedia.org/wiki/Infrastructure_as_a_service#Service_models
  • 15. IaaS - tl;dr “Automate infrastructure such as the customer will not know the underlying details, will not manage them and can provision services automagically.”
  • 16. IaaS - building blocks • Servers: virtual and physical • Storage area • Network devices: firewall, switches, load balancer
  • 17. IaaS - High Level • Automation • Resource Management • Install, Uninstall, Migrate • High Availability, Scalability, Capacity Planning
  • 18. IaaS at Locaweb • 3 DCs, 6k Servers (physical), 1k storages 6PB area, 12K network equipments/ports, > 100 Km of cables • 10k VMs, 3.2M email accounts, 250k hosting customers, ~500k sites, ~600k DB • 130 people at day to day 24/7 Operations team (from DC basics to managing apps and platforms), < 40 sysadmins • Currently ~ 18 people from Engineering team taking care of IaaS
  • 19. Virtual and Physical • Single tenant per Physical Server • Single tenant per VM • Multiple tenants per VM • Multiple tenants per Physical Server • Multiple VMs per Physical Server
  • 20. Virtual and Physical • Single tenant per Physical Server • Single tenant per VM • Multiple tenants per VM • Multiple tenants per Physical Server • Multiple VMs per Physical Server = Cloud
  • 21. Cloud • Check back NIST definition • Hypervisor + set of servers + set of storages + network = time sharing • Distinct capacity planning than physical servers • Flexible configuration options • Vertical Scaling • Horizontal scaling
  • 22. Architecture - Cloud Internet Main Network Network Physical Servers Firewall hypervisor ovs Gear Simplestack SimpleNet/Quantum
  • 23. Architecture - Physical Internet Main Network Why not ? Network Physical Servers Firewall Gear Simplestack SimpleNet/Quantum
  • 24. OSS
  • 25. OSS • Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL, Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io, bottle.py, Quantum, R, EjabberD, Resque, lots of gem, eggs
  • 26. OSS • Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL, Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io, bottle.py, Quantum, R, EjabberD, Resque, lots of gem, eggs • Up-to-date technology
  • 27. OSS • Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL, Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io, bottle.py, Quantum, R, EjabberD, Resque, lots of gem, eggs • Up-to-date technology • No lock-ins
  • 28. OSS • Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL, Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io, bottle.py, Quantum, R, EjabberD, Resque, lots of gem, eggs • Up-to-date technology • No lock-ins • Vendor neutral
  • 29. OSS • Ruby, Rails, Python, CFEngine, PostgreSQL, MySQL, Cassandra, Redis, XEN, KVM, Haskell, Cyclone.io, bottle.py, Quantum, R, EjabberD, Resque, lots of gem, eggs • Up-to-date technology • No lock-ins • Vendor neutral • We contribute back
  • 32. Our projects • Leela - Data collection monster
  • 33. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy
  • 34. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy • SimpleNet - OVS and FW controller
  • 35. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy • SimpleNet - OVS and FW controller • NET/L2 - Controller/Inventory for network equipment
  • 36. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy • SimpleNet - OVS and FW controller • NET/L2 - Controller/Inventory for network equipment • BrickLayer - packaging for normal people
  • 37. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy • SimpleNet - OVS and FW controller • NET/L2 - Controller/Inventory for network equipment • BrickLayer - packaging for normal people • Logix - Graylog2 message bus for log streams
  • 38. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy • SimpleNet - OVS and FW controller • NET/L2 - Controller/Inventory for network equipment • BrickLayer - packaging for normal people • Logix - Graylog2 message bus for log streams • xenapi-ruby - XEN API bindings for Ruby
  • 39. Our projects • Leela - Data collection monster • SimpleStack - Provisioning made easy • SimpleNet - OVS and FW controller • NET/L2 - Controller/Inventory for network equipment • BrickLayer - packaging for normal people • Logix - Graylog2 message bus for log streams • xenapi-ruby - XEN API bindings for Ruby • otto, debundler, bpmachine and more each week
  • 41. Our Contributions • Contributed to Quantum, from Openstack
  • 42. Our Contributions • Contributed to Quantum, from Openstack • Snorby/snort contributions
  • 43. Our Contributions • Contributed to Quantum, from Openstack • Snorby/snort contributions • Mod_security for Nginx and helping on IIS
  • 44. Our Contributions • Contributed to Quantum, from Openstack • Snorby/snort contributions • Mod_security for Nginx and helping on IIS • hired consulting from grsecurity and dovecot teams - we support OSS companies
  • 46. Bricklayer • First opensource project from Locaweb
  • 47. Bricklayer • First opensource project from Locaweb • Package builder (deb + rpm) straight from git
  • 48. Bricklayer • First opensource project from Locaweb • Package builder (deb + rpm) straight from git • 150+ projects, 500+ builds/day
  • 49. Bricklayer • First opensource project from Locaweb • Package builder (deb + rpm) straight from git • 150+ projects, 500+ builds/day • tag your project, get the packages done and on repositories
  • 50. Logix
  • 51. Logix • We have lots of logs. Everything broke.
  • 52. Logix • We have lots of logs. Everything broke. • 26.753.205.474 lines of log/day
  • 53. Logix • We have lots of logs. Everything broke. • 26.753.205.474 lines of log/day • Highly distributed: local syslog daemon to RabbitMQ
  • 54. Logix • We have lots of logs. Everything broke. • 26.753.205.474 lines of log/day • Highly distributed: local syslog daemon to RabbitMQ • Elastic search + graylog2 to store, search and filter
  • 55. Provisioning • Ruby: Panel, Control panel, Scheduler • Python: Provisioning, Server management, Metric collection • REST APIs to Hypervisor, Network, Firewall, XMPP
  • 56. Provisioning - Cloud Internet Cloud Control Panel API Sales Main Network Network Physical Servers Firewall hypervisor ovs Gear Provisioner Simplestack SimpleNet/Quantum
  • 57. Provisioning - Servers Dedicated Servers Control Panel API Closed Loop Racked Servers Sales Internet Main Network Network Physical Servers Firewall hypervisor ovs Gear Simplestack SimpleNet/Quantum
  • 58. Provisioning - Managed Servers Managed Servers Control Panel Sales Dedicated Servers Control Panel PaaS API Closed Loop Provisioner Racked Servers Sales Internet Cloud Control Panel API Sales Main Network Network Physical Servers Firewall hypervisor ovs Gear Provisioner Simplestack SimpleNet/Quantum
  • 59. Cloud provisioner Jobs DHCP CMDB FW API quantum/ Control Panel core Resque simplenet Sales simplestack Notifications Leela console
  • 60. Closed loop The closed loop process CMDB API Conductor Network Futurama Cobbler Hardware
  • 62. Closed loop • All servers get racked, wired, tested and configured
  • 63. Closed loop • All servers get racked, wired, tested and configured • Power management discovery
  • 64. Closed loop • All servers get racked, wired, tested and configured • Power management discovery • Network configuration
  • 65. Closed loop • All servers get racked, wired, tested and configured • Power management discovery • Network configuration • OS install: Windows, Linux and OpenSolaris aware
  • 66. Closed loop • All servers get racked, wired, tested and configured • Power management discovery • Network configuration • OS install: Windows, Linux and OpenSolaris aware • Server life cycle: once deactivated it gets back to the pool to be used again
  • 67. CMDB Futurama Power audit Ops NET/L2 Frontend Controllers Database API Product Resque provisioners IP provisioning Server provisioning IT chg SAP management
  • 68. Futurama Management CFEngine Server side Planet Express Leela- CF-Agent Server Conductor- CMDB Leela-agent bkp-agent audit Cegonha Asdrubal CFTools
  • 69. Resource Metering and Monitoring - Leela Cassandra Leela- Cassandra Cassandra Lasergun Leela-agent Leela- Reader Cassandra Cassandra Cassandra API
  • 70. Resource Metering and Monitoring - Leela
  • 71. Resource Metering and Monitoring - Leela • 18k writes/sec
  • 72. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster
  • 73. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics
  • 74. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics • ~600GB/mo
  • 75. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics • ~600GB/mo • 1M keys (~5k servers)
  • 76. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics • ~600GB/mo • 1M keys (~5k servers) • Write latency: 15 us
  • 77. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics • ~600GB/mo • 1M keys (~5k servers) • Write latency: 15 us • Read latency: 1s to read 1mo worth of data
  • 78. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics • ~600GB/mo • 1M keys (~5k servers) • Write latency: 15 us • Read latency: 1s to read 1mo worth of data • Down to minute resolution
  • 79. Resource Metering and Monitoring - Leela • 18k writes/sec • 6 TB total per cluster • 13 baseline metrics + 68 distinct metrics • ~600GB/mo • 1M keys (~5k servers) • Write latency: 15 us • Read latency: 1s to read 1mo worth of data • Down to minute resolution • http://leela.readthedocs.org/en/latest/intro/archnut.html
  • 80. Resource Metering and Monitoring - Leela
  • 81. Resource Metering and Monitoring - Leela • Map/Reduce with SQL like interface:
  • 82. Resource Metering and Monitoring - Leela • Map/Reduce with SQL like interface: -SELECT mov_avg_samples = 7 (function)
  • 83. Resource Metering and Monitoring - Leela • Map/Reduce with SQL like interface: -SELECT mov_avg_samples = 7 (function) -FROM cpro9559.cpu.cpu8.idle (metric)
  • 84. Resource Metering and Monitoring - Leela • Map/Reduce with SQL like interface: -SELECT mov_avg_samples = 7 (function) -FROM cpro9559.cpu.cpu8.idle (metric) -WHERE timestamp >= 1346279003 (timeframe)
  • 85. Resource Metering and Monitoring - Leela
  • 86. Resource Metering and Monitoring - Leela • Create charts
  • 87. Resource Metering and Monitoring - Leela • Create charts - var widget = LEELA.widget(jQuery.(“#target”));
  • 88. Resource Metering and Monitoring - Leela • Create charts - var widget = LEELA.widget(jQuery.(“#target”)); - jQuery.ajax(“/v1/pastweek/ cpro9559.cpu.cpu8.idle”, {dataType: “jsonp”, success: widget.render});
  • 89. Software defined network
  • 90. Software defined network • Traditional equipment: local config and controller
  • 91. Software defined network • Traditional equipment: local config and controller • SDN: flows (commands), openflow 1.0, central controller, distributed data plane
  • 92. Software defined network • Traditional equipment: local config and controller • SDN: flows (commands), openflow 1.0, central controller, distributed data plane • Abstraction over VLANs with ACLs, Tunnels or even VLAN QoQ
  • 93. Software defined network Switch Vendor A Switch Vendor B Data path (hardware) Data path (hardware) Control path Openflow Control path Openflow API Controller OpenVSwitch
  • 94. Software defined network Cisco Force 10 HP OpenVSwitch Firewalls API Net/L2 Quantum CMDB
  • 95. Ruby @ Locaweb Not only for front-end
  • 96. ?