SlideShare a Scribd company logo
Stacki Lab
Joe Kaiser
Director of Open Source Engineering
Open Source Stack Installer
Stacki is a very fast and ultra reliable Linux server provisioning tool … at scale.
With zero prerequisites for taking systems from bare metal to a ping and prompt.
Stuff it does
 Installs to bare metal or VMs that look like bare metal
 Kickstart based
 Parallel formatting of disk
 Parallel Sharing of RPMs
 CentOS/RHEL
 Networking
 Multiple subnets, vlaning, bonding.
 Storage
 Controller config
 Partitioning
Whatever you can do in Linux, you can do with Stacki only clustered
Stacki and Hortonworks Data Platform
Hortonworks – stacki-hdp-bridge pallet
 Add/enable/run
 Stacki creates ambari appliance
 Download software as pallets (isos)
 Add/enable HDP and Ambari
 Run gethdp script
 Or download (links in documentation)
 Assign a node to be Ambari deployment.
 Do partitions and preps backend nodes for Hadoop.
 Install all.
 Go to Ambari interface to deploy Hadoop.
 Current versions: (updated /export/HDP/hdp.cfg for new versions)
 distribution = 2.x
 os = centos7
 ambari = 2.4.2.0
 hdp = 2.5.3.0
Augment the “default” Box
1) Replace the “os” pallet with the “CentOS” and “CentOS-
Updates” pallets
2) Add the Hortonworks pallets: “HDP”, “HDP-UTILS”,
“Updates-ambari” or run /export/HDP/gethdp.py
3) Add a pallet to glue the two layers together: “stacki-
hdp-bridge”
Specify One Host as the “Ambari” appliance
“backend-0-0” will be automatically configured with Ambari
Install the node and then point your web browser at it
Reinstall All Backend Nodes
Wipe all hardware disk array configuration and rebuild all the LUNs:
◦ stack set host attr ambari backend attr=nukecontroller value=true
Remove all partitions then repartition and reformat the disks:
◦ stack set host attr ambari backend attr=nukedisks value=true
Instruct nodes to install on next PXE boot:
◦ stack set host boot ambari backend action=install
Try It
 Website
www.stacki.com
 Vagrant tire kick
https://github.com/rfkrocktk/vagrant-stacki
Source Code and docs
github.com/stackiq/stacki
github.com/StackIQ/stacki-hdp-bridge
Slack Channel (because everyone)
Google Groups
groups.google.com/forum/#!forum/stacki
Finis
Thanks

More Related Content

What's hot (20)

PDF
Deploying Alluxio in the Cloud for Machine Learning
Alluxio, Inc.
 
PDF
DevOpsDaysRiga 2017 Ignite: Toshaan Bharvani - POWER your DC
DevOpsDays Riga
 
PDF
Openstack CPI cloudfoundry
Yitao Jiang
 
PDF
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
NETWAYS
 
PPTX
MySQL Head-to-Head
Patrick McGarry
 
PDF
Stig Telfer - OpenStack and the Software-Defined SuperComputer
Danny Abukalam
 
PDF
DevOps Days Kyiv 2019 -- Power your PC // Toshaan Bharvani
Mykola Marzhan
 
PPTX
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Community
 
PPTX
Ceph Day KL - Ceph on ARM
Ceph Community
 
PDF
Luci, ricci and the rac bc
fauzg
 
PDF
OpenStack Manila 紹介
Takeshi Kuramochi
 
PDF
OSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
NETWAYS
 
ODP
Hpc to OpenStack: Our journey
Arif Ali
 
PDF
Cncf meetup kubespray
Juraj Hantak
 
PDF
BlackPearl introduction
inside-BigData.com
 
PPT
What is OpenStack Trove? Trove Day 2014
Tesora
 
PDF
ONIE LinuxCon 2015
Curt Brune
 
PDF
Ata Over Ethernet
Kit Peters
 
PDF
Cloud foundry on kubernetes
상준 윤
 
PDF
Solr on Docker - the Good, the Bad and the Ugly
Sematext Group, Inc.
 
Deploying Alluxio in the Cloud for Machine Learning
Alluxio, Inc.
 
DevOpsDaysRiga 2017 Ignite: Toshaan Bharvani - POWER your DC
DevOpsDays Riga
 
Openstack CPI cloudfoundry
Yitao Jiang
 
OpenNebula Conf 2014 | Lightning talk: OpenNebula Puppet Module - Norman Mess...
NETWAYS
 
MySQL Head-to-Head
Patrick McGarry
 
Stig Telfer - OpenStack and the Software-Defined SuperComputer
Danny Abukalam
 
DevOps Days Kyiv 2019 -- Power your PC // Toshaan Bharvani
Mykola Marzhan
 
Ceph Day KL - Ceph Tiering with High Performance Archiecture
Ceph Community
 
Ceph Day KL - Ceph on ARM
Ceph Community
 
Luci, ricci and the rac bc
fauzg
 
OpenStack Manila 紹介
Takeshi Kuramochi
 
OSMC 2019 | Ignite | Power your Datacenter by Toshaan Bharvani
NETWAYS
 
Hpc to OpenStack: Our journey
Arif Ali
 
Cncf meetup kubespray
Juraj Hantak
 
BlackPearl introduction
inside-BigData.com
 
What is OpenStack Trove? Trove Day 2014
Tesora
 
ONIE LinuxCon 2015
Curt Brune
 
Ata Over Ethernet
Kit Peters
 
Cloud foundry on kubernetes
상준 윤
 
Solr on Docker - the Good, the Bad and the Ugly
Sematext Group, Inc.
 

Similar to Building a Hadoop Cluster with Stacki (20)

PDF
SF Bay Area OpenStack Meetup Stacki Presentation
StackIQ
 
PPTX
Ambari blueprints-overview
Shivaji Dutta
 
PDF
Hortonworks Technical Workshop: Apache Ambari
Hortonworks
 
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
Hortonworks
 
PPTX
Managing Enterprise Hadoop Clusters with Apache Ambari
Jayush Luniya
 
PPTX
Apache Ambari - What's New in 1.6.0
Hortonworks
 
PPTX
Accumulo Summit 2014: Monitoring Apache Accumulo
Accumulo Summit
 
PDF
Provisioning Servers Made Easy
All Things Open
 
PDF
An Overview of Ambari
Chicago Hadoop Users Group
 
PDF
StackiFest 16: Stacki Overview- Anoop Rajendra
StackIQ
 
PDF
Introduction to Stacki at Atlanta Meetup February 2016
StackIQ
 
PDF
StackiFest16: What's Next in Stacki - Mason Katz
StackIQ
 
PDF
Introduction to Stacki - World's fastest Linux server provisioning Tool
Suresh Paulraj
 
PPTX
Stacki at the Seattle Scalability Meetup
StackIQ
 
PPTX
Apache Ambari Stack Extensibility
Jayush Luniya
 
PPTX
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
 
PPTX
Managing 2000 Node Cluster with Ambari
DataWorks Summit
 
PPTX
Manage Add-on Services in Apache Ambari
Jayush Luniya
 
PPTX
Streamline Hadoop DevOps with Apache Ambari
Jayush Luniya
 
PPTX
Apache Ambari BOF - Overview - Hadoop Summit 2013
Hortonworks
 
SF Bay Area OpenStack Meetup Stacki Presentation
StackIQ
 
Ambari blueprints-overview
Shivaji Dutta
 
Hortonworks Technical Workshop: Apache Ambari
Hortonworks
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Hortonworks
 
Managing Enterprise Hadoop Clusters with Apache Ambari
Jayush Luniya
 
Apache Ambari - What's New in 1.6.0
Hortonworks
 
Accumulo Summit 2014: Monitoring Apache Accumulo
Accumulo Summit
 
Provisioning Servers Made Easy
All Things Open
 
An Overview of Ambari
Chicago Hadoop Users Group
 
StackiFest 16: Stacki Overview- Anoop Rajendra
StackIQ
 
Introduction to Stacki at Atlanta Meetup February 2016
StackIQ
 
StackiFest16: What's Next in Stacki - Mason Katz
StackIQ
 
Introduction to Stacki - World's fastest Linux server provisioning Tool
Suresh Paulraj
 
Stacki at the Seattle Scalability Meetup
StackIQ
 
Apache Ambari Stack Extensibility
Jayush Luniya
 
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
 
Managing 2000 Node Cluster with Ambari
DataWorks Summit
 
Manage Add-on Services in Apache Ambari
Jayush Luniya
 
Streamline Hadoop DevOps with Apache Ambari
Jayush Luniya
 
Apache Ambari BOF - Overview - Hadoop Summit 2013
Hortonworks
 
Ad

More from StackIQ (20)

PPTX
How Teradata uses Stacki
StackIQ
 
PPTX
StackiFest 2017 Technology Update
StackIQ
 
PPTX
StackiFest 2017 Welcome & Opening Address
StackIQ
 
PDF
Provisioning with Stacki at NIST
StackIQ
 
PDF
Public vs. Private Cloud Performance by Flex
StackIQ
 
PPTX
StackiFest16: Building a Cluster with Stacki - Greg Bruno
StackIQ
 
PPTX
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackIQ
 
PDF
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
StackIQ
 
PDF
StackiFest16: CoreOS/Ubuntu on Stacki
StackIQ
 
PDF
StackiFest16: Building a Cart
StackIQ
 
PDF
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
StackIQ
 
PDF
Salesforce at Stacki Atlanta Meetup February 2016
StackIQ
 
PDF
Private Cloud vs. Public Cloud
StackIQ
 
PDF
Datacenter Word Search
StackIQ
 
PDF
Stacki: Remove Commands
StackIQ
 
PDF
The Big Picture on Hadoop
StackIQ
 
PDF
Stacki Crossword Puzzle
StackIQ
 
PDF
Open Source Adoption in the Enterprise
StackIQ
 
PDF
Stacki Saves Time
StackIQ
 
PPTX
Stacki: Automate with Spreadsheets (Tutorial)
StackIQ
 
How Teradata uses Stacki
StackIQ
 
StackiFest 2017 Technology Update
StackIQ
 
StackiFest 2017 Welcome & Opening Address
StackIQ
 
Provisioning with Stacki at NIST
StackIQ
 
Public vs. Private Cloud Performance by Flex
StackIQ
 
StackiFest16: Building a Cluster with Stacki - Greg Bruno
StackIQ
 
StackiFest16: How PayPal got a 300 Nodes up in 14 minutes - Greg Bruno
StackIQ
 
StackiFest16: Automation for Event-Driven Infrastructure - Dave Boucha
StackIQ
 
StackiFest16: CoreOS/Ubuntu on Stacki
StackIQ
 
StackiFest16: Building a Cart
StackIQ
 
StackiFest16: Stacki 1600+ Server Journey - Dave Peterson, Salesforce
StackIQ
 
Salesforce at Stacki Atlanta Meetup February 2016
StackIQ
 
Private Cloud vs. Public Cloud
StackIQ
 
Datacenter Word Search
StackIQ
 
Stacki: Remove Commands
StackIQ
 
The Big Picture on Hadoop
StackIQ
 
Stacki Crossword Puzzle
StackIQ
 
Open Source Adoption in the Enterprise
StackIQ
 
Stacki Saves Time
StackIQ
 
Stacki: Automate with Spreadsheets (Tutorial)
StackIQ
 
Ad

Recently uploaded (20)

PDF
Upgrading to z_OS V2R4 Part 01 of 02.pdf
Flavio787771
 
PDF
Upskill to Agentic Automation 2025 - Kickoff Meeting
DianaGray10
 
PDF
Sustainable and comertially viable mining process.pdf
Avijit Kumar Roy
 
PDF
Market Wrap for 18th July 2025 by CIFDAQ
CIFDAQ
 
PDF
Novus-Safe Pro: Brochure-What is Novus Safe Pro?.pdf
Novus Hi-Tech
 
PDF
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
PDF
CloudStack GPU Integration - Rohit Yadav
ShapeBlue
 
PDF
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
PPTX
Lecture 5 - Agentic AI and model context protocol.pptx
Dr. LAM Yat-fai (林日辉)
 
PDF
CIFDAQ'S Token Spotlight for 16th July 2025 - ALGORAND
CIFDAQ
 
PDF
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
AWS Chicago
 
PDF
The Past, Present & Future of Kenya's Digital Transformation
Moses Kemibaro
 
PPTX
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
ShapeBlue
 
PDF
Ampere Offers Energy-Efficient Future For AI And Cloud
ShapeBlue
 
PDF
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
PPTX
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
DOCX
TLE9 COOKERY DLL WEEK3 technology and li
jamierha cabaero
 
PDF
GITLAB-CICD_For_Professionals_KodeKloud.pdf
deepaktyagi0048
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
Upgrading to z_OS V2R4 Part 01 of 02.pdf
Flavio787771
 
Upskill to Agentic Automation 2025 - Kickoff Meeting
DianaGray10
 
Sustainable and comertially viable mining process.pdf
Avijit Kumar Roy
 
Market Wrap for 18th July 2025 by CIFDAQ
CIFDAQ
 
Novus-Safe Pro: Brochure-What is Novus Safe Pro?.pdf
Novus Hi-Tech
 
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
CloudStack GPU Integration - Rohit Yadav
ShapeBlue
 
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
Lecture 5 - Agentic AI and model context protocol.pptx
Dr. LAM Yat-fai (林日辉)
 
CIFDAQ'S Token Spotlight for 16th July 2025 - ALGORAND
CIFDAQ
 
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
AWS Chicago
 
The Past, Present & Future of Kenya's Digital Transformation
Moses Kemibaro
 
Simplifying End-to-End Apache CloudStack Deployment with a Web-Based Automati...
ShapeBlue
 
Ampere Offers Energy-Efficient Future For AI And Cloud
ShapeBlue
 
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
TLE9 COOKERY DLL WEEK3 technology and li
jamierha cabaero
 
GITLAB-CICD_For_Professionals_KodeKloud.pdf
deepaktyagi0048
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 

Building a Hadoop Cluster with Stacki

  • 1. Stacki Lab Joe Kaiser Director of Open Source Engineering
  • 2. Open Source Stack Installer Stacki is a very fast and ultra reliable Linux server provisioning tool … at scale. With zero prerequisites for taking systems from bare metal to a ping and prompt.
  • 3. Stuff it does  Installs to bare metal or VMs that look like bare metal  Kickstart based  Parallel formatting of disk  Parallel Sharing of RPMs  CentOS/RHEL  Networking  Multiple subnets, vlaning, bonding.  Storage  Controller config  Partitioning Whatever you can do in Linux, you can do with Stacki only clustered
  • 4. Stacki and Hortonworks Data Platform
  • 5. Hortonworks – stacki-hdp-bridge pallet  Add/enable/run  Stacki creates ambari appliance  Download software as pallets (isos)  Add/enable HDP and Ambari  Run gethdp script  Or download (links in documentation)  Assign a node to be Ambari deployment.  Do partitions and preps backend nodes for Hadoop.  Install all.  Go to Ambari interface to deploy Hadoop.  Current versions: (updated /export/HDP/hdp.cfg for new versions)  distribution = 2.x  os = centos7  ambari = 2.4.2.0  hdp = 2.5.3.0
  • 6. Augment the “default” Box 1) Replace the “os” pallet with the “CentOS” and “CentOS- Updates” pallets 2) Add the Hortonworks pallets: “HDP”, “HDP-UTILS”, “Updates-ambari” or run /export/HDP/gethdp.py 3) Add a pallet to glue the two layers together: “stacki- hdp-bridge”
  • 7. Specify One Host as the “Ambari” appliance “backend-0-0” will be automatically configured with Ambari Install the node and then point your web browser at it
  • 8. Reinstall All Backend Nodes Wipe all hardware disk array configuration and rebuild all the LUNs: ◦ stack set host attr ambari backend attr=nukecontroller value=true Remove all partitions then repartition and reformat the disks: ◦ stack set host attr ambari backend attr=nukedisks value=true Instruct nodes to install on next PXE boot: ◦ stack set host boot ambari backend action=install
  • 9. Try It  Website www.stacki.com  Vagrant tire kick https://github.com/rfkrocktk/vagrant-stacki Source Code and docs github.com/stackiq/stacki github.com/StackIQ/stacki-hdp-bridge Slack Channel (because everyone) Google Groups groups.google.com/forum/#!forum/stacki

Editor's Notes

  • #3: Linux – Focused on RedHat-ish (Kickstart/Anaconda) Provisioning – Bare Metal (total stack control) Scale – solve 1000+ servers problem then scale down Ping and Prompt – Get machine up to known base OS fully configuration raid / disk / networking / ssh access on Nothing else … No agent left on the server