SlideShare a Scribd company logo
4
Most read
7
Most read
20
Most read
1
Elastic (ELK) Stack
By Vikram Shinde
2
Agenda
• Introduction
• Elastic Stack Overview
• Components of Elastic Stack
• Role of Elastic Stack in Big Data Analysis
• Demo
• ElasticSearch configurations
• Logstash pipelines
• Kibana Dashboards
• Beats example
• Twitter trend example
• Q & A
3
Elastic (ELK) Stack
Elastic Stack is a group of open source products from Elastic designed to help users take
data from any type of source and in any format and search, analyze, and visualize that data
in real time. It uses Logstash for log aggregation, Elasticsearch for searching, and Kibana
for visualizing and analyzing data.
• ElasticSearch: Store, Search, and Analyze
• Logstash: Collect logs and events data, Parse and Transform
• Kibana: Explore, Visualize, and Share
• Beats: Data shipper.
4
Elastic (ELK) Stack Architecture
5
ElasticSearch
Elasticsearch is a highly available and distributed search engine.
• Built on top of Apache Lucene
• NoSQL Datastore
• Schema-free
• JSON Document
• RESTful APIs
Relational Database ElasticSearch
Database Index
Table Type
Row Document
Column Field
Schema Mapping
• Node
• Cluster
6
ElasticSearch
Elasticsearch is distributed, which means that indices can be divided into shards and each
shard can have zero or more replicas. By default, an index is created with 5 shards and 1
replica per shard (5/1). Rebalancing and routing of shards are done automatically.
Features
• Distributed
• Scalable
• Highly available
• Near Real Time (NRT) search
• Full Text Search
• Java, .NET, PHP, Python, Curl, Perl, Ruby
• HADOOP & SPARK -- Elasticsearch-Hadoop (ES-Hadoop)
7
ElasticSearch RESTful API
HTTP Based CRUD Operations
Operation CURL command
Create curl –XPUT “http://localhost:9200/<index>/<type>/<id>”
Read curl –XGET “http://localhost:9200/<index>/<type>/<id>”
Update curl –XPOST “http://localhost:9200/<index>/<type>/<id>”
Delete curl –XDELETE “http://localhost:9200/<index>/<type>/<id>”
8
GitHub Casestudy
Challenge : How do you satisfy the search needs of GitHub's 4 million users while
simultaneously providing tactical operational insights that help you iteratively
improve customer service?
Solution: GitHub uses Elasticsearch to continually index the data from an ever-
growing store of over 8 million code repositories, comprising over 2 billion
documents.
GitHub uses Elasticsearch to index new code as soon as users push it to a
repository on GitHub.
"Search is at the core of GitHub"
Other customers includes Facebook, Netflix, ebay, Wikimedia, etc.
ebay : Searching across 800 million listings in subseconds
9
Logstash
Logstash can collect logs from a variety of sources (using input plugins), process the data
into a common format using filters, and stream data to a variety of sources (using output
plugins). Multiple filters can be chained to parse the data into a common format. Together,
they build a Logstash Processing Pipeline.
10
Logstash Plug-ins
Input Plugins
• Beats
• Elasticsearch
• File
• Graphite
• Heartbeat
• Tttp
• Jdbc
• Kafka
• Log4j
• Redis
• Stdin
• TCP
• Twitter
Output Plugins
• CSV
• Elasticsearch
• Email
• File
• Graphite
• Http
• Jira
• Kafka
• Nagios
• Redis
• Stdout
• S3
• Tcp
• Udp
Filter Plugins
• Aggregate
• csv
• Date
• geoip
• Grok
• Json
• sleep
• urlencode
• UUID
• xml
Logstash has a rich collections of input, filter and output plugins. You can now create
your own Logstash plugin and add it into community plugins.
11
Logstash Pipeline
Basic Configuration of Logstash Pipeline
12
Kibana
• Discover
• Visualise
• Dashboards
• Put Geo Data on Any Map
• Insert dashboards into your
internal wiki or webpage
• Send your coworker a URL to
a dashboard.
Kibana gives you the freedom to select the way you give shape to your data.
13
Beats
Lightweight Data Shippers.
Beats is the platform for single-purpose data shippers. They install as lightweight agents and
send data from hundreds or thousands of machines to Logstash or Elasticsearch.
14
Elastic Stack for Big Data Analysis
Connect the massive data storage and deep processing power of Hadoop with the real-time
search and analytics of Elasticsearch.
ES-Hadoop lets you index Hadoop data into the Elastic Stack to take full advantage of the
speedy Elasticsearch engine and beautiful Kibana visualizations.
Elasticsearch for Apache Hadoop
15
Splunk VS ELKStack
Popularity Trend
A head to head comparison is always a tough call, especially when there’s no clear
winner and the tool you choose can potentially have a huge impact on the business
Splunk and the ELK stack are dominating the interest in the log management space
with the most comprehensive and customizable solutions.
16
Demo !!
17
Basic Example
18
Demo – Twitter Example
19
X-Pack & Elastic Cloud
20
Summary
• Elastic Stack
• Components of Elastic Stack
• Configurations
• ES-Hadoop plugin for Big Data Analysis
• ElasticSearch : Store, Search , Analysis
• Logstash: ETL
• Kibana: Visualisation
• Beats: Data Shipper
• Elastic Cloud
21
Thank You !!
Contact me @vikshinde

More Related Content

What's hot (20)

PDF
Introduction to elasticsearch
hypto
 
ODP
Deep Dive Into Elasticsearch
Knoldus Inc.
 
PDF
ELK Stack
Eberhard Wolff
 
PPTX
Log management with ELK
Geert Pante
 
PPTX
ElasticSearch Basic Introduction
Mayur Rathod
 
PPTX
The Elastic ELK Stack
enterprisesearchmeetup
 
PDF
Elk - An introduction
Hossein Shemshadi
 
PPTX
Elastic search overview
ABC Talks
 
ODP
Elasticsearch for beginners
Neil Baker
 
PDF
Log analysis with the elk stack
Vikrant Chauhan
 
PDF
Elasticsearch
Shagun Rathore
 
PDF
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Edureka!
 
PPTX
Elasticsearch
Divij Sehgal
 
PDF
Introduction to Kibana
Vineet .
 
PPTX
An Intro to Elasticsearch and Kibana
ObjectRocket
 
PDF
Kibana + timelion: time series with the elastic stack
Sylvain Wallez
 
PDF
Elasticsearch
Hermeto Romano
 
PPTX
Introduction to Elasticsearch with basics of Lucene
Rahul Jain
 
PDF
Log analysis with elastic stack
Bangladesh Network Operators Group
 
PPTX
Kibana overview
Rinat Tainov
 
Introduction to elasticsearch
hypto
 
Deep Dive Into Elasticsearch
Knoldus Inc.
 
ELK Stack
Eberhard Wolff
 
Log management with ELK
Geert Pante
 
ElasticSearch Basic Introduction
Mayur Rathod
 
The Elastic ELK Stack
enterprisesearchmeetup
 
Elk - An introduction
Hossein Shemshadi
 
Elastic search overview
ABC Talks
 
Elasticsearch for beginners
Neil Baker
 
Log analysis with the elk stack
Vikrant Chauhan
 
Elasticsearch
Shagun Rathore
 
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Edureka!
 
Elasticsearch
Divij Sehgal
 
Introduction to Kibana
Vineet .
 
An Intro to Elasticsearch and Kibana
ObjectRocket
 
Kibana + timelion: time series with the elastic stack
Sylvain Wallez
 
Elasticsearch
Hermeto Romano
 
Introduction to Elasticsearch with basics of Lucene
Rahul Jain
 
Log analysis with elastic stack
Bangladesh Network Operators Group
 
Kibana overview
Rinat Tainov
 

Similar to Elastic Stack Introduction (20)

PDF
ELK stack introduction
abenyeung1
 
PPTX
ELK Solutions Enablement Session - 17th March'2020
Ashnikbiz
 
PPTX
Elastic Search Capability Presentation.pptx
Knoldus Inc.
 
PDF
Technology behind-real-time-log-analytics
Data Science Thailand
 
PPTX
Elasticsearch features and ecosystem
Pavel Alexeev
 
PDF
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
ShapeBlue
 
PDF
Eko10 - Security Monitoring for Big Infrastructures without a Million Dollar ...
Hernan Costante
 
PPTX
Centralized Logging System Using ELK Stack
Rohit Sharma
 
PPTX
Intro elasticsearch taswarbhatti
Taswar Bhatti
 
PPTX
centralization of log systems pour suivis
Thierry Gayet
 
PDF
Elastic.co's ELK Stack - Platform Agnostic Immutable Infrastructure & Analys...
AWS Chicago
 
PDF
Security Monitoring for big Infrastructures without a Million Dollar budget
Juan Berner
 
PPTX
Devteach 2017 Store 2 million of audit a day into elasticsearch
Taswar Bhatti
 
PDF
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
NAVER D2
 
PDF
DIY Netflow Data Analytic with ELK Stack by CL Lee
MyNOG
 
PPTX
The Elastic Stack as a SIEM
John Hubbard
 
PDF
Logs aggregation and analysis
Divante
 
PDF
Présentation ELK/SIEM et démo Wazuh
Aurélie Henriot
 
PPTX
ELK Ruminating on Logs (Zendcon 2016)
Mathew Beane
 
PDF
Mulesoft ELK
Integration Assistance
 
ELK stack introduction
abenyeung1
 
ELK Solutions Enablement Session - 17th March'2020
Ashnikbiz
 
Elastic Search Capability Presentation.pptx
Knoldus Inc.
 
Technology behind-real-time-log-analytics
Data Science Thailand
 
Elasticsearch features and ecosystem
Pavel Alexeev
 
Centralized Logging Feature in CloudStack using ELK and Grafana - Kiran Chava...
ShapeBlue
 
Eko10 - Security Monitoring for Big Infrastructures without a Million Dollar ...
Hernan Costante
 
Centralized Logging System Using ELK Stack
Rohit Sharma
 
Intro elasticsearch taswarbhatti
Taswar Bhatti
 
centralization of log systems pour suivis
Thierry Gayet
 
Elastic.co's ELK Stack - Platform Agnostic Immutable Infrastructure & Analys...
AWS Chicago
 
Security Monitoring for big Infrastructures without a Million Dollar budget
Juan Berner
 
Devteach 2017 Store 2 million of audit a day into elasticsearch
Taswar Bhatti
 
Elastic v5.0.0 Update uptoalpha3 v0.2 - 김종민
NAVER D2
 
DIY Netflow Data Analytic with ELK Stack by CL Lee
MyNOG
 
The Elastic Stack as a SIEM
John Hubbard
 
Logs aggregation and analysis
Divante
 
Présentation ELK/SIEM et démo Wazuh
Aurélie Henriot
 
ELK Ruminating on Logs (Zendcon 2016)
Mathew Beane
 
Ad

Recently uploaded (20)

PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Digital Circuits, important subject in CS
contactparinay1
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Ad

Elastic Stack Introduction

  • 1. 1 Elastic (ELK) Stack By Vikram Shinde
  • 2. 2 Agenda • Introduction • Elastic Stack Overview • Components of Elastic Stack • Role of Elastic Stack in Big Data Analysis • Demo • ElasticSearch configurations • Logstash pipelines • Kibana Dashboards • Beats example • Twitter trend example • Q & A
  • 3. 3 Elastic (ELK) Stack Elastic Stack is a group of open source products from Elastic designed to help users take data from any type of source and in any format and search, analyze, and visualize that data in real time. It uses Logstash for log aggregation, Elasticsearch for searching, and Kibana for visualizing and analyzing data. • ElasticSearch: Store, Search, and Analyze • Logstash: Collect logs and events data, Parse and Transform • Kibana: Explore, Visualize, and Share • Beats: Data shipper.
  • 4. 4 Elastic (ELK) Stack Architecture
  • 5. 5 ElasticSearch Elasticsearch is a highly available and distributed search engine. • Built on top of Apache Lucene • NoSQL Datastore • Schema-free • JSON Document • RESTful APIs Relational Database ElasticSearch Database Index Table Type Row Document Column Field Schema Mapping • Node • Cluster
  • 6. 6 ElasticSearch Elasticsearch is distributed, which means that indices can be divided into shards and each shard can have zero or more replicas. By default, an index is created with 5 shards and 1 replica per shard (5/1). Rebalancing and routing of shards are done automatically. Features • Distributed • Scalable • Highly available • Near Real Time (NRT) search • Full Text Search • Java, .NET, PHP, Python, Curl, Perl, Ruby • HADOOP & SPARK -- Elasticsearch-Hadoop (ES-Hadoop)
  • 7. 7 ElasticSearch RESTful API HTTP Based CRUD Operations Operation CURL command Create curl –XPUT “http://localhost:9200/<index>/<type>/<id>” Read curl –XGET “http://localhost:9200/<index>/<type>/<id>” Update curl –XPOST “http://localhost:9200/<index>/<type>/<id>” Delete curl –XDELETE “http://localhost:9200/<index>/<type>/<id>”
  • 8. 8 GitHub Casestudy Challenge : How do you satisfy the search needs of GitHub's 4 million users while simultaneously providing tactical operational insights that help you iteratively improve customer service? Solution: GitHub uses Elasticsearch to continually index the data from an ever- growing store of over 8 million code repositories, comprising over 2 billion documents. GitHub uses Elasticsearch to index new code as soon as users push it to a repository on GitHub. "Search is at the core of GitHub" Other customers includes Facebook, Netflix, ebay, Wikimedia, etc. ebay : Searching across 800 million listings in subseconds
  • 9. 9 Logstash Logstash can collect logs from a variety of sources (using input plugins), process the data into a common format using filters, and stream data to a variety of sources (using output plugins). Multiple filters can be chained to parse the data into a common format. Together, they build a Logstash Processing Pipeline.
  • 10. 10 Logstash Plug-ins Input Plugins • Beats • Elasticsearch • File • Graphite • Heartbeat • Tttp • Jdbc • Kafka • Log4j • Redis • Stdin • TCP • Twitter Output Plugins • CSV • Elasticsearch • Email • File • Graphite • Http • Jira • Kafka • Nagios • Redis • Stdout • S3 • Tcp • Udp Filter Plugins • Aggregate • csv • Date • geoip • Grok • Json • sleep • urlencode • UUID • xml Logstash has a rich collections of input, filter and output plugins. You can now create your own Logstash plugin and add it into community plugins.
  • 12. 12 Kibana • Discover • Visualise • Dashboards • Put Geo Data on Any Map • Insert dashboards into your internal wiki or webpage • Send your coworker a URL to a dashboard. Kibana gives you the freedom to select the way you give shape to your data.
  • 13. 13 Beats Lightweight Data Shippers. Beats is the platform for single-purpose data shippers. They install as lightweight agents and send data from hundreds or thousands of machines to Logstash or Elasticsearch.
  • 14. 14 Elastic Stack for Big Data Analysis Connect the massive data storage and deep processing power of Hadoop with the real-time search and analytics of Elasticsearch. ES-Hadoop lets you index Hadoop data into the Elastic Stack to take full advantage of the speedy Elasticsearch engine and beautiful Kibana visualizations. Elasticsearch for Apache Hadoop
  • 15. 15 Splunk VS ELKStack Popularity Trend A head to head comparison is always a tough call, especially when there’s no clear winner and the tool you choose can potentially have a huge impact on the business Splunk and the ELK stack are dominating the interest in the log management space with the most comprehensive and customizable solutions.
  • 20. 20 Summary • Elastic Stack • Components of Elastic Stack • Configurations • ES-Hadoop plugin for Big Data Analysis • ElasticSearch : Store, Search , Analysis • Logstash: ETL • Kibana: Visualisation • Beats: Data Shipper • Elastic Cloud
  • 21. 21 Thank You !! Contact me @vikshinde

Editor's Notes

  • #14: The Beats are open source data shippers that you install as agents on your servers to send different types of operational data to Elasticsearch. Beats can send data directly to Elasticsearch or send it to Elasticsearch via Logstash, which you can use to parse and transform the data. Packetbeat, Filebeat, Metricbeat, and Winlogbeat are a few examples of Beats. Packetbeat is a network packet analyzer that ships information about the transactions exchanged between your application servers. Filebeat ships log files from your servers. Metricbeat is a server monitoring agent that periodically collects metrics from the operating systems and services running on your servers. And Winlogbeat ships Windows event logs.
  • #16: https://www.google.com/trends/explore?date=all&q=elasticsearch%20%2B%20logstash%20%2B%20kibana,splunk&hl=en-US