SlideShare a Scribd company logo
Optimizing Log
Analytics from the Edge
April 2016
© Hortonworks Inc. 2011 – 2015. All Rights Reserved
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
About Hortonworks
Customer Momentum
~800 customers (as of Feb 10, 2016)
Publicly traded on NASDAQ: HDP
Hortonworks Data Platform
Completely open multi-tenant platform
for any app and any data
Consistent enterprise services for security,
operations, and governance
Partner for Customer Success
Leader in open-source community, focused
on innovation to meet enterprise needs
Unrivaled Hadoop support subscriptions
Founded in 2011
Original 24 architects, developers,
operators of Hadoop from Yahoo!
800+
E M P L O Y E E S
1500+
E C O S Y S T E M
PA R T N E R S
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
EMBRACE AN
OPEN APPROACH
MASTER THE
VALUE OF DATA
EVERY BUSINESS
IS A DATA BUSINESS
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
DATA
AT REST
DATA
IN MOTION
ACTIONABLE
INTELLIGENCE
MODERN DATA APPLICATIONS
Actionable
Intelligence from
Connected Data
Platforms
Capturing perishable
insights from data in motion
Ensuring rich, historical insights on
data at rest
Necessary for modern data
applications
Hortonworks
DataFlow
Hortonworks
Data Platform
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Optimizing Log Ingest with
Hortonworks DataFlow
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Why Hortonworks DataFlow?
Because even the best data scientists
and most powerful platforms need
the right data to analyze
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Store Data
Process and Analyze
Data
Acquire Data
Perception of DataFlows: Easy, Definitive
Dataflow
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Reality of Dataflows: Complex, Convoluted
Store Data
Process and Analyze
Data
Acquire Data
Store DataStore Data
Store Data
Store Data
Acquire Data
Acquire Data
Acquire Data
Dataflow
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
HDF has 130+ Processors - Multiple for Log Analytics
HTTP
Syslog
Email
HTML
Image
Hash Encrypt
Extract
TailMerge
Evaluate
Duplicate Execute
Scan
GeoEnrich
Replace
ConvertSplit
Translate
HL7
FTP
UDP
XML
SFTP
Route Content
Route Context
Route Text
Control Rate
Distribute Load
AMQP
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Log Analytics Systems Today
LOG
ANALYTICS
PLATFORMNetwork
Device Logs
• Not all data can be captured
• Not all captured data is valuable
• Transport all data
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Cost Effectively Expand Storage Options of Log Data
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDP
HDF
3. Cost effectively
expand collection and
grow timescale of logs
collected
2. Content-based routing
based on dynamic
evaluation of content,
attributes, priority
1. Integrate and
enrich logs across
data centers and
security zones
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Efficiently Expand Log Ingestion from the Edge
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDF
HDF
HDF
HDPHDF
• Expand collection to new sources of machine data
• Edge analytics to transform, enrich and prioritize content based routing
• Capture and transport only valuable data
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Expand Analytics and Reporting Options with HDP
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDF
HDF
HDF
HDPHDF
ODBC interface
traditional BI tools
Easy access to log analytics data
through traditional BI tools
Give data scientists better
tooling – Spark, Storm etc
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Expand to small scale, remote systems
LOG
ANALYTICS
PLATFORM
Network
Device Logs
HDF
HDF
HDF
HDPHDF
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Optimize Log Analytics with Content Based Routing
LOG
ANALYTICS
PLATFORM
Edge analytics for cost-effective
and efficient movement of
machine data
HDF
Intelligent, content based
routing, transformation
and enrichment
Send data to alternative
systems based on value,
content, priority
HDP
HDF
HDF
HDF
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk Optimization:
Using HDP as Data Refinery
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk Hadoop Connect
17
 Reliable bi-directional integration
Import
Browse
Export
Splunk Hadoop Connect
>2000 downloads
HA Indexes and
Storage
Commodity
Servers
Hadoop
(MapReduce &
HDFS)
Report &
analyze
Custom
dashboards
Monitor
and alert
Ad hoc
search
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk, Hunk & Hortonworks
YARN Ready Partner
Certified on Hortonworks Data Platform
Existing Sandbox tutorial
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk, Part of the Modern Data Architecture
• Bi-directional data integration
between Splunk & HDP
• Collect data from across the
organization, deliver it to Hadoop
for refining data and batch
analytics
• Output of Hadoop jobs can be
imported into Splunk Enterprise
for rapid analysis and visualization
• Archiving from Splunk Enterprise
to Hadoop
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Splunk, Part of the Modern Data Architecture
• Bi-directional data integration
between Splunk & HDP
• Collect data from across the
organization, deliver it to
Hadoop for refining data and
batch analytics
• Output of Hadoop jobs can be
imported into Splunk Enterprise
for rapid analysis and
visualization
• Archiving from Splunk Enterprise
to Hadoop
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Hunk + Hortonworks
21
Explore, analyze and visualize data in
HDP from one integrated platform
Simply point Hunk at your HDP cluster(s)
and start exploring data immediately
Search data, change perspectives and
preview results as MapReduce jobs run
INTERACTIVE
EXPLORATION
RICH DEVELOPER
ENVIRONMENT
Build big data apps on data in HDP using
standard web languages and frameworks
FULL-FEATURED
ANALYTICS
FAST TO DEPLOY
AND DRIVE VALUE
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Augment Splunk Deployment with Hortonworks Data Platform
Heavy Indexer
Universal
Forwarders
HDP
Enables
Splunk Storage
• Expansion to more data than previously feasible
• Archive data from Splunk into Hadoop
• Query archived Splunk data in Hadoop
• Focus Splunk infrastructure on what really matters
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Find out how much you can optimize
your log analytics infrastructure today.
Contact sales@hortonworks.com

More Related Content

What's hot (20)

PPTX
Apache NiFi 1.0 in Nutshell
DataWorks Summit/Hadoop Summit
 
PPTX
ODPi 101: Who we are, What we do
Hortonworks
 
PPTX
Mission to NARs with Apache NiFi
Hortonworks
 
PDF
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
PPTX
Apache NiFi Toronto Meetup
Hortonworks
 
PDF
Hp Converged Systems and Hortonworks - Webinar Slides
Hortonworks
 
PPTX
Introduction to Apache NiFi - Seattle Scalability Meetup
Saptak Sen
 
PPTX
Securing Hadoop with Apache Ranger
DataWorks Summit
 
PPTX
Hive present-and-feature-shanghai
Yifeng Jiang
 
PPTX
Apache Ambari - What's New in 2.2
Hortonworks
 
PPTX
Integrating NiFi and Flink
Bryan Bende
 
PPTX
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks
 
PPTX
Hadoop Operations - Past, Present, and Future
DataWorks Summit
 
PPTX
The Elephant in the Clouds
DataWorks Summit/Hadoop Summit
 
PPTX
Row/Column- Level Security in SQL for Apache Spark
DataWorks Summit/Hadoop Summit
 
PDF
Hortonworks tech workshop in-memory processing with spark
Hortonworks
 
PDF
Keynote
DataWorks Summit
 
PPTX
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
 
PPTX
Apache Hadoop YARN: state of the union
DataWorks Summit
 
PDF
Fast SQL on Hadoop, really?
DataWorks Summit
 
Apache NiFi 1.0 in Nutshell
DataWorks Summit/Hadoop Summit
 
ODPi 101: Who we are, What we do
Hortonworks
 
Mission to NARs with Apache NiFi
Hortonworks
 
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
Apache NiFi Toronto Meetup
Hortonworks
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hortonworks
 
Introduction to Apache NiFi - Seattle Scalability Meetup
Saptak Sen
 
Securing Hadoop with Apache Ranger
DataWorks Summit
 
Hive present-and-feature-shanghai
Yifeng Jiang
 
Apache Ambari - What's New in 2.2
Hortonworks
 
Integrating NiFi and Flink
Bryan Bende
 
Hortonworks Data In Motion Series Part 3 - HDF Ambari
Hortonworks
 
Hadoop Operations - Past, Present, and Future
DataWorks Summit
 
The Elephant in the Clouds
DataWorks Summit/Hadoop Summit
 
Row/Column- Level Security in SQL for Apache Spark
DataWorks Summit/Hadoop Summit
 
Hortonworks tech workshop in-memory processing with spark
Hortonworks
 
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
 
Apache Hadoop YARN: state of the union
DataWorks Summit
 
Fast SQL on Hadoop, really?
DataWorks Summit
 

Viewers also liked (13)

PPTX
Apache NiFi- MiNiFi meetup Slides
Isheeta Sanghi
 
PPTX
Big data market prediction
bernard lunn
 
PDF
Multi-Tenant Log Analytics SaaS Service using Solr: Presented by Chirag Gupta...
Lucidworks
 
PDF
What is big data - Architectures and Practical Use Cases
Tony Pearson
 
PDF
Framework and Product Comparison for Big Data Log Analytics and ITOA
Kai Wähner
 
PPTX
Azure Stream Analytics
James Serra
 
PPTX
Introduction to Apache Kafka
Jeff Holoman
 
PPTX
Cloudera Impala: A Modern SQL Engine for Hadoop
Cloudera, Inc.
 
PDF
Developing Real-Time Data Pipelines with Apache Kafka
Joe Stein
 
PDF
Impala Architecture presentation
hadooparchbook
 
PPTX
Big Data Analytics with Hadoop
Philippe Julio
 
PPTX
Big data ppt
Nasrin Hussain
 
Apache NiFi- MiNiFi meetup Slides
Isheeta Sanghi
 
Big data market prediction
bernard lunn
 
Multi-Tenant Log Analytics SaaS Service using Solr: Presented by Chirag Gupta...
Lucidworks
 
What is big data - Architectures and Practical Use Cases
Tony Pearson
 
Framework and Product Comparison for Big Data Log Analytics and ITOA
Kai Wähner
 
Azure Stream Analytics
James Serra
 
Introduction to Apache Kafka
Jeff Holoman
 
Cloudera Impala: A Modern SQL Engine for Hadoop
Cloudera, Inc.
 
Developing Real-Time Data Pipelines with Apache Kafka
Joe Stein
 
Impala Architecture presentation
hadooparchbook
 
Big Data Analytics with Hadoop
Philippe Julio
 
Big data ppt
Nasrin Hussain
 
Ad

Similar to Log Analytics Optimization (20)

PPTX
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks
 
PDF
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
PDF
Splunk-hortonworks-risk-management-oct-2014
Hortonworks
 
PDF
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Mats Johansson
 
PDF
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks
 
PDF
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks
 
PDF
Storm Demo Talk - Colorado Springs May 2015
Mac Moore
 
PDF
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 
PDF
Apache Hadoop on the Open Cloud
Hortonworks
 
PDF
Building a Modern Data Architecture with Enterprise Hadoop
Slim Baltagi
 
PPTX
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
skumpf
 
PDF
Enterprise Apache Hadoop: State of the Union
Hortonworks
 
PPTX
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
PPTX
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
DataWorks Summit/Hadoop Summit
 
PPTX
Internet of Things Crash Course Workshop at Hadoop Summit
DataWorks Summit
 
PPTX
Internet of things Crash Course Workshop
DataWorks Summit
 
PPTX
HDF Powered by Apache NiFi Introduction
Milind Pandit
 
PDF
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
Hortonworks
 
PDF
Storm Demo Talk - Denver Apr 2015
Mac Moore
 
PDF
Introduction to Hadoop
Timothy Spann
 
Hortonworks Data In Motion Webinar Series Pt. 2
Hortonworks
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
Splunk-hortonworks-risk-management-oct-2014
Hortonworks
 
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Mats Johansson
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Hortonworks
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks
 
Storm Demo Talk - Colorado Springs May 2015
Mac Moore
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 
Apache Hadoop on the Open Cloud
Hortonworks
 
Building a Modern Data Architecture with Enterprise Hadoop
Slim Baltagi
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
skumpf
 
Enterprise Apache Hadoop: State of the Union
Hortonworks
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
DataWorks Summit/Hadoop Summit
 
Internet of Things Crash Course Workshop at Hadoop Summit
DataWorks Summit
 
Internet of things Crash Course Workshop
DataWorks Summit
 
HDF Powered by Apache NiFi Introduction
Milind Pandit
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
Hortonworks
 
Storm Demo Talk - Denver Apr 2015
Mac Moore
 
Introduction to Hadoop
Timothy Spann
 
Ad

More from Hortonworks (20)

PDF
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks
 
PDF
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Hortonworks
 
PDF
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Hortonworks
 
PDF
Johns Hopkins - Using Hadoop to Secure Access Log Events
Hortonworks
 
PDF
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Hortonworks
 
PDF
HDF 3.2 - What's New
Hortonworks
 
PPTX
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Hortonworks
 
PDF
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Hortonworks
 
PDF
IBM+Hortonworks = Transformation of the Big Data Landscape
Hortonworks
 
PDF
Premier Inside-Out: Apache Druid
Hortonworks
 
PDF
Accelerating Data Science and Real Time Analytics at Scale
Hortonworks
 
PDF
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
Hortonworks
 
PDF
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Hortonworks
 
PDF
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Hortonworks
 
PDF
Making Enterprise Big Data Small with Ease
Hortonworks
 
PDF
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Hortonworks
 
PDF
Driving Digital Transformation Through Global Data Management
Hortonworks
 
PPTX
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
Hortonworks
 
PDF
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks
 
PDF
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Hortonworks
 
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
Hortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Hortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Hortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Hortonworks
 
HDF 3.2 - What's New
Hortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Hortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Hortonworks
 
IBM+Hortonworks = Transformation of the Big Data Landscape
Hortonworks
 
Premier Inside-Out: Apache Druid
Hortonworks
 
Accelerating Data Science and Real Time Analytics at Scale
Hortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
Hortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Hortonworks
 
Making Enterprise Big Data Small with Ease
Hortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Hortonworks
 
Driving Digital Transformation Through Global Data Management
Hortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
Hortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Hortonworks
 

Recently uploaded (20)

PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PDF
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
PPT
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PDF
July Patch Tuesday
Ivanti
 
PPTX
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PDF
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
July Patch Tuesday
Ivanti
 
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 

Log Analytics Optimization

  • 1. Optimizing Log Analytics from the Edge April 2016 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved About Hortonworks Customer Momentum ~800 customers (as of Feb 10, 2016) Publicly traded on NASDAQ: HDP Hortonworks Data Platform Completely open multi-tenant platform for any app and any data Consistent enterprise services for security, operations, and governance Partner for Customer Success Leader in open-source community, focused on innovation to meet enterprise needs Unrivaled Hadoop support subscriptions Founded in 2011 Original 24 architects, developers, operators of Hadoop from Yahoo! 800+ E M P L O Y E E S 1500+ E C O S Y S T E M PA R T N E R S
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved EMBRACE AN OPEN APPROACH MASTER THE VALUE OF DATA EVERY BUSINESS IS A DATA BUSINESS
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved DATA AT REST DATA IN MOTION ACTIONABLE INTELLIGENCE MODERN DATA APPLICATIONS Actionable Intelligence from Connected Data Platforms Capturing perishable insights from data in motion Ensuring rich, historical insights on data at rest Necessary for modern data applications Hortonworks DataFlow Hortonworks Data Platform
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Optimizing Log Ingest with Hortonworks DataFlow
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Why Hortonworks DataFlow? Because even the best data scientists and most powerful platforms need the right data to analyze
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Store Data Process and Analyze Data Acquire Data Perception of DataFlows: Easy, Definitive Dataflow
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Reality of Dataflows: Complex, Convoluted Store Data Process and Analyze Data Acquire Data Store DataStore Data Store Data Store Data Acquire Data Acquire Data Acquire Data Dataflow
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved HDF has 130+ Processors - Multiple for Log Analytics HTTP Syslog Email HTML Image Hash Encrypt Extract TailMerge Evaluate Duplicate Execute Scan GeoEnrich Replace ConvertSplit Translate HL7 FTP UDP XML SFTP Route Content Route Context Route Text Control Rate Distribute Load AMQP
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Log Analytics Systems Today LOG ANALYTICS PLATFORMNetwork Device Logs • Not all data can be captured • Not all captured data is valuable • Transport all data
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Cost Effectively Expand Storage Options of Log Data LOG ANALYTICS PLATFORM Network Device Logs HDP HDF 3. Cost effectively expand collection and grow timescale of logs collected 2. Content-based routing based on dynamic evaluation of content, attributes, priority 1. Integrate and enrich logs across data centers and security zones
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Efficiently Expand Log Ingestion from the Edge LOG ANALYTICS PLATFORM Network Device Logs HDF HDF HDF HDPHDF • Expand collection to new sources of machine data • Edge analytics to transform, enrich and prioritize content based routing • Capture and transport only valuable data
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Expand Analytics and Reporting Options with HDP LOG ANALYTICS PLATFORM Network Device Logs HDF HDF HDF HDPHDF ODBC interface traditional BI tools Easy access to log analytics data through traditional BI tools Give data scientists better tooling – Spark, Storm etc
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Expand to small scale, remote systems LOG ANALYTICS PLATFORM Network Device Logs HDF HDF HDF HDPHDF
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Optimize Log Analytics with Content Based Routing LOG ANALYTICS PLATFORM Edge analytics for cost-effective and efficient movement of machine data HDF Intelligent, content based routing, transformation and enrichment Send data to alternative systems based on value, content, priority HDP HDF HDF HDF
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk Optimization: Using HDP as Data Refinery
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk Hadoop Connect 17  Reliable bi-directional integration Import Browse Export Splunk Hadoop Connect >2000 downloads HA Indexes and Storage Commodity Servers Hadoop (MapReduce & HDFS) Report & analyze Custom dashboards Monitor and alert Ad hoc search
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk, Hunk & Hortonworks YARN Ready Partner Certified on Hortonworks Data Platform Existing Sandbox tutorial
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk, Part of the Modern Data Architecture • Bi-directional data integration between Splunk & HDP • Collect data from across the organization, deliver it to Hadoop for refining data and batch analytics • Output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization • Archiving from Splunk Enterprise to Hadoop
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Splunk, Part of the Modern Data Architecture • Bi-directional data integration between Splunk & HDP • Collect data from across the organization, deliver it to Hadoop for refining data and batch analytics • Output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization • Archiving from Splunk Enterprise to Hadoop
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Hunk + Hortonworks 21 Explore, analyze and visualize data in HDP from one integrated platform Simply point Hunk at your HDP cluster(s) and start exploring data immediately Search data, change perspectives and preview results as MapReduce jobs run INTERACTIVE EXPLORATION RICH DEVELOPER ENVIRONMENT Build big data apps on data in HDP using standard web languages and frameworks FULL-FEATURED ANALYTICS FAST TO DEPLOY AND DRIVE VALUE
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Augment Splunk Deployment with Hortonworks Data Platform Heavy Indexer Universal Forwarders HDP Enables Splunk Storage • Expansion to more data than previously feasible • Archive data from Splunk into Hadoop • Query archived Splunk data in Hadoop • Focus Splunk infrastructure on what really matters
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Find out how much you can optimize your log analytics infrastructure today. Contact [email protected]

Editor's Notes

  • #9: In reality, dataflows move all over. Data is moved and stored in multiple places – sometimes interim, sometimes longterm. Data is procesed in different places, and then moved again. Complicated, convoluted, messy.
  • #20: Interactively search without fixed schemas or moving data. Preview results and accelerate reports for fast search and improved cluster performance. Provide self-service analytics for business and IT stakeholders with data models and pivot. Rapidly build big data apps with a rich developer environment.
  • #21: Interactively search without fixed schemas or moving data. Preview results and accelerate reports for fast search and improved cluster performance. Provide self-service analytics for business and IT stakeholders with data models and pivot. Rapidly build big data apps with a rich developer environment.