SlideShare a Scribd company logo
Smart Enterprise Big Data Bus
ForThe Modern Responsive Enterprise
AnandVenugopal
June 10, 2015
THE JOURNEY SO FAR…
• ImpetusTechnologies – Leading Big Data Solutions Provider
• StreamAnalytix – Enterprise Class Streaming Analytics platform
– Early Access Program: Aug 2014
– Pilot Projects: October 2014 – February 2015
– GA Launch: February 2015
– Today – Sharing one of the key insights and use-case patterns from pilot
SMART ENTERPRISE BIG DATA BUS - AGENDA
Why ?
• Case Study
• Emerging Big Data Landscape and its Challenges
What ?
• Solution Characteristics
• Components Required
How ?
• Implementation Using StreamAnalytix
• Technology Stack
Q&A
• Summary
• Follow up
CASE STUDY - HOW IT BEGAN
Healthcare giant is speeding up critical business processes by using a streaming analytics
platform for real-time data synchronization between their Hadoop platform and their
enterprise NoSQL database
App-1 App-2 App-3
Enterprise NoSQL
Enterprise Data
Hub/Lake
Hadoop
EMERGING ENTERPRISE BIG DATA LANDSCAPE
Hadoop Stack –
Becoming the Center of
the Enterprise Data
Universe
Business Critical
Applications Are Still
''Silos''
Real-time Streaming
Analytics Adoption
Rapidly Increasing
HADOOP – IT MAKES COFFEETOO!
Active
Archive
ETL
Single
Source of
Truth
BI +
Analytics
Hadoop
Streams/
Transactions
HADOOP ARCHITECTUREVIEW: CLOUDERA
HADOOP ARCHITECTUREVIEW: HORTONWORKS
HADOOP ARCHITECTUREVIEW: MAPR
WHAT’S MISSING FROMTHESE PICTURES?
HOW HADOOP CONNECTSTOTHE REST OF IT ALL !
Active
Archive
ETL
Single
Source of
Truth
BI +
Analytics
Hadoo
p
Streams/
Transactions
Enterprise Data
Hub/Lake
Hadoop
ERP CRM
Current
EDW/ ADW
App Server 1
App Server 2
Hub and Spoke
Architecture
The IntegrationChallenge – Developing,
Maintaining, Managing the Hub-spoke System
EMERGING ENTERPRISE BIG DATA LANDSCAPE
Hadoop Stack –
Becoming the Center of
the Enterprise Data
Universe
Business Critical
Applications Are Still
''Silos''
Real-time Streaming
Analytics Adoption
Rapidly Increasing
SILOED ENTERPRISE APPLICATIONS
MAYTAKE MANYYEARS TO INTEGRATE WITH HADOOP
Billing
Customer
Service
Transaction
Processing
Provisioning
THE DATA INTEGRATION CHALLENGE IS HERETO STAY
Enterprise Data Hub/Lake
Hadoop
ERP CRM
Current
EDW/ADW
App Server 1
App Server 2
Hub and Spoke
Architecture
The Integration Challenge – Developing, Maintaining,
Managing the Hub-spoke System
EMERGING ENTERPRISE BIG DATA LANDSCAPE
Hadoop Stack –
Becoming the Center of
the Enterprise Data
Universe
Real-time Streaming
Analytics Adoption
Rapidly Increasing
Business Critical
Applications Are Still
''Silos''
The Modern Enterprise
Expected To Be ''Real-time'' Or ''Near Real-time''
Sales
Web Site
Billing
Customer
Service
Growth of Internet ofThings (IoT) and
Sensor/ Machine Data Sources
Context-sensitive Customer Service
The Modern Enterprise
Expected To Be ''Real-time'' Or ''Near Real-time''
Sales
Web Site
Billing
Customer
Service
Mobile Location Based Offers
Internet Advertisements
Call-center Agent Interactions
CONTEXT AWARE  POSITIVE CUSTOMER EXPERIENCE
Multi-channel
engagement in
real-time
Context
Sensitive service
Happy customers,
Loyalty, Revenue,
Profits, Growth
Real-time Responses Need Hi-speed Any-to-Any
Data Synchronization & Analytics Over Streaming Data
Provisioning
Machine Data
Processing
Billing
Customer Service
Enterprise Data
Hub/Lake
Hadoop
Developing and managing all the peer-to-peer data interfaces and the hub-spoke model together would be very hard
EMERGING ENTERPRISE BIG DATA LANDSCAPE
Hadoop Stack –
Becoming the Center of
the Enterprise Data
Universe
Business Critical
Applications Are Still
''Silos''
Real-time Streaming
Analytics Adoption
Rapidly Increasing
SMART ENTERPRISE BIG DATA BUS - AGENDA
Why ?
• Case Study
• Emerging Big Data Landscape and its Challenges
What ?
• Solution Characteristics
• Components Required
How ?
• Implementation Using StreamAnalytix
• Technology Stack
Q&A
• Summary
• Follow up
A HI-SPEED BIG DATA BUS ARCHITECTURE WOULD BE AN
EFFICIENT ANY-TO-ANY DATA SYNCHRONIZATION
MECHANISM
Provisioning
Machine Data
Processing
Billing
Customer
Service
Enterprise Data
Hub/Lake
Hadoop
A HI-SPEED BIG DATA BUS ARCHITECTURE WOULD BE AN
EFFICIENT ANY-TO-ANY DATA SYNCHRONIZATION
MECHANISM
Provisioning
Machine Data
Processing
Billing
Customer
Service
Enterprise Data
Hub/Lake
Hadoop
A HI-SPEED BIG DATA BUS ARCHITECTURE WOULD BE AN
EFFICIENT ANY-TO-ANY DATA SYNCHRONIZATION
MECHANISM
Provisioning
Machine Data
Processing
Billing
Customer
Service
Enterprise Data
Hub/Lake
Hadoop
THE HI-SPEED BIG DATA BUSWOULDALSO NEEDTO SUPPORT
''ONTHEWIRE'' COMPUTATION FOR DATA ANALYTICS,TRANSFORMS,
CEP
Provisioning
Machine Data
Processing
Billing
Customer
Service
Enterprise Data
Hub/Lake
Hadoop
Transformation
Analytics,Alerting
CAPABILITY LIST FORTHE ''SMART ENTERPRISE BIG DATA BUS''
• Ingest
• Parse
• Filter
• Transform
• Move
• Store
AT SCALE, AND FAST !
• Read
• Synchronize
• Analyse
• Predict
• Alert
• Visualise
Provisioning
Machine Data
Processing
Billing
Enterprise Data
Hub/Lake
Hadoop
Transformation
Analytics, AlertingCustomer
Service
THE ''SMART ENTERPRISE BIG DATA BUS''
''Stations'' in the Data Transit System Read-Write Adapters
Reliable, Fault-tolerant,
Elastic Scalable Distributed Stream
Processing and Transport Fabric
Stream Processing Services provided
by the ''Smart Enterprise Big Data Bus''
include UI for Work-flow Orchestration,
Management and Monitoring
THE SMART ENTERPRISE BIG DATA BUS
ESB (VS. SMART ENTERPRISE BIG DATA BUS)
• Were architected for a different workload in a different era
• Designed for light weight remote service invocations – not as a heavy throughput full peer-to-
peer data transfer mechanism
• No compute / analytics capability on the wire
• Expensive vertical scaling vs. distributed elastic scale-out with commodity hardware
• Monolithic workflows vs. independent control and elastic scalability of each stage in a workflow
based on compute needs
SMART ENTERPRISE BIG DATA BUS - AGENDA
Why ?
• Case Study
• Emerging Big Data Landscape and its Challenges
What ?
• Solution Characteristics
• Components Required
How ?
• Implementation Using StreamAnalytix
• Technology Stack
Q&A
• Summary
• Follow up
SMART ENTERPRISE BIG DATA BUS IMPLEMENTATION
• Kafka
• Rabbit MQ
• Apache Storm
• Operators
• RT Dashoards
• Websockets
• CEP
• Filter
• Indexer
• NoSQL
• HDFS, Hbase
• PMML
AT SCALE, FAST, EASY !
STREAMANALYTIX ARCHITECTURE BLOCK DIAGRAM
STREAMANALYTIX – ENTERPRISE BIG DATA BUS – CREATION
VISUAL CONFIGURATION – NO CODE – NO COMPLEXITY
MONITORING SCREEN
REAL-TIME DASHBOARD
SAMPLE DEPLOYMENT/HARDWARE SPECIFICATION
External systems
Data Store
Integration layer
Processing andAnalytics layer
Storm
RHEL 4 cores, 16GB RAM
vm06dev222
x 1
ElasticSearch
RHEL 8 cores, 32GB
RAM
vm06dev218
x 1
PostgreSQL
RHEL 4 cores, 8GB RAM
vm06dev222
x
1
Meta Information Store
Zookeeper
x 1
Kafka+ZK
RHEL 4 cores, and 8GB RAM
vm06dev220
x 1
Free Switch
x 2
Graphite
x 1
Web UI Layer
RHEL 4 cores, and
2GB RAM
vm07eng29,
vm07eng31
IR(Platform)
x 1
RHEL 4 cores,
and 2GB RAM
vm07dev97
RHEL 4 cores, 16GB RAM
vm06dev222
RHEL 4 cores, 4GB RAM
vm06dev224
Tomcat(Query Server)
RHEL 4 cores, 8GB RAM
vm06dev218
x 1
Tomcat(Admin+Log UI)
RHEL 4 cores, 16GB RAM
vm06dev222
x 1
Couchbase
RHEL 8 cores, 32GB RAM
vm06dev224
x 1
RabbitMQ
RHEL 4 cores, and 4GB RAM
vm06dev224
x 1
39
Any real
time source
Filter
Transfor
m
Analyze
Transport Store
Pull PushWeb Front
End
WWW
App
Server 1
App
Server 2
P.O.S.
Business
Transaction
AD
W
Analyst
Machine
Learning
Pattern
Recognition
Enterprise Data
Hub/Lake
Hadoop
Real-time
Dashboards
CEOs
Office
ERP ERP
Current
EDW
Rules
Alerts
Downstream
Apps
The Smart Enterprise Big Data Bus - Big Data River ?
www.streamanalytix.com
Email: inquiry@streamanalytix.com
Thank you.
Questions??
We are
Hiring!!
Free
Storm
Book
Demo
Booth
S9
Ad

More Related Content

What's hot (20)

Admiral Group
Admiral GroupAdmiral Group
Admiral Group
DataWorks Summit/Hadoop Summit
 
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
VMware Tanzu
 
Log I am your father
Log I am your fatherLog I am your father
Log I am your father
DataWorks Summit/Hadoop Summit
 
The DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to ProductionThe DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to Production
DataWorks Summit/Hadoop Summit
 
Data Science in the Cloud with Spark, Zeppelin, and Cloudbreak
Data Science in the Cloud with Spark, Zeppelin, and CloudbreakData Science in the Cloud with Spark, Zeppelin, and Cloudbreak
Data Science in the Cloud with Spark, Zeppelin, and Cloudbreak
DataWorks Summit
 
Format Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetFormat Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and Parquet
DataWorks Summit
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
Data Con LA
 
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalPresentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Diego Alberto Tamayo
 
YARN webinar series: Using Scalding to write applications to Hadoop and YARN
YARN webinar series: Using Scalding to write applications to Hadoop and YARNYARN webinar series: Using Scalding to write applications to Hadoop and YARN
YARN webinar series: Using Scalding to write applications to Hadoop and YARN
Hortonworks
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon Redshift
Attunity
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
Zeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureZeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data Architecture
MapR Technologies
 
Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success
DataWorks Summit/Hadoop Summit
 
Big Data at your Desk with KNIME
Big Data at your Desk with KNIMEBig Data at your Desk with KNIME
Big Data at your Desk with KNIME
DataWorks Summit/Hadoop Summit
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
DataWorks Summit
 
2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration
Jeffrey T. Pollock
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
DataWorks Summit
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
 
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
VMware Tanzu
 
The DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to ProductionThe DAP - Where YARN, HBase, Kafka and Spark go to Production
The DAP - Where YARN, HBase, Kafka and Spark go to Production
DataWorks Summit/Hadoop Summit
 
Data Science in the Cloud with Spark, Zeppelin, and Cloudbreak
Data Science in the Cloud with Spark, Zeppelin, and CloudbreakData Science in the Cloud with Spark, Zeppelin, and Cloudbreak
Data Science in the Cloud with Spark, Zeppelin, and Cloudbreak
DataWorks Summit
 
Format Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and ParquetFormat Wars: from VHS and Beta to Avro and Parquet
Format Wars: from VHS and Beta to Avro and Parquet
DataWorks Summit
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapRHadoop and NoSQL joining forces by Dale Kim of MapR
Hadoop and NoSQL joining forces by Dale Kim of MapR
Data Con LA
 
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalPresentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Diego Alberto Tamayo
 
YARN webinar series: Using Scalding to write applications to Hadoop and YARN
YARN webinar series: Using Scalding to write applications to Hadoop and YARNYARN webinar series: Using Scalding to write applications to Hadoop and YARN
YARN webinar series: Using Scalding to write applications to Hadoop and YARN
Hortonworks
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon Redshift
Attunity
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Data Integration for Big Data (OOW 2016, Co-Presented With Oracle)
Rittman Analytics
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
Zeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureZeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data Architecture
MapR Technologies
 
Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success Swimming Across the Data Lake, Lessons learned and keys to success
Swimming Across the Data Lake, Lessons learned and keys to success
DataWorks Summit/Hadoop Summit
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
DataWorks Summit
 
2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration2017 OpenWorld Keynote for Data Integration
2017 OpenWorld Keynote for Data Integration
Jeffrey T. Pollock
 
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo ClinicBig Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
DataWorks Summit
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
 

Viewers also liked (20)

Smart Enterprises
Smart EnterprisesSmart Enterprises
Smart Enterprises
Georg Guentner
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Impetus Technologies
 
Making The News – One Small Microservice at a Time
Making The News – One Small Microservice at a TimeMaking The News – One Small Microservice at a Time
Making The News – One Small Microservice at a Time
Tobias Järlund
 
NEC’s Smart Enterprise Solutions - Did You Know That…
NEC’s Smart Enterprise Solutions - Did You Know That…NEC’s Smart Enterprise Solutions - Did You Know That…
NEC’s Smart Enterprise Solutions - Did You Know That…
InteractiveNEC
 
Topic3 Enterprise Application Integration
Topic3 Enterprise Application IntegrationTopic3 Enterprise Application Integration
Topic3 Enterprise Application Integration
sanjoysanyal
 
EAI example
EAI exampleEAI example
EAI example
Prabhath Suminda
 
HBase and Drill: How loosley typed SQL is ideal for NoSQL
HBase and Drill: How loosley typed SQL is ideal for NoSQLHBase and Drill: How loosley typed SQL is ideal for NoSQL
HBase and Drill: How loosley typed SQL is ideal for NoSQL
DataWorks Summit
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP Haven
DataWorks Summit
 
Inspiring Travel at Airbnb [WIP]
Inspiring Travel at Airbnb [WIP]Inspiring Travel at Airbnb [WIP]
Inspiring Travel at Airbnb [WIP]
DataWorks Summit
 
One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)
DataWorks Summit
 
Realistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure DevelopmentRealistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure Development
DataWorks Summit
 
The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...
The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...
The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...
DataWorks Summit
 
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
DataWorks Summit
 
Practical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on HadoopPractical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on Hadoop
DataWorks Summit
 
50 Shades of SQL
50 Shades of SQL50 Shades of SQL
50 Shades of SQL
DataWorks Summit
 
Running Spark and MapReduce together in Production
Running Spark and MapReduce together in ProductionRunning Spark and MapReduce together in Production
Running Spark and MapReduce together in Production
DataWorks Summit
 
Hadoop for Genomics__HadoopSummit2010
Hadoop for Genomics__HadoopSummit2010Hadoop for Genomics__HadoopSummit2010
Hadoop for Genomics__HadoopSummit2010
Yahoo Developer Network
 
Big Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNBig Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeN
DataWorks Summit
 
Hadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance InitiativeHadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance Initiative
DataWorks Summit
 
Karta an ETL Framework to process high volume datasets
Karta an ETL Framework to process high volume datasets Karta an ETL Framework to process high volume datasets
Karta an ETL Framework to process high volume datasets
DataWorks Summit
 
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Impetus Technologies
 
Making The News – One Small Microservice at a Time
Making The News – One Small Microservice at a TimeMaking The News – One Small Microservice at a Time
Making The News – One Small Microservice at a Time
Tobias Järlund
 
NEC’s Smart Enterprise Solutions - Did You Know That…
NEC’s Smart Enterprise Solutions - Did You Know That…NEC’s Smart Enterprise Solutions - Did You Know That…
NEC’s Smart Enterprise Solutions - Did You Know That…
InteractiveNEC
 
Topic3 Enterprise Application Integration
Topic3 Enterprise Application IntegrationTopic3 Enterprise Application Integration
Topic3 Enterprise Application Integration
sanjoysanyal
 
HBase and Drill: How loosley typed SQL is ideal for NoSQL
HBase and Drill: How loosley typed SQL is ideal for NoSQLHBase and Drill: How loosley typed SQL is ideal for NoSQL
HBase and Drill: How loosley typed SQL is ideal for NoSQL
DataWorks Summit
 
Carpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP HavenCarpe Datum: Building Big Data Analytical Applications with HP Haven
Carpe Datum: Building Big Data Analytical Applications with HP Haven
DataWorks Summit
 
Inspiring Travel at Airbnb [WIP]
Inspiring Travel at Airbnb [WIP]Inspiring Travel at Airbnb [WIP]
Inspiring Travel at Airbnb [WIP]
DataWorks Summit
 
One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)
DataWorks Summit
 
Realistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure DevelopmentRealistic Synthetic Generation Allows Secure Development
Realistic Synthetic Generation Allows Secure Development
DataWorks Summit
 
The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...
The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...
The Most Valuable Customer on Earth-1298: Comic Book Analysis with Oracel's B...
DataWorks Summit
 
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
Coexistence and Migration of Vendor HPC based infrastructure to Hadoop Ecosys...
DataWorks Summit
 
Practical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on HadoopPractical Distributed Machine Learning Pipelines on Hadoop
Practical Distributed Machine Learning Pipelines on Hadoop
DataWorks Summit
 
Running Spark and MapReduce together in Production
Running Spark and MapReduce together in ProductionRunning Spark and MapReduce together in Production
Running Spark and MapReduce together in Production
DataWorks Summit
 
Big Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeNBig Data Simplified - Is all about Ab'strakSHeN
Big Data Simplified - Is all about Ab'strakSHeN
DataWorks Summit
 
Hadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance InitiativeHadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance Initiative
DataWorks Summit
 
Karta an ETL Framework to process high volume datasets
Karta an ETL Framework to process high volume datasets Karta an ETL Framework to process high volume datasets
Karta an ETL Framework to process high volume datasets
DataWorks Summit
 
Ad

Similar to Smart Enterprise Big Data Bus for the Modern Responsive Enterprise (20)

Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
Kiththi Perera
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Kiththi Perera
 
Digital transformation slideshare
Digital transformation   slideshareDigital transformation   slideshare
Digital transformation slideshare
ShivamPatsariya1
 
The Cloud - What's different
The Cloud - What's differentThe Cloud - What's different
The Cloud - What's different
Chen-Tien Tsai
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming Era
Attunity
 
Unlocking value with event-driven architecture by Confluent
Unlocking value with event-driven architecture by ConfluentUnlocking value with event-driven architecture by Confluent
Unlocking value with event-driven architecture by Confluent
confluent
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
Mukund Babbar
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
Stratebi
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)
Karim Lalji
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
MapR Technologies
 
Snaplogic Tableau Cloud Integration Analytics 11052013
Snaplogic Tableau Cloud Integration Analytics 11052013Snaplogic Tableau Cloud Integration Analytics 11052013
Snaplogic Tableau Cloud Integration Analytics 11052013
Mark Ames
 
How to build streaming data applications - evaluating the top contenders
How to build streaming data applications - evaluating the top contendersHow to build streaming data applications - evaluating the top contenders
How to build streaming data applications - evaluating the top contenders
Akmal Chaudhri
 
Top SAP Online training institute in Hyderabad
Top SAP Online training institute in HyderabadTop SAP Online training institute in Hyderabad
Top SAP Online training institute in Hyderabad
AadhyaKrishnan
 
KidoZen紹介資料
KidoZen紹介資料KidoZen紹介資料
KidoZen紹介資料
アシアル株式会社
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
Edgar Alejandro Villegas
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
UiPath 23.4 Product Release Updates
UiPath 23.4 Product Release UpdatesUiPath 23.4 Product Release Updates
UiPath 23.4 Product Release Updates
DianaGray10
 
Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...
Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...
Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...
NoSQLmatters
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
Hortonworks
 
Big data solutions on cloud – the way forward
Big data solutions on cloud – the way forwardBig data solutions on cloud – the way forward
Big data solutions on cloud – the way forward
Kiththi Perera
 
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLTBig Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Kiththi Perera
 
Digital transformation slideshare
Digital transformation   slideshareDigital transformation   slideshare
Digital transformation slideshare
ShivamPatsariya1
 
The Cloud - What's different
The Cloud - What's differentThe Cloud - What's different
The Cloud - What's different
Chen-Tien Tsai
 
Digital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming EraDigital Business Transformation in the Streaming Era
Digital Business Transformation in the Streaming Era
Attunity
 
Unlocking value with event-driven architecture by Confluent
Unlocking value with event-driven architecture by ConfluentUnlocking value with event-driven architecture by Confluent
Unlocking value with event-driven architecture by Confluent
confluent
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
Mukund Babbar
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Denodo
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
Stratebi
 
CDS Overview (May 2015)
CDS Overview (May 2015)CDS Overview (May 2015)
CDS Overview (May 2015)
Karim Lalji
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
MapR Technologies
 
Snaplogic Tableau Cloud Integration Analytics 11052013
Snaplogic Tableau Cloud Integration Analytics 11052013Snaplogic Tableau Cloud Integration Analytics 11052013
Snaplogic Tableau Cloud Integration Analytics 11052013
Mark Ames
 
How to build streaming data applications - evaluating the top contenders
How to build streaming data applications - evaluating the top contendersHow to build streaming data applications - evaluating the top contenders
How to build streaming data applications - evaluating the top contenders
Akmal Chaudhri
 
Top SAP Online training institute in Hyderabad
Top SAP Online training institute in HyderabadTop SAP Online training institute in Hyderabad
Top SAP Online training institute in Hyderabad
AadhyaKrishnan
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
UiPath 23.4 Product Release Updates
UiPath 23.4 Product Release UpdatesUiPath 23.4 Product Release Updates
UiPath 23.4 Product Release Updates
DianaGray10
 
Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...
Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...
Akmal Chaudhri - How to Build Streaming Data Applications: Evaluating the Top...
NoSQLmatters
 
Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It! Boost Performance with Scala – Learn From Those Who’ve Done It!
Boost Performance with Scala – Learn From Those Who’ve Done It!
Hortonworks
 
Ad

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

Recently uploaded (20)

The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager APIUiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPath Community Berlin: Orchestrator API, Swagger, and Test Manager API
UiPathCommunity
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.Greenhouse_Monitoring_Presentation.pptx.
Greenhouse_Monitoring_Presentation.pptx.
hpbmnnxrvb
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 

Smart Enterprise Big Data Bus for the Modern Responsive Enterprise

  • 1. Smart Enterprise Big Data Bus ForThe Modern Responsive Enterprise AnandVenugopal June 10, 2015
  • 2. THE JOURNEY SO FAR… • ImpetusTechnologies – Leading Big Data Solutions Provider • StreamAnalytix – Enterprise Class Streaming Analytics platform – Early Access Program: Aug 2014 – Pilot Projects: October 2014 – February 2015 – GA Launch: February 2015 – Today – Sharing one of the key insights and use-case patterns from pilot
  • 3. SMART ENTERPRISE BIG DATA BUS - AGENDA Why ? • Case Study • Emerging Big Data Landscape and its Challenges What ? • Solution Characteristics • Components Required How ? • Implementation Using StreamAnalytix • Technology Stack Q&A • Summary • Follow up
  • 4. CASE STUDY - HOW IT BEGAN Healthcare giant is speeding up critical business processes by using a streaming analytics platform for real-time data synchronization between their Hadoop platform and their enterprise NoSQL database App-1 App-2 App-3 Enterprise NoSQL Enterprise Data Hub/Lake Hadoop
  • 5. EMERGING ENTERPRISE BIG DATA LANDSCAPE Hadoop Stack – Becoming the Center of the Enterprise Data Universe Business Critical Applications Are Still ''Silos'' Real-time Streaming Analytics Adoption Rapidly Increasing
  • 6. HADOOP – IT MAKES COFFEETOO! Active Archive ETL Single Source of Truth BI + Analytics Hadoop Streams/ Transactions
  • 11. HOW HADOOP CONNECTSTOTHE REST OF IT ALL ! Active Archive ETL Single Source of Truth BI + Analytics Hadoo p Streams/ Transactions Enterprise Data Hub/Lake Hadoop ERP CRM Current EDW/ ADW App Server 1 App Server 2 Hub and Spoke Architecture The IntegrationChallenge – Developing, Maintaining, Managing the Hub-spoke System
  • 12. EMERGING ENTERPRISE BIG DATA LANDSCAPE Hadoop Stack – Becoming the Center of the Enterprise Data Universe Business Critical Applications Are Still ''Silos'' Real-time Streaming Analytics Adoption Rapidly Increasing
  • 13. SILOED ENTERPRISE APPLICATIONS MAYTAKE MANYYEARS TO INTEGRATE WITH HADOOP Billing Customer Service Transaction Processing Provisioning
  • 14. THE DATA INTEGRATION CHALLENGE IS HERETO STAY Enterprise Data Hub/Lake Hadoop ERP CRM Current EDW/ADW App Server 1 App Server 2 Hub and Spoke Architecture The Integration Challenge – Developing, Maintaining, Managing the Hub-spoke System
  • 15. EMERGING ENTERPRISE BIG DATA LANDSCAPE Hadoop Stack – Becoming the Center of the Enterprise Data Universe Real-time Streaming Analytics Adoption Rapidly Increasing Business Critical Applications Are Still ''Silos''
  • 16. The Modern Enterprise Expected To Be ''Real-time'' Or ''Near Real-time'' Sales Web Site Billing Customer Service Growth of Internet ofThings (IoT) and Sensor/ Machine Data Sources Context-sensitive Customer Service
  • 17. The Modern Enterprise Expected To Be ''Real-time'' Or ''Near Real-time'' Sales Web Site Billing Customer Service Mobile Location Based Offers Internet Advertisements Call-center Agent Interactions
  • 18. CONTEXT AWARE  POSITIVE CUSTOMER EXPERIENCE Multi-channel engagement in real-time Context Sensitive service Happy customers, Loyalty, Revenue, Profits, Growth
  • 19. Real-time Responses Need Hi-speed Any-to-Any Data Synchronization & Analytics Over Streaming Data Provisioning Machine Data Processing Billing Customer Service Enterprise Data Hub/Lake Hadoop Developing and managing all the peer-to-peer data interfaces and the hub-spoke model together would be very hard
  • 20. EMERGING ENTERPRISE BIG DATA LANDSCAPE Hadoop Stack – Becoming the Center of the Enterprise Data Universe Business Critical Applications Are Still ''Silos'' Real-time Streaming Analytics Adoption Rapidly Increasing
  • 21. SMART ENTERPRISE BIG DATA BUS - AGENDA Why ? • Case Study • Emerging Big Data Landscape and its Challenges What ? • Solution Characteristics • Components Required How ? • Implementation Using StreamAnalytix • Technology Stack Q&A • Summary • Follow up
  • 22. A HI-SPEED BIG DATA BUS ARCHITECTURE WOULD BE AN EFFICIENT ANY-TO-ANY DATA SYNCHRONIZATION MECHANISM Provisioning Machine Data Processing Billing Customer Service Enterprise Data Hub/Lake Hadoop
  • 23. A HI-SPEED BIG DATA BUS ARCHITECTURE WOULD BE AN EFFICIENT ANY-TO-ANY DATA SYNCHRONIZATION MECHANISM Provisioning Machine Data Processing Billing Customer Service Enterprise Data Hub/Lake Hadoop
  • 24. A HI-SPEED BIG DATA BUS ARCHITECTURE WOULD BE AN EFFICIENT ANY-TO-ANY DATA SYNCHRONIZATION MECHANISM Provisioning Machine Data Processing Billing Customer Service Enterprise Data Hub/Lake Hadoop
  • 25. THE HI-SPEED BIG DATA BUSWOULDALSO NEEDTO SUPPORT ''ONTHEWIRE'' COMPUTATION FOR DATA ANALYTICS,TRANSFORMS, CEP Provisioning Machine Data Processing Billing Customer Service Enterprise Data Hub/Lake Hadoop Transformation Analytics,Alerting
  • 26. CAPABILITY LIST FORTHE ''SMART ENTERPRISE BIG DATA BUS'' • Ingest • Parse • Filter • Transform • Move • Store AT SCALE, AND FAST ! • Read • Synchronize • Analyse • Predict • Alert • Visualise Provisioning Machine Data Processing Billing Enterprise Data Hub/Lake Hadoop Transformation Analytics, AlertingCustomer Service
  • 27. THE ''SMART ENTERPRISE BIG DATA BUS''
  • 28. ''Stations'' in the Data Transit System Read-Write Adapters Reliable, Fault-tolerant, Elastic Scalable Distributed Stream Processing and Transport Fabric Stream Processing Services provided by the ''Smart Enterprise Big Data Bus'' include UI for Work-flow Orchestration, Management and Monitoring THE SMART ENTERPRISE BIG DATA BUS
  • 29. ESB (VS. SMART ENTERPRISE BIG DATA BUS) • Were architected for a different workload in a different era • Designed for light weight remote service invocations – not as a heavy throughput full peer-to- peer data transfer mechanism • No compute / analytics capability on the wire • Expensive vertical scaling vs. distributed elastic scale-out with commodity hardware • Monolithic workflows vs. independent control and elastic scalability of each stage in a workflow based on compute needs
  • 30. SMART ENTERPRISE BIG DATA BUS - AGENDA Why ? • Case Study • Emerging Big Data Landscape and its Challenges What ? • Solution Characteristics • Components Required How ? • Implementation Using StreamAnalytix • Technology Stack Q&A • Summary • Follow up
  • 31. SMART ENTERPRISE BIG DATA BUS IMPLEMENTATION • Kafka • Rabbit MQ • Apache Storm • Operators • RT Dashoards • Websockets • CEP • Filter • Indexer • NoSQL • HDFS, Hbase • PMML AT SCALE, FAST, EASY !
  • 33. STREAMANALYTIX – ENTERPRISE BIG DATA BUS – CREATION
  • 34. VISUAL CONFIGURATION – NO CODE – NO COMPLEXITY
  • 37. SAMPLE DEPLOYMENT/HARDWARE SPECIFICATION External systems Data Store Integration layer Processing andAnalytics layer Storm RHEL 4 cores, 16GB RAM vm06dev222 x 1 ElasticSearch RHEL 8 cores, 32GB RAM vm06dev218 x 1 PostgreSQL RHEL 4 cores, 8GB RAM vm06dev222 x 1 Meta Information Store Zookeeper x 1 Kafka+ZK RHEL 4 cores, and 8GB RAM vm06dev220 x 1 Free Switch x 2 Graphite x 1 Web UI Layer RHEL 4 cores, and 2GB RAM vm07eng29, vm07eng31 IR(Platform) x 1 RHEL 4 cores, and 2GB RAM vm07dev97 RHEL 4 cores, 16GB RAM vm06dev222 RHEL 4 cores, 4GB RAM vm06dev224 Tomcat(Query Server) RHEL 4 cores, 8GB RAM vm06dev218 x 1 Tomcat(Admin+Log UI) RHEL 4 cores, 16GB RAM vm06dev222 x 1 Couchbase RHEL 8 cores, 32GB RAM vm06dev224 x 1 RabbitMQ RHEL 4 cores, and 4GB RAM vm06dev224 x 1
  • 38. 39 Any real time source Filter Transfor m Analyze Transport Store Pull PushWeb Front End WWW App Server 1 App Server 2 P.O.S. Business Transaction AD W Analyst Machine Learning Pattern Recognition Enterprise Data Hub/Lake Hadoop Real-time Dashboards CEOs Office ERP ERP Current EDW Rules Alerts Downstream Apps The Smart Enterprise Big Data Bus - Big Data River ?

Editor's Notes

  • #2: One intro slide of DSP 10 mins for DSP neustar use case (focus on algorithms) Not techy but more business 10 mins for other customer complaints use cases and demo 30 Mins for Impetus Corp 15 mins for BDP
  • #19: Data source – listener for Active MQ Secure data streaming from remote servers Alert for call drop events in the main pipeline