SlideShare a Scribd company logo
How to integrate
Qlik with Cloudera
10 de Novembro de 2018
Luciano Assad
Solution Architect – Pre-Sales
2
Agenda
1 Big Data Landscape
2 Hadoop Ecosystem
3 Cloudera
4 15 Points of Integration with Cloudera
5 Qlik Big Data Methodologies
3
Agenda
1 Big Data Landscape
2 Hadoop Ecosystem
3 Cloudera
4 15 Points of Integration with Cloudera
5 Qlik Big Data Methodologies
4
http://mattturck.com/bigdata2017/
5
6
Challenge
Pokemon
or
Big Data?
7
8
9
10
11
12
13
https://pixelastic.github.io/pokemonorbigdata/
- Pokemon or Big Data
14
Agenda
1 Big Data Landscape
2 Hadoop Ecosystem
3 Cloudera
4 15 Points of Integration with Cloudera
5 Qlik Big Data Methodologies
15
What is Apache Hadoop?
• Hadoop is a software framework for storing, processing, and analyzing
“big data”
- Open source
- Distributed
- Scalable
- Fault-tolerant
• Hadoop - Blocks diagram:
HDFS MapReduceYARN
A file system to
manage the storage
of data
A framework to
define a data
processing task
A framework to run
the data processing
task
16
Large Ecosystem
In-Memory,
Data Flow
Engine
Analytical
SQL-on-
Hadoop
NoSQL
Database
Machine
Learning
Search Scripting Integration
&
Streaming
Management
&
Coordinantion
Resource
Management
Storage
Reference: https://www.dotnettricks.com/learn/hadoop/apache-hadoop-ecosystem-and-components
17
Large Ecosystem
In-Memory,
Data Flow
Engine
Analytical
SQL-on-
Hadoop
NoSQL
Database
Machine
Learning
Search Scripting Integration
&
Streaming
Management
&
Coordinantion
Resource
Management
Storage
Reference: https://www.dotnettricks.com/learn/hadoop/apache-hadoop-ecosystem-and-components
Most used for BI
18
Agenda
1 Big Data Landscape
2 Hadoop Ecosystem
3 Cloudera
4 15 Points of Integration with Cloudera
5 Qlik Big Data Methodologies
19
CDH
Cloudera’s Distribution including Apache Hadoop - Components
Hadoop Distributed File System
YARN and MapReduce
Spark
HBase Flume
Sqoop Hive
Impala
Solr
...
Hadoop
Ecosystem
Hadoop Core
Components
CDH
20
CDH - Important Components
Component Definition
Project What does itdo?
Spark In-memory execution framework
HBase NoSQL database built onHDFS
Hive SQL processing engine designed for batch workloads
Impala SQL query engine designed for BI workloads
Parquet Very efficient columnar data storage format
Sqoop Data movement to/from RDBMSs
Flume, Kafka Streaming dataingestion
Solr Enables users to find the data they need
Hue Web-based user interface for Hadoop
Oozie Workflow scheduler used to managejobs
Sentry Authorization tool, providing security for Hadoop
21
How do I create a Lab Environment?
Cloudera QuickStats
https://goo.gl/zwwDRg
22
How do I create a Lab Environment?
Cloudera QuickStats
https://goo.gl/zwwDRg
Recommended requirements:
4 cores - 12 GB RAM
23
Agenda
1 Big Data Landscape
2 Hadoop Ecosystem
3 Cloudera
4 15 Points of Integration with Cloudera
5 Qlik Big Data Methodologies
24
Qlik + Cloudera
15 Points of Integration with Cloudera
Go Beyond SQL
Fast & Flexible
BI & Analytics
Enterprise Ready
Data Lake Browser
Cloudera Data Explorer
Writeback with Kudu
Interactive analytics
IoT and Kafka Integration
Event driven / Streaming analytics
App on Demand w/ Impala
In memory user generated slices
Direct Query w/ Impala
Data stored in Parquet or Kudu
Complex Data Types with Impala
Maps, arrays, and structures
Data Science Workbench
Powered by Qlik Associative Engine
Advanced Analytics
Integration with Spark/Python/R
Solr Integration
In-memory apps built on Solr Data
Qlik Solr-API App on Demand
Search + QAP + D3js
Cloudera Altus
Analytic DB Integration
Cloudera Metadata Miner
Impala, Cloudera Manager, Navigator
SAP Offload with Attunity
SAP S&D Module into HDFS/Impala
Security – SSO Support
Kerberos delegation/SSO pass-thru
Cloudera Metrics Dashboard
REST API based management
console for Cloudera Manager
25
DEMO
Cloudera Data Lake Explorer
&
Cloudera Metadata Miner
26
Qlik + Cloudera
Cloudera Data Lake Explorer
https://goo.gl/g7PywC
27
Qlik + Cloudera
Cloudera Data Lake Explorer
https://goo.gl/g7PywC
28
Qlik + Cloudera
Cloudera Data Lake Explorer
https://goo.gl/g7PywC
29
Qlik + Cloudera
Cloudera Data Lake Explorer
https://goo.gl/g7PywC
30
Qlik + Cloudera
Cloudera Metadata Catalog
https://goo.gl/g7PywC
31
Qlik + Cloudera
http://cloudera.qlik.com/
Where to find more information?
32
Agenda
1 Big Data Landscape
2 Hadoop Ecosystem
3 Cloudera
4 15 Points of Integration with Cloudera
5 Qlik Big Data Methodologies
33
Qlik Big Data Methodologies
Different data volumes and complexities are best met using different methods
Different methods ensure an optimized
experience for the user for every situation
Methods can be combined to meet different
use cases
Methods vary in deployment complexity
Data Volume
• Size (rows)
• Dimensions
(columns)
• Cardinality
(uniqueness)
App Complexity
• Computational
complexity such
as set analysis
• Object density
Segmentation
Chaining
In-Memory
On-Demand
App Generation
On-Demand App
Generation (API’s)
34
On Demand App Generation
1. User views summary data in
Selection App and selects a slice of
data
2. User requests the Analysis App to
be built
3. Source data is extracted and
Analysis App is created
4. Repeat steps 1-3 as many times as
needed
Big Data Repository
Selection
App
Summary
Data
1
Analysis App
Request
Request
2
2
3
35
ODAG – Selection App
Aggregated Data Dictionary
36
ODAG – Analysis App
2. Where-Statement Generation
1. Binding of Selections
37
DEMO
On Demand App Generation
38
On Demand App Generation
Too hard? Ask a wizard to help you !
https://goo.gl/dNkdB7
39
DEMO
ODAG Wizard
40
Will you share this presentation?
https://goo.gl/ZMcGP9
Obrigado !
Luciano Assad
Solution Architect – Pre-Sales Brasil
luciano.assad@qlik.com
Ad

More Related Content

What's hot (20)

Big Data and ML on Google Cloud
Big Data and ML on Google CloudBig Data and ML on Google Cloud
Big Data and ML on Google Cloud
Wlodek Bielski
 
#DataUnlimited - Google Big Data Unlimited
#DataUnlimited - Google Big Data Unlimited#DataUnlimited - Google Big Data Unlimited
#DataUnlimited - Google Big Data Unlimited
Audrey Huvet
 
Containerizing the Cloud with Kubernetes and Docker
Containerizing the Cloud with Kubernetes and DockerContainerizing the Cloud with Kubernetes and Docker
Containerizing the Cloud with Kubernetes and Docker
James Chittenden
 
Introduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted ConfIntroduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted Conf
In Marketing We Trust
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Imam Raza
 
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark Summit
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Chris Jang
 
AWS Cost Reduction and Management Plan
AWS Cost Reduction and Management PlanAWS Cost Reduction and Management Plan
AWS Cost Reduction and Management Plan
Michael J Geiser
 
Google Cloud Dataflow
Google Cloud DataflowGoogle Cloud Dataflow
Google Cloud Dataflow
Alex Van Boxel
 
SEC302 Twitter's GCP Architecture for its petabyte scale data storage in gcs...
SEC302  Twitter's GCP Architecture for its petabyte scale data storage in gcs...SEC302  Twitter's GCP Architecture for its petabyte scale data storage in gcs...
SEC302 Twitter's GCP Architecture for its petabyte scale data storage in gcs...
Vrushali Channapattan
 
Solving Hybrid Cloud Data Replication with Apache Cassandra
Solving Hybrid Cloud Data Replication with Apache CassandraSolving Hybrid Cloud Data Replication with Apache Cassandra
Solving Hybrid Cloud Data Replication with Apache Cassandra
Aaron Ploetz
 
IoT at Google Scale
IoT at Google ScaleIoT at Google Scale
IoT at Google Scale
James Chittenden
 
StackEngine Demo - Docker Austin
StackEngine Demo - Docker AustinStackEngine Demo - Docker Austin
StackEngine Demo - Docker Austin
Boyd Hemphill
 
GCP Gaming 2016 Seoul, Korea Gaming Analytics
GCP Gaming 2016 Seoul, Korea Gaming AnalyticsGCP Gaming 2016 Seoul, Korea Gaming Analytics
GCP Gaming 2016 Seoul, Korea Gaming Analytics
Chris Jang
 
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
A Microservices approach with Cassandra and Quarkus | DevNation Tech TalkA Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
Red Hat Developers
 
Google Cloud Platform at Vente-Exclusive.com
Google Cloud Platform at Vente-Exclusive.comGoogle Cloud Platform at Vente-Exclusive.com
Google Cloud Platform at Vente-Exclusive.com
Alex Van Boxel
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Alluxio, Inc.
 
Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your Product
Sergey Smetanin
 
Google Bigtable
Google BigtableGoogle Bigtable
Google Bigtable
GirdhareeSaran
 
Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and Analytics
Data Driven Innovation
 
Big Data and ML on Google Cloud
Big Data and ML on Google CloudBig Data and ML on Google Cloud
Big Data and ML on Google Cloud
Wlodek Bielski
 
#DataUnlimited - Google Big Data Unlimited
#DataUnlimited - Google Big Data Unlimited#DataUnlimited - Google Big Data Unlimited
#DataUnlimited - Google Big Data Unlimited
Audrey Huvet
 
Containerizing the Cloud with Kubernetes and Docker
Containerizing the Cloud with Kubernetes and DockerContainerizing the Cloud with Kubernetes and Docker
Containerizing the Cloud with Kubernetes and Docker
James Chittenden
 
Introduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted ConfIntroduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted Conf
In Marketing We Trust
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Imam Raza
 
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark and Hadoop at Production Scale-(Anil Gadre, MapR)
Spark Summit
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Chris Jang
 
AWS Cost Reduction and Management Plan
AWS Cost Reduction and Management PlanAWS Cost Reduction and Management Plan
AWS Cost Reduction and Management Plan
Michael J Geiser
 
SEC302 Twitter's GCP Architecture for its petabyte scale data storage in gcs...
SEC302  Twitter's GCP Architecture for its petabyte scale data storage in gcs...SEC302  Twitter's GCP Architecture for its petabyte scale data storage in gcs...
SEC302 Twitter's GCP Architecture for its petabyte scale data storage in gcs...
Vrushali Channapattan
 
Solving Hybrid Cloud Data Replication with Apache Cassandra
Solving Hybrid Cloud Data Replication with Apache CassandraSolving Hybrid Cloud Data Replication with Apache Cassandra
Solving Hybrid Cloud Data Replication with Apache Cassandra
Aaron Ploetz
 
StackEngine Demo - Docker Austin
StackEngine Demo - Docker AustinStackEngine Demo - Docker Austin
StackEngine Demo - Docker Austin
Boyd Hemphill
 
GCP Gaming 2016 Seoul, Korea Gaming Analytics
GCP Gaming 2016 Seoul, Korea Gaming AnalyticsGCP Gaming 2016 Seoul, Korea Gaming Analytics
GCP Gaming 2016 Seoul, Korea Gaming Analytics
Chris Jang
 
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
A Microservices approach with Cassandra and Quarkus | DevNation Tech TalkA Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
Red Hat Developers
 
Google Cloud Platform at Vente-Exclusive.com
Google Cloud Platform at Vente-Exclusive.comGoogle Cloud Platform at Vente-Exclusive.com
Google Cloud Platform at Vente-Exclusive.com
Alex Van Boxel
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Alluxio, Inc.
 
Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your Product
Sergey Smetanin
 
Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and Analytics
Data Driven Innovation
 

Similar to QMeeting 2018 - Como integrar qlik e cloudera (20)

Hybrid data lake on google cloud with alluxio and dataproc
Hybrid data lake on google cloud  with alluxio and dataprocHybrid data lake on google cloud  with alluxio and dataproc
Hybrid data lake on google cloud with alluxio and dataproc
Alluxio, Inc.
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
 
Eric Andersen Keynote
Eric Andersen KeynoteEric Andersen Keynote
Eric Andersen Keynote
Data Con LA
 
Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)
bigdata trunk
 
Data platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptxData platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptx
CalvinSim10
 
Workshop on Google Cloud Data Platform
Workshop on Google Cloud Data PlatformWorkshop on Google Cloud Data Platform
Workshop on Google Cloud Data Platform
GoDataDriven
 
Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-Cloud
DATAVERSITY
 
Session 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers ProgramSession 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers Program
FIWARE
 
Data Platform on GCP
Data Platform on GCPData Platform on GCP
Data Platform on GCP
Patrick Alexander
 
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Accelerating workloads and bursting data with Google Dataproc & AlluxioAccelerating workloads and bursting data with Google Dataproc & Alluxio
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Big Data Aplications Meetup
 
Day 13 - Creating Data Processing Services | Train the Trainers Program
Day 13 - Creating Data Processing Services | Train the Trainers ProgramDay 13 - Creating Data Processing Services | Train the Trainers Program
Day 13 - Creating Data Processing Services | Train the Trainers Program
FIWARE
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
Hitachi Vantara
 
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and SummaryzData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc.
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenuHow @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
Yahoo Developer Network
 
Rapids: Data Science on GPUs
Rapids: Data Science on GPUsRapids: Data Science on GPUs
Rapids: Data Science on GPUs
inside-BigData.com
 
NVIDIA Rapids presentation
NVIDIA Rapids presentationNVIDIA Rapids presentation
NVIDIA Rapids presentation
testSri1
 
BigQuery - Snowflake compete deck _ Sales _ Y21.pptx
BigQuery - Snowflake compete deck _ Sales _ Y21.pptxBigQuery - Snowflake compete deck _ Sales _ Y21.pptx
BigQuery - Snowflake compete deck _ Sales _ Y21.pptx
Erkan Çiftçi
 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
 
Hybrid data lake on google cloud with alluxio and dataproc
Hybrid data lake on google cloud  with alluxio and dataprocHybrid data lake on google cloud  with alluxio and dataproc
Hybrid data lake on google cloud with alluxio and dataproc
Alluxio, Inc.
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
huguk
 
Eric Andersen Keynote
Eric Andersen KeynoteEric Andersen Keynote
Eric Andersen Keynote
Data Con LA
 
Getting started with GCP ( Google Cloud Platform)
Getting started with GCP ( Google  Cloud Platform)Getting started with GCP ( Google  Cloud Platform)
Getting started with GCP ( Google Cloud Platform)
bigdata trunk
 
Data platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptxData platform modernization with Databricks.pptx
Data platform modernization with Databricks.pptx
CalvinSim10
 
Workshop on Google Cloud Data Platform
Workshop on Google Cloud Data PlatformWorkshop on Google Cloud Data Platform
Workshop on Google Cloud Data Platform
GoDataDriven
 
Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-Cloud
DATAVERSITY
 
Session 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers ProgramSession 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers Program
FIWARE
 
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Accelerating workloads and bursting data with Google Dataproc & AlluxioAccelerating workloads and bursting data with Google Dataproc & Alluxio
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Big Data Aplications Meetup
 
Day 13 - Creating Data Processing Services | Train the Trainers Program
Day 13 - Creating Data Processing Services | Train the Trainers ProgramDay 13 - Creating Data Processing Services | Train the Trainers Program
Day 13 - Creating Data Processing Services | Train the Trainers Program
FIWARE
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
Hitachi Vantara
 
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and SummaryzData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc. Big Data Consulting and Services - Overview and Summary
zData Inc.
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenuHow @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
How @TwitterHadoop Chose Google Cloud, Joep Rottinghuis, Lohit VijayaRenu
Yahoo Developer Network
 
NVIDIA Rapids presentation
NVIDIA Rapids presentationNVIDIA Rapids presentation
NVIDIA Rapids presentation
testSri1
 
BigQuery - Snowflake compete deck _ Sales _ Y21.pptx
BigQuery - Snowflake compete deck _ Sales _ Y21.pptxBigQuery - Snowflake compete deck _ Sales _ Y21.pptx
BigQuery - Snowflake compete deck _ Sales _ Y21.pptx
Erkan Çiftçi
 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
 
Ad

More from Roberto Oliveira (20)

10 mandamentos do projeto de BI
10 mandamentos do projeto de BI10 mandamentos do projeto de BI
10 mandamentos do projeto de BI
Roberto Oliveira
 
Qmeeting 2018 - Utilizando Mashups no Qlik Sense
Qmeeting 2018 - Utilizando Mashups no Qlik SenseQmeeting 2018 - Utilizando Mashups no Qlik Sense
Qmeeting 2018 - Utilizando Mashups no Qlik Sense
Roberto Oliveira
 
Qmeeting 2018 - Web Connectors para Qlik
Qmeeting 2018 -  Web Connectors para QlikQmeeting 2018 -  Web Connectors para Qlik
Qmeeting 2018 - Web Connectors para Qlik
Roberto Oliveira
 
QMeeting 2018 - Utilizando o Qlik core
QMeeting 2018 - Utilizando o Qlik coreQMeeting 2018 - Utilizando o Qlik core
QMeeting 2018 - Utilizando o Qlik core
Roberto Oliveira
 
Datalakers 2018 Qmeeting
Datalakers 2018 QmeetingDatalakers 2018 Qmeeting
Datalakers 2018 Qmeeting
Roberto Oliveira
 
Qmeeting Experts Hands on
Qmeeting Experts Hands onQmeeting Experts Hands on
Qmeeting Experts Hands on
Roberto Oliveira
 
Tuning de performance_qmeeting2018
Tuning de performance_qmeeting2018Tuning de performance_qmeeting2018
Tuning de performance_qmeeting2018
Roberto Oliveira
 
Design para Analise de Dados - Thiago Pessato
Design para Analise de Dados - Thiago PessatoDesign para Analise de Dados - Thiago Pessato
Design para Analise de Dados - Thiago Pessato
Roberto Oliveira
 
Carreiras em analise de dados
Carreiras em analise de dadosCarreiras em analise de dados
Carreiras em analise de dados
Roberto Oliveira
 
Data Science Qmeeting 2018
Data Science Qmeeting 2018Data Science Qmeeting 2018
Data Science Qmeeting 2018
Roberto Oliveira
 
Modelagem de dados para Qlik Qmeeting 2018
Modelagem de dados para Qlik Qmeeting 2018Modelagem de dados para Qlik Qmeeting 2018
Modelagem de dados para Qlik Qmeeting 2018
Roberto Oliveira
 
Mapas com qlik qmeeting 2018
Mapas com qlik qmeeting 2018Mapas com qlik qmeeting 2018
Mapas com qlik qmeeting 2018
Roberto Oliveira
 
Machine learning qmeeting 2018
Machine learning qmeeting 2018Machine learning qmeeting 2018
Machine learning qmeeting 2018
Roberto Oliveira
 
Business Analytics com Tableau Qmeeting 2018
Business Analytics com Tableau Qmeeting 2018Business Analytics com Tableau Qmeeting 2018
Business Analytics com Tableau Qmeeting 2018
Roberto Oliveira
 
Abertura Qmeeting 2018
Abertura Qmeeting 2018Abertura Qmeeting 2018
Abertura Qmeeting 2018
Roberto Oliveira
 
CONHEÇA O POWER BI - QMEETING 2018
CONHEÇA O POWER BI - QMEETING 2018CONHEÇA O POWER BI - QMEETING 2018
CONHEÇA O POWER BI - QMEETING 2018
Roberto Oliveira
 
Sistema licenciamento qliksense
Sistema licenciamento qliksenseSistema licenciamento qliksense
Sistema licenciamento qliksense
Roberto Oliveira
 
Qmeeting 2015 Big Data
Qmeeting 2015 Big DataQmeeting 2015 Big Data
Qmeeting 2015 Big Data
Roberto Oliveira
 
Qmeeting2015 Boas_vindas
Qmeeting2015 Boas_vindasQmeeting2015 Boas_vindas
Qmeeting2015 Boas_vindas
Roberto Oliveira
 
Qmeeting Pequenos_erros_grandes_problemas_Yuri
Qmeeting Pequenos_erros_grandes_problemas_YuriQmeeting Pequenos_erros_grandes_problemas_Yuri
Qmeeting Pequenos_erros_grandes_problemas_Yuri
Roberto Oliveira
 
10 mandamentos do projeto de BI
10 mandamentos do projeto de BI10 mandamentos do projeto de BI
10 mandamentos do projeto de BI
Roberto Oliveira
 
Qmeeting 2018 - Utilizando Mashups no Qlik Sense
Qmeeting 2018 - Utilizando Mashups no Qlik SenseQmeeting 2018 - Utilizando Mashups no Qlik Sense
Qmeeting 2018 - Utilizando Mashups no Qlik Sense
Roberto Oliveira
 
Qmeeting 2018 - Web Connectors para Qlik
Qmeeting 2018 -  Web Connectors para QlikQmeeting 2018 -  Web Connectors para Qlik
Qmeeting 2018 - Web Connectors para Qlik
Roberto Oliveira
 
QMeeting 2018 - Utilizando o Qlik core
QMeeting 2018 - Utilizando o Qlik coreQMeeting 2018 - Utilizando o Qlik core
QMeeting 2018 - Utilizando o Qlik core
Roberto Oliveira
 
Tuning de performance_qmeeting2018
Tuning de performance_qmeeting2018Tuning de performance_qmeeting2018
Tuning de performance_qmeeting2018
Roberto Oliveira
 
Design para Analise de Dados - Thiago Pessato
Design para Analise de Dados - Thiago PessatoDesign para Analise de Dados - Thiago Pessato
Design para Analise de Dados - Thiago Pessato
Roberto Oliveira
 
Carreiras em analise de dados
Carreiras em analise de dadosCarreiras em analise de dados
Carreiras em analise de dados
Roberto Oliveira
 
Data Science Qmeeting 2018
Data Science Qmeeting 2018Data Science Qmeeting 2018
Data Science Qmeeting 2018
Roberto Oliveira
 
Modelagem de dados para Qlik Qmeeting 2018
Modelagem de dados para Qlik Qmeeting 2018Modelagem de dados para Qlik Qmeeting 2018
Modelagem de dados para Qlik Qmeeting 2018
Roberto Oliveira
 
Mapas com qlik qmeeting 2018
Mapas com qlik qmeeting 2018Mapas com qlik qmeeting 2018
Mapas com qlik qmeeting 2018
Roberto Oliveira
 
Machine learning qmeeting 2018
Machine learning qmeeting 2018Machine learning qmeeting 2018
Machine learning qmeeting 2018
Roberto Oliveira
 
Business Analytics com Tableau Qmeeting 2018
Business Analytics com Tableau Qmeeting 2018Business Analytics com Tableau Qmeeting 2018
Business Analytics com Tableau Qmeeting 2018
Roberto Oliveira
 
CONHEÇA O POWER BI - QMEETING 2018
CONHEÇA O POWER BI - QMEETING 2018CONHEÇA O POWER BI - QMEETING 2018
CONHEÇA O POWER BI - QMEETING 2018
Roberto Oliveira
 
Sistema licenciamento qliksense
Sistema licenciamento qliksenseSistema licenciamento qliksense
Sistema licenciamento qliksense
Roberto Oliveira
 
Qmeeting Pequenos_erros_grandes_problemas_Yuri
Qmeeting Pequenos_erros_grandes_problemas_YuriQmeeting Pequenos_erros_grandes_problemas_Yuri
Qmeeting Pequenos_erros_grandes_problemas_Yuri
Roberto Oliveira
 
Ad

Recently uploaded (20)

Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
VKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptxVKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptx
Vinod Srivastava
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
LLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bertLLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bert
ChadapornK
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..Secure_File_Storage_Hybrid_Cryptography.pptx..
Secure_File_Storage_Hybrid_Cryptography.pptx..
yuvarajreddy2002
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Digilocker under workingProcess Flow.pptx
Digilocker  under workingProcess Flow.pptxDigilocker  under workingProcess Flow.pptx
Digilocker under workingProcess Flow.pptx
satnamsadguru491
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
VKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptxVKS-Python Basics for Beginners and advance.pptx
VKS-Python Basics for Beginners and advance.pptx
Vinod Srivastava
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjksPpt. Nikhil.pptxnshwuudgcudisisshvehsjks
Ppt. Nikhil.pptxnshwuudgcudisisshvehsjks
panchariyasahil
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
LLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bertLLM finetuning for multiple choice google bert
LLM finetuning for multiple choice google bert
ChadapornK
 
Stack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptxStack_and_Queue_Presentation_Final (1).pptx
Stack_and_Queue_Presentation_Final (1).pptx
binduraniha86
 

QMeeting 2018 - Como integrar qlik e cloudera

  • 1. How to integrate Qlik with Cloudera 10 de Novembro de 2018 Luciano Assad Solution Architect – Pre-Sales
  • 2. 2 Agenda 1 Big Data Landscape 2 Hadoop Ecosystem 3 Cloudera 4 15 Points of Integration with Cloudera 5 Qlik Big Data Methodologies
  • 3. 3 Agenda 1 Big Data Landscape 2 Hadoop Ecosystem 3 Cloudera 4 15 Points of Integration with Cloudera 5 Qlik Big Data Methodologies
  • 5. 5
  • 7. 7
  • 8. 8
  • 9. 9
  • 10. 10
  • 11. 11
  • 12. 12
  • 14. 14 Agenda 1 Big Data Landscape 2 Hadoop Ecosystem 3 Cloudera 4 15 Points of Integration with Cloudera 5 Qlik Big Data Methodologies
  • 15. 15 What is Apache Hadoop? • Hadoop is a software framework for storing, processing, and analyzing “big data” - Open source - Distributed - Scalable - Fault-tolerant • Hadoop - Blocks diagram: HDFS MapReduceYARN A file system to manage the storage of data A framework to define a data processing task A framework to run the data processing task
  • 16. 16 Large Ecosystem In-Memory, Data Flow Engine Analytical SQL-on- Hadoop NoSQL Database Machine Learning Search Scripting Integration & Streaming Management & Coordinantion Resource Management Storage Reference: https://www.dotnettricks.com/learn/hadoop/apache-hadoop-ecosystem-and-components
  • 17. 17 Large Ecosystem In-Memory, Data Flow Engine Analytical SQL-on- Hadoop NoSQL Database Machine Learning Search Scripting Integration & Streaming Management & Coordinantion Resource Management Storage Reference: https://www.dotnettricks.com/learn/hadoop/apache-hadoop-ecosystem-and-components Most used for BI
  • 18. 18 Agenda 1 Big Data Landscape 2 Hadoop Ecosystem 3 Cloudera 4 15 Points of Integration with Cloudera 5 Qlik Big Data Methodologies
  • 19. 19 CDH Cloudera’s Distribution including Apache Hadoop - Components Hadoop Distributed File System YARN and MapReduce Spark HBase Flume Sqoop Hive Impala Solr ... Hadoop Ecosystem Hadoop Core Components CDH
  • 20. 20 CDH - Important Components Component Definition Project What does itdo? Spark In-memory execution framework HBase NoSQL database built onHDFS Hive SQL processing engine designed for batch workloads Impala SQL query engine designed for BI workloads Parquet Very efficient columnar data storage format Sqoop Data movement to/from RDBMSs Flume, Kafka Streaming dataingestion Solr Enables users to find the data they need Hue Web-based user interface for Hadoop Oozie Workflow scheduler used to managejobs Sentry Authorization tool, providing security for Hadoop
  • 21. 21 How do I create a Lab Environment? Cloudera QuickStats https://goo.gl/zwwDRg
  • 22. 22 How do I create a Lab Environment? Cloudera QuickStats https://goo.gl/zwwDRg Recommended requirements: 4 cores - 12 GB RAM
  • 23. 23 Agenda 1 Big Data Landscape 2 Hadoop Ecosystem 3 Cloudera 4 15 Points of Integration with Cloudera 5 Qlik Big Data Methodologies
  • 24. 24 Qlik + Cloudera 15 Points of Integration with Cloudera Go Beyond SQL Fast & Flexible BI & Analytics Enterprise Ready Data Lake Browser Cloudera Data Explorer Writeback with Kudu Interactive analytics IoT and Kafka Integration Event driven / Streaming analytics App on Demand w/ Impala In memory user generated slices Direct Query w/ Impala Data stored in Parquet or Kudu Complex Data Types with Impala Maps, arrays, and structures Data Science Workbench Powered by Qlik Associative Engine Advanced Analytics Integration with Spark/Python/R Solr Integration In-memory apps built on Solr Data Qlik Solr-API App on Demand Search + QAP + D3js Cloudera Altus Analytic DB Integration Cloudera Metadata Miner Impala, Cloudera Manager, Navigator SAP Offload with Attunity SAP S&D Module into HDFS/Impala Security – SSO Support Kerberos delegation/SSO pass-thru Cloudera Metrics Dashboard REST API based management console for Cloudera Manager
  • 25. 25 DEMO Cloudera Data Lake Explorer & Cloudera Metadata Miner
  • 26. 26 Qlik + Cloudera Cloudera Data Lake Explorer https://goo.gl/g7PywC
  • 27. 27 Qlik + Cloudera Cloudera Data Lake Explorer https://goo.gl/g7PywC
  • 28. 28 Qlik + Cloudera Cloudera Data Lake Explorer https://goo.gl/g7PywC
  • 29. 29 Qlik + Cloudera Cloudera Data Lake Explorer https://goo.gl/g7PywC
  • 30. 30 Qlik + Cloudera Cloudera Metadata Catalog https://goo.gl/g7PywC
  • 32. 32 Agenda 1 Big Data Landscape 2 Hadoop Ecosystem 3 Cloudera 4 15 Points of Integration with Cloudera 5 Qlik Big Data Methodologies
  • 33. 33 Qlik Big Data Methodologies Different data volumes and complexities are best met using different methods Different methods ensure an optimized experience for the user for every situation Methods can be combined to meet different use cases Methods vary in deployment complexity Data Volume • Size (rows) • Dimensions (columns) • Cardinality (uniqueness) App Complexity • Computational complexity such as set analysis • Object density Segmentation Chaining In-Memory On-Demand App Generation On-Demand App Generation (API’s)
  • 34. 34 On Demand App Generation 1. User views summary data in Selection App and selects a slice of data 2. User requests the Analysis App to be built 3. Source data is extracted and Analysis App is created 4. Repeat steps 1-3 as many times as needed Big Data Repository Selection App Summary Data 1 Analysis App Request Request 2 2 3
  • 35. 35 ODAG – Selection App Aggregated Data Dictionary
  • 36. 36 ODAG – Analysis App 2. Where-Statement Generation 1. Binding of Selections
  • 37. 37 DEMO On Demand App Generation
  • 38. 38 On Demand App Generation Too hard? Ask a wizard to help you ! https://goo.gl/dNkdB7
  • 40. 40 Will you share this presentation? https://goo.gl/ZMcGP9
  • 41. Obrigado ! Luciano Assad Solution Architect – Pre-Sales Brasil [email protected]