SlideShare a Scribd company logo
Self Service Analytics
November 2020
© 2019 g2o LLC; proprietary and confidential
Uttam Channegowda
Practice Director
Big Data and Data Engineering
g2o
Speakers
Click icon to
add picture
Paul Moxon
SVP Data Architectures
Denodo Technologies
about g2o
3
400+
analysts, designers,
developers,
engineers,
researchers, and
strategists
Largest digital
experience and
technology shop
based in Ohio technology,
data, experience
design
We stay close to
our clients
for speed, efficiency,
and deep
collaboration
25+ YEARS
EXPERIENCE
© 2020 g2o, LLC; proprietary and confidential
strategically organizing and architecting your data
4© 2020 g2o, LLC; proprietary and confidential
Data s trategy
Accelerated by best-practice templates, automation, and a deep partner
network, our experts help you prioritize data efforts to capture
opportunities and support your business goals.
Modern data
platforms
We assess, reimagine, and re-platform your data environments, so you can
economically and sustainably leverage advanced capabilities to turn data
from a cost center to a growth engine.
Mas ter data management
We help you ensure the integrity of your essential data assets – especially
your customer and product data, to support guided selling, dynamic
pricing, and personalization.
S ingle view of the
cus tomer
We help you achieve a singular and complete representation of your
customer data, and analyze customer behavior, so you can better target
and personalize customer interactions.
1 2 3 4 5
modern
data
platform
analytics and
actionable
insights
seamless
customer
experience
automation and
personalization
tools
optimization with
on-going
measures and
KPIs
5 components of a data-driven organization
5
4 common gaps
• customer data lacks quality that is needed for analysis or
personalization
• customer data is not organized or accessible to support analytics or
drive experiences
• customer data is siloed across multiple systems and the customer
view is incomplete
• customer data is not integrated into other systems that can
personalize the customer experience
where organizations struggle
6
self-service users are always waiting on their data
© 2019 g2o LLC; proprietary and confidential
data warehouses MDM platforms data lakes
Plagued by gaps in
data governance
limited to a small subset of core data,
not easily accessible to business
analysts
the up-front effort of developing a
schema pushed to the data consumption
team
© 2019 g2o LLC; proprietary and confidential
80% Data Preparation
drivers of self-service analytics
data democratization an integral part of being a data-driven organization
disruptive events COVID-19, Japan earthquake, Asian tsunami
data-driven innovation according to Forrester, between 60% and 73% of all data within an
enterprise goes unused for analytics
© 2019 g2o LLC; proprietary and confidential
1. Not all data needs to be
integrated
2. Data quality is in the eye of the
beholder
3. Combining datasets does not
always need to be an IT project
thinking differently about
data
© 2019 g2o LLC; proprietary and confidential
© 2019 g2o LLC; proprietary and confidential
Sometimes, being
directionally
correct is good
enough
Find the right balance where IT’s charter to govern and
secure data can peacefully coexist with the business’
need for speed to market
The reality is that shadow IT will continue to exist and
truly does serve a purpose for specific analytics use
cases
controlled chaos
© 2019 g2o LLC; proprietary and confidential
1. Data abstraction
2. Zero replication, zero relocation
3. Real-time information
4. Self-service data services
5. Centralized metadata, security &
governance
6. Location-agnostic architecture for
multi-cloud, hybrid acceleration
data virtualization as a self-
service architecture
© 2019 g2o LLC; proprietary and confidential
© 2019 g2o LLC; proprietary and confidential
data virtualization is not just for self-service, it’s
also a first-class citizen when it comes to modern
data platform architectures
15
Gartner – The Evolution of Data Architectures
This is a Second Major Cycle of Analytical Consolidation
Operational Application
Operational Application
Operational Application
IoT Data
Other NewData
Operational
Application
Operational
Application
Cube
Operational
Application
Cube
? Operational Application
Operational Application
Operational Application
IoT Data
Other NewData
1980s
Pre EDW
1990s
EDW
2010s2000s
Post EDW
Time
LDW
Operational
Application
Operational
Application
Operational
Application
Data
Warehouse
Data
Warehouse
Data
Lake
?
LDW
Data Warehouse
Data Lake
Marts
ODS
Staging/Ingest
Unified analysis
› Consolidated data
› "Collect the data"
› Single server, multiple nodes
› More analysis than any
one server can provide
©2018 Gartner, Inc.
Unified analysis
› Logically consolidated view of all data
› "Connect and collect"
› Multiple servers, of multiple nodes
› More analysis than any one system can provide
ID: 342254
Fragmented/
nonexistent analysis
› Multiple sources
› Multiple structured sources
Fragmented analysis
› "Collect the data" (Into
› different repositories)
› New data types,
› processing, requirements
› Uncoordinated views
16
Gartner – Logical Data Architecture
“Adopt the Logical Data Warehouse Architecture to Meet Your Modern Analytical Needs”. Henry Cook, Gartner April 2018
DATA VIRTUALIZATION
17
Data Virtualization – A Data Fabric Layer
Consume
in business applications
Combine
related data into views
Connect
to disparate data sources
2
3
1
DATA CONSUMERS
DISPARATE DATA SOURCES
Enterprise Applications, Reporting, BI, Portals, ESB, Mobile, Web, Users
Databases & Warehouses, Cloud/Saas Applications, Big Data, NoSQL, Web, XML, Excel, PDF, Word...
Analytical Operational
Less StructuredMore Structured
CONNECT COMBINE PUBLISH
Multiple Protocols,
Formats
Query, Search,
Browse
Request/Reply,
Event Driven
Secure
Delivery
SQL,
MDX
Web
Services
Big Data
APIs
Web Automation
and Indexing
CONNECT COMBINE CONSUME
Share, Deliver,
Publish, Govern,
Collaborate
Discover, Transform,
Prepare, Improve
Quality, Integrate
Normalized views of
disparate data
“Data virtualization
integrates disparate
data sources in real
time or near-real
time to meet
demands for
analytics and
transactional data.”
– Create a Road Map For A
Real-time, Agile, Self-
Service Data Platform,
Forrester Research, Dec 16,
2015
18
How Does It Work?
Development
Lifecycle Mgmt
Monitoring &
Audit
Governance
Security
Development
Tools and SDK
Scheduled Tasks
Data Caching
Query Optimizer
JDBC/ODBC/ADO.Net SOAP / REST WS
U
Customer 360
View
Virtual
Data Mart
View
J
Application
Layer
Business
Layer
Unified
View
Unified
View
Unified
View
Unified
View
A
J
J
Derived
View
Derived
View
J
JS
Transformation
& Cleansing
Data
Source
Layer
Base
View
Base
View
Base
View
Base
View
Base
View
Base
View
Base
View
Abstraction
19
Data Virtualization Connects the Users to the Data That They Need
1. Data Virtualization allows you to connect to (almost) any data source
2. You can combine and transform that data into the format needed by the consumer
3. The data can be exposed to the consumers in a format and interface that is usable
by them
• Typically consumers use the tools that they already use – they don’t have to learn new tools
and skills to access the data
4. All of this can be done without copying or moving the data
• The data stays in the original sources (databases, applications, files, etc.) and is retrieved, in
real-time, on demand
Cliffs Notes version (TL;DR)
20
Data Source Connectivity
Relational Databases
• MS SQL*Server (JDBC, ODBC): 2000, 2005, 2008, 2008R2, 2012, 2014,
2016, 2017
• Oracle (JDBC): 8i, 9i, 10g, 11g, 12c, 18c, 19c
• Oracle E-Business Suite (JDBC): 12
• IBM DB2 (JDBC): 8, 9, 10, 11, 12 for LUW; 9,10 for z/OS, AS400
• Informix (JDBC): 7, 12
• Sybase Adaptive Server Enterprise (JDBC): 12, 15
• MySQL (JDBC): 4, 5
• PostgreSQL (JDBC): 8, 9, 10, 11
• Denodo Platform (JDBC): 5.5, 6.0, 7.0, 8.0
- For multi-location architecture deployments
• MS Access (ODBC)
• Apache Derby (JDBC): 10
• Generic (JDBC)
In-Memory Databases
• SAP HANA (JDBC): 1
• Oracle TimesTen (JDBC): 11g
• Oracle 12c In-Memory
• Redis In-memory Cache
Parallel databases and appliances
• GreenPlum (JDBC): 4.2
• HP Vertica (JDBC): 7, 8
• Oracle Exadata (JDBC): X5-2
• ParAccel 8.0.2 (using ParAccel 2.5.0.0 JDBC3g/SSL driver)
• Netezza (JDBC): 4.6, 5.0, 6.0, 7.0
• SybaseIQ (JDBC) 12.x, 15.x
• Teradata (JDBC): 12, 13, 14, 15
• Yellowbrick
Multi-Dimensional Sources
• SAP BW (BAPI/XMLA): 3.x
• SAP BI 7.x (BAPI): 7.x
• Mondrian (XMLA): 3.x
• IBM Cognos TM1
• MS SQL Server Analysis Services 200x
• Essbase (XMLA): 9, 11
Cloud Databases and Data Warehouses
• Amazon Redshift (JDBC)
• Amazon Athena (JDBC)
• Amazon Aurora (JDBC)
• Amazon DynamoDB
• Amazon RDS (JDBC)
• Azure Cosmos DB
• Azure SQL Database
• Azure Synapse Analytics (fka SQL Data Warehouse)
• Databricks Delta Lake
• Google Cloud SQL
• Google BigQuery (JDBC)
• MongoDB Atlas
• Snowflake (JDBC)
Data Lake Storage
• Amazon S3
• Azure Data Lake Storage
• Azure Data Lake Storage Gen 2
• Azure Blob Storage
• Google Cloud Storage
• Parquet (Distributed File System Connector)
• Avro
Big Data
• Apache Hive (JDBC): 0.12, 1.1.0, 1.1.0 for Cloudera 1.2.1
and for Hortonworks 2.0.0
• MapR-XD, MapR-DB, MapR-ES, Hive, and Drill for MapR 6.1
• Amazon Elastic Map-Reduce (EMR)
• Apache HBase (using DenodoConnect connector)
• Impala (JDBC): 2.3
• Google BigTable
• Spark SQL (JDBC): 1.5, 1.6
• Presto (JDBC)
• Databricks 2.x
NoSQL
• MongoDB
• Cassandra
Web Services
• SOAP
• REST (XML, RSS, ATOM, JSON)
• OData v2 and v4
Packaged Applications
• SAP ERP/ECC (BAPIs and RFC tables)
• Oracle E-Business Suite 12
• Siebel
• SAS (SAS JDBC Driver): 7 and higher
Flat and Binary Files
• CSV, pipe-delimited, Regular expression-parsed
• MS Excel xls 97-2003
• MS Excel xlsx 2007 or later
• MS Access
• XML
• JSON
• SAS Files (SAS7BDAT)
All files can be locally accessible or in remote filesystems,
through FTP/ SFTP/FTPS, and in clear, zipped and/or
encrypted format.
Active Directory as source or leveraging security
• LDAP v3
• Microsoft Active Directory 2003, 2008
Cloud, SaaS, Web Sources with Simplified OAuth Security
• Amazon
• Google
• Google Sheets
• Facebook
• LinkedIn
• MS SharePoint (by using the OData connector)
• MS Dynamics 365 Business Central/Customer
Engagement
• Marketo
• ServiceNow
• Salesforce (SOQL)
• NetSuite
• Twitter via APIs with simplified OAuth integration (1.0,
1.0a and 2.0)
• Workday
Indexes and unstructured content
• CMS, file systems, pdf, word, text, email servers,
knowledge bases, indexes
• Elastic Search 6.4, 6.7
Streaming/Messaging Systems
• MQSeries
• SonicMQ
• ActiveMQ
• TIBCO EMS
• Kafka Messaging
• Spark Streams
• IBM Streams
Semantic Repositories
• Semantic repositories in Triple Stores/RDF accessed
through SPARQL endpoints.
• Neo4j Graph Database
Denodo SDK for Custom Connectors
• CouchDB
• Lotus Domino
Web Automation
• Denodo’s ITPilot automates extraction from web
pages
Mainframe
• IMS
• IBM IMS native drivers: 8, 9
• IMS Universal Drivers: 11
Hierarchical databases
• Adabas (SOA Gateway and Denodo’s SOAP
connector): 5, 6
Legacy
• Microsoft FoxPro (ODBC)
The following data sources have been successfully tested
with Denodo using JDBC and ODBC drivers, WS/SOAP
and WS/REST, and DenodoConnect adapters (not
exhaustive list):
• Apache Solr
• IBM BigInsights
• Pivotal HAWQ
21
Protocols and Formats
• SQL Based access via JDBC, ODBC and ADO.NET
• Web Services
• SOAP (XML/JSON)
• REST (JSON/XML)
• OData 2 & 4
• GraphQL
• Open API (a.k.a Swagger)
• Web Parts (for SharePoint), Portlets
• Kafka and JMS listeners for message queues
• Denodo Scheduler for batch process and ‘ETL lite’
Security Options
• Authentication using LDAP or Active Directory
• Kerberos for Single Sign-On (SSO)
• OAuth, OAuth 2.0 (JWT)
• SAML
• SSL/TLS
• WS-Security, X.509 certificates
• Two-Factor Authentication – via identity providers Okta, Duo, etc.
BI/Reporting tools
• Microstrategy, Cognos, Business Objects, Oracle OBIEE
• Tableau, Qlikview, Spotfire, Microsoft PowerBI
• Excel
Analytical Tools/Languages
• SAS, Statistica, SPSS, MatLab
• R, Python, Java, Scala, etc.
• Azure ML Studio, Apache Zeppelin and Jupyter analytics
notebooks
Portals
• SharePoint, Enterprise portals, Web/mobile apps
Enterprise Service Bus
• Oracle Service Bus, Azure Service Bus, TIBCO Active Matrix
Bus
ETL tools
• SAP Data Services, Informatica Powercenter, IBM Data Stage,
Talend ETL
API Management tools
• CA (Layer 7), TIBCO Mashery, Apigee
Publishing Options
22
Decoupling Business and IT
IT: Flexible Source Architecture
Business: Flexible
Tool Choice
IT can now
move at
slower speed
without
affecting the
business
Business can now
make faster and
more
sophisticated
decisions as all
data accessible
by any tool of
choice
23
Multi-cloud future is a reality:
• Risk mitigation
• Mix and match of best of breed tools and
technologies
• Multi-cloud architectures include a mix of
on-premise databases as well
• Organizations won’t be moving to the
cloud overnight and need a layer that
eases the transition
Data Virtualization Enables Cloud Modernization
24
• Data Virtualization has reached the
‘Plateau of Productivity’
• Alternatives are still not mature
enough for mainstream
• Data Lakes still rely on ETL and security
remains a challenge
• ‘No code’ data tools for self-service
(e.g. data Prep tools) have governance
and security issues also.
Data Virtualization is Mainstream…
25
Gartner and Forrester Research Evaluations
Why Denodo?
Forrester Wave: Enterprise Data Virtualization, Q4 2017Forrester Wave: Enterprise Data Fabric, Q2 20202020 Gartner Magic Quadrant for Data Integration Tools
26
Publication Date – 25th August 2020
Gartner Critical Capabilities for Data Integration Tools
Denodo is the only
product with 5.0 score in
Data Virtualization
category
Q&A
What’s your Next? Request a Discovery Session
Learn how to put
Data Virtualization to work
in your organization!
pages.denodo.com/g2orequest.html
REGISTER NOW
Thank you for joining us!
© 2020 g2o LLC; proprietary and confidential

More Related Content

What's hot (20)

PPTX
Graph databases
Vinoth Kannan
 
PDF
Data Modeling & Metadata Management
DATAVERSITY
 
PDF
Introduction to column oriented databases
ArangoDB Database
 
PDF
Modern Data architecture Design
Kujambu Murugesan
 
PDF
The ABCs of Treating Data as Product
DATAVERSITY
 
PPTX
Data quality problem and solution
Punk Milton
 
PPTX
Building an Effective Data Warehouse Architecture
James Serra
 
PDF
Data Architecture Best Practices for Advanced Analytics
DATAVERSITY
 
PDF
Data Mesh
Piethein Strengholt
 
PDF
DAS Slides: Best Practices in Metadata Management
DATAVERSITY
 
PPTX
Data Virtualization: An Introduction
Denodo
 
PPTX
Capability Model_Data Governance
Steve Novak
 
PPTX
Oracle to Azure PostgreSQL database migration webinar
Minnie Seungmin Cho
 
PDF
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Informatica
 
PDF
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
PPTX
Data Lakehouse Symposium | Day 4
Databricks
 
PDF
Building an Effective Data & Analytics Operating Model A Data Modernization G...
Mark Hewitt
 
PDF
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...
Lace Lofranco
 
PDF
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 
PDF
201905 Azure Databricks for Machine Learning
Mark Tabladillo
 
Graph databases
Vinoth Kannan
 
Data Modeling & Metadata Management
DATAVERSITY
 
Introduction to column oriented databases
ArangoDB Database
 
Modern Data architecture Design
Kujambu Murugesan
 
The ABCs of Treating Data as Product
DATAVERSITY
 
Data quality problem and solution
Punk Milton
 
Building an Effective Data Warehouse Architecture
James Serra
 
Data Architecture Best Practices for Advanced Analytics
DATAVERSITY
 
DAS Slides: Best Practices in Metadata Management
DATAVERSITY
 
Data Virtualization: An Introduction
Denodo
 
Capability Model_Data Governance
Steve Novak
 
Oracle to Azure PostgreSQL database migration webinar
Minnie Seungmin Cho
 
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Informatica
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
Data Lakehouse Symposium | Day 4
Databricks
 
Building an Effective Data & Analytics Operating Model A Data Modernization G...
Mark Hewitt
 
DataOps for the Modern Data Warehouse on Microsoft Azure @ NDCOslo 2020 - Lac...
Lace Lofranco
 
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 
201905 Azure Databricks for Machine Learning
Mark Tabladillo
 

Similar to Self Service Analytics and a Modern Data Architecture with Data Virtualization (US) (20)

PDF
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Denodo
 
PDF
Self-Service Analytics with Guard Rails
Denodo
 
PDF
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
PDF
Future of Data Strategy (ASEAN)
Denodo
 
PDF
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
Denodo
 
PDF
Connecting Silos in Real Time with Data Virtualization
Denodo
 
PDF
Bridging the Last Mile: Getting Data to the People Who Need It
Denodo
 
PPTX
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo
 
PDF
6 Solution Patterns for Accelerating Self-Service BI, Cloud, Big Data, and Ot...
Denodo
 
PPTX
Deutsche Telekom on Big Data
DataWorks Summit
 
PDF
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
DATAVERSITY
 
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Denodo
 
PDF
Data Virtualization - Enabling Next Generation Analytics
Denodo
 
PDF
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
Denodo
 
PDF
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
PPTX
Finding business value in Big Data
James Serra
 
PDF
Unlock Your Data for ML & AI using Data Virtualization
Denodo
 
PDF
Hadoop 2.0: YARN to Further Optimize Data Processing
Hortonworks
 
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Denodo
 
PDF
Analytical Innovation: How to Build the Next Generation Data Platform
VMware Tanzu
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Denodo
 
Self-Service Analytics with Guard Rails
Denodo
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
Future of Data Strategy (ASEAN)
Denodo
 
¿Cómo modernizar una arquitectura de TI con la virtualización de datos?
Denodo
 
Connecting Silos in Real Time with Data Virtualization
Denodo
 
Bridging the Last Mile: Getting Data to the People Who Need It
Denodo
 
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo
 
6 Solution Patterns for Accelerating Self-Service BI, Cloud, Big Data, and Ot...
Denodo
 
Deutsche Telekom on Big Data
DataWorks Summit
 
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
DATAVERSITY
 
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Denodo
 
Data Virtualization - Enabling Next Generation Analytics
Denodo
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
Denodo
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
CCG
 
Finding business value in Big Data
James Serra
 
Unlock Your Data for ML & AI using Data Virtualization
Denodo
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hortonworks
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Denodo
 
Analytical Innovation: How to Build the Next Generation Data Platform
VMware Tanzu
 
Ad

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
Denodo
 
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
PDF
What you need to know about Generative AI and Data Management?
Denodo
 
PDF
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
PDF
Drive Data Privacy Regulatory Compliance
Denodo
 
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
PDF
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
PDF
Enabling Data Catalog users with advanced usability
Denodo
 
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Enterprise Monitoring and Auditing in Denodo
Denodo
 
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
What you need to know about Generative AI and Data Management?
Denodo
 
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
Drive Data Privacy Regulatory Compliance
Denodo
 
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
Enabling Data Catalog users with advanced usability
Denodo
 
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Ad

Recently uploaded (20)

PPTX
01_Nico Vincent_Sailpeak.pptx_AI_Barometer_2025
FinTech Belgium
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PDF
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PDF
Data Science Course Certificate by Sigma Software University
Stepan Kalika
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PPTX
big data eco system fundamentals of data science
arivukarasi
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PDF
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
PDF
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PDF
Optimizing Large Language Models with vLLM and Related Tools.pdf
Tamanna36
 
01_Nico Vincent_Sailpeak.pptx_AI_Barometer_2025
FinTech Belgium
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
BinarySearchTree in datastructures in detail
kichokuttu
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
Data Science Course Certificate by Sigma Software University
Stepan Kalika
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
big data eco system fundamentals of data science
arivukarasi
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
apidays Singapore 2025 - Surviving an interconnected world with API governanc...
apidays
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
Optimizing Large Language Models with vLLM and Related Tools.pdf
Tamanna36
 

Self Service Analytics and a Modern Data Architecture with Data Virtualization (US)

  • 2. © 2019 g2o LLC; proprietary and confidential Uttam Channegowda Practice Director Big Data and Data Engineering g2o Speakers Click icon to add picture Paul Moxon SVP Data Architectures Denodo Technologies
  • 3. about g2o 3 400+ analysts, designers, developers, engineers, researchers, and strategists Largest digital experience and technology shop based in Ohio technology, data, experience design We stay close to our clients for speed, efficiency, and deep collaboration 25+ YEARS EXPERIENCE © 2020 g2o, LLC; proprietary and confidential
  • 4. strategically organizing and architecting your data 4© 2020 g2o, LLC; proprietary and confidential Data s trategy Accelerated by best-practice templates, automation, and a deep partner network, our experts help you prioritize data efforts to capture opportunities and support your business goals. Modern data platforms We assess, reimagine, and re-platform your data environments, so you can economically and sustainably leverage advanced capabilities to turn data from a cost center to a growth engine. Mas ter data management We help you ensure the integrity of your essential data assets – especially your customer and product data, to support guided selling, dynamic pricing, and personalization. S ingle view of the cus tomer We help you achieve a singular and complete representation of your customer data, and analyze customer behavior, so you can better target and personalize customer interactions.
  • 5. 1 2 3 4 5 modern data platform analytics and actionable insights seamless customer experience automation and personalization tools optimization with on-going measures and KPIs 5 components of a data-driven organization 5
  • 6. 4 common gaps • customer data lacks quality that is needed for analysis or personalization • customer data is not organized or accessible to support analytics or drive experiences • customer data is siloed across multiple systems and the customer view is incomplete • customer data is not integrated into other systems that can personalize the customer experience where organizations struggle 6
  • 7. self-service users are always waiting on their data © 2019 g2o LLC; proprietary and confidential data warehouses MDM platforms data lakes Plagued by gaps in data governance limited to a small subset of core data, not easily accessible to business analysts the up-front effort of developing a schema pushed to the data consumption team
  • 8. © 2019 g2o LLC; proprietary and confidential 80% Data Preparation
  • 9. drivers of self-service analytics data democratization an integral part of being a data-driven organization disruptive events COVID-19, Japan earthquake, Asian tsunami data-driven innovation according to Forrester, between 60% and 73% of all data within an enterprise goes unused for analytics © 2019 g2o LLC; proprietary and confidential
  • 10. 1. Not all data needs to be integrated 2. Data quality is in the eye of the beholder 3. Combining datasets does not always need to be an IT project thinking differently about data © 2019 g2o LLC; proprietary and confidential
  • 11. © 2019 g2o LLC; proprietary and confidential Sometimes, being directionally correct is good enough
  • 12. Find the right balance where IT’s charter to govern and secure data can peacefully coexist with the business’ need for speed to market The reality is that shadow IT will continue to exist and truly does serve a purpose for specific analytics use cases controlled chaos © 2019 g2o LLC; proprietary and confidential
  • 13. 1. Data abstraction 2. Zero replication, zero relocation 3. Real-time information 4. Self-service data services 5. Centralized metadata, security & governance 6. Location-agnostic architecture for multi-cloud, hybrid acceleration data virtualization as a self- service architecture © 2019 g2o LLC; proprietary and confidential
  • 14. © 2019 g2o LLC; proprietary and confidential data virtualization is not just for self-service, it’s also a first-class citizen when it comes to modern data platform architectures
  • 15. 15 Gartner – The Evolution of Data Architectures This is a Second Major Cycle of Analytical Consolidation Operational Application Operational Application Operational Application IoT Data Other NewData Operational Application Operational Application Cube Operational Application Cube ? Operational Application Operational Application Operational Application IoT Data Other NewData 1980s Pre EDW 1990s EDW 2010s2000s Post EDW Time LDW Operational Application Operational Application Operational Application Data Warehouse Data Warehouse Data Lake ? LDW Data Warehouse Data Lake Marts ODS Staging/Ingest Unified analysis › Consolidated data › "Collect the data" › Single server, multiple nodes › More analysis than any one server can provide ©2018 Gartner, Inc. Unified analysis › Logically consolidated view of all data › "Connect and collect" › Multiple servers, of multiple nodes › More analysis than any one system can provide ID: 342254 Fragmented/ nonexistent analysis › Multiple sources › Multiple structured sources Fragmented analysis › "Collect the data" (Into › different repositories) › New data types, › processing, requirements › Uncoordinated views
  • 16. 16 Gartner – Logical Data Architecture “Adopt the Logical Data Warehouse Architecture to Meet Your Modern Analytical Needs”. Henry Cook, Gartner April 2018 DATA VIRTUALIZATION
  • 17. 17 Data Virtualization – A Data Fabric Layer Consume in business applications Combine related data into views Connect to disparate data sources 2 3 1 DATA CONSUMERS DISPARATE DATA SOURCES Enterprise Applications, Reporting, BI, Portals, ESB, Mobile, Web, Users Databases & Warehouses, Cloud/Saas Applications, Big Data, NoSQL, Web, XML, Excel, PDF, Word... Analytical Operational Less StructuredMore Structured CONNECT COMBINE PUBLISH Multiple Protocols, Formats Query, Search, Browse Request/Reply, Event Driven Secure Delivery SQL, MDX Web Services Big Data APIs Web Automation and Indexing CONNECT COMBINE CONSUME Share, Deliver, Publish, Govern, Collaborate Discover, Transform, Prepare, Improve Quality, Integrate Normalized views of disparate data “Data virtualization integrates disparate data sources in real time or near-real time to meet demands for analytics and transactional data.” – Create a Road Map For A Real-time, Agile, Self- Service Data Platform, Forrester Research, Dec 16, 2015
  • 18. 18 How Does It Work? Development Lifecycle Mgmt Monitoring & Audit Governance Security Development Tools and SDK Scheduled Tasks Data Caching Query Optimizer JDBC/ODBC/ADO.Net SOAP / REST WS U Customer 360 View Virtual Data Mart View J Application Layer Business Layer Unified View Unified View Unified View Unified View A J J Derived View Derived View J JS Transformation & Cleansing Data Source Layer Base View Base View Base View Base View Base View Base View Base View Abstraction
  • 19. 19 Data Virtualization Connects the Users to the Data That They Need 1. Data Virtualization allows you to connect to (almost) any data source 2. You can combine and transform that data into the format needed by the consumer 3. The data can be exposed to the consumers in a format and interface that is usable by them • Typically consumers use the tools that they already use – they don’t have to learn new tools and skills to access the data 4. All of this can be done without copying or moving the data • The data stays in the original sources (databases, applications, files, etc.) and is retrieved, in real-time, on demand Cliffs Notes version (TL;DR)
  • 20. 20 Data Source Connectivity Relational Databases • MS SQL*Server (JDBC, ODBC): 2000, 2005, 2008, 2008R2, 2012, 2014, 2016, 2017 • Oracle (JDBC): 8i, 9i, 10g, 11g, 12c, 18c, 19c • Oracle E-Business Suite (JDBC): 12 • IBM DB2 (JDBC): 8, 9, 10, 11, 12 for LUW; 9,10 for z/OS, AS400 • Informix (JDBC): 7, 12 • Sybase Adaptive Server Enterprise (JDBC): 12, 15 • MySQL (JDBC): 4, 5 • PostgreSQL (JDBC): 8, 9, 10, 11 • Denodo Platform (JDBC): 5.5, 6.0, 7.0, 8.0 - For multi-location architecture deployments • MS Access (ODBC) • Apache Derby (JDBC): 10 • Generic (JDBC) In-Memory Databases • SAP HANA (JDBC): 1 • Oracle TimesTen (JDBC): 11g • Oracle 12c In-Memory • Redis In-memory Cache Parallel databases and appliances • GreenPlum (JDBC): 4.2 • HP Vertica (JDBC): 7, 8 • Oracle Exadata (JDBC): X5-2 • ParAccel 8.0.2 (using ParAccel 2.5.0.0 JDBC3g/SSL driver) • Netezza (JDBC): 4.6, 5.0, 6.0, 7.0 • SybaseIQ (JDBC) 12.x, 15.x • Teradata (JDBC): 12, 13, 14, 15 • Yellowbrick Multi-Dimensional Sources • SAP BW (BAPI/XMLA): 3.x • SAP BI 7.x (BAPI): 7.x • Mondrian (XMLA): 3.x • IBM Cognos TM1 • MS SQL Server Analysis Services 200x • Essbase (XMLA): 9, 11 Cloud Databases and Data Warehouses • Amazon Redshift (JDBC) • Amazon Athena (JDBC) • Amazon Aurora (JDBC) • Amazon DynamoDB • Amazon RDS (JDBC) • Azure Cosmos DB • Azure SQL Database • Azure Synapse Analytics (fka SQL Data Warehouse) • Databricks Delta Lake • Google Cloud SQL • Google BigQuery (JDBC) • MongoDB Atlas • Snowflake (JDBC) Data Lake Storage • Amazon S3 • Azure Data Lake Storage • Azure Data Lake Storage Gen 2 • Azure Blob Storage • Google Cloud Storage • Parquet (Distributed File System Connector) • Avro Big Data • Apache Hive (JDBC): 0.12, 1.1.0, 1.1.0 for Cloudera 1.2.1 and for Hortonworks 2.0.0 • MapR-XD, MapR-DB, MapR-ES, Hive, and Drill for MapR 6.1 • Amazon Elastic Map-Reduce (EMR) • Apache HBase (using DenodoConnect connector) • Impala (JDBC): 2.3 • Google BigTable • Spark SQL (JDBC): 1.5, 1.6 • Presto (JDBC) • Databricks 2.x NoSQL • MongoDB • Cassandra Web Services • SOAP • REST (XML, RSS, ATOM, JSON) • OData v2 and v4 Packaged Applications • SAP ERP/ECC (BAPIs and RFC tables) • Oracle E-Business Suite 12 • Siebel • SAS (SAS JDBC Driver): 7 and higher Flat and Binary Files • CSV, pipe-delimited, Regular expression-parsed • MS Excel xls 97-2003 • MS Excel xlsx 2007 or later • MS Access • XML • JSON • SAS Files (SAS7BDAT) All files can be locally accessible or in remote filesystems, through FTP/ SFTP/FTPS, and in clear, zipped and/or encrypted format. Active Directory as source or leveraging security • LDAP v3 • Microsoft Active Directory 2003, 2008 Cloud, SaaS, Web Sources with Simplified OAuth Security • Amazon • Google • Google Sheets • Facebook • LinkedIn • MS SharePoint (by using the OData connector) • MS Dynamics 365 Business Central/Customer Engagement • Marketo • ServiceNow • Salesforce (SOQL) • NetSuite • Twitter via APIs with simplified OAuth integration (1.0, 1.0a and 2.0) • Workday Indexes and unstructured content • CMS, file systems, pdf, word, text, email servers, knowledge bases, indexes • Elastic Search 6.4, 6.7 Streaming/Messaging Systems • MQSeries • SonicMQ • ActiveMQ • TIBCO EMS • Kafka Messaging • Spark Streams • IBM Streams Semantic Repositories • Semantic repositories in Triple Stores/RDF accessed through SPARQL endpoints. • Neo4j Graph Database Denodo SDK for Custom Connectors • CouchDB • Lotus Domino Web Automation • Denodo’s ITPilot automates extraction from web pages Mainframe • IMS • IBM IMS native drivers: 8, 9 • IMS Universal Drivers: 11 Hierarchical databases • Adabas (SOA Gateway and Denodo’s SOAP connector): 5, 6 Legacy • Microsoft FoxPro (ODBC) The following data sources have been successfully tested with Denodo using JDBC and ODBC drivers, WS/SOAP and WS/REST, and DenodoConnect adapters (not exhaustive list): • Apache Solr • IBM BigInsights • Pivotal HAWQ
  • 21. 21 Protocols and Formats • SQL Based access via JDBC, ODBC and ADO.NET • Web Services • SOAP (XML/JSON) • REST (JSON/XML) • OData 2 & 4 • GraphQL • Open API (a.k.a Swagger) • Web Parts (for SharePoint), Portlets • Kafka and JMS listeners for message queues • Denodo Scheduler for batch process and ‘ETL lite’ Security Options • Authentication using LDAP or Active Directory • Kerberos for Single Sign-On (SSO) • OAuth, OAuth 2.0 (JWT) • SAML • SSL/TLS • WS-Security, X.509 certificates • Two-Factor Authentication – via identity providers Okta, Duo, etc. BI/Reporting tools • Microstrategy, Cognos, Business Objects, Oracle OBIEE • Tableau, Qlikview, Spotfire, Microsoft PowerBI • Excel Analytical Tools/Languages • SAS, Statistica, SPSS, MatLab • R, Python, Java, Scala, etc. • Azure ML Studio, Apache Zeppelin and Jupyter analytics notebooks Portals • SharePoint, Enterprise portals, Web/mobile apps Enterprise Service Bus • Oracle Service Bus, Azure Service Bus, TIBCO Active Matrix Bus ETL tools • SAP Data Services, Informatica Powercenter, IBM Data Stage, Talend ETL API Management tools • CA (Layer 7), TIBCO Mashery, Apigee Publishing Options
  • 22. 22 Decoupling Business and IT IT: Flexible Source Architecture Business: Flexible Tool Choice IT can now move at slower speed without affecting the business Business can now make faster and more sophisticated decisions as all data accessible by any tool of choice
  • 23. 23 Multi-cloud future is a reality: • Risk mitigation • Mix and match of best of breed tools and technologies • Multi-cloud architectures include a mix of on-premise databases as well • Organizations won’t be moving to the cloud overnight and need a layer that eases the transition Data Virtualization Enables Cloud Modernization
  • 24. 24 • Data Virtualization has reached the ‘Plateau of Productivity’ • Alternatives are still not mature enough for mainstream • Data Lakes still rely on ETL and security remains a challenge • ‘No code’ data tools for self-service (e.g. data Prep tools) have governance and security issues also. Data Virtualization is Mainstream…
  • 25. 25 Gartner and Forrester Research Evaluations Why Denodo? Forrester Wave: Enterprise Data Virtualization, Q4 2017Forrester Wave: Enterprise Data Fabric, Q2 20202020 Gartner Magic Quadrant for Data Integration Tools
  • 26. 26 Publication Date – 25th August 2020 Gartner Critical Capabilities for Data Integration Tools Denodo is the only product with 5.0 score in Data Virtualization category
  • 27. Q&A
  • 28. What’s your Next? Request a Discovery Session Learn how to put Data Virtualization to work in your organization! pages.denodo.com/g2orequest.html REGISTER NOW
  • 29. Thank you for joining us! © 2020 g2o LLC; proprietary and confidential