SlideShare a Scribd company logo
DENODO LUNCH & LEARN
23 NOVEMBER
BUILDING A LOGICAL DATA FABRIC
USING DATA VIRTUALIZATION
Presenters for this Session
Chris Day
Director, APAC Sales Engineering, Denodo
Regional Vice President, Sales, ASEAN & Korea, Denodo
Elaine Chan
Agenda
1. Data Lakes and Data Warehouses
2. Observations From the Recent TDWI Report
3. What is a Logical Data Fabric ? (Deep Dive)
4. How Does This Apply to a Data Warehouse/Data Lake?
5. Customer Case Study - Autodesk
6. Conclusions
7. Product Demo
8. Q & A and Next Steps
Building A Logical Data Fabric Using
Data Virtualization
Regional Vice President, Sales, ASEAN & Korea, Denodo
Elaine Chan
5
A Brief History
6
What is a Data Warehouse ?
In computing, a data warehouse (DW or DWH), also known as an enterprise
data warehouse (EDW), is a system used for reporting and data analysis and
is considered a core component of business intelligence.[1] DWs are central
repositories of integrated data from one or more disparate sources. They
store current and historical data in one single place[2] that are used for
creating analytical reports for workers throughout the enterprise.[3]
The data stored in the warehouse is uploaded from the operational
systems (such as marketing or sales). The data may pass through
an operational data store and may require data cleansing[2] for additional
operations to ensure data quality before it is used in the DW for reporting.
Source: https://en.wikipedia.org/wiki/Data_warehouse
7
Etymology of “Data Lake”
Pentaho’s CTO James Dixon is credited with coining the term "data lake".
He described it in his blog in 2010:
“If you think of a data mart as a store of bottled water –
cleansed and packaged and structured for easy consumption
– the data lake is a large body of water in a more natural
state. The contents of the data lake stream in from a source
to fill the lake, and various users of the lake can come to
examine, dive in, or take samples.”
Source: https://jamesdixon.wordpress.com/2010/10/14/pentaho-hadoop-and-data-lakes/
8
Can We Place All Data Into a Single Cloud System?
Taking advantage of the current shift to the cloud, couldn't we
simply consolidate all data in a single system, like a data lake or a
“lakehouse”?
§ They are (relatively) cheap and scale out well for large data
volumes
§ Is that realistic?
§ Is that possible?
9
Do I Need Both a Data Lake and a Data Warehouse?
Data Warehouses:
Typically contain structured data
and more often than not are on
premise
• Top use case BI and Analytics
Data Lakes:
Often contain unstructured data
and typically cloud based.
• Top use case Data Science
2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
10
Do I Really Need Both a Data Lake and a Data Warehouse?
2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
11
Why?
2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
12
Data lakes were born to efficiently address
the challenge of cost reduction:
Data lakes allow for cheap, efficient storage
of very large amounts of data.
Cloud implementation simplified the
complexity of managing a large data lake.
13
…Data lakes lack semantic consistency and governed
metadata. Meeting the needs of wider audiences require
curated repositories with governance, semantic
consistency and access controls.”
14
So How Are Organizations Going About It?
2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
15
Final Observations From the Report
Architects own the overall design. It is no
surprise that architects are the top
contributors to the design of the data
warehousing environment. This includes data
warehouse architects (49%), enterprise
architects (43%), and IT architects (21%).
Data scientists (56%) are the top
contributor of various
components to the unified environment.
2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
What is a Logical Data Fabric?
17
A data fabric is an architecture pattern that informs and automates the design, integration
and deployment of data objects regardless of deployment platforms and architectural
approaches.
It utilizes continuous analytics and AI/ML over all metadata assets to provide actionable
insights and recommendations on data management and integration design and
deployment patterns.
This results in faster, informed and, in some cases, completely automated data access and
sharing.
Data Fabric Definition
18
Data Fabric
A data fabric is an architecture pattern that informs and automates the design, integration and deployment
of data objects regardless of deployment platforms and architectural approaches
§ It utilizes continuous analytics and AI/ML over all metadata assets to provide actionable insights
and recommendations on data management and integration design and deployment patterns.
§ This results in faster, informed and, in some cases, completely automated data access and sharing
§ Strongly supported by both Gartner and Forrester
Data Fabric Net
Compounds Customers Products Claims
RDBMS/OLTP Traditional Analytics/BI Data Lakes Cloud Data Stores Apps and Document
Repositories
Flat Files
Third Party
Legacy
Mart
Data Warehouse
Mart
ETL ETL
XML • JSON • PDF
DOC • WEB
19
Logical Data Fabric
Demystifying the Data Fabric,
September 2020
The core of the matter is being
able to consolidate many diverse
data sources in an efficient
manner by allowing trusted data
to be delivered from all relevant
data sources to all relevant data
consumers through one
common layer.
20
Logical Data Fabric
• Data Abstraction: decoupling
applications/data usage from data
sources
• Data Integration without replication
or relocation of physical data
• Easy Access to Any Data, high
performant and real-time/ right-
time
• Data Catalog for self-service data
services and easy discovery
• Unified metadata, security &
governance across all data assets
• Data Delivery in any format with
intelligent query optimization that
leverages new and existing
physical data platforms
A logical data layer – a “logical data fabric” – that provides high-performant, real-time, and secure
access to integrated business views of disparate data across the enterprise
How Does This Apply to a
Data Warehouse/Data Lake ?
22
Logical Data Fabric Reference Architecture
LOGICAL DATA FABRIC
23
Cloud
Applications
Salesforce.com
AWS Ecosystem
AWS Data Sources
Amazon Aurora Amazon Dynamo DB
Amazon Redshift
Amazon Athena Amazon S3
Data Consumers
Amazon QuickSight
OnPrem Data
Sources
MS Excel
Other Applications
SAP Data Sources
Example: Denodo in Multiple Locations
Azure Ecosystem
Denodo Data
Virtualization
Data Catalog
SQL
API
Denodo Data
Virtualization
SQL
API
Azure Data Sources
Azure Synapse Azure Delta Lake
Customer Case Studies
25
v
Autodesk Overview
• Founded 1982 (NASDAQ: ASDK)
• Annual revenues (FY 2018) $2.06B
§ Over 8,800 employees
• 3D modeling and animation software
§ Flagship product is AutoCAD
• Market sectors:
§ Architecture, Engineering, and Construction
§ Manufacturing
§ Media and Entertainment
§ Recently started 3D Printing offerings
26
v
Business Drivers for Change
• Software consumption model is changing
§ Perpetual licenses to subscriptions
§ User want more flexibility in how they use software
• Autodesk needed to transition to subscription
pricing
§ 2016 – some products will be subscription only
• Lifetime revenue higher with subscriptions
§ Over 3-5 years, subscriptions = more revenue
• Changing a licensing model is disruptive
27
v
Technology Challenges
• Current ‘traditional’ BI/EDW architecture not
designed for data streams from online apps
§ Weblogs, Clickstreams, Cloud/Desktop apps, etc.
• Existing infrastructure can’t simply ‘go away’
§ Regulatory reporting (e.g. SEC)
§ Existing ‘perpetual’ customers
• ‘Subscription’ infrastructure work in parallel
§ Extend and enhance existing systems
§ With single access point to all data
• Solution – ‘Logical Data Warehouse/Fabric’
28
Logical Data Warehouse
29
Logical Data Warehouse
30
Logical Data Warehouse
31
Logical Data Warehouse
32
v
Autodesk Successfully Changes Their Revenue Model and
Transforms Business
§ Autodesk was changing their business
revenue model from a conventional
perpetual license model to subscription-
based license model.
§ Inability to deliver high quality data in a
timely manner to business stakeholders.
§ Evolution from traditional operational
data warehouse to contemporary logical
data warehouse deemed necessary for
faster speed.
§ Successfully transitioned to subscription-
based licensing.
§ For the first time, Autodesk can do single
point security enforcement and have
uniform data environment for access.
§ General purpose platform to deliver data
through logical data warehouse.
§ Denodo Abstraction Layer helps live
invoicing with SAP.
§ Data virtualization enabled a culture of
“see before you build”.
32
Autodesk, Inc. is an American multinational software corporation that makes software for the
architecture, engineering, construction, manufacturing, media, and entertainment industries.
Case Study
Conclusions
34
A Logical Data Fabric
§ Pillar 1 — Integrates data across multi-cloud environments
§ Pillar 2 - Automates manual tasks using augmented intelligence
§ Pillar 3 - Boosts performance of analytics with rapid data delivery
§ Pillar 4 - Supports data discovery and data science initiatives
§ Pillar 5 - Analyzes across data at rest and data in motion
§ Pillar 6 - Catalogs all data for discovery, lineage, and associations
TDWI Checklist Report: Six Critical Capabilities of a Logical Data Fabric (May 2020) By Fern Halper and David Loshin - denodo.link/tdwi5
Product Demonstration
Director, APAC Sales Engineering, Denodo
Chris Day
Q&A
Next Steps
38
denodo.link/TD2111
Featuring Leading Industry Experts
Angel Vina
Founder & CEO
Alberto Pan
Executive VP & CTO
Ravi Shankar
Senior VP & CMO
David Loshin
President of Knowledge Integrity
Terry Moon
Enterprise Information Architect
Logical Data Fabric: The Future of
Data Management and Analytics
Michele Goetz
VP & Principal Analyst
denodo.link/DF2111
AVAILABLE ON DEMAND
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical,
including photocopying and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

What's hot (20)

PDF
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PDF
The Oracle RAC Family of Solutions - Presentation
Markus Michalewicz
 
PPTX
Databricks Fundamentals
Dalibor Wijas
 
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
PDF
Databricks Delta Lake and Its Benefits
Databricks
 
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
PDF
Achieving Lakehouse Models with Spark 3.0
Databricks
 
PPTX
Azure Data Factory
HARIHARAN R
 
PPTX
DW Migration Webinar-March 2022.pptx
Databricks
 
PDF
Making Apache Spark Better with Delta Lake
Databricks
 
PDF
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 
PDF
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
PPTX
Data Vault Overview
Empowered Holdings, LLC
 
PDF
Azure Data Factory V2; The Data Flows
Thomas Sykes
 
PPTX
Data Lake Overview
James Serra
 
PPTX
Building a modern data warehouse
James Serra
 
PPTX
Building an Effective Data Warehouse Architecture
James Serra
 
PPTX
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
 
PPTX
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
The Oracle RAC Family of Solutions - Presentation
Markus Michalewicz
 
Databricks Fundamentals
Dalibor Wijas
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Databricks Delta Lake and Its Benefits
Databricks
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Achieving Lakehouse Models with Spark 3.0
Databricks
 
Azure Data Factory
HARIHARAN R
 
DW Migration Webinar-March 2022.pptx
Databricks
 
Making Apache Spark Better with Delta Lake
Databricks
 
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 
Getting Started with Delta Lake on Databricks
Knoldus Inc.
 
Data Vault Overview
Empowered Holdings, LLC
 
Azure Data Factory V2; The Data Flows
Thomas Sykes
 
Data Lake Overview
James Serra
 
Building a modern data warehouse
James Serra
 
Building an Effective Data Warehouse Architecture
James Serra
 
Modern Data Warehousing with the Microsoft Analytics Platform System
James Serra
 
Data Warehousing in the Cloud: Practical Migration Strategies
SnapLogic
 

Similar to Building a Logical Data Fabric using Data Virtualization (ASEAN) (20)

PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
PDF
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Denodo
 
PDF
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
Denodo
 
PDF
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Denodo
 
PDF
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Denodo
 
PPTX
Chap3-Data Warehousing and OLAP operations..pptx
stuti8985
 
PPTX
Business Intelligence Module 3_Datawarehousing.pptx
AmbikaVenkatesh4
 
PDF
A Logical Architecture is Always a Flexible Architecture (ASEAN)
Denodo
 
PDF
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Denodo
 
PPT
Data Warehouse Introduction to Data Warehouse
MSridhar18
 
PDF
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Denodo
 
PDF
Whitepaper-The-Data-Lake-3_0
Jane Roberts
 
PDF
Data Lakes: A Logical Approach for Faster Unified Insights
Denodo
 
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Denodo
 
PPTX
Ppt
bullsrockr666
 
DOCX
Microsoft Fabric data warehouse by dataplatr
ajaykumar405166
 
PDF
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Denodo
 
PPTX
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
shruthisweety4
 
PPTX
Speak to Your Data
Amer Radwan , PMP , CSM
 
PDF
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Denodo
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Denodo
 
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
Denodo
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Denodo
 
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Denodo
 
Chap3-Data Warehousing and OLAP operations..pptx
stuti8985
 
Business Intelligence Module 3_Datawarehousing.pptx
AmbikaVenkatesh4
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
Denodo
 
Maximizing Oil and Gas (Data) Asset Utilization with a Logical Data Fabric (A...
Denodo
 
Data Warehouse Introduction to Data Warehouse
MSridhar18
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Denodo
 
Whitepaper-The-Data-Lake-3_0
Jane Roberts
 
Data Lakes: A Logical Approach for Faster Unified Insights
Denodo
 
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Denodo
 
Microsoft Fabric data warehouse by dataplatr
ajaykumar405166
 
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Denodo
 
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
shruthisweety4
 
Speak to Your Data
Amer Radwan , PMP , CSM
 
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Denodo
 
Ad

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
Denodo
 
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
PDF
What you need to know about Generative AI and Data Management?
Denodo
 
PDF
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
PDF
Drive Data Privacy Regulatory Compliance
Denodo
 
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
PDF
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
PDF
Enabling Data Catalog users with advanced usability
Denodo
 
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Enterprise Monitoring and Auditing in Denodo
Denodo
 
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
What you need to know about Generative AI and Data Management?
Denodo
 
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
Drive Data Privacy Regulatory Compliance
Denodo
 
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
Enabling Data Catalog users with advanced usability
Denodo
 
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Ad

Recently uploaded (20)

PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
PPTX
在线购买英国本科毕业证苏格兰皇家音乐学院水印成绩单RSAMD学费发票
Taqyea
 
PDF
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
PPTX
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
PDF
SQL for Accountants and Finance Managers
ysmaelreyes
 
PDF
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
PPTX
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PDF
apidays Singapore 2025 - From API Intelligence to API Governance by Harsha Ch...
apidays
 
PPTX
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
PPTX
How to Add Columns and Rows in an R Data Frame
subhashenia
 
PPTX
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
Unlocking Insights: Introducing i-Metrics Asia-Pacific Corporation and Strate...
Janette Toral
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
在线购买英国本科毕业证苏格兰皇家音乐学院水印成绩单RSAMD学费发票
Taqyea
 
A GraphRAG approach for Energy Efficiency Q&A
Marco Brambilla
 
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
SQL for Accountants and Finance Managers
ysmaelreyes
 
The Best NVIDIA GPUs for LLM Inference in 2025.pdf
Tamanna36
 
Listify-Intelligent-Voice-to-Catalog-Agent.pptx
nareshkottees
 
BinarySearchTree in datastructures in detail
kichokuttu
 
apidays Singapore 2025 - From API Intelligence to API Governance by Harsha Ch...
apidays
 
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
How to Add Columns and Rows in an R Data Frame
subhashenia
 
apidays Singapore 2025 - Generative AI Landscape Building a Modern Data Strat...
apidays
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
Unlocking Insights: Introducing i-Metrics Asia-Pacific Corporation and Strate...
Janette Toral
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 

Building a Logical Data Fabric using Data Virtualization (ASEAN)

  • 1. DENODO LUNCH & LEARN 23 NOVEMBER BUILDING A LOGICAL DATA FABRIC USING DATA VIRTUALIZATION
  • 2. Presenters for this Session Chris Day Director, APAC Sales Engineering, Denodo Regional Vice President, Sales, ASEAN & Korea, Denodo Elaine Chan
  • 3. Agenda 1. Data Lakes and Data Warehouses 2. Observations From the Recent TDWI Report 3. What is a Logical Data Fabric ? (Deep Dive) 4. How Does This Apply to a Data Warehouse/Data Lake? 5. Customer Case Study - Autodesk 6. Conclusions 7. Product Demo 8. Q & A and Next Steps
  • 4. Building A Logical Data Fabric Using Data Virtualization Regional Vice President, Sales, ASEAN & Korea, Denodo Elaine Chan
  • 6. 6 What is a Data Warehouse ? In computing, a data warehouse (DW or DWH), also known as an enterprise data warehouse (EDW), is a system used for reporting and data analysis and is considered a core component of business intelligence.[1] DWs are central repositories of integrated data from one or more disparate sources. They store current and historical data in one single place[2] that are used for creating analytical reports for workers throughout the enterprise.[3] The data stored in the warehouse is uploaded from the operational systems (such as marketing or sales). The data may pass through an operational data store and may require data cleansing[2] for additional operations to ensure data quality before it is used in the DW for reporting. Source: https://en.wikipedia.org/wiki/Data_warehouse
  • 7. 7 Etymology of “Data Lake” Pentaho’s CTO James Dixon is credited with coining the term "data lake". He described it in his blog in 2010: “If you think of a data mart as a store of bottled water – cleansed and packaged and structured for easy consumption – the data lake is a large body of water in a more natural state. The contents of the data lake stream in from a source to fill the lake, and various users of the lake can come to examine, dive in, or take samples.” Source: https://jamesdixon.wordpress.com/2010/10/14/pentaho-hadoop-and-data-lakes/
  • 8. 8 Can We Place All Data Into a Single Cloud System? Taking advantage of the current shift to the cloud, couldn't we simply consolidate all data in a single system, like a data lake or a “lakehouse”? § They are (relatively) cheap and scale out well for large data volumes § Is that realistic? § Is that possible?
  • 9. 9 Do I Need Both a Data Lake and a Data Warehouse? Data Warehouses: Typically contain structured data and more often than not are on premise • Top use case BI and Analytics Data Lakes: Often contain unstructured data and typically cloud based. • Top use case Data Science 2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
  • 10. 10 Do I Really Need Both a Data Lake and a Data Warehouse? 2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
  • 11. 11 Why? 2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
  • 12. 12 Data lakes were born to efficiently address the challenge of cost reduction: Data lakes allow for cheap, efficient storage of very large amounts of data. Cloud implementation simplified the complexity of managing a large data lake.
  • 13. 13 …Data lakes lack semantic consistency and governed metadata. Meeting the needs of wider audiences require curated repositories with governance, semantic consistency and access controls.”
  • 14. 14 So How Are Organizations Going About It? 2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
  • 15. 15 Final Observations From the Report Architects own the overall design. It is no surprise that architects are the top contributors to the design of the data warehousing environment. This includes data warehouse architects (49%), enterprise architects (43%), and IT architects (21%). Data scientists (56%) are the top contributor of various components to the unified environment. 2021 TDWI report Building the Unified Data Warehouse and Data Lake BEST PRACTICES REPORT By Fern Halper, Ph.D., and James Kobielus
  • 16. What is a Logical Data Fabric?
  • 17. 17 A data fabric is an architecture pattern that informs and automates the design, integration and deployment of data objects regardless of deployment platforms and architectural approaches. It utilizes continuous analytics and AI/ML over all metadata assets to provide actionable insights and recommendations on data management and integration design and deployment patterns. This results in faster, informed and, in some cases, completely automated data access and sharing. Data Fabric Definition
  • 18. 18 Data Fabric A data fabric is an architecture pattern that informs and automates the design, integration and deployment of data objects regardless of deployment platforms and architectural approaches § It utilizes continuous analytics and AI/ML over all metadata assets to provide actionable insights and recommendations on data management and integration design and deployment patterns. § This results in faster, informed and, in some cases, completely automated data access and sharing § Strongly supported by both Gartner and Forrester Data Fabric Net Compounds Customers Products Claims RDBMS/OLTP Traditional Analytics/BI Data Lakes Cloud Data Stores Apps and Document Repositories Flat Files Third Party Legacy Mart Data Warehouse Mart ETL ETL XML • JSON • PDF DOC • WEB
  • 19. 19 Logical Data Fabric Demystifying the Data Fabric, September 2020 The core of the matter is being able to consolidate many diverse data sources in an efficient manner by allowing trusted data to be delivered from all relevant data sources to all relevant data consumers through one common layer.
  • 20. 20 Logical Data Fabric • Data Abstraction: decoupling applications/data usage from data sources • Data Integration without replication or relocation of physical data • Easy Access to Any Data, high performant and real-time/ right- time • Data Catalog for self-service data services and easy discovery • Unified metadata, security & governance across all data assets • Data Delivery in any format with intelligent query optimization that leverages new and existing physical data platforms A logical data layer – a “logical data fabric” – that provides high-performant, real-time, and secure access to integrated business views of disparate data across the enterprise
  • 21. How Does This Apply to a Data Warehouse/Data Lake ?
  • 22. 22 Logical Data Fabric Reference Architecture LOGICAL DATA FABRIC
  • 23. 23 Cloud Applications Salesforce.com AWS Ecosystem AWS Data Sources Amazon Aurora Amazon Dynamo DB Amazon Redshift Amazon Athena Amazon S3 Data Consumers Amazon QuickSight OnPrem Data Sources MS Excel Other Applications SAP Data Sources Example: Denodo in Multiple Locations Azure Ecosystem Denodo Data Virtualization Data Catalog SQL API Denodo Data Virtualization SQL API Azure Data Sources Azure Synapse Azure Delta Lake
  • 25. 25 v Autodesk Overview • Founded 1982 (NASDAQ: ASDK) • Annual revenues (FY 2018) $2.06B § Over 8,800 employees • 3D modeling and animation software § Flagship product is AutoCAD • Market sectors: § Architecture, Engineering, and Construction § Manufacturing § Media and Entertainment § Recently started 3D Printing offerings
  • 26. 26 v Business Drivers for Change • Software consumption model is changing § Perpetual licenses to subscriptions § User want more flexibility in how they use software • Autodesk needed to transition to subscription pricing § 2016 – some products will be subscription only • Lifetime revenue higher with subscriptions § Over 3-5 years, subscriptions = more revenue • Changing a licensing model is disruptive
  • 27. 27 v Technology Challenges • Current ‘traditional’ BI/EDW architecture not designed for data streams from online apps § Weblogs, Clickstreams, Cloud/Desktop apps, etc. • Existing infrastructure can’t simply ‘go away’ § Regulatory reporting (e.g. SEC) § Existing ‘perpetual’ customers • ‘Subscription’ infrastructure work in parallel § Extend and enhance existing systems § With single access point to all data • Solution – ‘Logical Data Warehouse/Fabric’
  • 32. 32 v Autodesk Successfully Changes Their Revenue Model and Transforms Business § Autodesk was changing their business revenue model from a conventional perpetual license model to subscription- based license model. § Inability to deliver high quality data in a timely manner to business stakeholders. § Evolution from traditional operational data warehouse to contemporary logical data warehouse deemed necessary for faster speed. § Successfully transitioned to subscription- based licensing. § For the first time, Autodesk can do single point security enforcement and have uniform data environment for access. § General purpose platform to deliver data through logical data warehouse. § Denodo Abstraction Layer helps live invoicing with SAP. § Data virtualization enabled a culture of “see before you build”. 32 Autodesk, Inc. is an American multinational software corporation that makes software for the architecture, engineering, construction, manufacturing, media, and entertainment industries. Case Study
  • 34. 34 A Logical Data Fabric § Pillar 1 — Integrates data across multi-cloud environments § Pillar 2 - Automates manual tasks using augmented intelligence § Pillar 3 - Boosts performance of analytics with rapid data delivery § Pillar 4 - Supports data discovery and data science initiatives § Pillar 5 - Analyzes across data at rest and data in motion § Pillar 6 - Catalogs all data for discovery, lineage, and associations TDWI Checklist Report: Six Critical Capabilities of a Logical Data Fabric (May 2020) By Fern Halper and David Loshin - denodo.link/tdwi5
  • 35. Product Demonstration Director, APAC Sales Engineering, Denodo Chris Day
  • 36. Q&A
  • 39. Featuring Leading Industry Experts Angel Vina Founder & CEO Alberto Pan Executive VP & CTO Ravi Shankar Senior VP & CMO David Loshin President of Knowledge Integrity Terry Moon Enterprise Information Architect Logical Data Fabric: The Future of Data Management and Analytics Michele Goetz VP & Principal Analyst denodo.link/DF2111 AVAILABLE ON DEMAND
  • 40. Thanks! www.denodo.com [email protected] © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.