SlideShare a Scribd company logo
© 2019 Snowflake Inc. All Rights Reserved
HOW TO SAVE
PILES OF $$$
BY CREATING THE
BEST DATA MODEL
THE FIRST TIME
KENT GRAZIANO, CHIEF TECHNICAL EVANGELIST I @KentGraziano
© 2019 Snowflake Inc. All Rights Reserved 2
© 2019 Snowflake Inc. All Rights Reserved
AGENDA
3
Bio
Assumptions
My Best Practices
Standards
Checklist #1
Checklist #2
Checklist #3
Benefits
Considerations
Is it Agile?
Conclusion
© 2019 Snowflake Inc. All Rights Reserved
MY BIO
4
Chief Technical Evangelist, Snowflake Computing
Oracle ACE Director (DW/BI)
Blogger: The Data Warrior
Certified Data Vault Master and DV 2.0 Practitioner
Former Member: Boulder BI Brain Trust (#BBBT)
Member: DAMA Houston & DAMA International
Data Architecture and Data Warehouse Specialist
• 30+ years in IT
• 25+ years of Oracle-related work
• 20+ years of data warehousing experience
Author & Co-Author of a bunch of books (Amazon)
Past-President of ODTUG and Rocky Mountain Oracle User Group
© 2019 Snowflake Inc. All Rights Reserved
3 years in stealth, 3+ years GA
5
1200+ employees
Over 2000
customers today
Over $920M in venture
funding from leading
investors
First customers
2014, general
availability 2015
Founded 2012 by
industry veterans
with over 120
database patents
Queries processed in
Snowflake per day:
100 million
Largest single
table:
68 trillion rows
Largest number of
tables single DB:
200,000
Single customer
most data:
> 40PB
Single customer
most users:
> 10,000
Fun facts:
© 2019 Snowflake Inc. All Rights Reserved
PRESENTATION BASED ON:
6
http://www.amazon.com/Check-Doing-Design-Reviews-ebook/dp/B008RG9L5E/
© 2019 Snowflake Inc. All Rights Reserved
Not a data modeling
presentation
• You already understand ERD’s and
database design
Not a project management
presentation
7
ASSUMPTIONS
© 2019 Snowflake Inc. All Rights Reserved
Who are you?
• Data Modeler or Architect
• Project Managers
• IT Managers
• DBA
• Developer
Experience
• Data Modeling
– Less than 1yr?
– 1-5yrs?
– Over 5yrs?
8
SURVEY
© 2019 Snowflake Inc. All Rights Reserved
Design the Database BEFORE you start
programming!
• Based on the actual business
requirements, needs, and terminology
Have standards and FOLLOW them!
Develop a check list so you don’t
miss a step
• Really – it is faster and more efficient
The sooner you find a mistake the
cheaper it is to fix!
9
MY BEST PRACTICES
(NOSURPRISESHERE)
© 2019 Snowflake Inc. All Rights Reserved
Have them!
• Insure consistency
• Easier to review
Pick a tool to model in
• SQL Developer Data Modeler (SDDM)
• Erwin (if you have $$)
• Visio (really?)
Generate code whenever possible!
10
STANDARDS
© 2019 Snowflake Inc. All Rights Reserved
Object naming
• Entities
• Attributes
• Tables
• Columns
• Views
• Constraints (PK, UK, FK)
• Indexes
Use a tool that helps
support these!
11
STANDARDS
© 2019 Snowflake Inc. All Rights Reserved
SDDM NAMING TEMPLATES
12
© 2019 Snowflake Inc. All Rights Reserved
Diagrams
• Shapes
• Colors
– Yellow = lookup
– Red outline = shared
• Crows feet & direction
• Legend
– Diagram Name
– Author
– Last Updated
13
STANDARDS
© 2019 Snowflake Inc. All Rights Reserved
DIAGRAM LEGEND – SDDM
14
© 2019 Snowflake Inc. All Rights Reserved
CHECKLIST #1
© 2019 Snowflake Inc. All Rights Reserved
BASIC DESIGN PROCESS – STEPS
16
Develop the logical data model (i.e., entity relationship diagram)
• If doing data vault or data mart design, skip to build of physical model
Subject the model to one (or more) peer reviews
Make changes from the reviews
Get team “sign off” on the model
Convert the model to a physical database design
Subject the physical design to peer review
© 2019 Snowflake Inc. All Rights Reserved
BASIC DESIGN PROCESS – CONT’D
17
Make changes
Get team sign off
Generate the DDL to build the model
Review the DDL
Make changes
Execute the changed DDL
Hand off for development of ETL programs
© 2019 Snowflake Inc. All Rights Reserved
CHECKLIST #2
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE LOGICAL MODEL (DETAILS)
19
Gather and review requirements
Review current naming standards and domains
Review existing models (if any)
Develop new data model objects
Modify existing objects if required/indicated
Review logical model check list
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE LOGICAL MODEL
20
Schedule logical model design review session
Prepare for review session
• Print the ERD (or ERD Subview)
• Print entity and attribute detail reports
– Assuming your modeling tool can produce some
– Otherwise, print whatever meta-data you have
– Email to the reviewers
• BETTER: Save the diagrams and reports to a shared knowledge
management site like SharePoint or Confluence
• BEST: Use SDDM with SVN and share
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE LOGICAL MODEL
21
Conduct the review session
Make changes resulting from review session.
Return to the develop/modify objects steps & repeat
When approved, proceed to the development of
physical model
© 2019 Snowflake Inc. All Rights Reserved
REVIEW SESSION
CHECKLIST (#2A)
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
23
If you have Oracle SDDM
• Have the standard Design Rules been run to validate the model
completeness?
– If no – then do it!
If not using SDDM
• Get a sanity check from another team member before
scheduling review
• Instead of pair-programming (from XP) do pair data modeling
Saves embarrassment from obvious mistakes
© 2019 Snowflake Inc. All Rights Reserved
SDDM DESIGN RULES
24
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
25
Entity Review Questions
• Is each entity in the model in 3rd Normal Form?
• Are any entities similar in nature to or a sub-type of, an entity already
in the corporate repository?
– If so, why was that entity not used?
• Are there detailed definitions for every entity and sub-entity?
– Review definitions.
– For data warehouse models, is the source of the data noted?
• Do all entities conform to existing naming standards and
abbreviations?
– Are any new standards or abbreviations indicated?
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
26
Relationship Review
• Is the cardinality correct?
• Is the optionality correct?
• Do the relationship names make sense to a business
analyst or end user?
• Do all arc’ed relationships validate?
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
27
Attribute review
• Do all attributes conform to existing naming standards
and abbreviations?
• Are any new standards or abbreviations indicated?
• Have domains been used appropriately?
• Are there any new domains that can be identified?
• Is there an “attribute comment” for every attribute?
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
28
Attribute review
• Optional - Is there a note for every attribute?
– For data warehouse models, is the source of the attribute noted?
• Are any of the attributes derived?
– If so, why are they included in the logical model, which should
be 3NF?
• Are all the appropriate audit columns included (created by, creation
date, etc.)?
– If not add them!
© 2019 Snowflake Inc. All Rights Reserved
AUTOMATICALLY ADD AUDIT COLUMNS IN SDDM
29
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
30
Unique Identifiers
• Does every entity have at least one unique identifier (UID)?
– Does this UID distinguish each instance as being unique?
• Verify with examples.
– Is this UID a true business key as opposed to a system generated
sequence?
– If it is not a business key, why not?
• If system generated, then it should be noted as an exception.
– Is the business key updatable?
• Are there any alternate unique identifiers?
– Verify with examples.
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW CHECKLIST
31
Other
• Does the model answer all the known requirements?
– Assumes you have more than air-specs
– Can it answer all the known questions from the users?
• Has the model been compared to other models for quality and
completeness?
– For example, “Universal Data Models” from The Data
Model Resource Book by Silverston, et al.
© 2019 Snowflake Inc. All Rights Reserved
SDDM REPORTS
32
© 2019 Snowflake Inc. All Rights Reserved
LOGICAL MODEL REVIEW
33
This process seems intensive
• An ounce of prevention…
Requires modelers to be thorough
Requires reviewers to actually read all the meta-data to verify:
• The descriptions are understandable to a business user
• The text is grammatically correct with proper punctuation
– It may appear in reports or as meta-data in a business
intelligence tool
© 2019 Snowflake Inc. All Rights Reserved
THANKS FOR PLAYING!
34
Pass or Fail?
• Reviewers decide
Pass – go on to physical model
Fail – revise the model and try again
• The “crumple” factor
Conditional Pass
• Clean up some meta data and domains,
then proceed w/o further review
© 2019 Snowflake Inc. All Rights Reserved
CHECKLIST #3
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE PHYSICAL MODEL - STEPS
36
Review logical model with data modeler
• If the database designer is a separate person on the team
Review specific implementation requirements
Forward Engineer logical model to a Relational model in SDDM
• Diagram automatically created
© 2019 Snowflake Inc. All Rights Reserved
SDDM ENGINEER TO RELATIONAL
37
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE PHYSICAL MODEL
38
Make manual modifications to tables and columns
• Try to AVOID this if possible
• As required by tuning and implementation requirements
• To comply with standards
• Add column and table check constraints
Coordinate with the data modeler to account for apparent
missing data elements
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE PHYSICAL MODEL
39
Review physical model design check list
Schedule physical design review session
Prepare for review session
• Print the schema diagram and table and column detail reports
– Email to the reviewers
• BETTER: Save the diagrams and reports to a shared knowledge
management site like SharePoint or Confluence
• BEST: Use SDDM with SVN and share
Conduct the review session
© 2019 Snowflake Inc. All Rights Reserved
DEVELOP THE PHYSICAL MODEL
40
Make changes resulting from review session
• Return to modify the schema diagram
When approved, proceed to generate and deploy DDL to the
development database
• Hand off to development DBA if there is one
Hand-off to programmers for application or ETL development
© 2019 Snowflake Inc. All Rights Reserved
REVIEW SESSION
CHECKLIST (#3A)
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
42
Almost identical to Logical Model list
General
• If you have Oracle SDDM
– Have the standard Design Rules been run to validate the model
completeness?
• If no – then do it!
• Otherwise get a sanity check from another team member before
scheduling review
• Is this an OLTP system or Data Warehouse/Data Mart?
– If the model is a Data Mart, is it in a Star Schema format?
• If not, why not?
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
43
Table Review
• Are any tables similar in nature to a table already in the corporate
repository?
– If so, can the existing definition be used with modifications?
• Is there a table level comment for every table?
• Are there detailed definitions for every table?
• Are help or reference tables needed to support implementation?
– i.e., Does the normalization look correct?
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
44
Table Review
• Do any of the tables require journaling (i.e., audit trail for changes)?
– If so will a table API be needed to support it?
– Note: See Table API generator in SQL Developer
• If a table is denormalized, validate the reasoning.
– Are there triggers to keep the denormalization in sync?
• Do all tables conform to existing naming standards and abbreviations?
– Are any new standards or abbreviations indicated?
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
45
Column Review
• Are appropriate audit columns defined?
• Are there column comments for every column?
• If the column is derived from other columns, have the derivation rules
and formulas been recorded?
• If a column has been denormalized from another table, has the
denormalization information been recorded?
• NEW: Are there security considerations such as PII or PHI?
© 2019 Snowflake Inc. All Rights Reserved
COLUMN DERIVATIONS IN SDDM (I.E., VIRTUAL)
46
Virtual Column
Derivation
specifications
© 2019 Snowflake Inc. All Rights Reserved
SDDM DIAGRAM SHOWS DERIVATIONS
47
© 2019 Snowflake Inc. All Rights Reserved
SDDM COLUMN SECURITY SETTINGS
48
© 2019 Snowflake Inc. All Rights Reserved
SDDM COLUMN SECURITY DISPLAY (SENSITIVE)
49
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
50
Column Review
• Do all columns conform to existing naming standards and
abbreviations?
– Are any new standards or abbreviations indicated?
• Have domains been used appropriately?
– Are there any new domains that can be identified?
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
51
Constraints/Referential Integrity
• Do all tables have appropriate primary key constraints?
• Should any secondary unique constraints be defined?
• Are all the foreign keys correct and enabled?
• Has cascade delete been enabled on any foreign key?
– If so, why?
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
52
Constraints/Referential Integrity
• Should any of the tables have table level check constraints?
– If so have they been defined correctly?
– Should they be enabled in the database?
• Are there any column level check constraints?
– Should they be enabled in the database?
• If any defined constraints will not be enabled, document the
justification and get consensus.
© 2019 Snowflake Inc. All Rights Reserved
PHYSICAL MODEL REVIEW CHECK LIST
53
Other
• If a corporate data model is available, have all tables and columns
been mapped to entities and attributes in that model?
• If this is a warehouse model, has the source data been identified?
– Is there a model for the source systems?
– Have the target columns been mapped to a known source?
• Does the model answer all the known requirements?
© 2019 Snowflake Inc. All Rights Reserved
BENEFITS &
CONSIDERATIONS
© 2019 Snowflake Inc. All Rights Reserved
BENEFITS OF THE REVIEWS
55
#1 – Higher quality models & databases
Standards enforcement
• The modelers and designers can’t ignore the approved standards if
the review team holds them accountable
Issues are discovered sooner in the lifecycle
• Saves $$$$ and results in a better product, faster
© 2019 Snowflake Inc. All Rights Reserved
BENEFITS OF THE REVIEWS
56
Knowledge transfer
• Everyone in the review learns the model and the business requirements.
Cross Training
• Jr. modelers and programmers get exposure to best practices and examples
of both good and bad data models
• This experience is often more effective than a data modeling class
A team of professionals that treat their work as business and not personal,
while deriving great personal satisfaction from a job well done
© 2019 Snowflake Inc. All Rights Reserved
New people may be intimidated by
the process
• Outsider to the team
• Feel like being picked on
• Must accept the process as
valid
• Mention the process in
interview
Strive to hold people accountable
Must have team buy-in to
the process
57
CAUTIONS TO CONSIDER
© 2019 Snowflake Inc. All Rights Reserved
In-experienced data modelers may
be intimidated
• Be clear up front that the critique
is being done to:
– Improve the quality of the models
– Improve their skill and knowledge
– Should not be taken as a personal
attack on their abilities
58
CAUTIONS TO CONSIDER
© 2019 Snowflake Inc. All Rights Reserved
Keep the units of work small
• A few entities/tables
• Requires scope control
• Think SCRUM!
• Review sessions less than
one hour
Takes time but reduces
overall effort
Reviewers must be readily
available on short notice
• Get at least 2 people if urgent
59
IS IT AGILE?
© 2019 Snowflake Inc. All Rights Reserved
CONCLUSION
60
Improving the quality of a model will save lots of $$$
over the course of a project
• And in the future!
Improving the quality of models can be done
• Requires discipline & structure
Suggested approaches
• Set standards
• Define a process
• Develop a check list
• Stick to it!
© 2019 Snowflake Inc. All Rights Reserved
Kent Graziano
Snowflake Computing
Kent.graziano@snowflake.com
On Twitter @KentGraziano
More info at
http://snowflake.com
Visit my blog at
http://kentgraziano.com
© 2019 Snowflake Inc. All Rights Reserved
QUESTIONS?
© 2019 Snowflake Inc. All Rights Reserved
Ad

More Related Content

What's hot (20)

Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and DatabricksSelf-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Grega Kespret
 
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSAgile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Kent Graziano
 
Snowflake Automated Deployments / CI/CD Pipelines
Snowflake Automated Deployments / CI/CD PipelinesSnowflake Automated Deployments / CI/CD Pipelines
Snowflake Automated Deployments / CI/CD Pipelines
Drew Hansen
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
Snowflake Computing
 
Launching a Data Platform on Snowflake
Launching a Data Platform on SnowflakeLaunching a Data Platform on Snowflake
Launching a Data Platform on Snowflake
KETL Limited
 
Analyzing Semi-Structured Data At Volume In The Cloud
Analyzing Semi-Structured Data At Volume In The CloudAnalyzing Semi-Structured Data At Volume In The Cloud
Analyzing Semi-Structured Data At Volume In The Cloud
Robert Dempsey
 
Altis AWS Snowflake Practice
Altis AWS Snowflake PracticeAltis AWS Snowflake Practice
Altis AWS Snowflake Practice
SamanthaSwain7
 
Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020
Harald Erb
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
Snowflake Computing
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)
Kent Graziano
 
Zero to Snowflake Presentation
Zero to Snowflake Presentation Zero to Snowflake Presentation
Zero to Snowflake Presentation
Brett VanderPlaats
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with Snowflake
Snowflake Computing
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)
Kent Graziano
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
Kent Graziano
 
Snowflake for Data Engineering
Snowflake for Data EngineeringSnowflake for Data Engineering
Snowflake for Data Engineering
Harald Erb
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & Future
Agilisium Consulting
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
Databricks
 
Elastic Data Warehousing
Elastic Data WarehousingElastic Data Warehousing
Elastic Data Warehousing
Snowflake Computing
 
Cheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduceCheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduce
Tilani Gunawardena PhD(UNIBAS), BSc(Pera), FHEA(UK), CEng, MIESL
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Kent Graziano
 
Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and DatabricksSelf-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Grega Kespret
 
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODSAgile Data Warehousing: Using SDDM to Build a Virtualized ODS
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Kent Graziano
 
Snowflake Automated Deployments / CI/CD Pipelines
Snowflake Automated Deployments / CI/CD PipelinesSnowflake Automated Deployments / CI/CD Pipelines
Snowflake Automated Deployments / CI/CD Pipelines
Drew Hansen
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
Snowflake Computing
 
Launching a Data Platform on Snowflake
Launching a Data Platform on SnowflakeLaunching a Data Platform on Snowflake
Launching a Data Platform on Snowflake
KETL Limited
 
Analyzing Semi-Structured Data At Volume In The Cloud
Analyzing Semi-Structured Data At Volume In The CloudAnalyzing Semi-Structured Data At Volume In The Cloud
Analyzing Semi-Structured Data At Volume In The Cloud
Robert Dempsey
 
Altis AWS Snowflake Practice
Altis AWS Snowflake PracticeAltis AWS Snowflake Practice
Altis AWS Snowflake Practice
SamanthaSwain7
 
Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020Dataiku & Snowflake Meetup Berlin 2020
Dataiku & Snowflake Meetup Berlin 2020
Harald Erb
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
Snowflake Computing
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)Demystifying Data Warehousing as a Service (GLOC 2019)
Demystifying Data Warehousing as a Service (GLOC 2019)
Kent Graziano
 
Zero to Snowflake Presentation
Zero to Snowflake Presentation Zero to Snowflake Presentation
Zero to Snowflake Presentation
Brett VanderPlaats
 
A 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with SnowflakeA 30 day plan to start ending your data struggle with Snowflake
A 30 day plan to start ending your data struggle with Snowflake
Snowflake Computing
 
Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)Demystifying Data Warehouse as a Service (DWaaS)
Demystifying Data Warehouse as a Service (DWaaS)
Kent Graziano
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
Kent Graziano
 
Snowflake for Data Engineering
Snowflake for Data EngineeringSnowflake for Data Engineering
Snowflake for Data Engineering
Harald Erb
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & Future
Agilisium Consulting
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
Databricks
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Kent Graziano
 

Similar to HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Kscope 19) (20)

(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020
Markus Michalewicz
 
Machine Learning Everywhere
Machine Learning EverywhereMachine Learning Everywhere
Machine Learning Everywhere
DataWorks Summit
 
SDL Tridion Next Generation GraphQL Integration Accelerators
SDL Tridion Next Generation GraphQL Integration AcceleratorsSDL Tridion Next Generation GraphQL Integration Accelerators
SDL Tridion Next Generation GraphQL Integration Accelerators
Ben Middleton
 
Power BI Advance Modeling
Power BI Advance ModelingPower BI Advance Modeling
Power BI Advance Modeling
CCG
 
Power BI Advanced Data Modeling Virtual Workshop
Power BI Advanced Data Modeling Virtual WorkshopPower BI Advanced Data Modeling Virtual Workshop
Power BI Advanced Data Modeling Virtual Workshop
CCG
 
Spark working with a Cloud IDE: Notebook/Shiny Apps
Spark working with a Cloud IDE: Notebook/Shiny AppsSpark working with a Cloud IDE: Notebook/Shiny Apps
Spark working with a Cloud IDE: Notebook/Shiny Apps
Data Con LA
 
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Emtec Inc.
 
The coding portion of Data Science
The coding portion of Data ScienceThe coding portion of Data Science
The coding portion of Data Science
Institute of Contemporary Sciences
 
M01_Introduction+to+Performance+Tuning_ed1.pdf
M01_Introduction+to+Performance+Tuning_ed1.pdfM01_Introduction+to+Performance+Tuning_ed1.pdf
M01_Introduction+to+Performance+Tuning_ed1.pdf
ssuserc34fbf
 
Strategic imperative the enterprise data model
Strategic imperative the enterprise data modelStrategic imperative the enterprise data model
Strategic imperative the enterprise data model
DATAVERSITY
 
The Changing Role of a DBA in an Autonomous World
The Changing Role of a DBA in an Autonomous WorldThe Changing Role of a DBA in an Autonomous World
The Changing Role of a DBA in an Autonomous World
Maria Colgan
 
DAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data ArchitectureDAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Daniel Zivkovic
 
Data Architecture vs Data Modeling
Data Architecture vs Data ModelingData Architecture vs Data Modeling
Data Architecture vs Data Modeling
DATAVERSITY
 
Going the DITA way
Going the DITA wayGoing the DITA way
Going the DITA way
Abigail Samuel
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
Inside Analysis
 
How to reduce complexity by segregating your data with Virtual Workspaces
How to reduce complexity by segregating your data with Virtual WorkspacesHow to reduce complexity by segregating your data with Virtual Workspaces
How to reduce complexity by segregating your data with Virtual Workspaces
LeanIX GmbH
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
Nicolas Georgeault
 
Achieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERAAchieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERA
IDERA Software
 
Benchmarking Hadoop - Which hadoop sql engine leads the herd
Benchmarking Hadoop - Which hadoop sql engine leads the herdBenchmarking Hadoop - Which hadoop sql engine leads the herd
Benchmarking Hadoop - Which hadoop sql engine leads the herd
Gord Sissons
 
(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020(Oracle) DBA and Other Skills Needed in 2020
(Oracle) DBA and Other Skills Needed in 2020
Markus Michalewicz
 
Machine Learning Everywhere
Machine Learning EverywhereMachine Learning Everywhere
Machine Learning Everywhere
DataWorks Summit
 
SDL Tridion Next Generation GraphQL Integration Accelerators
SDL Tridion Next Generation GraphQL Integration AcceleratorsSDL Tridion Next Generation GraphQL Integration Accelerators
SDL Tridion Next Generation GraphQL Integration Accelerators
Ben Middleton
 
Power BI Advance Modeling
Power BI Advance ModelingPower BI Advance Modeling
Power BI Advance Modeling
CCG
 
Power BI Advanced Data Modeling Virtual Workshop
Power BI Advanced Data Modeling Virtual WorkshopPower BI Advanced Data Modeling Virtual Workshop
Power BI Advanced Data Modeling Virtual Workshop
CCG
 
Spark working with a Cloud IDE: Notebook/Shiny Apps
Spark working with a Cloud IDE: Notebook/Shiny AppsSpark working with a Cloud IDE: Notebook/Shiny Apps
Spark working with a Cloud IDE: Notebook/Shiny Apps
Data Con LA
 
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Webinar: Ten Ways to Enhance Your Salesforce.com Application in 2013
Emtec Inc.
 
M01_Introduction+to+Performance+Tuning_ed1.pdf
M01_Introduction+to+Performance+Tuning_ed1.pdfM01_Introduction+to+Performance+Tuning_ed1.pdf
M01_Introduction+to+Performance+Tuning_ed1.pdf
ssuserc34fbf
 
Strategic imperative the enterprise data model
Strategic imperative the enterprise data modelStrategic imperative the enterprise data model
Strategic imperative the enterprise data model
DATAVERSITY
 
The Changing Role of a DBA in an Autonomous World
The Changing Role of a DBA in an Autonomous WorldThe Changing Role of a DBA in an Autonomous World
The Changing Role of a DBA in an Autonomous World
Maria Colgan
 
DAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data ArchitectureDAS Slides: Enterprise Architecture vs. Data Architecture
DAS Slides: Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Canadian Experts Discuss Modern Data Stacks and Cloud Computing for 5 Years o...
Daniel Zivkovic
 
Data Architecture vs Data Modeling
Data Architecture vs Data ModelingData Architecture vs Data Modeling
Data Architecture vs Data Modeling
DATAVERSITY
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
Inside Analysis
 
How to reduce complexity by segregating your data with Virtual Workspaces
How to reduce complexity by segregating your data with Virtual WorkspacesHow to reduce complexity by segregating your data with Virtual Workspaces
How to reduce complexity by segregating your data with Virtual Workspaces
LeanIX GmbH
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
Nicolas Georgeault
 
Achieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERAAchieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERA
IDERA Software
 
Benchmarking Hadoop - Which hadoop sql engine leads the herd
Benchmarking Hadoop - Which hadoop sql engine leads the herdBenchmarking Hadoop - Which hadoop sql engine leads the herd
Benchmarking Hadoop - Which hadoop sql engine leads the herd
Gord Sissons
 
Ad

More from Kent Graziano (14)

Balance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data CloudBalance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data Cloud
Kent Graziano
 
Rise of the Data Cloud
Rise of the Data CloudRise of the Data Cloud
Rise of the Data Cloud
Kent Graziano
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on Read
Kent Graziano
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Kent Graziano
 
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Kent Graziano
 
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Kent Graziano
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
Kent Graziano
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse Design
Kent Graziano
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Kent Graziano
 
Agile Methods and Data Warehousing
Agile Methods and Data WarehousingAgile Methods and Data Warehousing
Agile Methods and Data Warehousing
Kent Graziano
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Kent Graziano
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data Modeler
Kent Graziano
 
Why Data Vault?
Why Data Vault? Why Data Vault?
Why Data Vault?
Kent Graziano
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
Kent Graziano
 
Balance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data CloudBalance agility and governance with #TrueDataOps and The Data Cloud
Balance agility and governance with #TrueDataOps and The Data Cloud
Kent Graziano
 
Rise of the Data Cloud
Rise of the Data CloudRise of the Data Cloud
Rise of the Data Cloud
Kent Graziano
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on Read
Kent Graziano
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Kent Graziano
 
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Kent Graziano
 
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Kent Graziano
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
Kent Graziano
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse Design
Kent Graziano
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Kent Graziano
 
Agile Methods and Data Warehousing
Agile Methods and Data WarehousingAgile Methods and Data Warehousing
Agile Methods and Data Warehousing
Kent Graziano
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Kent Graziano
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data Modeler
Kent Graziano
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
Kent Graziano
 
Ad

Recently uploaded (20)

Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Calories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptxCalories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptx
TijiLMAHESHWARI
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Classification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptxClassification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptx
wencyjorda88
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 
Simple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptxSimple_AI_Explanation_English somplr.pptx
Simple_AI_Explanation_English somplr.pptx
ssuser2aa19f
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story04302025_CCC TUG_DataVista: The Design Story
04302025_CCC TUG_DataVista: The Design Story
ccctableauusergroup
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Calories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptxCalories_Prediction_using_Linear_Regression.pptx
Calories_Prediction_using_Linear_Regression.pptx
TijiLMAHESHWARI
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
Classification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptxClassification_in_Machinee_Learning.pptx
Classification_in_Machinee_Learning.pptx
wencyjorda88
 
Data Analytics Overview and its applications
Data Analytics Overview and its applicationsData Analytics Overview and its applications
Data Analytics Overview and its applications
JanmejayaMishra7
 
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
CTS EXCEPTIONSPrediction of Aluminium wire rod physical properties through AI...
ThanushsaranS
 
Defense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptxDefense Against LLM Scheming 2025_04_28.pptx
Defense Against LLM Scheming 2025_04_28.pptx
Greg Makowski
 
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.pptJust-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
Just-In-Timeasdfffffffghhhhhhhhhhj Systems.ppt
ssuser5f8f49
 
Cleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdfCleaned_Lecture 6666666_Simulation_I.pdf
Cleaned_Lecture 6666666_Simulation_I.pdf
alcinialbob1234
 
Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...Thingyan is now a global treasure! See how people around the world are search...
Thingyan is now a global treasure! See how people around the world are search...
Pixellion
 
How iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost FundsHow iCode cybertech Helped Me Recover My Lost Funds
How iCode cybertech Helped Me Recover My Lost Funds
ireneschmid345
 
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
Molecular methods diagnostic and monitoring of infection  -  Repaired.pptxMolecular methods diagnostic and monitoring of infection  -  Repaired.pptx
Molecular methods diagnostic and monitoring of infection - Repaired.pptx
7tzn7x5kky
 
Medical Dataset including visualizations
Medical Dataset including visualizationsMedical Dataset including visualizations
Medical Dataset including visualizations
vishrut8750588758
 
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
1. Briefing Session_SEED with Hon. Governor Assam - 27.10.pdf
Simran112433
 

HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Kscope 19)

  • 1. © 2019 Snowflake Inc. All Rights Reserved HOW TO SAVE PILES OF $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME KENT GRAZIANO, CHIEF TECHNICAL EVANGELIST I @KentGraziano
  • 2. © 2019 Snowflake Inc. All Rights Reserved 2
  • 3. © 2019 Snowflake Inc. All Rights Reserved AGENDA 3 Bio Assumptions My Best Practices Standards Checklist #1 Checklist #2 Checklist #3 Benefits Considerations Is it Agile? Conclusion
  • 4. © 2019 Snowflake Inc. All Rights Reserved MY BIO 4 Chief Technical Evangelist, Snowflake Computing Oracle ACE Director (DW/BI) Blogger: The Data Warrior Certified Data Vault Master and DV 2.0 Practitioner Former Member: Boulder BI Brain Trust (#BBBT) Member: DAMA Houston & DAMA International Data Architecture and Data Warehouse Specialist • 30+ years in IT • 25+ years of Oracle-related work • 20+ years of data warehousing experience Author & Co-Author of a bunch of books (Amazon) Past-President of ODTUG and Rocky Mountain Oracle User Group
  • 5. © 2019 Snowflake Inc. All Rights Reserved 3 years in stealth, 3+ years GA 5 1200+ employees Over 2000 customers today Over $920M in venture funding from leading investors First customers 2014, general availability 2015 Founded 2012 by industry veterans with over 120 database patents Queries processed in Snowflake per day: 100 million Largest single table: 68 trillion rows Largest number of tables single DB: 200,000 Single customer most data: > 40PB Single customer most users: > 10,000 Fun facts:
  • 6. © 2019 Snowflake Inc. All Rights Reserved PRESENTATION BASED ON: 6 http://www.amazon.com/Check-Doing-Design-Reviews-ebook/dp/B008RG9L5E/
  • 7. © 2019 Snowflake Inc. All Rights Reserved Not a data modeling presentation • You already understand ERD’s and database design Not a project management presentation 7 ASSUMPTIONS
  • 8. © 2019 Snowflake Inc. All Rights Reserved Who are you? • Data Modeler or Architect • Project Managers • IT Managers • DBA • Developer Experience • Data Modeling – Less than 1yr? – 1-5yrs? – Over 5yrs? 8 SURVEY
  • 9. © 2019 Snowflake Inc. All Rights Reserved Design the Database BEFORE you start programming! • Based on the actual business requirements, needs, and terminology Have standards and FOLLOW them! Develop a check list so you don’t miss a step • Really – it is faster and more efficient The sooner you find a mistake the cheaper it is to fix! 9 MY BEST PRACTICES (NOSURPRISESHERE)
  • 10. © 2019 Snowflake Inc. All Rights Reserved Have them! • Insure consistency • Easier to review Pick a tool to model in • SQL Developer Data Modeler (SDDM) • Erwin (if you have $$) • Visio (really?) Generate code whenever possible! 10 STANDARDS
  • 11. © 2019 Snowflake Inc. All Rights Reserved Object naming • Entities • Attributes • Tables • Columns • Views • Constraints (PK, UK, FK) • Indexes Use a tool that helps support these! 11 STANDARDS
  • 12. © 2019 Snowflake Inc. All Rights Reserved SDDM NAMING TEMPLATES 12
  • 13. © 2019 Snowflake Inc. All Rights Reserved Diagrams • Shapes • Colors – Yellow = lookup – Red outline = shared • Crows feet & direction • Legend – Diagram Name – Author – Last Updated 13 STANDARDS
  • 14. © 2019 Snowflake Inc. All Rights Reserved DIAGRAM LEGEND – SDDM 14
  • 15. © 2019 Snowflake Inc. All Rights Reserved CHECKLIST #1
  • 16. © 2019 Snowflake Inc. All Rights Reserved BASIC DESIGN PROCESS – STEPS 16 Develop the logical data model (i.e., entity relationship diagram) • If doing data vault or data mart design, skip to build of physical model Subject the model to one (or more) peer reviews Make changes from the reviews Get team “sign off” on the model Convert the model to a physical database design Subject the physical design to peer review
  • 17. © 2019 Snowflake Inc. All Rights Reserved BASIC DESIGN PROCESS – CONT’D 17 Make changes Get team sign off Generate the DDL to build the model Review the DDL Make changes Execute the changed DDL Hand off for development of ETL programs
  • 18. © 2019 Snowflake Inc. All Rights Reserved CHECKLIST #2
  • 19. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE LOGICAL MODEL (DETAILS) 19 Gather and review requirements Review current naming standards and domains Review existing models (if any) Develop new data model objects Modify existing objects if required/indicated Review logical model check list
  • 20. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE LOGICAL MODEL 20 Schedule logical model design review session Prepare for review session • Print the ERD (or ERD Subview) • Print entity and attribute detail reports – Assuming your modeling tool can produce some – Otherwise, print whatever meta-data you have – Email to the reviewers • BETTER: Save the diagrams and reports to a shared knowledge management site like SharePoint or Confluence • BEST: Use SDDM with SVN and share
  • 21. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE LOGICAL MODEL 21 Conduct the review session Make changes resulting from review session. Return to the develop/modify objects steps & repeat When approved, proceed to the development of physical model
  • 22. © 2019 Snowflake Inc. All Rights Reserved REVIEW SESSION CHECKLIST (#2A)
  • 23. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 23 If you have Oracle SDDM • Have the standard Design Rules been run to validate the model completeness? – If no – then do it! If not using SDDM • Get a sanity check from another team member before scheduling review • Instead of pair-programming (from XP) do pair data modeling Saves embarrassment from obvious mistakes
  • 24. © 2019 Snowflake Inc. All Rights Reserved SDDM DESIGN RULES 24
  • 25. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 25 Entity Review Questions • Is each entity in the model in 3rd Normal Form? • Are any entities similar in nature to or a sub-type of, an entity already in the corporate repository? – If so, why was that entity not used? • Are there detailed definitions for every entity and sub-entity? – Review definitions. – For data warehouse models, is the source of the data noted? • Do all entities conform to existing naming standards and abbreviations? – Are any new standards or abbreviations indicated?
  • 26. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 26 Relationship Review • Is the cardinality correct? • Is the optionality correct? • Do the relationship names make sense to a business analyst or end user? • Do all arc’ed relationships validate?
  • 27. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 27 Attribute review • Do all attributes conform to existing naming standards and abbreviations? • Are any new standards or abbreviations indicated? • Have domains been used appropriately? • Are there any new domains that can be identified? • Is there an “attribute comment” for every attribute?
  • 28. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 28 Attribute review • Optional - Is there a note for every attribute? – For data warehouse models, is the source of the attribute noted? • Are any of the attributes derived? – If so, why are they included in the logical model, which should be 3NF? • Are all the appropriate audit columns included (created by, creation date, etc.)? – If not add them!
  • 29. © 2019 Snowflake Inc. All Rights Reserved AUTOMATICALLY ADD AUDIT COLUMNS IN SDDM 29
  • 30. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 30 Unique Identifiers • Does every entity have at least one unique identifier (UID)? – Does this UID distinguish each instance as being unique? • Verify with examples. – Is this UID a true business key as opposed to a system generated sequence? – If it is not a business key, why not? • If system generated, then it should be noted as an exception. – Is the business key updatable? • Are there any alternate unique identifiers? – Verify with examples.
  • 31. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW CHECKLIST 31 Other • Does the model answer all the known requirements? – Assumes you have more than air-specs – Can it answer all the known questions from the users? • Has the model been compared to other models for quality and completeness? – For example, “Universal Data Models” from The Data Model Resource Book by Silverston, et al.
  • 32. © 2019 Snowflake Inc. All Rights Reserved SDDM REPORTS 32
  • 33. © 2019 Snowflake Inc. All Rights Reserved LOGICAL MODEL REVIEW 33 This process seems intensive • An ounce of prevention… Requires modelers to be thorough Requires reviewers to actually read all the meta-data to verify: • The descriptions are understandable to a business user • The text is grammatically correct with proper punctuation – It may appear in reports or as meta-data in a business intelligence tool
  • 34. © 2019 Snowflake Inc. All Rights Reserved THANKS FOR PLAYING! 34 Pass or Fail? • Reviewers decide Pass – go on to physical model Fail – revise the model and try again • The “crumple” factor Conditional Pass • Clean up some meta data and domains, then proceed w/o further review
  • 35. © 2019 Snowflake Inc. All Rights Reserved CHECKLIST #3
  • 36. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE PHYSICAL MODEL - STEPS 36 Review logical model with data modeler • If the database designer is a separate person on the team Review specific implementation requirements Forward Engineer logical model to a Relational model in SDDM • Diagram automatically created
  • 37. © 2019 Snowflake Inc. All Rights Reserved SDDM ENGINEER TO RELATIONAL 37
  • 38. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE PHYSICAL MODEL 38 Make manual modifications to tables and columns • Try to AVOID this if possible • As required by tuning and implementation requirements • To comply with standards • Add column and table check constraints Coordinate with the data modeler to account for apparent missing data elements
  • 39. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE PHYSICAL MODEL 39 Review physical model design check list Schedule physical design review session Prepare for review session • Print the schema diagram and table and column detail reports – Email to the reviewers • BETTER: Save the diagrams and reports to a shared knowledge management site like SharePoint or Confluence • BEST: Use SDDM with SVN and share Conduct the review session
  • 40. © 2019 Snowflake Inc. All Rights Reserved DEVELOP THE PHYSICAL MODEL 40 Make changes resulting from review session • Return to modify the schema diagram When approved, proceed to generate and deploy DDL to the development database • Hand off to development DBA if there is one Hand-off to programmers for application or ETL development
  • 41. © 2019 Snowflake Inc. All Rights Reserved REVIEW SESSION CHECKLIST (#3A)
  • 42. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 42 Almost identical to Logical Model list General • If you have Oracle SDDM – Have the standard Design Rules been run to validate the model completeness? • If no – then do it! • Otherwise get a sanity check from another team member before scheduling review • Is this an OLTP system or Data Warehouse/Data Mart? – If the model is a Data Mart, is it in a Star Schema format? • If not, why not?
  • 43. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 43 Table Review • Are any tables similar in nature to a table already in the corporate repository? – If so, can the existing definition be used with modifications? • Is there a table level comment for every table? • Are there detailed definitions for every table? • Are help or reference tables needed to support implementation? – i.e., Does the normalization look correct?
  • 44. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 44 Table Review • Do any of the tables require journaling (i.e., audit trail for changes)? – If so will a table API be needed to support it? – Note: See Table API generator in SQL Developer • If a table is denormalized, validate the reasoning. – Are there triggers to keep the denormalization in sync? • Do all tables conform to existing naming standards and abbreviations? – Are any new standards or abbreviations indicated?
  • 45. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 45 Column Review • Are appropriate audit columns defined? • Are there column comments for every column? • If the column is derived from other columns, have the derivation rules and formulas been recorded? • If a column has been denormalized from another table, has the denormalization information been recorded? • NEW: Are there security considerations such as PII or PHI?
  • 46. © 2019 Snowflake Inc. All Rights Reserved COLUMN DERIVATIONS IN SDDM (I.E., VIRTUAL) 46 Virtual Column Derivation specifications
  • 47. © 2019 Snowflake Inc. All Rights Reserved SDDM DIAGRAM SHOWS DERIVATIONS 47
  • 48. © 2019 Snowflake Inc. All Rights Reserved SDDM COLUMN SECURITY SETTINGS 48
  • 49. © 2019 Snowflake Inc. All Rights Reserved SDDM COLUMN SECURITY DISPLAY (SENSITIVE) 49
  • 50. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 50 Column Review • Do all columns conform to existing naming standards and abbreviations? – Are any new standards or abbreviations indicated? • Have domains been used appropriately? – Are there any new domains that can be identified?
  • 51. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 51 Constraints/Referential Integrity • Do all tables have appropriate primary key constraints? • Should any secondary unique constraints be defined? • Are all the foreign keys correct and enabled? • Has cascade delete been enabled on any foreign key? – If so, why?
  • 52. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 52 Constraints/Referential Integrity • Should any of the tables have table level check constraints? – If so have they been defined correctly? – Should they be enabled in the database? • Are there any column level check constraints? – Should they be enabled in the database? • If any defined constraints will not be enabled, document the justification and get consensus.
  • 53. © 2019 Snowflake Inc. All Rights Reserved PHYSICAL MODEL REVIEW CHECK LIST 53 Other • If a corporate data model is available, have all tables and columns been mapped to entities and attributes in that model? • If this is a warehouse model, has the source data been identified? – Is there a model for the source systems? – Have the target columns been mapped to a known source? • Does the model answer all the known requirements?
  • 54. © 2019 Snowflake Inc. All Rights Reserved BENEFITS & CONSIDERATIONS
  • 55. © 2019 Snowflake Inc. All Rights Reserved BENEFITS OF THE REVIEWS 55 #1 – Higher quality models & databases Standards enforcement • The modelers and designers can’t ignore the approved standards if the review team holds them accountable Issues are discovered sooner in the lifecycle • Saves $$$$ and results in a better product, faster
  • 56. © 2019 Snowflake Inc. All Rights Reserved BENEFITS OF THE REVIEWS 56 Knowledge transfer • Everyone in the review learns the model and the business requirements. Cross Training • Jr. modelers and programmers get exposure to best practices and examples of both good and bad data models • This experience is often more effective than a data modeling class A team of professionals that treat their work as business and not personal, while deriving great personal satisfaction from a job well done
  • 57. © 2019 Snowflake Inc. All Rights Reserved New people may be intimidated by the process • Outsider to the team • Feel like being picked on • Must accept the process as valid • Mention the process in interview Strive to hold people accountable Must have team buy-in to the process 57 CAUTIONS TO CONSIDER
  • 58. © 2019 Snowflake Inc. All Rights Reserved In-experienced data modelers may be intimidated • Be clear up front that the critique is being done to: – Improve the quality of the models – Improve their skill and knowledge – Should not be taken as a personal attack on their abilities 58 CAUTIONS TO CONSIDER
  • 59. © 2019 Snowflake Inc. All Rights Reserved Keep the units of work small • A few entities/tables • Requires scope control • Think SCRUM! • Review sessions less than one hour Takes time but reduces overall effort Reviewers must be readily available on short notice • Get at least 2 people if urgent 59 IS IT AGILE?
  • 60. © 2019 Snowflake Inc. All Rights Reserved CONCLUSION 60 Improving the quality of a model will save lots of $$$ over the course of a project • And in the future! Improving the quality of models can be done • Requires discipline & structure Suggested approaches • Set standards • Define a process • Develop a check list • Stick to it!
  • 61. © 2019 Snowflake Inc. All Rights Reserved Kent Graziano Snowflake Computing [email protected] On Twitter @KentGraziano More info at http://snowflake.com Visit my blog at http://kentgraziano.com
  • 62. © 2019 Snowflake Inc. All Rights Reserved QUESTIONS?
  • 63. © 2019 Snowflake Inc. All Rights Reserved

Editor's Notes

  • #4: Original material for this presentation is protected by copyrights from Data Warrior LLC and Kent Graziano. Snowflake Computing is herby granted a non-exclusive, revocable right to reuse this material without attribution on the individual slides so long as this notice remains in the note page for the agenda slide.
  • #6: Snowflake was founded in 2012 by a team of database experts and industry veterans from companies including Oracle, Teradata, Actian, and Cloudera. We have significant backing from established investors who have a track record of success, ensuring that we have the resources to continue to invest in our product and growth. We’ve been adopted by a rapidly growing number of customers since customers first started using our product in 2014, and over 1800 have signed up for Snowflake to date.