Snowflake Data Sharing For Dummies Guide
Snowflake Data Sharing For Dummies Guide
by Lawrence C. Miller
and David Baum
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data Sharing For Dummies®, 2nd Snowflake Special Edition
Published by
John Wiley & Sons, Inc.
111 River St.
Hoboken, NJ 07030-5774
www.wiley.com
Copyright © 2020 by John Wiley & Sons, Inc., Hoboken, New Jersey
No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any
means, electronic, mechanical, photocopying, recording, scanning or otherwise, except as permitted under Sections
107 or 108 of the 1976 United States Copyright Act, without the prior written permission of the Publisher. Requests to
the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River
Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/permissions.
Trademarks: Wiley, For Dummies, the Dummies Man logo, Dummies.com, and related trade dress are trademarks
or registered trademarks of John Wiley & Sons, Inc. and/or its affiliates in the United States and other countries,
and may not be used without written permission. Snowflake and the Snowflake logo are trademarks or registered
trademarks of Snowflake Inc. All other trademarks are the property of their respective owners. John Wiley & Sons,
Inc., is not associated with any product or vendor mentioned in this book.
LIMIT OF LIABILITY/DISCLAIMER OF WARRANTY: THE PUBLISHER AND THE AUTHOR MAKE NO REPRESENTATIONS
OR WARRANTIES WITH RESPECT TO THE ACCURACY OR COMPLETENESS OF THE CONTENTS OF THIS WORK AND
SPECIFICALLY DISCLAIM ALL WARRANTIES, INCLUDING WITHOUT LIMITATION WARRANTIES OF FITNESS FOR A
PARTICULAR PURPOSE. NO WARRANTY MAY BE CREATED OR EXTENDED BY SALES OR PROMOTIONAL MATERIALS. THE
ADVICE AND STRATEGIES CONTAINED HEREIN MAY NOT BE SUITABLE FOR EVERY SITUATION. THIS WORK IS SOLD
WITH THE UNDERSTANDING THAT THE PUBLISHER IS NOT ENGAGED IN RENDERING LEGAL, ACCOUNTING, OR
OTHER PROFESSIONAL SERVICES. IF PROFESSIONAL ASSISTANCE IS REQUIRED, THE SERVICES OF A COMPETENT
PROFESSIONAL PERSON SHOULD BE SOUGHT. NEITHER THE PUBLISHER NOR THE AUTHOR SHALL BE LIABLE FOR
DAMAGES ARISING HEREFROM. THE FACT THAT AN ORGANIZATION OR WEBSITE IS REFERRED TO IN THIS WORK AS
A CITATION AND/OR A POTENTIAL SOURCE OF FURTHER INFORMATION DOES NOT MEAN THAT THE AUTHOR OR THE
PUBLISHER ENDORSES THE INFORMATION THE ORGANIZATION OR WEBSITE MAY PROVIDE OR RECOMMENDATIONS
IT MAY MAKE. FURTHER, READERS SHOULD BE AWARE THAT INTERNET WEBSITES LISTED IN THIS WORK MAY HAVE
CHANGED OR DISAPPEARED BETWEEN WHEN THIS WORK WAS WRITTEN AND WHEN IT IS READ.
For general information on our other products and services, or how to create a custom For Dummies book for your
business or organization, please contact our Business Development Department in the U.S. at 877-409-4177,
contact [email protected], or visit www.wiley.com/go/custompub. For information about licensing the
For Dummies brand for products or services, contact BrandedRights&[email protected].
10 9 8 7 6 5 4 3 2 1
Publisher’s Acknowledgments
We’re proud of this book and of the people who worked on it. For details on how to
create a custom For Dummies book for your business or organization, contact info@
dummies.biz or visit www.wiley.com/go/custompub. For details on licensing the
For Dummies brand for products or services, contact BrandedRights&Licenses@
Wiley.com.
Some of the people who helped bring this book to market include the following:
Development Editor: Nicole Sholly Production Editor: Siddique Shaik
Project Editor: Martin V. Minner Snowflake Contributors Team:
Vincent Morello, Daniel Kuperman,
Executive Editor: Steve Hayes
Leslie Steere, Michael Nixon,
Editorial Manager: Rev Mengle Matthew Glickman
Business Development Representative:
Karen Hattan
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Table of Contents
INTRODUCTION................................................................................................ 1
About This Book.................................................................................... 1
Icons Used in This Book........................................................................ 2
Beyond the Book................................................................................... 2
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
CHAPTER 4: Enabling Live Data Sharing................................................. 23
Exploring Modern Data Sharing........................................................ 23
Winning with modern data sharing............................................. 24
Making data sharing easy............................................................. 25
Using Modern Cloud Data Sharing.................................................... 29
Learning How It All Works.................................................................. 29
Controlling Access to Shared Data by Using Secure Views............ 30
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Introduction
O
rganizations inside an enterprise acquire crucial insight
by analyzing data they share with each other. For exam-
ple, finance teams need sales data to forecast future
financial performance. Product management teams require mar-
keting data to determine future products and services. Executive
management needs up-to-the minute dashboards, fueled by data
from many parts of the enterprise, to make timely, data-driven
business decisions.
Introduction 1
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
sharing methods. Modern data sharing allows an organization to
easily and quickly forge one-to-one, one-to-many, and many-
to-many relationships to share data in new and imaginative ways
and reduce time to insight to a level never before possible.
This icon points out information you should commit to your non-
volatile memory — your gray matter.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Defining data sharing
Chapter 1
Getting Up to Speed on
Data Sharing Basics
E
very day, organizations everywhere use data to track busi-
ness results, make decisions, engage customers, define and
create products, forecast trends, and more. Data is also a
resource used and consumed between organizations, internal and
external to one another, to collaborate on business plans, mutual
initiatives, or joint opportunities.
In this chapter, you learn about data sharing — what it is, why it
matters, how and why organizations share data, and what busi-
ness opportunities data sharing can create.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
What Is Data Sharing?
Data can originate from the many software applications an enter-
prise uses to run its business, from the constant activity of visi-
tors engaging a website, from an Internet of Things (IoT) device
attached to the refrigerator in your home, or from a sensor built
into something as sophisticated as the jet engine of an airliner.
There are potentially endless data-creating scenarios in the mod-
ern world. Market intelligence firm IDC estimates the world’s total
digital data created will increase to 180 zettabytes by 2025 (one
zettabyte is equal to about 1 trillion gigabytes). Unfortunately,
traditional data sharing methods require moving data, which is
riddled with problems. Going forward it will be impractical, if not
impossible, to share vast amounts of data in meaningful ways.
But this process is slow, cumbersome, and costly and only allows
for moving limited amounts of shared data. Figure 1-2 shows how
modern data sharing happens without moving data. Instead, a
data provider makes available live, read-only copies of data to its
data consumers via modern cloud data sharing. In essence, data
doesn’t have to move.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
FIGURE 1-1: Traditional data sharing requires duplicating and moving data
from a data provider to data consumers.
FIGURE 1-2: Modern cloud data sharing enables fast, live, secure, and
governed data sharing without moving data.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
PORTAL FOR JOB SEEKERS
IMPROVES 300 PERCENT
Snagajob’s mobile sourcing and hiring tools connect 75 million
registered hourly workers to Snagajob’s business subscribers, which
represent 300,000 employer locations.
Snagajob had a need to share its data from its data platform with an
external marketing analytics firm. The firm then used the data to reach
out to Snagajob’s business clients to execute targeted re-engagement
campaigns on behalf of Snagajob. This relationship allowed Snagajob
to avoid in-house labor costs associated with this marketing function.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data Sharing Examples
Here are just a few of the new business opportunities modern
cloud data sharing makes possible:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Online file sharing services: These are similar to FTP, but
sharing and downloading data files takes place via Internet file
transfer only.
»» Cloud storage: The provider stores data in the cloud and
provides the consumer with credentials for accessing it.
»» Application programming interfaces (APIs): An API is used
to initiate and manage the data transfer.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Looking at today’s data sharing methods
Chapter 2
Understanding
Traditional Data
Sharing Challenges
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
approaches (shown in Table 2-1) that a company would encounter,
for example, when sharing data with a third-party service pro-
vider or another external organization such as a business partner.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data Sharing
Approach Pros Cons
(continued)
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
TABLE 2-1 (continued)
Data Sharing
Approach Pros Cons
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
shared between the parties via a separate process, and the data
consumer must decrypt the shared data.
»» Changing file formats and schema: It may be necessary to
change the file format multiple times if additional database
attributes must be shared. When table attributes change on
the data provider’s end, a corresponding change must also
occur on the data consumer’s end.
Usually, the delays and difficulty don’t end with just the data
transfer effort. For example:
(continued)
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
(continued)
analytics, and other sources. Data was often exported and c opied into
other platforms, manipulated, and further analyzed. The process pro-
duced multiple copies of the data living in many places, thus increasing
costs and complexity and producing inconsistent results.
ETL eliminated
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
The goal should be effortless sharing of limitless amounts of data
with internal and external organizations, including your business
partners, for collaboration and business planning. If your busi-
ness model is focused on monetizing your data, you’ll want the
same level of effortless sharing to distribute data to as many data
consumers as possible, with individualized, self-service access
and security as needed.
If you think cloud storage is the answer, think again. Sharing data
using a basic cloud storage service is inefficient. It won’t provide
the ability for you or your data consumers to query the data in a
high-performance manner or ensure data consistency. A Hadoop
computing platform is not the answer either because of its inher-
ent complexities and complications.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Marketing teams monitor and analyze customer data to
predict behavior and align demand generation programs.
»» Different subsidiaries of an organization share data with
each other to better align their go-to-market plans and gain
more understanding of the separate areas of the business.
Monetizing data
An example of monetizing shared data is a data service company
that gathers mobile phone location information and usage data
and then shares the information with advertising agencies and
marketing groups so they can execute highly targeted campaigns
to specific consumers.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Tracing the history of data sharing
in business
Chapter 3
Recognizing the Business
Value of Sharing Data
I
n this chapter, you learn how data sharing methods have
evolved in business, why data sharing is critical to any busi-
ness, how businesses share data internally and externally, and
how the cloud and software-as-a-service (SaaS) change the data
sharing model.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Each of these applications would also have an associated data-
base. These databases were not optimized for analytics and did
not share data between applications. In order to analyze this data,
each business unit in charge of a database would have to extract,
transform, and load (ETL) the data into its own data mart, which
is a smaller, stand-alone version of a data warehouse. Then, to
develop business intelligence across an organization or to exe-
cute analytics against companywide data, data would have to be
sent through the ETL process from the individual data marts into
a central data warehouse. This data would then be prepared for
analytics. The entire process was slow and cumbersome. Data for-
mats varied across the applications, requiring further modeling
and transformation into a new data warehouse. But at least you
had access to the data because it was in your own data center.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Within an organization, however, data is often locked in silos.
Mergers or acquisitions, firewall restrictions, or other business
or technology barriers often restrict an organization from easily
sharing data across its business units. These physical or logical
separations of infrastructure can prevent two or more business
units from accessing all available data within an organization to
deliver all-inclusive, data-driven insights. These data silos emerge
when an organization relies on a traditional, on-premises data
warehouse or a traditional data warehouse ported to the cloud.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
STREAMLINING INTERNAL
OPERATIONS WITH DATA SHARING
Rakuten Rewards is a global conglomerate that since 1997 has helped
shape the way people shop online, offering cash-back deals and shop-
ping rewards on the world’s largest selection of products and services.
Each company division or subsidiary has specific legal requirements
and permissions related to which data can be shared, creating a com-
plicated data sharing infrastructure within the company.
Not only was the internal process of sharing data cumbersome, but it
also prevented business units from accessing each other’s data sets
for making more-informed business decisions.
From there, the analytics company would analyze the data for the
retailer. It would then provide the analysis back to the retailer in
the form of an inbound data share, as shown in Figure 3-2.
FIGURE 3-2: The organization is the data consumer, accessing the data from
its outside data analytics vendor, which is the data provider.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
In other scenarios, the organization contracts a service provider
to perform a function the organization chooses not to perform in-
house. In turn, the service provider generates data as a result of
that service — data that belongs to the organization, which is the
service provider’s customer. With inbound data sharing between
organizations, the data generated by the service provider is shared
with its customer. The customer then executes additional analyt-
ics to develop deeper insights and value from additional data gen-
erated outside its data center but within its business ecosystem.
Monetizing data
Data can also take on more significance today than just day-
to-day collaboration. Data is a business asset — a currency. As
such, data can offer different types of value depending on the
organization that wants to consume that data. Thus, as with any
asset, data has value. To monetize the value of its data, a provider
can sell data to consumers that can then use the data to advance
their own business objectives (see Figure 3-3).
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data consumers can use shared data without having to capture
and collect it themselves. They can benefit directly from analyz-
ing that data or combining it with other data to enhance its value.
When gathering data, and sharing it with game studios, PlayFab regu-
larly encountered challenges such as capturing the right data the first
time, getting data from client devices and moving it into the data pipe-
line, managing constantly changing data schemas driven by new
game data events, and reducing soaring costs due to moving and
transferring high volumes of game data. To solve these challenges,
PlayFab adopted a modern, cloud-built data platform.
With modern data sharing, PlayFab sets up secure, governed, and live
views of the data with each game studio under a straightforward,
self-service business model. This method avoids ETL entirely. Secure
and governed views guarantee each game studio’s data is truly iso-
lated and game designers get direct access to a direct feed of live
game data, without any custom import required.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Exploring a modern data sharing
architecture
Chapter 4
Enabling Live Data
Sharing
I
n this chapter, you learn how a modern, built-for-the-cloud
data platform architecture helps data providers and data con-
sumers overcome traditional data sharing challenges. You also
learn how to use real-time data sharing from inside a modern
cloud data platform environment to quickly and easily enable
secure and governed views of live data for your data consumers.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
and governance, among other things, means traditional data
warehouses and data lake architectures cannot support unlimited
concurrent access by data consumers or real-time data changes
by data providers without cumbersome unloading and transfer-
ring of data, as shown in Figure 4-1. This puts data consumers at
risk of operating on stale (static) data.
FIGURE 4-2: Providers can improve business with modern cloud data sharing.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
With modern data sharing, ready-to-use data is immediately
available in real time. In a modern cloud data platform architec-
ture, query speeds on shared data are exponentially faster and
fortified with limitless storage and compute resources. Modern
data sharing extends the architecture and functionality of the
modern cloud data platform to share data. Enterprises can grant
read-only access to their live, ready-to-use data (structured and
semi-structured) in a secure and governed environment. Data
consumers can then choose to combine (JOIN) data from other
organizations to augment and deepen their data analytics.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Enables data sharing with unlimited data providers and
consumers: A modern cloud data platform can serve an
unlimited number of data providers and consumers, with full
transactional integrity and data consistency.
Modern cloud data sharing eliminates the delays, cost, and fric-
tion of existing methods, which provide only primitive mecha-
nisms for data publishing, access, and control. Modern data
sharing is built on three key architectural innovations, which we
discuss in the next few sections.
FIGURE 4-3: A modern data sharing architecture built for the cloud with
storage, compute, and services completely separate but logically integrated.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
move), and the data provider doesn’t pay for any of the compute
resources that a data consumer uses to analyze shared data.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
If even a single operation fails, the entire transaction is rolled
back and the database state is left unchanged.
»» Consistency: The completion of any transaction brings the
database from one valid state to another valid state.
»» Isolation: Concurrent transactions do not contend for
access to the data and are run as if each transaction
executed sequentially.
»» Durability: After a transaction is committed, it remains
committed.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Using Modern Cloud Data Sharing
Modern cloud data sharing allows access to database tables and
views for any user of a modern cloud data platform. When a data
provider shares data with a data consumer, the database object and
view are all from within the data provider’s environment.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
adding the data sharing consumer(s). The following example
grants privileges for the sales_db database, the aggregates_
eula schema, and the aggregate_1 table to the data-share
object:
grant usage on database sales_db to
share sales_s;
grant usage on schema sales_
db.aggregates_eula to share sales_s;
grant select on table sales_
db.aggregates_eula.aggregate_1 to
share sales_s;
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
FIGURE 4-6: Secure views with modern data sharing allow data providers to
protect access to sensitive data.
When you need to plan new inventory with your distributor, you
want to provide access to the unit sales data, but not the cus-
tomer data, which is sensitive. Therefore, a secure view named
distributor_sales_data is created from the unitsales table,
just for the distributor.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
This logic creates a secure view named, but without the
sensitive customerID data. The data included in the view are
sku, date, and qty.
2. In the sales_s share container, add privileges for the
secure view:
grant usage on database sales_db to
share sales_s;
grant usage on schema sales_db.public
to share sales_s;
grant select on view sales_db.public.distributor_
sales_data to
share sales_s;
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Collaborating on truly shared data
Chapter 5
Assessing the Impact of
Modern Data Sharing
I
n this chapter, you learn how modern data sharing enables
real-time collaboration. You also learn which technologies and
trends have enabled a modern data sharing architecture, as well
as how modern data sharing can enable organizations to quickly
create new business assets from data.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Critical business decisions are made based on outdated,
incomplete, or inaccurate data.
»» Electronic discovery costs escalate when multiple sources of
data within and outside an organization must be identified,
searched, and produced for litigation support.
»» The potential number of data breaches and accidental data
loss/disclosure risks multiply, along with their associated
costs, such as breach notifications, credit monitoring
services, damage to an organization’s brand, customer
churn, litigation, forensic analysis, and recovery.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
computing resources easily available at massive scale to any-
sized organization.
But collecting and analyzing all this data from multiple channels
is exactly what successful organizations must do. Outcomes from
insightful analytics can help them better target new products
and services. So, it’s understandable those that want to provide
data as a service, or as a value-added business asset, are just as
interested in delivering access to data quickly and easily, so other,
non-competing organizations can benefit.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
REDUCING COSTS AND
STREAMLINING OPERATIONS
Heap Analytics provides web and mobile analytics for companies
across a number of industries. To share raw data with customers, the
company would drop files into Amazon S3 buckets, or host files with a
cloud-based vendor for customers who weren’t sophisticated enough
to handle all the extract, transfer, load (ETL) activities required to then
analyze the data. Heap’s costs and efforts included:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
regardless of its size, into a business asset by charging for
access to slices of its data repository. This low-cost, zero-
headache solution enables data companies to immediately
meet data consumers’ urgent demands for fresh data with
up-to-the-minute accuracy, which maintains the highest
value for data. Organizations have greater power and
platform capabilities to support, move forward, and
implement data monetization strategies.
»» Data sharing with business partners: Sharing data directly
with business partners is not new. But the effortless sharing
of live data is groundbreaking. Modern cloud data sharing
enables the following:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
THOUSANDS OF DATA
CONSUMERS — ONE
SOURCE OF TRUTH
PlaceIQ aggregates, collects, and anonymizes data from thousands of
applications on mobile devices. It then makes the data available for
companies that want to target and reach out to mobile consumers
based on their location and behavior. PlaceIQ customers may include
marketing companies, advertising agencies, and product producers,
to name a few.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Creating data exchanges to extend data
to third parties
Chapter 6
Using Exchanges
to Share Data and
Monetize Insights
E
xchanging data within a controlled ecosystem can yield rich
analytics and deep insights, leading to more-informed
decision-making. In this chapter, you learn how your orga-
nization can participate in a data exchange to leverage third-party
data and create new data assets.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Thousands of online marketplaces today link buyers and sellers.
Typically, the data vendor handles data transformation, prepara-
tion, and loading, while the marketplace is in charge of discovery,
collaboration, licensing, and auditing. It’s a huge opportunity: In
its September 2018 report, “Value of Data: The Dawn of the Data
Marketplace,” Accenture estimates the data-as-a-service (DaaS)
market is poised to reach $10.4 billion by 2021.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Moving into the Modern Age with
Real-Time Data Sharing
Most companies derive value from a data exchange fairly quickly
as they leverage the deeper insights from additional data and data
services. Those data consumers acquire data from third-party pro-
viders in several ways, including via an API, using a self-service
interface for discovery and analysis, buying raw data via download
or file transfer, and using an app that reveals trends and insights.
After that, two common questions arise:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Establishing the Right Architecture
for Exchanging Data
Developing and maintaining APIs levies a heavy toll on data
providers and data consumers. The providers must configure and
manage the APIs, while consumers have to establish custom links
to all marketplace vendors with which they want to do business.
Instead, the ideal data exchange provides the following mecha-
nisms for both providers and consumers:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
CREATING THE IDEAL DATA
MARKETPLACE
Weather data is an important and vital asset for business intelligence
across industries. In the U.S. alone, weather accounts for more than
$600 billion a year in lost revenue. Weather strategies help companies
forecast sales, mitigate risk, adjust transportation routes, and confi-
dently make decisions.
With the exchange, Weather Source is now able to reduce costs and
effort involved in publishing its data sets. Data transformation, load-
ing, and reconstruction are no longer required. New objects become
immediately available to all consumers, providing real-time data
access across the entire ecosystem. This more efficient operation
offers customers better access to Weather Source data in a SQL-
friendly format that’s always up to date.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
GETTING HELP FROM YOUR DATA
EXCHANGE VENDOR
A modern data exchange should provide many unique benefits,
including:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Discovering data monetization
opportunities
Chapter 7
Monetizing Your Data
D
ata monetization is a process by which a data provider
charges data consumers a fee to gain access to the provid-
er’s data or data services, so a data consumer can enrich its
existing data sets to benefit its business and its customers. Some
providers offer access to the data itself while others offer services,
such as data modeling, data enrichment, and data analytics.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Examining Industry Opportunities
Organizations can monetize data in countless ways, as shown in
Figure 7-1. For example, a telecommunications company can sell
location data to help retailers target consumers with ads. Con-
sumer packaged goods companies can share purchasing data
with online advertisers — or directly with customers. A logistics
company might sell data about transportation patterns and ship-
ping activity as an indication of economic trends. Transportation
agencies can collect data from tollbooths and bridges to optimize
navigation apps and minimize traffic congestion. From tracking
engine performance to monitoring human biometrics, data shar-
ing reveals previously unobtainable insights so organizations can
spot trends, make predictions, and take corrective action.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Getting in Front of Data
Monetization Trends
Modern data sharing is growing every year as the volume of data
that organizations collect expands exponentially. A Forrester
Research survey of global data and analytics decision makers
revealed that more than 75 percent of respondents are expanding
their use of external data and nearly 50 percent of these compa-
nies already commercialize their data.
Start by identifying the area of value for your customers. This will
help you determine what type of data and access to give them, and
how you might extend your current business practices to maxi-
mize future data sharing opportunities.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Observing Good Data Governance
For all types of data sharing scenarios, you must observe all
government and industry regulations regarding the security
and privacy of consumer data, as shown in Figure 7-2. Regard-
less what data you plan to share, setting conditions and limita-
tions is important. Appoint a data governance champion to pay
attention to these legislative mandates; set internal policies and
procedures; respond to customer requests; and control access to
data using secure views, secure functions, and secure joins — as
described in the next chapter.
FIGURE 7-2: Make sure your data sharing practice complies with all pertinent
regulations and doesn’t compromise customer privacy.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
»» Through a data broker: The advantage is ease of adoption
because the broker supplies a built-in clientele. The disad-
vantages include lack of control over the data and the
inability to forge direct relationships with users.
»» Via a data exchange. As explained in Chapter 6, a data
exchange is a scalable platform that supports flexible,
bidirectional data sharing. This is a great way to build your
brand and establish new types of direct relationships with
customers. The only possible disadvantage involves adop-
tion: If you’re starting with an early-stage ecosystem, it may
take time for the service to deliver on its full potential. In this
case, look for a data exchange that will help you generate
brand awareness and scale as part of its value proposition,
versus a traditional data marketplace where your data sits
there waiting for someone to find it.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
DATA SHARING SERVICE OPENS
NEW REVENUE STREAMS
Maintaining product quality in the fast-moving consumer goods
(FMCG) industry requires rapid decisions related to inventory, ware-
house management, shipping, and delivery. By helping grocery suppli-
ers visualize trends in their data and share insights with retailers,
Atheon Analytics keeps goods moving to the right place at the right
time. Atheon’s SKUtrak service is the leading flow-of-goods tracker for
FMCG suppliers throughout the United Kingdom.
Fresh groceries are a byproduct of fresh data, and Atheon’s new data
sharing service, based on its cloud-built data platform allows the com-
pany to easily share data to SKUtrak customers, so they always see
the latest insights but without complex data-copy or data-moving pro-
cedures. “We provide data directly from retailers and share it with
suppliers and customers to make sure their products get to the
supermarkets where they need to be,” Atheon’s head of service deliv-
ery Rose Ahearne said. “The data platform helped with scalability and
created new product opportunities.”
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Complying with government and
industry regulations
Chapter 8
Governing Your Data
M
any organizations need to securely link their own data
with data generated by their partners, suppliers, cus-
tomers, and industry peers, but they have concerns about
protecting personally identifiable information (PII), protected
health information (PHI), competitive data, and other forms of
fine-grained data.
For the GDPR, as an example, there are three key areas to consider:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
GDPR requires organizations to rein in the runaway complexity
that multiple data sources create and to establish a modern data
governance strategy on a global scale. Not only do most organiza-
tions keep their data in multiple locations (disparate data ware-
houses, data marts, servers, and clouds), but they also suffer from
hidden layers of data. This phrase describes the countless copies of
data that exist within organizations as a result of siloed teams and
departments sharing information.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
relatively straightforward. First, the retailer would need to create
a share, a database object that grants permissions and data access
to the data consumers.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
FIGURE 8-2: How to share the different subsets of data with multiple
consumers.
Data providers can each create secure views of their data and share
access to those views with other users of the SaaS cloud data plat-
form, even if these users are in other organizations. This is also
a useful method for organizations that want to grant database
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
access directly to multiple end customers. Secure views allow
these customers to see only their specific rows of data from each
table, but not the rows that contain data about other customers.
When should you consider using secure views? The following are
some guidelines:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
Data providers can grant rights to multiple consumers to use their
secure shared UDFs, even if those users are in different accounts
or different organizations. Secure UDFs have become a great way
to guarantee data security and privacy, giving data providers and
consumers the following benefits:
Secure joins allow you to jointly analyze only the records that are
in both of the data sets, and to prevent each data consumer from
seeing records about non-overlapping patients. For more details
regarding secure joins, see “Secure Joins: How to Join Data Between
Companies While Respecting PII,” at www.snowflake.com/blog/
secure-joins-how-to-join-data-between-companies-while-
respecting-pii.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
IN THIS CHAPTER
»» Evaluating opportunities and defining
your organization’s role
Chapter 9
Six Steps to Advance
Your Business with
Modern Data Sharing
N
ow that you understand the enormous potential of modern
data sharing and the challenges of traditional data sharing
methods, it’s time to consider the possible impact and
benefits of modern data sharing. This chapter outlines six key
steps to help you and your organization get started with modern
data sharing to advance your business:
CHAPTER 9 Six Steps to Advance Your Business with Modern Data Sharing 57
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
stage for easier and faster execution of your data sharing
business plans. Here’s what to look for:
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
required to share data. Eliminating data movement puts
you on a better path for limitless data sharing.
CHAPTER 9 Six Steps to Advance Your Business with Modern Data Sharing 59
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
5. Create an outreach plan. Outline the steps necessary to
engage your data consumers. Because modern data sharing
can facilitate one-to-one, one-to-many, and many-to-many
data sharing relationships, you must communicate the value
of your data and the benefits data consumers can expect to
receive.
6. Execute — demonstrate time-to-market and time-
to-value improvements to your business stakeholders.
After completing your PoC, demonstrate the benefits of
modern data sharing to your stakeholders. Estimate time
savings and related cost savings for your organization as the
data provider and/or the data consumer. Demonstrate the
improvements in productivity your organization will gain and,
if applicable, forecast the revenue potential for monetizing
your data. You should be able to develop a complete picture
of the ROI potential for modern data sharing and a modern
cloud-built data platform. You will then be well on your way
to taking data sharing to new levels of capabilities and
opportunities for your organization.
These materials are © 2020 John Wiley & Sons, Inc. Any dissemination, distribution, or unauthorized use is strictly prohibited.
WILEY END USER LICENSE AGREEMENT
Go to www.wiley.com/go/eula to access Wiley’s ebook EULA.