We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 17
Data Fabric Level 1 2022 Version
Quiz sack
You must receive a score of 75% or higher on the quiz to complete
the course.
Started on Wednesday, January 34, 2024, 6:44 AM
State Finished
Completed on Wednesday, January 31, 2024, 7:21 AM
Time taken 36 mins 44 secs
Feedback Congratul:
's, you passed the quiz!
Question
Correct
Points out of 1.00
Data architecture complexities add a time-consuming burden
on data consumers as they try to get what they need ready for
high-value analytics and AI. What percentage of a data
consumer's time is typically spent finding, understanding, and
preparing data before they can actually use it for analytics?
70
60
50
ovQuestion 2
Correct
Points eut of .09
Back
There are many different kinds of technologies used to store
data. Each of them serve different purposes and! types of
workloads. Which one of the following systems is dedicated to
serving high-performance analytics queries against cleansed
business data (usually for production business reports and
applications)?
Data virtualization
Transactional database
Data warehouse v
Data lake
Question 3
Incorrect
Points out of 3.00
You're meeting with a client who is interested in IBM's data
fabric vision, but has no IBM software and is using the Amazon
Web Services (AWS) public cloud platform. They are not
interested in leaving AWS, and do not want to disrupt their
existing analytics applications by moving from their currently
used database and integration software. Which one of the
following Cloud Pak for Data deployment options do you
recommend to this client?
Cloud Pak for Data running on a managed Openshift
service on AWS, and configured to integrate with the
databases and integration services on AWS
Cloud Pak for Data running natively as a collection of as a
service (aS) data management services on the AWS
cloud
Cloud Pak for Data as a Service on IBM Cloud, but
cataloging data in the AWS cloud
Cloud Pak for Data running on the AWS Kubernetes
Service, and configured to integrate with the AWS
databasesQuestion &
Correct
Points eut of .09
Back
Which one of the IBM's data fabric solution services provides
the capability for people of varying skill levels to build and
train AI and machine learning models as well as prepare and
analyze data?
Planning Analytics
Watson Studio v
Watson Knowledge Catalog
Watson Query
Question 5
Correct
Points cut of
Traditional database management systems and architectures
have been established fixtures in data centers for decades.
With the introduction of a data fabric, what role do these
traditional database management systems (like data
warehouses, data lakes, data hubs, and relational databases)
play in a data fabric architecture?
They become active participants in a unified approach v
to consuming data
Their data is moved into a single unified data fabric data
store
They are considered legacy systems and don't need new
investment
They become obsolete and are no longer neededQuestion 6
Correct
Points eut of .09
Back
‘The data lakehouse architecture is becoming increasingly
popular, with 451 Research estimating in 2021 that two thirds
of surveyed companies will have piloted a data lakehouse
architecture within the year. Which one of the following
answers represents the main characteristics of a data
lakehouse architecture?
Cloud-based deployment of Apache Hadoop 3.0 on
Openshift containers.
Managed cloud services that are integrated to form an
end-to-end DataOps solution that includes cloud object
storage, data preparation, and a high performance
relational database.
A data governance facility that's designed specifically to
support data lake deployments, where the focus is on
experimental data pipelines and machine learning
Projects.
Inexpensive, flexible storage layer (cloud object v
storage), with a good performance native SQL data
processing engine,Question 7
Correct
Points eut of .09
Back
You have a client who built a data lake to support exploratory
analytics and data science projects. This data lake is seen to
have yielded a poor return on investment, because it is
difficult to use for business users. The client feels their data
lake has become a "data swamp", Which one of the following.
IBM data fabric capabilities should you propose as a means to
help your client get some value out of their data lake
investment?
18M Cloud Object Storage can extend the data lake with
an inexpensive highly scalable storage layer that can
handle any kind of data.
Db2 as a Service can augment the data lake by being an
elastically scalable cloud-based complement to the on-
premises data lake.
Watson Knowledge Catalog can enable automated ¥
metadata capture of data in the lake to generate
augmented knowledge (which will support self-
service),
Data Replication can better manage the various replicas
(ie. copies) of the data that needs to be stored in the data
lake,Question &
Correct
Points eut of .09
Back
You have a retail customer with many siloed data systems,
where customer data is spread across these systems and
spans multiple formats, This makes even simple analysis (for
example, "How many unique customers do we have?") nearly
impossible. You propose the Customer 360 use case from
IBM's data fabric solution. Which one of the following benefits
are most applicable to this client opportunity?
Analyze all the customer data sets and build a set of data
science notebooks with automatically generated queries
for common questions
Pull metadata for all the customer data sets and generate
a DataStage job to generate a single customer table
From all the customer data sets, build a single master v
set of unique customer records
Generate a virtual view of the customer tables, while
leaving the actual data in their own databasesQuestion 9
Correct
Points eut of .09
Back
At one of your clients, a number of their business leaders
returned from a data analytics conference excited about the
"data mesh" concept. Having been burned by centralized IT
initiatives, these business leaders like the idea of data and
data products belonging to specific lines of business. What
18M Data and Al capability do you propose to them that would
help them build their data mesh architecture?
Netezza Performance Server (NPS): because it can be
used as a scalable data warehouse for the analytic
workloads from the different line of business teams. NPS
enables high performance analytic queries.
Big SQL: because it is a high performance query engine
for data lakes, which are frequently used by line of
business teams for self-service data analytics work. Big
‘SQL supports lower cost data analytics, which is practical
for self-service data exploration
Watson Knowledge Catalog (WKC): because itcanbe
used as the common catalog that is needed by a data
mesh architecture. WKC enables automated discovery
of distributed data assets and self-service shopping
for data,
Planning Analytics: because line of business teams need a
continous integrated planning solution that can integrate
with their own spreadsheets and consume data from
across their enterprise.Question 10
Correct
Points eut of .09
Back
AChief Data Officer (CDO) from one of your clients has been
discussing data fabric solutions with analysts and IBM
competitors. The CDO has observed how different vendors
seem to describe a data fabricin ways that reflect the
strengths of their offerings. To counter the competitive
messaging that the CDO has heard, how do you present IBM's
definition of the data fabric concept as implemented in Cloud
Pak for Data?
A data integration platform that delivers data to data
warehouses and data lakes on the cloud of your choice,
where it's cataloged and ready for users to explore.
A data lake framework that centralizes all of an
organization's data and provides self-service access to
this data through a variety of multi-disciplinary tools.
An integrated data pipelining and orchestration platform
that feeds data into a fully automated AI model creation
tool.
A set of integrated self-service tools to provide v
everyone in your organization with the ability to find,
explore, and ask questions against all available data,Question 11.
Correct
Points eut of .00
Back
For mast organizations, data is stored in many places and
formats. This makes a broad self-service data exploration
strategy highly challenging. One example of increased
complexity is the number of different cloud platforms where
organizations store data. Analysts from Flexera polled
different organizations and found that in most enterprises,
people expect to have data spread between two cloud
vendors. On average, on how many clouds is data actually
stored according to this survey?
Bev
34
72
56
‘Question 12
correct
Points eut of 1.09
Aproperly deployed data fabric solution involves many
different user personas. Which persona in an organization is
responsible for the definition, association, and enforcement of
data protection rules (and is also the beneficiary of the
automation of these tasks within Cloud Pak for Data)?
Database administrator
Data quality analyst
Data engineer
Data steward vQuestion 13
Correct
Points eut of .09
Back
‘There are many business benefits that clients can gain by
adopting a data fabric architecture. What is a major benefit of
a data fabric (for many organizations, this is the main reason
to adopt a data fabric), which is different from other data
management approaches they might have implemented in the
past?
The protection and security of confidential, personally
identifiable, and sensitive information
Increased performance for production applications that
consume enterprise data
Enabling self-service data consumption and v
collaboration
Reducing costs for data storage and processing
Question 14.
Incorrect
Points out of 2.00
Akey characteristic of a data fabric architecture is the ability
for users to have integrated experiences as their analytics
projects go through various stages; for instance: searching for
data, preparing the data, and data product deployment. What
is the term used to describe the metadata documenting the
various states a data set goes through on its way to being used
in a data product? (A data product can include a dashboard, a
machine learning model, or even simply a refined data table.)
Data dictionary
Data lineage
Data schema
Data classificationQuestion 15
Correct
Points eut of .09
Back
You are engaging with a client who has started a data science
practice and you've been reviewing the MLOps and
‘Trustworthy AI use case from IBM's data fabric solution with
them, They are particularly interested in building trust in their
machine learning models as they're being deployed. Which
one of the following capabilities should you promote with your
client?
Model monitoring for quality metrics, bias, prediction wv
explainability, and drift detection
‘Automate model building, deployment, and monitoring
with a trusted model pipeline
Machine learning tooling for users of a variety of skill
levels and technical backgrounds
Deep learning model development wizardsQuestion 16
Correct
Points eut of .09
Back
Aclient is interested in IBM's data fabric vision, but is,
reluctant to commit resources to an enterprise-wide data
governance effort. This client has hundreds of databases,
many with sensitive data fields. The client's feeling is that the
big effort needed to identify every field with sensitive data and
lock it down is not worth the reward of self-service access to
data, What capability from IBM's data governance and privacy
se case should you showcase to this customer?
As data is added to the catalog, it reads the access
control settings from the database management system,
and copies and applies them to the catalog service.
‘Automatic data classification identifies the class for v
each field of a data set that is added to the catalog;
once cataloged, data access rules can automatically
restrict access to data classes identified as being
sensitive,
Automatic identification of sensitive data, and then
encrypting each column deemed to be sensitive.
‘Automatic analysis and recognition of personally
identifiable (PII) information. Tables with PII data are
then automatically excluded from all self-service
exploration.Question 17
Correct
Points eut of .09
Back
Line of business teams quickly realize that getting access to
raw data isn't enough for them to start their analytics projects.
This raw data needs to be cleansed and prepared so it's in a
form where it can be used to answer questions. This process.
involves building data pipelines, which take raw data and
refines it into a business-ready state. What is the name of this
practice of delivering high-quality data to analytics teams?
DataOps (Data Operations) v
SQLOps (Structure Query Language Operations)
DevOps (Development Operations)
MLOps (Machine Learning Operations)
Question 18
Correct
Points out of 2.00
You have a client with multiple regional data lakes to support
local analytics work. They have a need for cross-region
analytics, so they built an expensive and complex replication
scheme to move data between these regional data lakes. A
data fabric solution can simplify how business users can
analyze data from multiple regions. Which one of IBM's data
fabric capabilities leaves data in-place and enables users to
get a single view of data, and query data from data lakes in
different regions?
Data matching
Data replication
Data masking
Data virtualization vQuestion 19
Correct
Points eut of .09
Back
You have a client operating in an unregulated industry who
isn't overly concerned about data protection and data privacy
issues, so they dismiss the need for a data governance
solution. What is a good reason you can present to your client
on why a data governance solution can deliver value?
Automatic classification of data assets and tracking of v
data preparation lineage metadata enables self-
service exploration
For data science and data visualization use cases, IBM's
data catalog has a bias detection utility for data sets
As data is added to the catalog, it's indexed optimally to
enable higher performance queries
You can use IBM's data catalog solution to build virtual,
views, pulling in data stored on different database servers
Question 20
Correct
Pints out of 2.09
‘There has been a steady evolution in the best practices that
organizations use to manage and analyze their data. Why is
there a need for a data fabric in addition to data lakes, data
warehouses, and databases?
A data fabric efficiently structures data to answer specific
business questions
A data fabric creates more data silos
A data fabrie eliminates the need for other technologies
A data fabric connects the right people tothe right. v-
data, atthe right timeQuestion 21.
Correct
Points eut of .09
Back
Ina client call that was initiated based on a request to hear
about IBM's data science offerings, it became clear that the
client is facing challenges while building data pipelines for
their model training. Their source data is in multiple systems,
and significant transformations are needed to convert the data
into a form where it can be analyzed. Which one of the
following Multicloud Data Integration capabilities is most
applicable for this client need?
Self-service exploration with Watson Knowledge Catalog
Data integration with DataStage v
Probabalistic matching with Match 360,
Data replication with the Replication service
(Question 22
Correct
Points cut of
Which of the following competitors also refers to their solution
asa "data fabric"?
Alteryx
Collibra
Snowflake
Denodo vQuestion 23
Correct
Points eut of .00
Back
‘There are many definitions of the data fabric concept. What is
the core concept in I8M's definition of data fabric?
It provides inexpensive storage for large volumes of data
and flexible processing for self-service analytics
It connects data from multiple locations, sources, and v
formats, which provides a simple unified means for
users to find and consume it
It consolidates data from across an organization in a
single place to provide users with simple access
Itis built with the help of container-based technologies
(such as Kubernetes)
Question 24.
Correct
Points out of 1.00
‘There are some established vendors from the DataOps space
(like Microsoft, Informatica, and TIBCO), who offer many of
the capabilities needed for a data fabric solution. If you have
clients interested in a data fabric solution who you know are
considering another data integration vendor, what are the IBM
value propositions you should promote to them?
The range of services, security, and reliability of the
vendor's public cloud
Hybrid cloud deployments, scalability, and integration
of other software
Industry leadership in the data integration, quality, and
governance space
Annual operational expenditures and total cost of
ownershipQuestion 25
Correct
Points eut of .00
Back
‘There are a number of essential capabilities that must be
present in a proper data fabric architecture. Which of these
capabilities represents the core of a data fabric?
Augmented knowledge - an enterprise data catalog v
Data federation - a central query facility for enterprise
data
Data observability - understanding the health and state of
an organization's data
Data preparation - central tooling for the movement,
cleansing, and transformation of data