0% found this document useful (0 votes)
731 views17 pages

Data Fabric Level 1 2022 Version Quiz - Q23 - 2024

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
731 views17 pages

Data Fabric Level 1 2022 Version Quiz - Q23 - 2024

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 17
Data Fabric Level 1 2022 Version Quiz sack You must receive a score of 75% or higher on the quiz to complete the course. Started on Wednesday, January 34, 2024, 6:44 AM State Finished Completed on Wednesday, January 31, 2024, 7:21 AM Time taken 36 mins 44 secs Feedback Congratul: 's, you passed the quiz! Question Correct Points out of 1.00 Data architecture complexities add a time-consuming burden on data consumers as they try to get what they need ready for high-value analytics and AI. What percentage of a data consumer's time is typically spent finding, understanding, and preparing data before they can actually use it for analytics? 70 60 50 ov Question 2 Correct Points eut of .09 Back There are many different kinds of technologies used to store data. Each of them serve different purposes and! types of workloads. Which one of the following systems is dedicated to serving high-performance analytics queries against cleansed business data (usually for production business reports and applications)? Data virtualization Transactional database Data warehouse v Data lake Question 3 Incorrect Points out of 3.00 You're meeting with a client who is interested in IBM's data fabric vision, but has no IBM software and is using the Amazon Web Services (AWS) public cloud platform. They are not interested in leaving AWS, and do not want to disrupt their existing analytics applications by moving from their currently used database and integration software. Which one of the following Cloud Pak for Data deployment options do you recommend to this client? Cloud Pak for Data running on a managed Openshift service on AWS, and configured to integrate with the databases and integration services on AWS Cloud Pak for Data running natively as a collection of as a service (aS) data management services on the AWS cloud Cloud Pak for Data as a Service on IBM Cloud, but cataloging data in the AWS cloud Cloud Pak for Data running on the AWS Kubernetes Service, and configured to integrate with the AWS databases Question & Correct Points eut of .09 Back Which one of the IBM's data fabric solution services provides the capability for people of varying skill levels to build and train AI and machine learning models as well as prepare and analyze data? Planning Analytics Watson Studio v Watson Knowledge Catalog Watson Query Question 5 Correct Points cut of Traditional database management systems and architectures have been established fixtures in data centers for decades. With the introduction of a data fabric, what role do these traditional database management systems (like data warehouses, data lakes, data hubs, and relational databases) play in a data fabric architecture? They become active participants in a unified approach v to consuming data Their data is moved into a single unified data fabric data store They are considered legacy systems and don't need new investment They become obsolete and are no longer needed Question 6 Correct Points eut of .09 Back ‘The data lakehouse architecture is becoming increasingly popular, with 451 Research estimating in 2021 that two thirds of surveyed companies will have piloted a data lakehouse architecture within the year. Which one of the following answers represents the main characteristics of a data lakehouse architecture? Cloud-based deployment of Apache Hadoop 3.0 on Openshift containers. Managed cloud services that are integrated to form an end-to-end DataOps solution that includes cloud object storage, data preparation, and a high performance relational database. A data governance facility that's designed specifically to support data lake deployments, where the focus is on experimental data pipelines and machine learning Projects. Inexpensive, flexible storage layer (cloud object v storage), with a good performance native SQL data processing engine, Question 7 Correct Points eut of .09 Back You have a client who built a data lake to support exploratory analytics and data science projects. This data lake is seen to have yielded a poor return on investment, because it is difficult to use for business users. The client feels their data lake has become a "data swamp", Which one of the following. IBM data fabric capabilities should you propose as a means to help your client get some value out of their data lake investment? 18M Cloud Object Storage can extend the data lake with an inexpensive highly scalable storage layer that can handle any kind of data. Db2 as a Service can augment the data lake by being an elastically scalable cloud-based complement to the on- premises data lake. Watson Knowledge Catalog can enable automated ¥ metadata capture of data in the lake to generate augmented knowledge (which will support self- service), Data Replication can better manage the various replicas (ie. copies) of the data that needs to be stored in the data lake, Question & Correct Points eut of .09 Back You have a retail customer with many siloed data systems, where customer data is spread across these systems and spans multiple formats, This makes even simple analysis (for example, "How many unique customers do we have?") nearly impossible. You propose the Customer 360 use case from IBM's data fabric solution. Which one of the following benefits are most applicable to this client opportunity? Analyze all the customer data sets and build a set of data science notebooks with automatically generated queries for common questions Pull metadata for all the customer data sets and generate a DataStage job to generate a single customer table From all the customer data sets, build a single master v set of unique customer records Generate a virtual view of the customer tables, while leaving the actual data in their own databases Question 9 Correct Points eut of .09 Back At one of your clients, a number of their business leaders returned from a data analytics conference excited about the "data mesh" concept. Having been burned by centralized IT initiatives, these business leaders like the idea of data and data products belonging to specific lines of business. What 18M Data and Al capability do you propose to them that would help them build their data mesh architecture? Netezza Performance Server (NPS): because it can be used as a scalable data warehouse for the analytic workloads from the different line of business teams. NPS enables high performance analytic queries. Big SQL: because it is a high performance query engine for data lakes, which are frequently used by line of business teams for self-service data analytics work. Big ‘SQL supports lower cost data analytics, which is practical for self-service data exploration Watson Knowledge Catalog (WKC): because itcanbe used as the common catalog that is needed by a data mesh architecture. WKC enables automated discovery of distributed data assets and self-service shopping for data, Planning Analytics: because line of business teams need a continous integrated planning solution that can integrate with their own spreadsheets and consume data from across their enterprise. Question 10 Correct Points eut of .09 Back AChief Data Officer (CDO) from one of your clients has been discussing data fabric solutions with analysts and IBM competitors. The CDO has observed how different vendors seem to describe a data fabricin ways that reflect the strengths of their offerings. To counter the competitive messaging that the CDO has heard, how do you present IBM's definition of the data fabric concept as implemented in Cloud Pak for Data? A data integration platform that delivers data to data warehouses and data lakes on the cloud of your choice, where it's cataloged and ready for users to explore. A data lake framework that centralizes all of an organization's data and provides self-service access to this data through a variety of multi-disciplinary tools. An integrated data pipelining and orchestration platform that feeds data into a fully automated AI model creation tool. A set of integrated self-service tools to provide v everyone in your organization with the ability to find, explore, and ask questions against all available data, Question 11. Correct Points eut of .00 Back For mast organizations, data is stored in many places and formats. This makes a broad self-service data exploration strategy highly challenging. One example of increased complexity is the number of different cloud platforms where organizations store data. Analysts from Flexera polled different organizations and found that in most enterprises, people expect to have data spread between two cloud vendors. On average, on how many clouds is data actually stored according to this survey? Bev 34 72 56 ‘Question 12 correct Points eut of 1.09 Aproperly deployed data fabric solution involves many different user personas. Which persona in an organization is responsible for the definition, association, and enforcement of data protection rules (and is also the beneficiary of the automation of these tasks within Cloud Pak for Data)? Database administrator Data quality analyst Data engineer Data steward v Question 13 Correct Points eut of .09 Back ‘There are many business benefits that clients can gain by adopting a data fabric architecture. What is a major benefit of a data fabric (for many organizations, this is the main reason to adopt a data fabric), which is different from other data management approaches they might have implemented in the past? The protection and security of confidential, personally identifiable, and sensitive information Increased performance for production applications that consume enterprise data Enabling self-service data consumption and v collaboration Reducing costs for data storage and processing Question 14. Incorrect Points out of 2.00 Akey characteristic of a data fabric architecture is the ability for users to have integrated experiences as their analytics projects go through various stages; for instance: searching for data, preparing the data, and data product deployment. What is the term used to describe the metadata documenting the various states a data set goes through on its way to being used in a data product? (A data product can include a dashboard, a machine learning model, or even simply a refined data table.) Data dictionary Data lineage Data schema Data classification Question 15 Correct Points eut of .09 Back You are engaging with a client who has started a data science practice and you've been reviewing the MLOps and ‘Trustworthy AI use case from IBM's data fabric solution with them, They are particularly interested in building trust in their machine learning models as they're being deployed. Which one of the following capabilities should you promote with your client? Model monitoring for quality metrics, bias, prediction wv explainability, and drift detection ‘Automate model building, deployment, and monitoring with a trusted model pipeline Machine learning tooling for users of a variety of skill levels and technical backgrounds Deep learning model development wizards Question 16 Correct Points eut of .09 Back Aclient is interested in IBM's data fabric vision, but is, reluctant to commit resources to an enterprise-wide data governance effort. This client has hundreds of databases, many with sensitive data fields. The client's feeling is that the big effort needed to identify every field with sensitive data and lock it down is not worth the reward of self-service access to data, What capability from IBM's data governance and privacy se case should you showcase to this customer? As data is added to the catalog, it reads the access control settings from the database management system, and copies and applies them to the catalog service. ‘Automatic data classification identifies the class for v each field of a data set that is added to the catalog; once cataloged, data access rules can automatically restrict access to data classes identified as being sensitive, Automatic identification of sensitive data, and then encrypting each column deemed to be sensitive. ‘Automatic analysis and recognition of personally identifiable (PII) information. Tables with PII data are then automatically excluded from all self-service exploration. Question 17 Correct Points eut of .09 Back Line of business teams quickly realize that getting access to raw data isn't enough for them to start their analytics projects. This raw data needs to be cleansed and prepared so it's in a form where it can be used to answer questions. This process. involves building data pipelines, which take raw data and refines it into a business-ready state. What is the name of this practice of delivering high-quality data to analytics teams? DataOps (Data Operations) v SQLOps (Structure Query Language Operations) DevOps (Development Operations) MLOps (Machine Learning Operations) Question 18 Correct Points out of 2.00 You have a client with multiple regional data lakes to support local analytics work. They have a need for cross-region analytics, so they built an expensive and complex replication scheme to move data between these regional data lakes. A data fabric solution can simplify how business users can analyze data from multiple regions. Which one of IBM's data fabric capabilities leaves data in-place and enables users to get a single view of data, and query data from data lakes in different regions? Data matching Data replication Data masking Data virtualization v Question 19 Correct Points eut of .09 Back You have a client operating in an unregulated industry who isn't overly concerned about data protection and data privacy issues, so they dismiss the need for a data governance solution. What is a good reason you can present to your client on why a data governance solution can deliver value? Automatic classification of data assets and tracking of v data preparation lineage metadata enables self- service exploration For data science and data visualization use cases, IBM's data catalog has a bias detection utility for data sets As data is added to the catalog, it's indexed optimally to enable higher performance queries You can use IBM's data catalog solution to build virtual, views, pulling in data stored on different database servers Question 20 Correct Pints out of 2.09 ‘There has been a steady evolution in the best practices that organizations use to manage and analyze their data. Why is there a need for a data fabric in addition to data lakes, data warehouses, and databases? A data fabric efficiently structures data to answer specific business questions A data fabric creates more data silos A data fabrie eliminates the need for other technologies A data fabric connects the right people tothe right. v- data, atthe right time Question 21. Correct Points eut of .09 Back Ina client call that was initiated based on a request to hear about IBM's data science offerings, it became clear that the client is facing challenges while building data pipelines for their model training. Their source data is in multiple systems, and significant transformations are needed to convert the data into a form where it can be analyzed. Which one of the following Multicloud Data Integration capabilities is most applicable for this client need? Self-service exploration with Watson Knowledge Catalog Data integration with DataStage v Probabalistic matching with Match 360, Data replication with the Replication service (Question 22 Correct Points cut of Which of the following competitors also refers to their solution asa "data fabric"? Alteryx Collibra Snowflake Denodo v Question 23 Correct Points eut of .00 Back ‘There are many definitions of the data fabric concept. What is the core concept in I8M's definition of data fabric? It provides inexpensive storage for large volumes of data and flexible processing for self-service analytics It connects data from multiple locations, sources, and v formats, which provides a simple unified means for users to find and consume it It consolidates data from across an organization in a single place to provide users with simple access Itis built with the help of container-based technologies (such as Kubernetes) Question 24. Correct Points out of 1.00 ‘There are some established vendors from the DataOps space (like Microsoft, Informatica, and TIBCO), who offer many of the capabilities needed for a data fabric solution. If you have clients interested in a data fabric solution who you know are considering another data integration vendor, what are the IBM value propositions you should promote to them? The range of services, security, and reliability of the vendor's public cloud Hybrid cloud deployments, scalability, and integration of other software Industry leadership in the data integration, quality, and governance space Annual operational expenditures and total cost of ownership Question 25 Correct Points eut of .00 Back ‘There are a number of essential capabilities that must be present in a proper data fabric architecture. Which of these capabilities represents the core of a data fabric? Augmented knowledge - an enterprise data catalog v Data federation - a central query facility for enterprise data Data observability - understanding the health and state of an organization's data Data preparation - central tooling for the movement, cleansing, and transformation of data

You might also like