DP 203T00A ENU PowerPoint - 01
DP 203T00A ENU PowerPoint - 01
engineering on Azure
SQL
Structured Integration
SELECT…
Python
Semi-structured Transformation
df=spark.read(…)
R Jav
Unstructured Consolidation .aNE Others
T
Scal
a
Analytical data stored in files Analytical data stored in a relational database Open-source engine for
distributed data processing
Distributed storage for massive scalability Typically modeled as a star schema to
optimize summary analysis
Azure Databricks
Azure Data Factory
Which of the following Azure services provides capabilities for running data pipelines
3 AND managing analytical data in a data lake or relational data warehouse?
⃣Azure Stream Analytics
⃣Azure Synapse Analytics
⃣Azure Databricks
Blobs can be organized in virtual directories, but File system includes directories and files, and is
each path is considered a single blob in a flat compatible with large scale data analytics systems
namespace – Folder level operations are not like Hadoop, Databricks, and Azure Synapse
supported Analytics
© Copyright Microsoft Corporation. All rights reserved.
Knowledge check
2 What option must you enable to use Azure Data Lake Storage
Gen2?
⃣Global replication
⃣Data encryption
⃣Hierarchical namespace
• Data integration
• Integrated analytics
Integrated notebook
experience
2 You want to create a data warehouse in Azure Synapse Analytics in which the data is stored and
queried in a relational data store. What kind of pool should you create?
⃣Serverless SQL pool
⃣Dedicated SQL pool
⃣Apache Spark pool
A data analyst wants to analyze data by using Python code combined with text descriptions of
3 the insights gained from the analysis. What should they use to perform the analysis?
⃣A notebook connected to an Apache Spark pool
⃣A SQL script connected to a serverless SQL pool
⃣A KQL script connected to a Data Explorer pool