0% found this document useful (0 votes)

53 views

DP 203T00A ENU PowerPoint - 01

Uploaded by

chief artificer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

DP 203T00A ENU PowerPoint - 01

Uploaded by

chief artificer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

Get started with data

engineering on Azure

© Copyright Microsoft Corporation. All rights reserved.

Learning Objectives
After completing this module, you will be able to:

1 Introduction to data engineering on Azure

2 Introduction to Azure Data Lake Storage Gen2

3 Introduction to Azure Synapse Analytics

© Copyright Microsoft Corporation. All rights reserved.

Introduction to data
engineering on Azure

© Copyright Microsoft Corporation. All rights reserved.

What is data engineering?
Data engineers work with multiple types of data to perform a variety of data operations using a
range of tools and scripting languages

Types of data Data operations Languages

SQL
Structured Integration
SELECT…

Python
Semi-structured Transformation
df=spark.read(…)

R Jav
Unstructured Consolidation .aNE Others
T
Scal
a

© Copyright Microsoft Corporation. All rights reserved.

Important data engineering concepts
Operational and analytical Streaming data Data pipeline
data

Orchestrated activities to transfer and

Operational: Transactional data used by Perpetual, real-time data feeds transform data.
applications
Used to implement extract, transform, and
Analytical: Optimized for analysis and load (ETL) or extract, load, and transform (ELT)
reporting operations.
Data Lake Data Warehouse Apache Spark

Analytical data stored in files Analytical data stored in a relational database Open-source engine for
distributed data processing
Distributed storage for massive scalability Typically modeled as a star schema to
optimize summary analysis

© Copyright Microsoft Corporation. All rights reserved.

Data engineering in Azure

Operational Data ingestion/ETL Analytical data storage Data modeling and

data and processing visualization
Azure Synapse Analytics Microsoft Power BI

Azure Data Lake Storage Gen2

Azure Stream Analytics

Azure Databricks
Azure Data Factory

© Copyright Microsoft Corporation. All rights reserved.

Knowledge check
1 Data in a relational database table is…
⃣Structured
⃣Semi-structured
⃣Unstructured

2 In a data lake, data is stored in…

⃣Relational tables
⃣Files
⃣A single JSON document

Which of the following Azure services provides capabilities for running data pipelines
3 AND managing analytical data in a data lake or relational data warehouse?
⃣Azure Stream Analytics
⃣Azure Synapse Analytics
⃣Azure Databricks

© Copyright Microsoft Corporation. All rights reserved.

Introduction to Azure
Data Lake Storage Gen2

© Copyright Microsoft Corporation. All rights reserved.

Understand Azure Data Lake Storage
Gen2
Distributed cloud
storage for data lakes
• HDFS-compatibility –
Common file system for
Hadoop, Spark, and
others
• Flexible security through
folder and file level
permissions
• Built on Azure Storage:
– High performance and
scalability
– Data redundancy
through built-in
replication
© Copyright Microsoft Corporation. All rights reserved.
Azure Data Lake Storage Gen 2 vs Azure Blob
Storage
Enable Hierarchical Namespace in a blob container to use Azure Data Lake Storage
Gen2
Azure Blob Storage Azure Data Lake Storage Gen2

Azure Storage Account

Azure Storage Account
Blob Container
Blob Container Directory
blob1 File1
File2
folder1/blob2 Hierarchical
Namespace

Blobs can be organized in virtual directories, but File system includes directories and files, and is
each path is considered a single blob in a flat compatible with large scale data analytics systems
namespace – Folder level operations are not like Hadoop, Databricks, and Azure Synapse
supported Analytics
© Copyright Microsoft Corporation. All rights reserved.
Knowledge check

1 Azure Data Lake Storage Gen2 stores data in…

⃣A document database hosted in Azure Cosmos DB
⃣An HDFS-compatible file system hosted in Azure Storage
⃣A relational data warehouse hosted in Azure Synapse Analytics

2 What option must you enable to use Azure Data Lake Storage
Gen2?
⃣Global replication
⃣Data encryption
⃣Hierarchical namespace

© Copyright Microsoft Corporation. All rights reserved.

Introduction to Azure
Synapse Analytics

© Copyright Microsoft Corporation. All rights reserved.

What is Azure Synapse Analytics?

Cloud platform for

data analytics
• Large-scale data
warehousing
• Advanced analytics

• Data exploration and

discovery
• Real time analytics

• Data integration

• Integrated analytics

Work with files in a data lake
• Connect to data lake
storage using linked
services
• Every Azure Synapse
Analytics workspace has
a default data lake

Ingest and transform data with
pipelines
• Native pipeline
functionality built on Azure
Data Factory
• Orchestrate activities to
ingest, transform, and load
data
• Integrate with other data
services

Query and manipulate data with SQL

SQL Server based

pools for scalable
relational data
processing:
• Built-in serverless SQL
pool for data exploration
and analysis of files in the
data lake
• Custom dedicated SQL
pools to host large-scale
relational data
warehouses

Process and analyze data with Apache
Spark
Open-source Spark
technology
• Highly scalable,
distributed processing
• Common libraries and
multiple programming
languages

Integrated notebook
experience

Exercise: Explore Azure Synapse
Analytics
Use the hosted lab environment provided, or view the lab
instructions at the link below:
https://aka.ms/mslearn-explore-synapse

Knowledge check
Which feature of Azure Synapse Analytics enables you to transfer data from one store to another
1 and apply transformations to the data at scheduled intervals?
⃣Serverless SQL pool
⃣Apache Spark pool
⃣Pipelines

2 You want to create a data warehouse in Azure Synapse Analytics in which the data is stored and
queried in a relational data store. What kind of pool should you create?
⃣Serverless SQL pool
⃣Dedicated SQL pool
⃣Apache Spark pool

A data analyst wants to analyze data by using Python code combined with text descriptions of
3 the insights gained from the analysis. What should they use to perform the analysis?
⃣A notebook connected to an Apache Spark pool
⃣A SQL script connected to a serverless SQL pool
⃣A KQL script connected to a Data Explorer pool

Get started with data engineering on Azure

https://aka.ms/mslearn-data-engineer

09 - Azure Data Engineering Cheatsheet
No ratings yet
09 - Azure Data Engineering Cheatsheet
37 pages
NLP Practice Problems (2)
No ratings yet
NLP Practice Problems (2)
48 pages
DP-203T00 Microsoft Azure Data Engineering-02
No ratings yet
DP-203T00 Microsoft Azure Data Engineering-02
23 pages
Azure Synapse Analytics
100% (1)
Azure Synapse Analytics
7,794 pages
AZ 801T00A ENU TrainerPrepGuide
0% (2)
AZ 801T00A ENU TrainerPrepGuide
15 pages
Azure Synapse
No ratings yet
Azure Synapse
609 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
DP-203T00 Data Engineering On Microsoft Azure
No ratings yet
DP-203T00 Data Engineering On Microsoft Azure
12 pages
Azure Data Engineer DP-203
No ratings yet
Azure Data Engineer DP-203
98 pages
SDC - Synapse Analytics
No ratings yet
SDC - Synapse Analytics
23 pages
DP-203 Agenda
No ratings yet
DP-203 Agenda
8 pages
DP 900t00a Enu Powerpoint 04
No ratings yet
DP 900t00a Enu Powerpoint 04
23 pages
Azure Synapse Course Presentation
100% (1)
Azure Synapse Course Presentation
261 pages
Diagrams
No ratings yet
Diagrams
69 pages
Data Engineering On Microsoft Azure (DP-203T00) H9P83S
No ratings yet
Data Engineering On Microsoft Azure (DP-203T00) H9P83S
5 pages
Data Analyst Azure PowerBI Syllabus (1)
No ratings yet
Data Analyst Azure PowerBI Syllabus (1)
35 pages
Azure synapse Analytics
No ratings yet
Azure synapse Analytics
29 pages
Azure DataEngineer Course Outline
No ratings yet
Azure DataEngineer Course Outline
4 pages
Modern Analytics Academy - Data Modeling
No ratings yet
Modern Analytics Academy - Data Modeling
12 pages
Azure Analytics Interview Answers Complete
No ratings yet
Azure Analytics Interview Answers Complete
5 pages
Synapse Project Deck
No ratings yet
Synapse Project Deck
196 pages
Warner DP 203 Slides
100% (1)
Warner DP 203 Slides
91 pages
Azure Data Engineer Learning Path
No ratings yet
Azure Data Engineer Learning Path
12 pages
Azure Synapse
No ratings yet
Azure Synapse
229 pages
Azure Data Factory
No ratings yet
Azure Data Factory
3,167 pages
What Is Azure Synapse Data Explorer (Preview) - Azure Synapse Analytics - Microsoft Docs
No ratings yet
What Is Azure Synapse Data Explorer (Preview) - Azure Synapse Analytics - Microsoft Docs
6 pages
Azure Synapse Analytics Overview
No ratings yet
Azure Synapse Analytics Overview
251 pages
Azure Data Solutions
No ratings yet
Azure Data Solutions
7 pages
James Serra Azure Synapse Analytics Overview Big Data Conference Europe
No ratings yet
James Serra Azure Synapse Analytics Overview Big Data Conference Europe
72 pages
Azure Synpse
No ratings yet
Azure Synpse
4 pages
Dkosjfnaf
No ratings yet
Dkosjfnaf
2 pages
Azure Data Platform End2End - 2day
100% (2)
Azure Data Platform End2End - 2day
108 pages
Azure Synapse Guidebook
100% (1)
Azure Synapse Guidebook
15 pages
Azure Data Platform End2End - 1day
No ratings yet
Azure Data Platform End2End - 1day
90 pages
Reference Guide - DP-203 Collection - v2
No ratings yet
Reference Guide - DP-203 Collection - v2
3 pages
Learning Azure Synapse Analytics (Third Early Release) Paul Andrew - The newest ebook version is ready, download now to explore
100% (1)
Learning Azure Synapse Analytics (Third Early Release) Paul Andrew - The newest ebook version is ready, download now to explore
66 pages
Data Engineering 101 - Azure Synapse Analytics
No ratings yet
Data Engineering 101 - Azure Synapse Analytics
45 pages
2023-IDA Custom Bootcamp Curriculum Day Wise Curriculum v0.1
No ratings yet
2023-IDA Custom Bootcamp Curriculum Day Wise Curriculum v0.1
122 pages
Azure Synapse Analytics
No ratings yet
Azure Synapse Analytics
102 pages
Download Full Learning Azure Synapse Analytics (Third Early Release) Paul Andrew PDF All Chapters
100% (3)
Download Full Learning Azure Synapse Analytics (Third Early Release) Paul Andrew PDF All Chapters
40 pages
Azure DATA Fatcory
No ratings yet
Azure DATA Fatcory
2,982 pages
Module 4
No ratings yet
Module 4
3 pages
Azure Data Engineering - Pragathi
No ratings yet
Azure Data Engineering - Pragathi
4 pages
ADE (2)
No ratings yet
ADE (2)
6 pages
Whiz-Cheat-Sheet-DP-203-v2
No ratings yet
Whiz-Cheat-Sheet-DP-203-v2
42 pages
Azure Data Engineer Interview QA
No ratings yet
Azure Data Engineer Interview QA
2 pages
Azure Synapse Analytics
No ratings yet
Azure Synapse Analytics
5 pages
Azure Analytics: Synapse
100% (4)
Azure Analytics: Synapse
251 pages
Document 2
No ratings yet
Document 2
11 pages
Presentation Deck Part 21612531397089
No ratings yet
Presentation Deck Part 21612531397089
59 pages
Azure DW
No ratings yet
Azure DW
2 pages
f4b7901ed5e5f9106a3a82eea2e2f003
No ratings yet
f4b7901ed5e5f9106a3a82eea2e2f003
3,614 pages
Azure Data Fundamentals
No ratings yet
Azure Data Fundamentals
4 pages
Azure Datalake
No ratings yet
Azure Datalake
8 pages
DP203-Certification Preparation
No ratings yet
DP203-Certification Preparation
9 pages
Study Guide For Exam DP-203 - Data Engineering On Microsoft Azure - Microsoft Learn
No ratings yet
Study Guide For Exam DP-203 - Data Engineering On Microsoft Azure - Microsoft Learn
4 pages
DP 203 Data Engineering Course Syllabus
No ratings yet
DP 203 Data Engineering Course Syllabus
4 pages
Data Factory, Data Integration
No ratings yet
Data Factory, Data Integration
2,034 pages
Azure Data Engineer
100% (4)
Azure Data Engineer
54 pages
Azure Synapse Analytics - Azure Synapse Analytics - Microsoft Docs
No ratings yet
Azure Synapse Analytics - Azure Synapse Analytics - Microsoft Docs
3 pages
MIE1628 Big Data Analytics Lecture7
No ratings yet
MIE1628 Big Data Analytics Lecture7
77 pages
Important DE Interview Questions
No ratings yet
Important DE Interview Questions
5 pages
My Weaknesses Have Magnified God’s Strength — Watchtower ONLINE LIBRARY
No ratings yet
My Weaknesses Have Magnified God’s Strength — Watchtower ONLINE LIBRARY
1 page
Husbands, Honor Your Wife | Watchtower Study
No ratings yet
Husbands, Honor Your Wife | Watchtower Study
1 page
Trust in the Merciful “Judge of All the Earth”! — Watchtower ONLINE LIBRARY
No ratings yet
Trust in the Merciful “Judge of All the Earth”! — Watchtower ONLINE LIBRARY
1 page
What Do We Know About Jehovah’s Future Judgments? — Watchtower ONLINE LIBRARY
No ratings yet
What Do We Know About Jehovah’s Future Judgments? — Watchtower ONLINE LIBRARY
1 page
Jehovah Has Tender Affection for You — Watchtower ONLINE LIBRARY
No ratings yet
Jehovah Has Tender Affection for You — Watchtower ONLINE LIBRARY
1 page
How to Have a Successful Courtship — Watchtower ONLINE LIBRARY
No ratings yet
How to Have a Successful Courtship — Watchtower ONLINE LIBRARY
1 page
Let Love Motivate You to Keep Preaching! — Watchtower ONLINE LIBRARY
No ratings yet
Let Love Motivate You to Keep Preaching! — Watchtower ONLINE LIBRARY
1 page
How to Have More Joy in the Ministry — Watchtower ONLINE LIBRARY
No ratings yet
How to Have More Joy in the Ministry — Watchtower ONLINE LIBRARY
1 page
How to Find a Potential Marriage Mate — Watchtower ONLINE LIBRARY
No ratings yet
How to Find a Potential Marriage Mate — Watchtower ONLINE LIBRARY
1 page
Find Comfort in Jehovah’s Approval — Watchtower ONLINE LIBRARY
No ratings yet
Find Comfort in Jehovah’s Approval — Watchtower ONLINE LIBRARY
1 page
Never Leave the Spiritual Paradise — Watchtower ONLINE LIBRARY
No ratings yet
Never Leave the Spiritual Paradise — Watchtower ONLINE LIBRARY
1 page
“Press On to Maturity” — Watchtower ONLINE LIBRARY
No ratings yet
“Press On to Maturity” — Watchtower ONLINE LIBRARY
1 page
Avoid the Darkness—Remain in the Light — Watchtower ONLINE LIBRARY
No ratings yet
Avoid the Darkness—Remain in the Light — Watchtower ONLINE LIBRARY
1 page
Strengthen Your Appreciation for Jehovah’s Organization — Watchtower ONLINE LIBRARY
No ratings yet
Strengthen Your Appreciation for Jehovah’s Organization — Watchtower ONLINE LIBRARY
1 page
You Can Persevere Despite Disappointments — Watchtower ONLINE LIBRARY
No ratings yet
You Can Persevere Despite Disappointments — Watchtower ONLINE LIBRARY
1 page
“Keep Following” Jesus After Baptism — Watchtower ONLINE LIBRARY
No ratings yet
“Keep Following” Jesus After Baptism — Watchtower ONLINE LIBRARY
1 page
Are You Ready for the Most Important Day of the Year? — Watchtower ONLINE LIBRARY
No ratings yet
Are You Ready for the Most Important Day of the Year? — Watchtower ONLINE LIBRARY
1 page
Using GPUs
No ratings yet
Using GPUs
18 pages
Are You Ready to Dedicate Yourself to Jehovah? — Watchtower ONLINE LIBRARY
No ratings yet
Are You Ready to Dedicate Yourself to Jehovah? — Watchtower ONLINE LIBRARY
1 page
Conquer Fear by Trusting in Jehovah — Watchtower ONLINE LIBRARY
No ratings yet
Conquer Fear by Trusting in Jehovah — Watchtower ONLINE LIBRARY
1 page
DP 203t00a Enu Powerpoint 02
No ratings yet
DP 203t00a Enu Powerpoint 02
24 pages
Do You Treat Women as Jehovah Does? — Watchtower ONLINE LIBRARY
No ratings yet
Do You Treat Women as Jehovah Does? — Watchtower ONLINE LIBRARY
1 page
Jehovah Will Help You During Difficult Times — Watchtower ONLINE LIBRARY
No ratings yet
Jehovah Will Help You During Difficult Times — Watchtower ONLINE LIBRARY
1 page
Screen, Root RPG
No ratings yet
Screen, Root RPG
5 pages
AZ 801T00A ENU ChangeLog
No ratings yet
AZ 801T00A ENU ChangeLog
3 pages
ML For The C64 and Other Commodore Computers
No ratings yet
ML For The C64 and Other Commodore Computers
350 pages
Coloringpage Beholder1 PDF
No ratings yet
Coloringpage Beholder1 PDF
1 page
2018 Summer Tutorial Intro To Linux
No ratings yet
2018 Summer Tutorial Intro To Linux
71 pages
Advanced Network Security: - Lecture# 4-1 - By: - Syed Irfan Ullah - Abasyn University Peshawar
No ratings yet
Advanced Network Security: - Lecture# 4-1 - By: - Syed Irfan Ullah - Abasyn University Peshawar
54 pages
Systems Analysis & Design: Analyzing Systems Using Data Dictionaries
No ratings yet
Systems Analysis & Design: Analyzing Systems Using Data Dictionaries
57 pages
10346NLP_Experiment_6
No ratings yet
10346NLP_Experiment_6
7 pages
4 - Compilation Process in C
No ratings yet
4 - Compilation Process in C
5 pages
AI_Algorithms_Summary_by_Djemoui_Badr
No ratings yet
AI_Algorithms_Summary_by_Djemoui_Badr
5 pages
Nursing Informatics
No ratings yet
Nursing Informatics
4 pages
Advances in Mobile Cloud Computing and Big Data in the 5G Era 1st Edition Constandinos X. Mavromoustakis pdf download
No ratings yet
Advances in Mobile Cloud Computing and Big Data in the 5G Era 1st Edition Constandinos X. Mavromoustakis pdf download
62 pages
AIT-3-Final-1
No ratings yet
AIT-3-Final-1
10 pages
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
No ratings yet
Unit V Intelligence and Applications: Morphological Analysis/Lexical Analysis
30 pages
DEVIKA.V ML
No ratings yet
DEVIKA.V ML
7 pages
CIT-3200-OPERATING-SYSTEMS-1
No ratings yet
CIT-3200-OPERATING-SYSTEMS-1
3 pages
Stock Market Prediction Using CNN and LSTM
No ratings yet
Stock Market Prediction Using CNN and LSTM
7 pages
A. Listening (10marks) : B. Writing (10marks) : C. Speaking: (10marks)
No ratings yet
A. Listening (10marks) : B. Writing (10marks) : C. Speaking: (10marks)
4 pages
Integrating Explainable Artificial Intelligence and Blockch 2023 Smart Agric
No ratings yet
Integrating Explainable Artificial Intelligence and Blockch 2023 Smart Agric
13 pages
SQL Course Content
No ratings yet
SQL Course Content
2 pages
Fully-incremental-public-key-encryption-with-adjustable-t_2025_Information-S
No ratings yet
Fully-incremental-public-key-encryption-with-adjustable-t_2025_Information-S
19 pages
HTML MCQ
No ratings yet
HTML MCQ
13 pages
Information Science: The Basics 1st Edition Judith Pintar - Discover the ebook with all chapters in just a few seconds
100% (1)
Information Science: The Basics 1st Edition Judith Pintar - Discover the ebook with all chapters in just a few seconds
64 pages
Using the CM Tool to Migrate a Standalone PME 9 to Distributed (1)
No ratings yet
Using the CM Tool to Migrate a Standalone PME 9 to Distributed (1)
2 pages
Korr 2 J Lqwahrg LHSK Q5 RFZ
No ratings yet
Korr 2 J Lqwahrg LHSK Q5 RFZ
1 page
Financial Analytics
No ratings yet
Financial Analytics
3 pages
Vacancy Sandhiguna
No ratings yet
Vacancy Sandhiguna
1 page
File Organization and Data Base Design
No ratings yet
File Organization and Data Base Design
17 pages
C. Holsapple Vita (Sep 2021)
No ratings yet
C. Holsapple Vita (Sep 2021)
50 pages
Artificial Intelligence A Study of Automation, and Its Impact On Data Science
No ratings yet
Artificial Intelligence A Study of Automation, and Its Impact On Data Science
10 pages
Khushboo Komal FullStackPythonDeveloper
No ratings yet
Khushboo Komal FullStackPythonDeveloper
3 pages
Btech Cs 7 Sem Artificial Intelligence ncs702 2020
No ratings yet
Btech Cs 7 Sem Artificial Intelligence ncs702 2020
2 pages
Cryptography and Its Types
No ratings yet
Cryptography and Its Types
2 pages
Big Data Mining Literature Review
100% (2)
Big Data Mining Literature Review
7 pages

DP 203T00A ENU PowerPoint - 01

Uploaded by

DP 203T00A ENU PowerPoint - 01

Uploaded by

Get started with data

© Copyright Microsoft Corporation. All rights reserved.

1 Introduction to data engineering on Azure

2 Introduction to Azure Data Lake Storage Gen2

3 Introduction to Azure Synapse Analytics

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

Types of data Data operations Languages

© Copyright Microsoft Corporation. All rights reserved.

Orchestrated activities to transfer and

© Copyright Microsoft Corporation. All rights reserved.

Operational Data ingestion/ETL Analytical data storage Data modeling and

Azure Data Lake Storage Gen2

© Copyright Microsoft Corporation. All rights reserved.

2 In a data lake, data is stored in…

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

Azure Storage Account

1 Azure Data Lake Storage Gen2 stores data in…

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

Cloud platform for

• Data exploration and

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

SQL Server based

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

© Copyright Microsoft Corporation. All rights reserved.

Get started with data engineering on Azure

© Copyright Microsoft Corporation. All rights reserved.

You might also like