ADE (2)
ADE (2)
Introduction to Azure
Introduction to Storage
Azure Storage
o Azure Blob
o Table
o Message
o Queue
Azure Data Lake Store Gen I & Gen II
o What is Data Lake
o Data Lake vs. Hadoop
o Blob Storage vs. Data Lake
o Hierarchical Namespace
o Ingestion through different tools i.e.; Azure Data Explorer, AzCopy, Azure CLI, Powershell
Introduction
Azure Synapse MPP Architecture
Storage and Sharding patterns
Data Distribution and Distributing Keys
Data Types and Table Types
Partitioning
Data Warehouse Concepts
Dimensions and Facts
Types of Dimensions and Facts
Different types of Schemas in Data Warehouse
Relationship types in Data Warehouse
Best Practices for Fact and Dimension tables
Demo - Analyze Data distribution before migration to Azure Synapse
Integrate SQL Server Integration Services Packages within Azure Data Factory
Activities
o Copy
o Data flow
o Stored Procedure
o Lookup
o ForEach
o Get Metadata
o Filter Activity
o Spark
o U-SQL
o Databricks Notebooks
o Web
o If Condition
o Delete
Data Flows
o Derived Column
o Join
o filter
o exists
o conditional split
o Lookup, Exists
o Select
o Aggregate
o Rank
o Filter
o Sort
o Alter Row
Dynamic Queries in ADF
Sending mails through Logic Apps
Few more Activities......
Dataset and Pipeline Parameterization
Monitor -- Azure and Visually
Setup Alerts from Azure Data Factory
Introduction
What is Azure Synapse Analytics
How Azure Synapse Analytics works
When to use Azure Synapse Analytics
th
Flat No: 403 & 404, 4 Floor, Annapurna Block,
Ameerpet, Hyderabad – 500038
+91-99850-1-4433
+91-99850-2-4433
Introduction
Azure Databricks
Spark Basics
Why Spark is difficult? Why Databricks Evolved?
Why Databricks in Cloud? Introduction to Azure Databricks
th
Flat No: 403 & 404, 4 Floor, Annapurna Block,
Ameerpet, Hyderabad – 500038
+91-99850-1-4433
+91-99850-2-4433
Demo
Provision Databricks, Clusters and workbook
Mount Data Lake to Databricks DBFS
Explore, Analyze, Clean, Transform and Load Data in Databricks
Azure Databricks Clusters
Azure Databricks other Important Components
Databricks - Monitoring
How to create Cluster
How to work with Databricks File System
How to create notebooks and Integrate with ADF
How to import and export the Notebooks
How to connect to blob, SQL DB from Databricks
How to read data files from Azure Blob and Azure Data Lake Store
o Using Scala, R, Python, Spark SQL Language
Creating Data Frames
Converting Data Frames into Temporary Table or Temporary View
Incremental and Full Load with Azure SQL Data Warehouse
Understand the architecture of Azure Databricks spark cluster
Understand the architecture of spark job
Read data in CSV format
Read data in JSON format
Read data in Parquet format
Read data stored in tables and views
Write data
Describe a DataFrame
Use common DataFrame methods
Use the display function
Exercise: Distinct articles
Describe the difference between eager and lazy execution
Describe the fundamentals of how the Catalyst Optimizer works
Define and identify actions and transformations
Describe the column class
Work with column expressions
Perform date and time manipulation
Use aggregate functions
Exercise: Deduplication of data
Describe the Azure Databricks platform architecture
Perform data protection
Describe Azure key vault and Databricks security scopes
th
Flat No: 403 & 404, 4 Floor, Annapurna Block,
Ameerpet, Hyderabad – 500038
+91-99850-1-4433
+91-99850-2-4433
We Offer: