0% found this document useful (0 votes)
437 views

Databricks Course Curriculum

The document outlines the curriculum for a Databricks course. It is divided into multiple sessions covering topics like Apache Spark architecture, Databricks workspace, reading file formats, narrow and wide transformations, Spark SQL, Delta tables, structured streaming, Unity Catalog, and Azure DevOps CI/CD for Databricks. Each session also includes interview questions related to the topics covered.

Uploaded by

karthikeyan
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
437 views

Databricks Course Curriculum

The document outlines the curriculum for a Databricks course. It is divided into multiple sessions covering topics like Apache Spark architecture, Databricks workspace, reading file formats, narrow and wide transformations, Spark SQL, Delta tables, structured streaming, Unity Catalog, and Azure DevOps CI/CD for Databricks. Each session also includes interview questions related to the topics covered.

Uploaded by

karthikeyan
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Databricks Course Curriculum

14-Oct-2023
 Apache Spark Architecture
 Spark's internal details (Driver, Executor, task, stages, jobs etc.)
 Spark Memory Allocation (Driver and Executor Memory Allocation)
 Cluster Deployment Modes (Client, Cluster)
 Narrow and Wide Transformations
 Spark different configurations
 Interview Questions based on Apache Spark Architecture
15-Oct-2023
 Introduction to Databricks
 Walkthrough of Databricks workspace
 Different types of clusters and their uses.
 Magic Commands
 DBUTILS
 Notebook Parametrization
 Secret Scopes
 Accessing Blob Storage/ ADLS Gen 2 using notebook
 Understanding of DBFS
 Interview Questions based on session.
21-Oct-2023
 Databricks Data Processing
 Read different file formats (CSV, JSON, Parquet.. etc.)
 Different reading options
 Interview Questions based on session.
22-Oct-2023
Narrow Transformations
 Select,filter,map, flatmap etc.

Wide Transformations

 groupBy, Aggregate,Window functions, Joins


 Interview Questions based on session.
28-Oct-2023
Spark SQL
 Databricks Database
 Global Temporary View
 Temporary View
 External Tables
 Managed Tables
 Views
 Joins
 Interview Questions based on session.
29-Oct-2023
Lakehouse Architecture

 Delta Tables
 Time Travel
 History
 Vacuum
 Incremental File Loading
 Interview Questions based on session.
04-Nov-2023 and 05-Nov-2023
 Unity Catalog
 Benefits of Unity Catalog
 Data Lineage
 Data Auditing
 Mini Project on Unity Catalog
 Interview Questions
11-Nov-2023 and 12-Nov-2023
Structured Streaming

 Different sources, event hub, kafka etc.


 Stateless/Stateful Transformations
 Output Modes (Complete, Update, Append)
 Tumbling and Sliding Window
 Watermarking
 Stream Joins
 Kafka Sink
 Cosmos Sink
 File Sink
18-Nov-2023 and 19-Nov-2023
 Delta LIVE Tables
 Quality Checks
 Data Quarantine
 Structured Streaming using Delta LIVE Tables
 Autoloader
 Interview Questions
25-Nov-2023 and 26-Nov-2023
 Azure Devops CI/CD for databricks

You might also like