0% found this document useful (0 votes)
16 views

Data Engineering Agenda

Uploaded by

thesantastor
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views

Data Engineering Agenda

Uploaded by

thesantastor
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 19

FULL STACK DATA ENGINEER

CAREER TRANSITION PROGRAM AGENDA


ABOUT BEPEC
WHAT WE DO IN BEPEC

BEPEC Solutions is majorly a consulting firm, We apply the right


technology to solve business problems with the help of Analytics,
AI, Edge AI, Explainable AI, ML, DL, RPA, Blockchain, e.t.c. across
multiple verticals like BFSI, CPG, Retail, Healthcare, Manufacturing,
Telecom, Insurance, e.t.c. We focus more on an incremental way of
approach in the augmentation of new technologies to solve
business problems with great value. We always look to get the
most out of AI to solve various unsolved problems in an
incremental way.

As part of our journey, we do talent upskilling with the right


knowledge of AI based on industry needs. We believe in
manufacturing the right talent more focused on business,
customers & solutions-centric mindset rather than having a
bookish mindset.

BEPEC successfully stepped into 200+ clients across the globe from
different verticals and we are serving our clients with 100+ Data
Scientists, Data Engineers, analysts, consultants, BI Experts,
Advisors, e.t.c.

BEPEC strongly believe in value-centric AI solutions or Digital


Transformation to our clients.
PROGRAM OVERVIEW:
KEY HIGHLIGHTS
DATE ENGINEERING CAREER
TRANSITION IS EASY WITH A
PROPER CAREER TRANSITION
ROADMAP AND EXECUTION PLAN.
Step-1: You will learn Job-Ready Data Engineering Course Syllabus,
which includes Python, SQL, AWS, Hadoop, HDFS, MapReduce,
Apache Spark, Snowflake, Kafka, Airflow, MongoDB, Sqoop, AWS &
Azure

Step 2: You will work as an Intern/Freelancer under BEPEC to build a


portfolio most needed to be a Data Engineer.

Step 3: Interview Preparation like Resume Building, Mock Interviews,


Previous Interview Clips from BEPEC Alumni, Interview Support &
Placement Support.
COURSE AGENDA
Introduction to Dato Warehouse
What Is Data Warehouse?
Important Data Warehouse Concepts
Difference between OLAP vs OLTP
Data Ecosystem Mindmap
Overview of Data Warehouse
Data Warehouse Architecture

Mastering Python
Introduction to Python
Structure of Any Programming Language like Value, Variables,
Functions & Librarles
Different In-Built Data Structures like List, Tuple, Set & Dict
Mastering Conditional Flow
Loops like While & For
List Comprehension
Mastering User-Defined Functions
Regular Expressions and Lambda
Strings and Arrays
Procedural Programming vs OOPS
Mastering OOPS
Different Libraries like Pandas & Numpy
Instance Variable, Class Variable, Class Method
Association vs Composition & Aggregation
Oops Concept
Encapsulation, Inheritance
Polymorphism, Method OverLoading, Method Overriding
COURSE AGENDA
Introduction to Snowflake
What is Snowflake?
SnowFlake Architecture
Snowflake Virtual Warehouse
SnowFlake Object Hierarchy

Mastering MySQL
Introduction to MySOL
Creating Database in Mysql
Star Schema and SnowFlake Schema Design
Forward and Reverse Engineering
Creating ER diagrams
Deep Dive into MysQL
SELECT, UPDATE, DELETE Operations
Clauses in MySql Part-1
Clauses in MySql Part-2
Clauses in MySQL Part-3
MySQL Data Handling Part-1
MySQL Data Handling Part-2
Data Integrity & Referential Integrity
Data Normalisation
First & Second Normal Form
Functional Dependency, Transitive Dependency & 3rd Normal Form
Boyce-Codd Normal Form
Denormalization
Temporary Table, Common Table Expression, Recursive CTE
When to use Temporary Table, CTE & Recursive CTE
Subquery in MysQL
Views in Mysol
Stored Functions
Stored Procedures
Triggers in Mysol
Create Events in MysQL
Different Functions in MysQL
COURSE AGENDA
Big Data & Streaming Tools
Introduction to Big Data
Distributed Computing & Hadoop Architecture
Storing Data on Hadoop -HDFS
Processing - MapReduce
Working on YARN
HBase
Working on Sqoop
Working on Impala
El Working with Flume
Working with Zookeeper

Big Data Scheduling


Big Data Scheduling with Airflow
Oozie

Mastering HIVE
Introduction to HIVE
HIVE Architecture
Data Storage in HIVE
Tables in HIVE
Loading Data into Partitions
COURSE AGENDA
Big Dato on Cloud
Introduction to Cloud - Models, Service Categories, AWS security, IAM
AWS platform
EC2 and S3
Databases on AWS
AWS EMR

AWS Data Collection & DobaLake


Data Warehousing on AWS
DynamoDB, Lambda
Redshift
AWS Glue
AWS Data pipelines and data nodes
Activity, Pre-condition and Schedule
Introduction to DataLake
Learn about Data Lake vs. Data Warehouse
Key components of a Data Lake Architecture
Query files directly using sQL

Amemon Kinesis
Amazon Kinesis and Kinesis Stream
Kinesis data stream architecture and components
Data producer and Consumer
Kinesis stream emitting data to AWS services and Kinesis connector
library
Kinesis firehose
COURSE AGENDA
Data Loading/Ingestion and Extraction
What is Data Ingestion?
Data Ingestion in Real-World
Create an Integration Object to Connect Snowflake with AWS S3
Ingesting CSV from S3 to Snowflake
Ingest JSON from S3 to Snowflake
How to do Continuous Data Ingestion in Snowflake
Snowpipe: Implementation of Snow Pipe
SnowPipe: Billing Estimation and Key Considerations for Data
Ingestion
Unloading of Data from Snowflake to S3

Big Data on Azure


Introduction to Cloud - Models, Service Categories
Overview of Azure Synapse and Architecture
Azure SQL, Creating SQL Pool
Data Warehousing on Azure
Azure SQL Data Warehouse, HDInsight
Cosmos, Azure Functions
Azure VM, Blob Storage
COURSE AGENDA
DataBricks and Data Factory
Azure Stream Analytles
Learn how to input data into Azure ASA
Learn how to create queries and data analysis pipelines with Azure
ASA
Learn how to output the results of the data analysis pipeline to
various output formats
Learn how to work with Azure ASA and Microsoft PowerBl
Azure Data Factory
Microsoft Azure Data Lake
Different tools and scenarios to ingest data in to Data Lake
Data Lifecycle and Architecture around Data Lake
Use Azure Databricks and HDinsight to process data in ADLS
Monitor the performance of your Data lake
Azure Data Factory
Azure Functions

PySpark Mastering
Core Pyspark architecture
Deployment modes
RDD and RDD Operations, DAG
PySpark SQL
Datatrames, Hive Integration, data manipulation with PySpark SQL
Pyspark Streaming
Structured Streaming
Amazon EMR and Running all Hadoop jobs on AWS EMR
COURSE AGENDA
Kafka
What is Kafka?
Kafka Features
Kafka Concepts
Kafka Architecture
Kafka Components
zookeeper
Where is Kafka Used?
Kafka Installation
Kafka Cluster
Types of Kafka Clusters
Configuring Single Node Single Broker Cluster
Kafka Producer
Kafka Consumer
Kafka Internals & Load Balancing

Mastering MongoDB
Introduction to Mongo DB
Installation of MongoDB
Basics Operations of MongoDB
Create Operations in MongoDB
Update Operations in MongoDB
Read Operations in MongoDB
Delete Operations in MongoDB
Query & Projection Operations
Python with MongoDB
Spark with MongoDB
COURSE AGENDA
Tobles in Snowllake
Different Types of Tables in Snowflake
Creating Tables In Snowflake
Materialised Views, Views & Secure Views in Snowflake
Creating Views in Snowflake
Creating Secure Views in Snowflake
In-depth about Views in Snowflake

Tasks and Query Scheduling


Introduction to Tasks
Creating Standalone and Dependent Tree of Tasks
Billing and Query History for Tasks

Streams and Change Data Capture


introduction to Streams
implementation of Standard Streams
Implementation of Append-Only Streams
Streams in a Transaction
Data Retention and Staleness
Change Tracking Using " Changes"
End-to-End Stream Creation
End-to-End Pipeline Demonstration
COURSE AGENDA
Snowflake - Exernal Functions
Introduction to External Functions
What is AWS Lambda Functions?
Write & Deploy AWS Lambda Function
What is IAM Role in AWS?
Creating IAM Role in AWS
Create AP| Gateway
Secure and Deploy API Gateway
Create External Function in Snowflake

Snowllake with Python, Spark, and Airflow on AWS


Connect Python with Snowflake in Your Local Machine
Introduction to AWS Glue
Deploy and Execute Python Script to AWS Glue
Parameterize Your Python Script on AWS Glue
Python Pandas with Snowflake on AWS Glue
What Is Pushdown in Spark 3.1?
What is Spark? Introduction to PySpark
Deploy a PySpark Script Using AWS Glue
What is Airflow?
6 Set Up Managed Airflow Cluster on AWS
6 Configure Snowflake Connectivity in Airflow
Deploy a Pyspark Transformation job in AWS Glue
Set Up Airflow DAG
COURSE AGENDA
Real-Time Streaming with Kafka and Snowllake

What is Kafka ?
Why to go with Real-Time Streaming
Introduction to Kafka
Download the Necessary JAR Fles
Set Up Kafka in your local system
Set Up Kafka Snowflake Connector
set Up Encryption Keys for Kafka-Snowflake Connectivity
streaming Data in Action

Snowilake - Dota Protection and Governance

What Is Time Travel and Failsafe in Snowflake?


Time Travel and Data Recovery
Column Level Dynamic Data Masking
What Is Row Level Security?
Create and Implement Row Level Access Policy

Project Work

End-to-End Project Work-1


End-to-End Project Work-2
BEPEC ADD-ONS

S
M
RA
Learn Every topic

G
with Mindmaps

O
PR
and Learn [What,
Why, When &

N
IO
How] Behind AT
every topic 04
UL

Experience the 03 Live Class Access


M

Data Scientist
SI

Until your Crack


Job Role before
B

the Interview &


O

you get into the Life-Time


|J

Industry Dashboard
01
.IN

02
Access with 380+
EC

Add Roles & Hours of Content.


EP

Responsibilities
.B

with Real-Time
W

Projects in your
W

Resume
|W
C

Few More Amenities..


PE

Pause & Restart your live classes at any point and


BE

number of times.
We measure your progress and learning track. If
needed we redesign your roadmap until you make it at
your comfort.
Every Project you work under BEPEC is counted as an
Internship
Every Live Class is going to be recorded and you get
recording access.
Successful Transitions!
Few Screenshots
We are dedicated to achieve
your
#careertransition
with our powerful training
approach

200+ 35000+ 10000+


Corporate Careers Careers
Clients Unskilled Transformed

"We Got Different Customised


Training Options based on
your needs"
Thank you!
All the best for your Career
Transition Journey!

You might also like