SlideShare a Scribd company logo
Overview of AWS Data Services
Understanding S3, Redshift, Glue & More
Agenda
Introduction to AWS Data Services
Amazon S3
Amazon Redshift
AWS Glue
Other AWS Data Services
Use Cases & Architectures
Q&A
What Are AWS Data Services?
Suite of cloud-based tools for storing, processing, analyzing, and moving
data
Fully managed, scalable, pay-as-you-go
Commonly used in:
-Data lakes
-Analytics pipelines
-ETL workflows
-Real-time processing
Amazon S3 (Simple Storage Service)
Object storage service for virtually unlimited data
Durable (99.999999999%) and available
Use cases:
Backup and restore
Data lake storage
Static website hosting
Supports: versioning, lifecycle policies, encryption
Integrates with: Athena, Redshift Spectrum, Glue, etc.
Amazon Redshift
Fully managed data warehouse
Columnar storage, optimized for analytics
Supports SQL, connects with BI tools (Tableau, Power BI)
Features:
Redshift Spectrum: query data in S3
Concurrency Scaling
Materialized Views
Use Cases: BI, analytics dashboards, reporting
AWS Glue
Serverless data integration & ETL service
Automates discovery, cataloging, and transformation
Components:
Glue Data Catalog
Glue Crawlers
Glue Jobs (Python or Spark)
Use cases:
Data preparation for analytics
Building data pipelines
Schema inference & metadata management
AWS Athena
Interactive query service for S3 data
SQL-based, serverless
Pay-per-query model
Works well with S3, Glue Catalog
Use cases: Ad hoc analysis, logs analysis, quick reports
AWS Lake Formation
Simplifies setting up secure data lakes on S3
Manages:
Data ingestion
Access control
Schema definitions
Centralized governance of data lake
AWS Kinesis
Real-time data streaming service
Kinesis Data Streams, Kinesis Firehose, Kinesis Analytics
Use cases:
Real-time analytics
Log & clickstream processing
IoT telemetry data
Sample Architecture: Modern Data Lake
Scalable, flexible, and cost-efficient architecture for analytics and ML
When to Use What?
Service Primary Use Case
S3 Storage for raw/processed data
Redshift Complex analytics on structured data
Glue ETL workflows, data discovery
Athena Ad-hoc SQL on S3
Kinesis Real-time data processing
Lake Formation Data lake setup & security
Summary
AWS provides end-to-end data tools: storage, transformation, analytics
Choose services based on use case: real-time, batch, ad-hoc
Integration between services is seamless
Great for building scalable and secure data architectures
Questions & Discussion
Let’s dive deeper into anything you’re curious about!

More Related Content

Similar to Aws Data Engineer Course | Aws Data Engineer Training (20)

PPTX
Construindo data lakes e analytics com AWS
Amazon Web Services LATAM
 
PPTX
Big data journey to the cloud rohit pujari 5.30.18
Cloudera, Inc.
 
PDF
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
SasikumarPalanivel3
 
PDF
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
saidbilgen
 
PDF
AWS Big Data Landscape
Crishantha Nanayakkara
 
PDF
From ingest to insights with AWS
Paul Van Siclen
 
PDF
AWS Analytics Services - When to use what? | AWS Summit Tel Aviv 2019
AWS Summits
 
PDF
Data engineering
Suman Debnath
 
PDF
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
Provectus
 
PDF
The Beginner's Guide to Data Lakes in AWS
Guillermo A. Fisher
 
PDF
AWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
Amazon Web Services Korea
 
PDF
Builders' Day - Building Data Lakes for Analytics On AWS LC
Amazon Web Services LATAM
 
PDF
Building a Modern Data Platform in the Cloud. AWS Initiate Portugal
javier ramirez
 
PPTX
Unleashing the Power of Data Analytics with AWS Glue and Data Lakes.pptx
ShamnadShaffi3
 
PDF
2017 AWS DB Day | Amazon Athena 서비스 최신 기능 소개
Amazon Web Services Korea
 
PDF
Module 1 - CP Datalake on AWS
Lam Le
 
PDF
Serverless Big Data Architectures: Serverless Data Analytics
Kristana Kane
 
PDF
Immersion Day - Como construir seu Data Lake em dias na AWS
Amazon Web Services LATAM
 
PDF
Big Data, Ingeniería de datos, y Data Lakes en AWS
javier ramirez
 
PDF
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresa
Amazon Web Services LATAM
 
Construindo data lakes e analytics com AWS
Amazon Web Services LATAM
 
Big data journey to the cloud rohit pujari 5.30.18
Cloudera, Inc.
 
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
SasikumarPalanivel3
 
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
saidbilgen
 
AWS Big Data Landscape
Crishantha Nanayakkara
 
From ingest to insights with AWS
Paul Van Siclen
 
AWS Analytics Services - When to use what? | AWS Summit Tel Aviv 2019
AWS Summits
 
Data engineering
Suman Debnath
 
"Building a Modern Data platform in the Cloud", Alex Casalboni, AWS Dev Day K...
Provectus
 
The Beginner's Guide to Data Lakes in AWS
Guillermo A. Fisher
 
AWS Innovate: Build a Data Lake on AWS- Johnathon Meichtry
Amazon Web Services Korea
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Amazon Web Services LATAM
 
Building a Modern Data Platform in the Cloud. AWS Initiate Portugal
javier ramirez
 
Unleashing the Power of Data Analytics with AWS Glue and Data Lakes.pptx
ShamnadShaffi3
 
2017 AWS DB Day | Amazon Athena 서비스 최신 기능 소개
Amazon Web Services Korea
 
Module 1 - CP Datalake on AWS
Lam Le
 
Serverless Big Data Architectures: Serverless Data Analytics
Kristana Kane
 
Immersion Day - Como construir seu Data Lake em dias na AWS
Amazon Web Services LATAM
 
Big Data, Ingeniería de datos, y Data Lakes en AWS
javier ramirez
 
Immersion Day - Como a AWS apoia a estratégia analítica de sua empresa
Amazon Web Services LATAM
 

More from Accentfuture (20)

PPTX
Understanding Databricks File System .
Accentfuture
 
PPTX
Databricks for Recommendation Systems.pptx
Accentfuture
 
PPTX
Spark Performance Tuning | Best PySpark & Databricks Online Training
Accentfuture
 
PPTX
Model Training & Hyperparameter Tuning.pptx
Accentfuture
 
PDF
Real-time Data Processing with Azure Stream Analytics.pdf
Accentfuture
 
PDF
Automating Data Pipelines with AWS Step Functions
Accentfuture
 
PPTX
Databricks_Intro_Presentation | Databricks Online Training
Accentfuture
 
PDF
Performance Optimization in Databricks .
Accentfuture
 
PPTX
Databricks Online Training | Databricks Online Course
Accentfuture
 
PPTX
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
PPTX
Aws Data Engineer Training | Aws Data Engineer Course
Accentfuture
 
PPTX
Databricks Training | Databricks Course
Accentfuture
 
PPTX
databricks course | databricks online training
Accentfuture
 
PDF
AWS data engineer online course | AWS data engineer training
Accentfuture
 
PDF
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
PDF
Databricks Online Training | Databricks Online Course
Accentfuture
 
PDF
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
PDF
Aws Data Engineer Training | Aws Data Engineer Course
Accentfuture
 
DOCX
Databricks Online Training | Databricks Online Course
Accentfuture
 
PDF
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
Understanding Databricks File System .
Accentfuture
 
Databricks for Recommendation Systems.pptx
Accentfuture
 
Spark Performance Tuning | Best PySpark & Databricks Online Training
Accentfuture
 
Model Training & Hyperparameter Tuning.pptx
Accentfuture
 
Real-time Data Processing with Azure Stream Analytics.pdf
Accentfuture
 
Automating Data Pipelines with AWS Step Functions
Accentfuture
 
Databricks_Intro_Presentation | Databricks Online Training
Accentfuture
 
Performance Optimization in Databricks .
Accentfuture
 
Databricks Online Training | Databricks Online Course
Accentfuture
 
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
Aws Data Engineer Training | Aws Data Engineer Course
Accentfuture
 
Databricks Training | Databricks Course
Accentfuture
 
databricks course | databricks online training
Accentfuture
 
AWS data engineer online course | AWS data engineer training
Accentfuture
 
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
Databricks Online Training | Databricks Online Course
Accentfuture
 
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
Aws Data Engineer Training | Aws Data Engineer Course
Accentfuture
 
Databricks Online Training | Databricks Online Course
Accentfuture
 
Azure Data Engineer Training | Azure Data Engineer Course
Accentfuture
 
Ad

Recently uploaded (20)

PPTX
Orientation MOOCs on SWAYAM for Teachers
moocs1
 
PPTX
Top 10 AI Tools, Like ChatGPT. You Must Learn In 2025
Digilearnings
 
PPTX
Company - Meaning - Definition- Types of Company - Incorporation of Company
DevaRam6
 
PPTX
Gupta Art & Architecture Temple and Sculptures.pptx
Virag Sontakke
 
PDF
Comprehensive Guide to Writing Effective Literature Reviews for Academic Publ...
AJAYI SAMUEL
 
PPTX
Folding Off Hours in Gantt View in Odoo 18.2
Celine George
 
PPTX
Maternal and Child Tracking system & RCH portal
Ms Usha Vadhel
 
PDF
FULL DOCUMENT: Read the full Deloitte and Touche audit report on the National...
Kweku Zurek
 
PPTX
ARAL-Guidelines-Learning-Resources_v3.pdf.pptx
canetevenus07
 
PPTX
FAMILY HEALTH NURSING CARE - UNIT 5 - CHN 1 - GNM 1ST YEAR.pptx
Priyanshu Anand
 
PDF
Stepwise procedure (Manually Submitted & Un Attended) Medical Devices Cases
MUHAMMAD SOHAIL
 
PPTX
MALABSORPTION SYNDROME: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PDF
Living Systems Unveiled: Simplified Life Processes for Exam Success
omaiyairshad
 
PPTX
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
PDF
Tips for Writing the Research Title with Examples
Thelma Villaflores
 
PPTX
ANORECTAL MALFORMATIONS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
PDF
water conservation .pdf by Nandni Kumari XI C
Directorate of Education Delhi
 
PPTX
Auditing and Assurance Meaning - Objectives - Types - Advantages & Disadvanta...
DevaRam6
 
PPTX
How to Manage Resupply Subcontracting in Odoo 18
Celine George
 
PPTX
ARAL Program of Adia Elementary School--
FatimaAdessaPanaliga
 
Orientation MOOCs on SWAYAM for Teachers
moocs1
 
Top 10 AI Tools, Like ChatGPT. You Must Learn In 2025
Digilearnings
 
Company - Meaning - Definition- Types of Company - Incorporation of Company
DevaRam6
 
Gupta Art & Architecture Temple and Sculptures.pptx
Virag Sontakke
 
Comprehensive Guide to Writing Effective Literature Reviews for Academic Publ...
AJAYI SAMUEL
 
Folding Off Hours in Gantt View in Odoo 18.2
Celine George
 
Maternal and Child Tracking system & RCH portal
Ms Usha Vadhel
 
FULL DOCUMENT: Read the full Deloitte and Touche audit report on the National...
Kweku Zurek
 
ARAL-Guidelines-Learning-Resources_v3.pdf.pptx
canetevenus07
 
FAMILY HEALTH NURSING CARE - UNIT 5 - CHN 1 - GNM 1ST YEAR.pptx
Priyanshu Anand
 
Stepwise procedure (Manually Submitted & Un Attended) Medical Devices Cases
MUHAMMAD SOHAIL
 
MALABSORPTION SYNDROME: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
Living Systems Unveiled: Simplified Life Processes for Exam Success
omaiyairshad
 
TOP 10 AI TOOLS YOU MUST LEARN TO SURVIVE IN 2025 AND ABOVE
digilearnings.com
 
Tips for Writing the Research Title with Examples
Thelma Villaflores
 
ANORECTAL MALFORMATIONS: NURSING MANAGEMENT.pptx
PRADEEP ABOTHU
 
water conservation .pdf by Nandni Kumari XI C
Directorate of Education Delhi
 
Auditing and Assurance Meaning - Objectives - Types - Advantages & Disadvanta...
DevaRam6
 
How to Manage Resupply Subcontracting in Odoo 18
Celine George
 
ARAL Program of Adia Elementary School--
FatimaAdessaPanaliga
 
Ad

Aws Data Engineer Course | Aws Data Engineer Training

  • 1. Overview of AWS Data Services Understanding S3, Redshift, Glue & More
  • 2. Agenda Introduction to AWS Data Services Amazon S3 Amazon Redshift AWS Glue Other AWS Data Services Use Cases & Architectures Q&A
  • 3. What Are AWS Data Services? Suite of cloud-based tools for storing, processing, analyzing, and moving data Fully managed, scalable, pay-as-you-go Commonly used in: -Data lakes -Analytics pipelines -ETL workflows -Real-time processing
  • 4. Amazon S3 (Simple Storage Service) Object storage service for virtually unlimited data Durable (99.999999999%) and available Use cases: Backup and restore Data lake storage Static website hosting Supports: versioning, lifecycle policies, encryption Integrates with: Athena, Redshift Spectrum, Glue, etc.
  • 5. Amazon Redshift Fully managed data warehouse Columnar storage, optimized for analytics Supports SQL, connects with BI tools (Tableau, Power BI) Features: Redshift Spectrum: query data in S3 Concurrency Scaling Materialized Views Use Cases: BI, analytics dashboards, reporting
  • 6. AWS Glue Serverless data integration & ETL service Automates discovery, cataloging, and transformation Components: Glue Data Catalog Glue Crawlers Glue Jobs (Python or Spark) Use cases: Data preparation for analytics Building data pipelines Schema inference & metadata management
  • 7. AWS Athena Interactive query service for S3 data SQL-based, serverless Pay-per-query model Works well with S3, Glue Catalog Use cases: Ad hoc analysis, logs analysis, quick reports
  • 8. AWS Lake Formation Simplifies setting up secure data lakes on S3 Manages: Data ingestion Access control Schema definitions Centralized governance of data lake
  • 9. AWS Kinesis Real-time data streaming service Kinesis Data Streams, Kinesis Firehose, Kinesis Analytics Use cases: Real-time analytics Log & clickstream processing IoT telemetry data
  • 10. Sample Architecture: Modern Data Lake Scalable, flexible, and cost-efficient architecture for analytics and ML
  • 11. When to Use What? Service Primary Use Case S3 Storage for raw/processed data Redshift Complex analytics on structured data Glue ETL workflows, data discovery Athena Ad-hoc SQL on S3 Kinesis Real-time data processing Lake Formation Data lake setup & security
  • 12. Summary AWS provides end-to-end data tools: storage, transformation, analytics Choose services based on use case: real-time, batch, ad-hoc Integration between services is seamless Great for building scalable and secure data architectures Questions & Discussion Let’s dive deeper into anything you’re curious about!