Architecture For Data Ingestion Clean Processing and Visulizationyounesse

The document discusses the ingestion, processing, analysis, and visualization of IoT and other data sources using various AWS services. IoT sensor data will be ingested using Kinesis Data Streams, historical database records will be migrated to RDS using DMS with CDC, and third-party data will be fetched and processed using AWS Glue. Glue will also be used for data cleansing and structuring. EMR will perform complex analysis and QuickSight will enable interactive dashboards. CloudWatch will monitor the system and Lambda will automate processes.

Uploaded by

Yøű Ñęş

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Architecture For Data Ingestion Clean Processing and Visulizationyounesse

Uploaded by

Yøű Ñęş

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Data Ingestion:

IoT Sensors Data:

I'll begin by capturing real-time data from our IoT sensors. To achieve this, I'm utilizing Amazon Kinesis
Data Stream. It effortlessly handles streaming data, allowing us to ingest data as it's generated.

Historical Database Records:

Moving on to historical data, I've chosen to employ the AWS Database Migration Service (DMS). It helps
replicate data from our existing database to an Amazon RDS instance. With Change Data Capture (CDC)
enabled, we can stay updated with ongoing changes.

Third-Party Data:
When it comes to supplementing our internally generated data, I'll be utilizing AWS Glue, which operates
much like our familiar Hadoop-based tools. Glue fetches data from various sources, performs
transformations, and stores the processed data in our S3 storage.

Data Processing and Transformation:

Data cleanliness and structure are paramount. For this, I'll continue to use AWS Glue. It not only
transforms and cleanses data but also ensures it's stored in a logical manner in S3.
Data Analysis and Visualization:

Data Analysis:
To perform complex analysis, I've opted for Amazon EMR (Elastic MapReduce). This aligns with my current
approach by providing a managed Hadoop framework. I can run Apache Spark and Hive jobs here, just as
I do with our existing setup.

Dashboards and Visualization:

For the exciting part – visualization – I've integrated Amazon QuickSight. This cloud-native business
intelligence tool directly connects to our S3 data and EMR for analysis. It enables me to craft interactive
dashboards showcasing our insights.

Automation and Monitoring:

To ensure smooth operations and management:

 I've employed AWS CloudWatch to keep tabs on the health and performance of our system.
 AWS Lambda functions will trigger specific actions when events occur, streamlining processes.

Cheat Sheet AWS Data Engineer Associate
No ratings yet
Cheat Sheet AWS Data Engineer Associate
117 pages
AWS Cloud Practitioner (CLF C02)
100% (1)
AWS Cloud Practitioner (CLF C02)
102 pages
DataAnalytics AWS PDF
No ratings yet
DataAnalytics AWS PDF
133 pages
AWS Certified Cloud Practitioner 03-09-2021
100% (1)
AWS Certified Cloud Practitioner 03-09-2021
111 pages
Final project on data lakes with AWS
No ratings yet
Final project on data lakes with AWS
2 pages
Implementing Travel & Hospitality Data Mesh: AWS Reference Architecture
No ratings yet
Implementing Travel & Hospitality Data Mesh: AWS Reference Architecture
2 pages
Humair S. AWS Certified Data Engineer Study Guide. Associate (DEA-C01) Exam 2025
No ratings yet
Humair S. AWS Certified Data Engineer Study Guide. Associate (DEA-C01) Exam 2025
1,059 pages
Amazon Capstone Project
No ratings yet
Amazon Capstone Project
2 pages
Core Components of AWS
No ratings yet
Core Components of AWS
3 pages
Aws Data Service Notes
No ratings yet
Aws Data Service Notes
9 pages
Ppb1 Workshop Batch v2
No ratings yet
Ppb1 Workshop Batch v2
43 pages
AWS Data Engineering Services
No ratings yet
AWS Data Engineering Services
24 pages
WhizCard CLF C01 06 09 2022
No ratings yet
WhizCard CLF C01 06 09 2022
111 pages
Section 2
No ratings yet
Section 2
1 page
AWS Data Lake
No ratings yet
AWS Data Lake
13 pages
AWS Services
No ratings yet
AWS Services
34 pages
60 Day Data Lake Plan v2
No ratings yet
60 Day Data Lake Plan v2
4 pages
How To Build Data Pipelines On AWS - Reference Workflow
No ratings yet
How To Build Data Pipelines On AWS - Reference Workflow
26 pages
Building Data Lakes
No ratings yet
Building Data Lakes
40 pages
AWS ML Cheat Sheet Nov 2024
No ratings yet
AWS ML Cheat Sheet Nov 2024
100 pages
Architecture
No ratings yet
Architecture
6 pages
Subtitle
No ratings yet
Subtitle
2 pages
AWS Portfolio
No ratings yet
AWS Portfolio
76 pages
Cheat Sheet AWS Solutions Architect Professional
No ratings yet
Cheat Sheet AWS Solutions Architect Professional
177 pages
Data Lakes For Maximum Flexibility
No ratings yet
Data Lakes For Maximum Flexibility
29 pages
Document 1
No ratings yet
Document 1
15 pages
AWS White Paper
No ratings yet
AWS White Paper
6 pages
DocScanner 20 Oct 2024 2-19 PM
No ratings yet
DocScanner 20 Oct 2024 2-19 PM
16 pages
airline-ticket-shopping-ra
No ratings yet
airline-ticket-shopping-ra
1 page
Data Lake On Aws
No ratings yet
Data Lake On Aws
29 pages
Modernize Your Analyticsand Data Architecture
No ratings yet
Modernize Your Analyticsand Data Architecture
47 pages
AWS 05 DataLake
No ratings yet
AWS 05 DataLake
78 pages
data-platform-on-aws-and-snowflake-ra
No ratings yet
data-platform-on-aws-and-snowflake-ra
1 page
AWS Data Lake
No ratings yet
AWS Data Lake
87 pages
AWS Learning material
No ratings yet
AWS Learning material
13 pages
Devops Project
No ratings yet
Devops Project
6 pages
Challenges of data platform
No ratings yet
Challenges of data platform
4 pages
TCS Anl Presentation - VIL v2.3
No ratings yet
TCS Anl Presentation - VIL v2.3
45 pages
Data Architecture On Aws Slides
No ratings yet
Data Architecture On Aws Slides
33 pages
112115115 CC LAB7
No ratings yet
112115115 CC LAB7
7 pages
Enterprise Data Warehousing On Aws
No ratings yet
Enterprise Data Warehousing On Aws
26 pages
ANT205 R Achieving Your Modern Data Architecture
No ratings yet
ANT205 R Achieving Your Modern Data Architecture
71 pages
Modernserverlessdatalak
No ratings yet
Modernserverlessdatalak
45 pages
Project
No ratings yet
Project
3 pages
Notatki Na CLF
No ratings yet
Notatki Na CLF
3 pages
AWS Data Engineering Involves Using Amazon Web Services
No ratings yet
AWS Data Engineering Involves Using Amazon Web Services
2 pages
Analytics Services v2
No ratings yet
Analytics Services v2
59 pages
CC Assignment 2
No ratings yet
CC Assignment 2
4 pages
Real Time Analytics Spark Streaming PDF
No ratings yet
Real Time Analytics Spark Streaming PDF
20 pages
Data Lake Foundation WTH Zeppelin and Amazon Rds On The Aws Cloud
No ratings yet
Data Lake Foundation WTH Zeppelin and Amazon Rds On The Aws Cloud
23 pages
devops lead
No ratings yet
devops lead
10 pages
AWS Services - Analytics and ML
No ratings yet
AWS Services - Analytics and ML
2 pages
Alex Casalboni Advanced Serverless Architectural Patterns On AWS
No ratings yet
Alex Casalboni Advanced Serverless Architectural Patterns On AWS
48 pages
Cloud computing activity - unit 3 (1)
No ratings yet
Cloud computing activity - unit 3 (1)
18 pages
AWS Machine Learning Specialty
100% (1)
AWS Machine Learning Specialty
67 pages
Industrial Data Platform Ra
No ratings yet
Industrial Data Platform Ra
1 page
Data Engineering Strategy for ETL and AWS
No ratings yet
Data Engineering Strategy for ETL and AWS
3 pages
Effective Business Intelligence with QuickSight
From Everand
Effective Business Intelligence with QuickSight
Rajesh Nadipalli
No ratings yet
Mastering Amazon Redshift: Scalable Cloud Data Warehousing
From Everand
Mastering Amazon Redshift: Scalable Cloud Data Warehousing
Robert Johnson
No ratings yet
AWS SysOps Administrator Associate: From basic to advanced
From Everand
AWS SysOps Administrator Associate: From basic to advanced
Alex Carvalho
No ratings yet

Architecture For Data Ingestion Clean Processing and Visulizationyounesse

Uploaded by

Architecture For Data Ingestion Clean Processing and Visulizationyounesse

Uploaded by

Data Ingestion:

IoT Sensors Data:

Historical Database Records:

Data Processing and Transformation:

Dashboards and Visualization:

Automation and Monitoring:

To ensure smooth operations and management:

You might also like