WhizCard CLF C02 Cheat Sheet Nov 2024
WhizCard CLF C02 Cheat Sheet Nov 2024
The information provided in the Cheat Sheet is for educational purposes only;
created in our efforts to help aspirants prepare for the AWS Certified Cloud
Practitioner certification exam. Though references have been taken from AWS
documentation, it’s not intended as a substitute for the official docs. The document
can be reused, reproduced, and printed in any form; ensure that appropriate
sources are credited and required permissions are received.
Index
Analytics ....................... 4 Compute…………………26 Developer Tools………..48
Amazon Athena........................ 5 AWS EC2………………27 AWS CodeBuild……..49
Amazon Elasticsearch Service............. 6 AWS EC2 Auto Scaling…….28 AWS CodeDeploy…..50
Amazon EMR......... 7 AWS Batch………………30 AWS X-Ray…………..51
AWS Kinesis Data Streams....................... 8 AWS Elastic Beanstalk……..31 Amazon Workspaces……..52
Amazon Kinesis Data Firehose........... 9 AWS Lambda………………..32
Amazon Managed Streaming for Apache AWS Serverless Application Repository….33 Front-End Web and Mobile……..53
Kafka........................ 10 AWS Outpost……………………………34
Amazon API Gateway…….54
Amazon Redshift.......................... 11
AWS Glue......................12 Containers……………………………..35
Internet of Things…………55
AWS LakeFormation .......................13 Amazon Elastic Container Registry….36
AWS IoT Core……………….56
Amazon Elastic Container Service….37
Application Integration……………………14 AWS IoT Events…….57
Amazon Elastic Kubernetes Service…..38
AWS Step Functions…………15 AWS IoT Greengrass……58
AWS Fargate…..39
AWS EventBridge…………….16 FreeRTOS……59
AWS SNS…………………………17 Database…………………..40
AWS SQS………………………..18 Amazon Aurora……..41
AWS AppSync…………………19
Amazon DocumentDB………42
AWS Simple Workflow Service…..20
Amazon DynamoDB……….43
Amazon ElastiCache…….44
AWS Cost Management……….21
Amazon Keyspaces………45
AWS Cost Explorer…….22
Amazon Neptune……….46
AWS Budgets………………23
Amazon RDS…………47
AWS Cost and Usage Report………24
Reserved Instance Reporting………25
Index
Machine Learning…………..60 Networking and Content Delivery……..83
Amazon VPC…………..84 Storage…………………100
Amazon SageMaker…..61
Amazon CloudFront……..85 Amazon S3…………101
Amazon Polly…………62
Amazon Route 53……….86 Amazon Elastic Block Storage…..102
Amazon Transcribe……..63
AWS Direct Connect…….87 Amazon Elastic File System ……..104
Amazon Comprehend……64
AWS PrivateLink…………….88 Amazon FSx for Lustre……….105
Amazon Lex ………………65
AWS Transit Gateway…..89 Amazon FSx for Windows File Server……….106
Amazon Translate…………66
Elastic Load Balancing …….90 Amazon S3 Glacier…………107
Amazon Kendra……………67
Amazon Backup……………..108
Security, Identity, and Compliance……..91 AWS Snowball…………………..109
Management and Governance………68
Amazon IAM……….92 AWS Storage Gateway……………110
Amazon CloudWatch………69
AWS CloudFormation….70 Amazon Cognito………..93
AWS CloudTrail……...71 AWS Certificate Manager…….94
AWS Config…………….72 AWS Directory Service………….95
AWS License Manager……73 AWS Key Management Service.......96
AWS Management Console…..74 AWS Resource Access Manager………….97
AWS Organizations………………75 AWS Secrets Manager………….98
AWS Systems Manager…………….76 AWS Security Hub……………..99
● Charges are applied for each hour of use of EC2 instances and storage volumes attached to the instances.
Pricing Details:
● Amazon OpenSearch Service does not charge for data transfer between availability zones.
Amazon EMR
What is Amazon EMR? It offers basic functionalities for maintaining clusters such as
Amazon EMR (Elastic Map Reduce) is a Monitoring
service used to process and analyze large Replacing failed instances
amounts of data in the cloud using Apache Bug fixes
Hive, Hadoop, Apache Flink, Spark, etc.
✔ The main component of EMR is a cluster that collects Amazon EC2 instances
(also known as nodes in EMR).
✔ It offers basic functionalities for maintaining clusters such as monitoring, Amazon EMR storage layers
replacing failed instances, bug fixes, etc.
✔ It analyzes machine learning workloads using Apache Spark MLlib and Amazon EMR can be accessed in the following ways:
TensorFlow, clickstream workloads using Apache Spark and Apache Hive, and EMR Console
real-time streaming workloads from Amazon Kinesis using Apache Flink.
AWS Command Line Interface (AWS CLI)
✔ It provides more than one compute instances or containers to process the Software Development Kit (SDK)
workloads and can be executed on the following AWS services:
Web Service API
Amazon EC2 Amazon EKS AWS Outposts
Amazon Kinesis Data Streams
What are Amazon Kinesis Data Streams?
Amazon Kinesis Data Streams (KDS) is a
scalable real-time data streaming service. It Amazon Kinesis is a service used to collect, process,
captures gigabytes of data from sources like and analyze real-time streaming data. It can be an
website clickstreams, events streams alternative to Apache Kafka.
(database and location-tracking), and social
media feeds. Amazon Kinesis Data
Streams
Data Streams are divided into Shards / Partitions whose data Each shard provides a capacity of 1MB per second input data
retention is 1 day (by default) and can be extended to 7 days and 2MB per second output data.
Amazon Kinesis Data Firehose
What is Amazon Kinesis Data
Firehose? It delivers streaming data to the following services:
Amazon Kinesis Data Firehose is a serverless
service used to capture, transform, and load Amazon S3
streaming data into data stores and analytics
services. Amazon Redshift
❖ Maintenance of Apache Kafka clusters Amazon Kinesis Data Analytics: To execute Apache
Flink job on Amazon MSK cluster
❖ Maintenance of Apache ZooKeeper
Amazon MSK
Lambda Functions
❖ Multi-AZ replication for Apache Kafka clusters Integrates with:
Charged based on the number of bytes scanned by Redshift It has the capability to automatically copy snapshots (automated
Spectrum, rounded up to 10MB minimum per query. or manual) of a cluster to another AWS Region
AWS Glue
What is AWS Glue?
Properties of AWS Glue:
AWS Glue is a serverless ETL (extract,
transform, and load) service used to It supports data integration, preparing and combining data for
categorize data and move them between analytics, machine learning, and other applications’ development.
various data stores and streams.
Amazon CloudWatch
Amazon CloudTrail Lake Formation is pointed at the data sources, then crawls
Amazon Glue: Both use same Data Catalog the sources and moves the data into the new Amazon S3
Amazon Redshift Spectrum data lake.
Amazon EMR
It integrates with AWS Identity and Access Management
AWS Key Management Service (IAM) to provide fine-grained access to the data stored in
Amazon Athena: Athena's users can query those AWS Glue data lakes using a simple grant/revoke process
catalog which has Lake Formation permissions on them.
Pricing Details: ● Charges are applied based on the service integrations (AWS Glue, Amazon S3, Amazon EMR, Amazon
Redshift) at a standard rate
Application Integration
AWS Step Functions
What is AWS Step Functions?
AWS Step Functions resembles state machines and tasks. Each step in a
AWS Step Functions is a serverless orchestration service that workflow is a state. The output of one step signifies an input to the next
converts an application's workflow into a series of steps by results in functions orchestration.
combining AWS Lambda functions and other AWS services. It helps to execute each step in an order defined by the business logic of
the application.
It provides some built-in functionalities like sequencing, error handling,
timeout handling, and removing a significant operational overhead from
the team.
It can control other AWS services, like AWS Lambda (to perform tasks),
Standard Express
processing machine learning models, AWS Glue (to create an extract,
Workflows Workflows
transform, and load (ETL) workflows), and automated workflows that
● It executes once in a workflow ● It executes at-least-once in a workflow
execution for up to one year. execution for up to five minutes. require human approval.
● They are ideal for long-running and ● They are ideal for high-processing It provides multiple automation features like routine deployments,
auditable workflows. workloads, such as streaming data
upgrades, installations, migrations, patch management, infrastructure
processing and IoT data ingestion.
selection, and data synchronization
Executions are the instances where workflow runs to perform
tasks.
Amazon EventBridge is a serverless event bus An event bus is an entity that receives events, and rules get
service that connects applications with data attached to that event bus that matches the events received.
from multiple sources.
Example:
1 message, 1 topic, 10 subscribers so that a single message will be
notified to 10 different subscribers.
Amazon SNS
Amazon Simple Queue Service (SQS)
What are Amazon Simple There are two SQS Queue types:
Queue Service (SQS)? Standard Queue -
Amazon Simple Queue Service (SQS) is a ❖ The unlimited number of transactions per second.
serverless service used to decouple (loose ❖ Messages get delivered in any order.
couple) serverless applications and ❖ Messages can be sent twice or multiple times.
components. FIFO Queue -
❖ 300 messages per second.
❑ The queue represents a temporary repository between ❖ Support batches of 10 messages per operation, results in
the producer and consumer of messages. 3000 messages per second.
❑ It can scale up to 1-10000 messages per second. ❖ Messages get consumed only once.
❑ The default retention period of messages is four days
and can be extended to fourteen days. Dead-Letter Queue is a queue for those messages that are not
❑ SQS messages get automatically deleted after being consumed successfully. It is used to handle message failure.
consumed by the consumers.
❑ SQS messages have a fixed size of 256KB.
It improves performance by providing data caches, provides Queries: For fetching data from the API
subscriptions to support real-time updates, and provides
client-side data stores to keep off-line clients in sync.
Mutations: For changing data via API
It offers certain advantages over GraphQL, such as enhanced
coding style and seamless integration with modern tools and
frameworks like iOS and Android AWS AppSync Subscriptions: The connections for streaming
data from API
AppSync interface provides a live GraphQL API feature that
allows users to test and iterate on GraphQL schemas and
data sources quickly.
What is Amazon Simple Amazon SWF stores tasks and assigns them to workers
Workflow Service? during execution.
Tasks are performed by implementing workers and execute It is best suited for human-intervened workflows.
either on Amazon EC2 or on on-premise servers (which
means it is not a serverless service).
Amazon SWF is a less-used service, so AWS Step Functions
is the better option than SWF.
AWS Cost Management
AWS Cost Explorer
Reserved Instance
Cost and Usage Reports Reports
AWS Cost and Usage Reports functions: ✔ If the master account of an AWS Organization wants to
block access to the member accounts to set-up a Cost and
❑ It sends report files to your Amazon S3 bucket. Usage Report, Service Control Policy (SCP) can be used.
On-Demand Instances
✔ Useful for short-term needs, unpredictable workloads. It helps to login into the instances using key-pairs, in which AWS manages the
✔ No advance payment, no prior commitment. public key, and the user operates the private key.
Spot Instances
It also provides firewall-like security by specifying IP ranges, type, protocols (TCP),
✔ No advance payment, no prior commitment. port range (22, 25, 443) using security groups.
✔ Useful for cost-sensitive compute workloads.
It provides temporary storage volumes known as instance store volumes, which
Reserved Instances
are deleted if the instance gets stopped, hibernated, or terminated. It also offers
✔ Useful for long-running workloads and predictable usage. non-temporary or persistent volumes known as Amazon EBS volumes.
✔ Offer to choose from No upfront, Partial upfront, or All upfront.
It enables users to write scripts under the option ‘User data,’ used at the
Dedicated Instances instances’ launch.
✔ Instances run on hardware dedicated to a single user.
✔ Other customers can not share the hardware.
It offers to choose from three IP addresses, which are Public IP address (Changes
Dedicated Hosts when the instance is stopped or refreshed), Private IP address (retained even if the
model is stopped), Elastic IP address (static public IP address).
✔ A whole physical server with an EC2 instance allocates to an organization.
Amazon EC2 Auto Scaling
What is Amazon EC2 Auto Scaling?
Amazon EC2 Auto Scaling is a region-specific
Launch Configuration Launch Template
service used to maintain application
availability and enables users to A launch configuration is a A launch template is similar to
automatically add or remove EC2 instances configuration file used by an Auto launch configuration with extra
according to the compute workloads. Scaling group to launch EC2 features as below
instances
❖ The Auto Scaling group is a collection of the minimum number of
EC2 used for high availability. It launches any one of the Spot or It launches both Spot and
On-Demand instances On-Demand instances.
❖ It enables users to use Amazon EC2 Auto Scaling features such as It specifies single instance types. It specifies multiple instance types
fault tolerance, health check, scaling policies, and cost management.
It specifies one launch It specifies multiple launch
❖ The scaling of the Auto Scaling group depends on the size of the configuration at a time templates.
desired capacity. It is not necessary to keep DesiredCapacity and
MaxSize equal.
It is also defined as a developer-centric view of deploying an application It provides multiple deployment policies such as:
on AWS. The only responsibility of the developer is to write, and Elastic All at once, Rolling
Beanstalk handles code and the infrastructure
Rolling with an additional batch
An Elastic Beanstalk application comprises components, including Immutable
environments, versions, platforms, and environment configurations.
Traffic splitting
Scaling computing
Scaling computing resources
resources is
is ItIt has
has automated
automated scaling.
scaling.
manual
manual.
✔ The memory allocated to AWS ✔ The default execution time
It helps to run the code on Using AWS Lambda, one can buildd Lambda for computing is for AWS Lambda is 3
highly-available computing serverless applications composed of 128MB (minimum) to 3008MB seconds, and the
infrastructure and performs Lambda functions triggered by (maximum). Additional maximum is 15 minutes
administrative tasks like server events and can be automatically memory can be requested in (900 seconds).
maintenance, logging, capacity deployed using AWS CodePipeline an increment of 64MB
provisioning, and automatic scaling and AWS CodeBuild. between 128MB - 3008MB.
and code monitoring.
Docker CLI
The Amazon EKS control plane consists of nodes that run the Users can execute batch workloads on the EKS cluster using the
Kubernetes software, such as etcd and the Kubernetes API Kubernetes Jobs API across AWS compute services such as
server. Amazon EC2, Fargate, and Spot Instances.
To ensure high availability, Amazon EKS runs Kubernetes The two methods for creating a new Kubernetes cluster with
control plane instances across multiple Availability Zones. nodes in Amazon EKS:
It automatically replaces unhealthy control plane instances o eksctl - A command-line utility that consists of kubectl for
and provides automated upgrades and patches for the new creating/managing Kubernetes clusters on Amazon EKS.
control planes. o AWS Management Console and AWS CLI
Amazon Elastic Kubernetes Service is integrated with many AWS services for unique capabilities:
❖ Images - Amazon ECR for container images ❖ Authentication - AWS IAM
❖ Load distribution - AWS ELB (Elastic Load Balancing) ❖ Isolation - Amazon VPC
AWS Fargate
What is AWS Fargate?
AWS Fargate is a serverless compute
service used for containers by Amazon In the AWS Management Console, ECS
Elastic Container Service (ECS) and Amazon clusters containing Fargate and EC2 tasks Storage Types supported
Elastic Kubernetes Service (EKS). are displayed separately. for Fargate Tasks
MySQL on RDS
PostgreSQL on RDS
Amazon DocumentDB
Amazon DocumentDB is a fully managed It provides 99.99% availability by copying the cluster's data in
NoSQL database service that manages three different Availability Zones.
MongoDB databases in AWS.
It provides the following throughput capacity modes for reads It eliminates the developers’ operational burden such as scaling,
and writes: patching, updates, server maintenance, and provisioning.
It is best suited for structured data and Online Transaction ✔ If there is a need for unsupported RDS database engines, DB can
Processing (OLTP) types of database workloads such as InnoDB. be deployed on EC2 instances.
It supports the following database engines:
SQL Server The following tasks need to be taken care of manually.
PostgreSQL
Encryption and Security Updates and Backups Disaster Recovery
Amazon Aurora
MYSQL
MariaDB
Oracle
AWS KMS provides encryption at rest for RDS instances, DB snapshots, DB instance
storage, and Read Replicas. The existing database cannot be encrypted.
Amazon RDS only scales up for compute and storage, with no option for decreasing
allocated storage
It provides Multi-AZ and Read Replicas features for high availability, disaster
recovery, and scaling.
• Multi-AZ Deployments - Synchronous replication
Amazon RDS
• Read Replicas - Asynchronous replication.
Developer Tools
AWS CodeBuild
What is AWS CodeBuild?
❑ Charges are applied based on the amount of time taken by AWS
AWS CodeBuild is a continuous CodeBuild to complete the build.
integration service in the cloud used to
compile source code, run tests, and build ❑ The following ways are used to run CodeBuild:
packages for deployment.
AWS CodeBuild
AWS CodePipeline console
❑ AWS Code Services family consists of AWS CodeBuild, AWS AWS Command Line Interface (AWS CLI)
CodeCommit, AWS CodeDeploy, and AWS CodePipeline that AWS SDKs
provide complete and automated continuous integration and
delivery (CI/CD).
❑ It requires VPC ID, VPC subnet IDs, and VPC security group IDs to
access resources in a VPC to perform build or test.
AWS CodeBuild
AWS CodeDeploy
What is AWS CodeDeploy?
❑ Using Amazon EKS, Kubernetes clusters and applications
AWS CodeDeploy is a service that helps to
can be managed across hybrid environments without
automate application deployments to a variety
altering the code.
of compute services such as Amazon EC2, AWS ❑ It can fetch the content for deployment from Amazon S3
Fargate, AWS ECS, and on-premises instances. buckets, Bitbucket, or GitHub repositories.
AWS CodeDeploy ❑ It can deploy different types of application content such
It provides the following deployment type to choose from: as Code, Lambda functions, configuration files, scripts
and even Multimedia files.
❑ It can scale with the infrastructure to deploy on multiple
In-place deployment: instances across development, test, and production
● All the instances in the deployment group are stopped, updated with environments.
new revision and started again after the deployment is complete. ❑ It can integrate with existing continuous delivery
● Useful for EC2/On-premises compute platform. workflows such as AWS CodePipeline, GitHub, Jenkins.
Blue/green deployment:
● The instances in the deployment group of the original environment
are replaced by a new set of instances of the replacement
environment.
● Using Elastic Load Balancer, traffic gets rerouted from the original
environment to the replacement environment and instances of the
original environment get terminated after the deployment is
complete.
● Useful for EC2/On-Premises, AWS Lambda and Amazon ECS compute
platform AWS CodeDeploy
AWS X-Ray
What is AWS X-Ray? It works with the following AWS services:
AWS EC2 (Applications deployed on Instances)
AWS X-Ray is a service that allows visual AWS Elastic Load Balancer
analysis or allows to trace microservices
AWS Elastic BeanStalk
based applications.
AWS Lambda
Amazon ECS (Elastic Container Service)
✔ It provides end-to-end information about the request, Amazon API Gateway
response and calls made to other AWS resources by travelling
through the application's underlying components consisting The X-Ray SDKs are available for the following languages:
of multiple microservices. Go
Java
✔ It creates a service graph by using trace data from the AWS Node.js
resources. Python
Ruby
The graph shows the information about front-end and .Net
backend services calls to process requests and continue
the flow of data.
Amazon WorkSpaces is a managed service It helps to eliminate the management of on-premise VDIs
used to provision virtual Windows or Linux (Virtual Desktop Infrastructure).
desktops for users across the globe.
AWS IoT Core HTTPS protocol - publish messages ❑ It helps to create a persistent Device Shadow (a virtual version of
devices) so that other applications or devices can interact.
It detects events from IoT sensors It helps to create event logic using
such as temperature, motor voltage, conditional statements and trigger
motion detectors, humidity. alerts when an event occurs.
AWS IoT Events accepts data from many IoT sources like sensor devices,
AWS IoT Core, and AWS IoT Analytics.
AWS IoT Greengrass
What is AWS IoT Greengrass?
AWS IoT Greengrass is a cloud service that ❖ The Greengrass Core is a device that enables the
groups, deploys, and manages software for communication between AWS IoT Core and the AWS IoT
all devices at once and enables edge devices Greengrass.
to communicate securely.
It synchronizes data on the device using the following AWS services: ❖ It provides encryption and authentication for device data for
cloud communications.
Amazon Simple Storage Service (Amazon S3)
Amazon Kinesis
❖ It provides AWS Lambda functions and Docker containers as an
AWS IoT Core
environment for code execution.
AWS IoT Analytics
FreeRTOS
What is FreeRTOS?
FreeRTOS is an open-source operating It helps securely connect small
system for microcontrollers that devices to AWS IoT Core or the
enables devices to connect, manage, devices running AWS IoT Greengrass.
program, deploy and scale.
Amazon Lex, an AWS service, enables ● Understand Intent and Context: Effortlessly
developers to build chatbots with integrate AI that comprehends user intent,
natural conversation capabilities, maintains conversation flow, and automates
leveraging the technology behind routine tasks in multiple languages.
Alexa. With seamless integration and ● Omnichannel Deployment: Quickly design
advanced language understanding, and deploy conversational AI across platforms
Lex simplifies speech recognition and x with one-click setup, avoiding hardware or
facilitates the creation of engaging infrastructure management.
chatbots for intuitive user ● AWS Service Integration: Connect seamlessly
interactions. with other AWS services for data access,
business logic execution, performance
monitoring, and more.
● Cost-Efficiency: Pay only for usage based on
speech and text requests, with no upfront or
minimum fees.
Amazon Translate
Amazon CloudWatch
Collects and correlates monitoring data in logs, metrics, and Alarms can be created using CloudWatch Alarms that monitors
events from AWS resources, applications, and services that metrics and send notifications.
run on AWS and on-premises servers.
CloudWatch Agent or API can be used to monitor hybrid cloud
Offers dashboards and creates graphs to visualize cloud architectures.
resources.
CloudWatch Container Insights and Lambda Insights both
Visualizes logs to address issues and improve performance provide dashboards to summarize the performance and errors
by performing queries. for a selected time window.
❑ Nested Stacks are stacks created within another stack by using EC2Instance:
the ‘AWS::CloudFormation::Stack’ resource attribute. Type: AWS::EC2::Instance
Properties:
❑ The main stack is termed as parent stack, and other belonging ImageId: 1234xyz
stacks are termed as child stack, which can be implemented by KeyName: aws-keypair
InstanceType: t2.micro
using ref variable ‘! Ref’. SecurityGroups:
- !Ref EC2SecurityGroup
BlockDeviceMappings:
AWS does not charge for using AWS CloudFormation, and - DeviceName: /dev/sda1
charges are applied for the CloudFormation template services. Ebs:
VolumeSize: 50
AWS CloudTrail
What is AWS CloudTrail? Records": [{
AWS CloudTrail is a service that gets "eventVersion": "1.0",
"userIdentity": {
enabled when the AWS account is created
"type": "IAMUser",
and is used to enable compliance and "principalId": "PR_ID",
auditing of the AWS account. "arn":
"arn:aws:iam::210123456789:user/Rohit",
"accountId": "210123456789",
✔ It offers to view, analyze, and respond to activity across the AWS
"accessKeyId": "KEY_ID",
infrastructure. "userName": "Rohit"
},
✔ It records actions as an event by an IAM user, role, or an AWS service. "eventTime": "2021-01-24T21:18:50Z",
"eventSource": "iam.amazonaws.com",
✔ CloudTrail records can download Cloud Trial events in JSON or CSV file. "eventName": "CreateUser",
"awsRegion": "ap-south-2",
"sourceIPAddress": "176.1.0.1",
✔ CloudWatch monitors and manages the activity of AWS services and "userAgent": "aws-cli/1.3.2 Python/2.7.5
resources, reporting on their health and performance. Whereas Windows/7",
CloudTrail resembles logs of all actions performed inside the AWS "requestParameters": {"userName": "Nayan"},
environment. "responseElements": {"user": {
"createDate": "Jan 24, 2021 9:18:50 PM",
"userName": "Nayan",
✔ IAM log file -
"arn": "arn:aws:iam::128x:user/Nayan",
The below example shows that the IAM user Rohit used the AWS "path": "/",
Management Console to call the AddUserToGroup action to add Nayan to "userId": "12xyz"
the administrator group. }}
}]}
AWS Config
What is AWS Config?
Functions of AWS Config:
AWS Config is a service that allows users to
determine the quality of a resource's
configuration in the AWS account. It helps to monitor configuration changes performed over a specific
period using AWS Config console and AWS CLI and generates
notifications about changes.
A GUI Console is available as an app for Android and iOS for a better experience. AWS Regions
AWS Organizations
What are AWS Organizations?
It includes account management and combined billing
AWS Organizations is a service that allows
capabilities to meet the business’s budgetary and
users to manage multiple AWS accounts
security needs.
grouped into a single organization.
✔ It simplifies maintenance and identifies issues in the resources It helps to automate repetitive operations and management tasks
that may impact the applications. using predefined playbooks.
Amazon Database
Management Service
AWS DMS supports the following data sources and targets engines ❑ It performs all the management steps required during the
for migration: migration, such as monitoring, scaling, error handling, network
❑ Sources: Oracle, Microsoft SQL Server, PostgreSQL, Db2 LUW, connectivity, replicating during failure, and software patching.
SAP, MySQL, MariaDB, MongoDB, and Amazon Aurora.
❑ Targets: Oracle, Microsoft SQL Server, PostgreSQL, SAP ASE, ❑ AWS DMS with AWS Schema Conversion Tool (AWS SCT) helps to
MySQL, Amazon Redshift, Amazon S3, and Amazon DynamoDB. perform heterogeneous migration.
AWS DataSync
❖ Every EC2 instance is launched within a default VPC with equal security and control like normal Amazon VPC. Default VPC has no private
subnet.
❖ It uses Security Groups and NACL (Network Access Control Lists) for multi-layer security.
❖ Security Groups (stateful) provide instance-level security, whereas NACLs (stateless) provide subnet-level security.
❖ VPC sharing is a component that allows subnets to share with other AWS accounts within the same AWS Organization.
Amazon CloudFront
What is Amazon CloudFront? Amazon CloudFront Access Controls:
Signed URLs:
Amazon CloudFront is a content delivery network
● Use this to restrict access to individual files.
(CDN) service that securely delivers any kind of
Signed Cookies:
data to customers worldwide with low latency,
● Use this to provide access to multiple restricted files.
low network, and high transfer speeds.
● Use this if the user does not want to change current URLs.
Geo Restriction:
● Use this to restrict access to the data based on the geographic location of the
It makes use of Edge locations (worldwide network of data centers) to deliver website viewers.
the content faster. Origin Access Identity (OAI):
Without edge locations, it retrieves data from an origin such as an Amazon S3
● Outside access is restricted using signed URLs and signed cookies, but what if
bucket, a Media Package channel, or an HTTP server.
someone tries to access objects using Amazon S3 URL, bypassing CloudFront signed
URL and signed cookies. To restrict that, OAI is used.
CloudFront provides some security features such as:
● Use OAI as a special CloudFront user and associate it with your CloudFront
❖ Field-level encryption with HTTPS - Data remains encrypted
distribution to secure Amazon S3 content.
throughout starting from the upload of sensitive data.
CloudFront Signed URL: S3 Pre-Signed URL:
❖ AWS Shield Standard - Against DDoS attacks.
○ It allows access to a path, no matter what is o It issues a request as the
❖ AWS Shield Standard + AWS WAF + Amazon Route 53 - Against
the origin person who pre-signed the
more complex attacks than DDoS.
○ It can be filtered by IP, path, date, expiration URL.
CloudFront is integrated with AWS Services such as: ○ It leverages caching features
Amazon S3
Amazon EC2
Elastic Load Balancing
Amazon Route 53
AWS Essential Media Services
Public virtual
interface:
Private virtual It helps to connect
interface: AWS services located
It helps to connect an in any AWS region
Amazon VPC using (except China) from
private IP addresses. your on-premises data
center using public IP
addresses.
Pricing details:
Port hours - charges are determined by capacity and
connection type Amazon Direct Connect
Outbound data transfer
AWS PrivateLink
What is PrivateLink?
AWS PrivateLink is a network service used
to connect to AWS services hosted by other Types of VPC End Points
AWS accounts (referred to as endpoint
services) or AWS Marketplace.
So, service consumers use an interface endpoint to access It serves as an entry point It is a gateway in the
the services running in the service provider. for traffic destined to an route-table that routes
AWS service or a VPC traffic only to Amazon S3
It provides security by not allowing the public internet and endpoint service and DynamoDB
reducing the exposure to threats, such as brute force and
DDoS attacks.
AWS Transit Gateway Transit Gateway vs. VPC Peering
What is AWS Transit Gateway? Transit Gateway VPC Peering
AWS Transit Gateway is a network hub used It has an hourly charge per It does not charge for data transfer.
to interconnect multiple VPCs. It can be used attachment in addition to the data
to attach all hybrid connectivity by controlling transfer fees.
your organization's entire AWS routing Multicast traffic can be routed Multicast traffic cannot be routed to
configuration in one place between VPC attachments to a peering connections.
Transit Gateway.
It provides Maximum bandwidth
(burst) of 50 Gbps per Availability It provides no aggregate bandwidth.
It can be more than one per region but can not be Zone per VPC connection.
peered within a single region. It supports attaching Security groups feature does not Security groups feature works with
Amazon VPCs with IPv6 CIDRs. currently work with Transit intra-Region VPC peering.
Gateway
It helps to solve the problem of complex VPC peering
connections.
IAM Policies
IAM Users
Policies are documents written in JSON (key-value pairs) used to
User can be a person or service.
define permissions.
Amazon Cognito allows customers to sign in through social identity Amazon Cognito is capable enough to allow usage of user pools
providers such as Google, Facebook, and Amazon, and through and identity pools separately or together
enterprise identity providers such as Microsoft Active Directory via
SAML.
HTTPS transactions require server certificates X.509 that bind the Domain Validated Certificates (DV SSL)
public key in the certificate to provide authenticity. Provides minimal encryption
The certificates are signed by a certificate authority (CA) and Wildcard SSL Certificate
contain the server’s name, the validity period, the public key, the Secures base domain and subdomains.
signature algorithm, and more.
Multi-Domain SSL Certificate (MDC)
It centrally manages the certificate lifecycle and helps to automate Secure up to hundreds of domain and subdomains.
certificate renewals.
SSL/TLS certificates provide data-in-transit security and authorize Unified Communications Certificate (UCC)
the identity of sites and connections between browsers and Single certificate secures multiple domain names.
applications.
Ways to deploy managed X.509 certificates:
The certificates created by AWS Certificate Manager for using AWS Certificate Manager (ACM)
ACM-integrated services are free. Useful for customers who need a secure and public web
presence.
With AWS Certificate Manager Private Certificate Authority,
monthly charges are applied for the private CA operation and the ACM Private CA
private certificates issued. Useful for customers that are intended for private use within an
organization.
AWS Directory Service
What is AWS Directory Service?
Simple AD
AWS Directory Service, also known as AWS
● It is an inexpensive Active Directory-compatible service
Managed Microsoft Active Directory (AD),
driven by SAMBA 4.
enables multiple ways to use Microsoft Active
● It can be used when there is a need for less than 5000
Directory (AD) with other AWS services.
users.
● It does not support Multi-factor authentication (MFA).
Using AWS Managed Microsoft AD, it becomes easy to migrate
AD-dependent applications and Windows workloads to AWS.
A trust relationship can be created between AWS Managed
Amazon Cognito
Microsoft AD and existing on-premises Microsoft Active using single
● It is a user directory type that provides sign-up and sign-in for the
sign-on (SSO).
application using Amazon Cognito User Pools.
AWS Directory Service provides the following directory types
to choose from:
AD Connector
Simple AD
● It is like a gateway used for redirecting directory requests to the
on-premise Active Directory.
Amazon Cognito ● For this, there must be an existing AD, and VPC must be
connected to the on-premise network via VPN or Direct
Connect.
AWS Directory AD Connector ● It supports multi-factor authentication (MFA) via existing
Service RADIUS-based MFA infrastructure.
AWS Key Management Service
What is AWS Key Management Service? ❑ Provides data security at rest using encryption keys
AWS Key Management Service is a global and provides access control for encryption,
service that creates, stores, and manages decryption, and re-encryption.
encryption keys. ❑ Offers SDKs for different languages to add digital
signature capability in the application code.
AWS Key Management
Encryption using AWS KMS Service ❑ Allows rotation of master keys once a year using
previous versions of keys.
❑ AWS KMS produces new cryptographic data for the
KMS key once a year, when automatic key rotation is
turned on for a KMS key.
❑ AWS KMS preserves all previous iterations of the
cryptographic information so that you can decrypt
any data that has been encrypted using that KMS
key. Until the KMS key is deleted, AWS KMS does not
remove any rotated key material.
❑ It provides security and compliance facilities by rotating Secret rotation is supported with the below Databases:
secrets safely without the need for code deployment. ▪ MySQL, PostgreSQL, Oracle, MariaDB, Microsoft SQL Server, on
❑ It integrates with AWS CloudTrail and AWS CloudWatch to log Amazon RDS
and monitor services for centralized auditing. ▪ Amazon Aurora on Amazon RDS
Charges are applied only for the current Region, not for all Regions
in which Security Hub is enabled.
Storage
Amazon Simple Storage Service (S3)
What is Amazon Simple Storage Service? Amazon S3 uses the following ways for security:
Amazon S3 is a simple service used to provide key-based object User-based security
storage across multiple availability zones (AZs) in a specific region. ▪ IAM policies
❑ Objects (files) are stored in a region-specific container known as Bucket. Amazon S3 provides the following storage classes used to maintain the integrity
❑ Objects that are stored can range from 0 bytes - 5TB. of the objects:
▪ It provides ‘Multipart upload’ features that upload objects in parts, suitable ❑ S3 Standard - offers frequent data access.
for 100 MB or larger objects. ❑ S3 Intelligent-Tiering - automatically transfer data to other cost-effective
▪ It offers to choose ‘Versioning’ features to retain multiple versions of access tiers.
objects, must enable versioning at both source and destination. ❑ S3 Standard-IA - offers immediate and infrequent data access.
▪ Amazon S3 Transfer Acceleration allows fast and secure transfer of objects ❑ S3 One Zone-IA - infrequent data access.
over long distances with minimum latency using Amazon CloudFront’s Edge ❑ S3 Glacier - long-term archive data, cheap data retrieval.
Locations. ❑ S3 Glacier Deep Archive - used for long-term retention.
▪ Amazon S3 uses access control lists (ACL) to control access to the objects
Amazon S3 offers to choose from the following ways to replicate objects:
and buckets.
▪ Cross-Region Replication - used to replicate objects in different AWS Regions.
▪ Amazon S3 provides Cross-Account access to the objects and buckets by
▪ Same Region Replication - used to replicate objects in the same AWS Region.
assuming a role with specified privileges.
Amazon Elastic Block Store
What is Amazon Elastic
Block Store?
Amazon Elastic Block Store is a service that By default, the non-root EBS volume does not get affected when
provides the block-level storage drive to store the instance is terminated.
persistent data.
Cold HDD:
HDD (Hard disk drives)
Useful for infrequently accessed data and
Throughput Optimized HDD:
lowest cost workloads.
Useful for Big data and Log processing workloads.
Supports volume size from 125 GiB to 16 TiB.
Supports volume size from 125 GiB to 16 TiB.
Allows 250 as maximum IOPS per volume.
Allows 500 as maximum IOPS per volume.
Allows 250 MiB/s as maximum throughput per
Allows 500 MiB/s as maximum throughput per
volume.
volume.
Amazon Elastic File System (EFS)
What is Amazon Elastic File System?
It offers the following storage classes for file storage:
Amazon Elastic File System is a managed
▪ EFS Standard storage class
service used to create and scale file storage
▪ EFS Infrequent Access storage class - can store less
systems for AWS and on-premises resources.
frequently accessed files.
Amazon EFS
It offers the following modes to ease the file storage system:
❑ It spans multiple availability zones and regions.
❑ Performance modes -
❑ It uses EFS Mount Target to share a file system with multiple
General Purpose performance mode: Useful for
availability zones and VPCs.
low-latency workloads.
❑ It is best suited for Linux-based workloads and applications.
Max I/O mode: High throughput workloads.
❑ Multiple instances can access it at the same time leads to high
❑ Throughput modes -
throughput and low latency IOPS.
Bursting Throughput mode: Throughput increases based
❑ It automatically scales storage capacity up to petabyte.
on the file system storage.
❑ It supports file locking and strong data consistency.
Provisioned Throughput mode: Throughput changes are
❑ It offers data encryption at rest and in-transit using AWS KMS
independent of the file system storage.
and TLS, respectively.
❑ It provides EFS lifecycle management policies based on the
❑ It uses POSIX permissions to control access to files and
number of days ranges from 7-90 days to automatically move
directories.
files from Standard storage class to EFS IA storage class.
Amazon FSx for Lustre
What is Amazon FSx?
Amazon FSx for Lustre is an FSx solution that
offers scalable storage for the Lustre system
(parallel and high-performance file storage
system). Amazon FSx
❖ Supports fast processing workloads like custom electronic design automation (EDA) and
high-performance computing (HPC).
❖ Stores datasets in S3 as files instead of objects and automatically updates with the
latest data to run the workload.
❖ FSx can be used with existing Linux-based applications without any changes. Using Amazon FSx
❖ Offers network access control using POSIX permissions or Amazon VPC Security
Groups.
❖ FSx easily provides data-at-rest and in-transit encryption. AWS Backup can also be used to backup Lustre file systems.
Amazon FSx for Windows File Server
What is Amazon FSx for
Windows File Server??
Amazon FSx for Windows File Server is an
FSx solution that offers a scalable and
shared file storage system on the
Microsoft Windows server. Amazon FSx
Amazon FSxuses
Amazon FSx offers
SSD identity-based
storage, offers highauthentication
throughput and Amazon
AmazonFSx
FSx offers identity-based
offers identity-based authentication
authentication using
IOPS with sub-millisecond Microsoft Active
using Microsoft Directory
Active (AD). (AD).
Directory
using Microsoft latencies for Windows(AD).
Active Directory workloads.
Amazon S3 Glacier
It is the cheapest S3 storage class and offers 99.999999999% of data Amazon S3 Glacier jobs are the select queries that execute to
durability. retrieve archived data. It uses Amazon SNS to notify when the jobs
complete.
S3 Glacier provides the following data retrieval options:
Amazon S3 Glacier does not provide real-time data retrieval of the
Expedited retrievals - archives.
• It retrieves data in 1-5 minutes.
Standard retrievals - Amazon S3 Glacier uses ‘S3 Glacier Select’ to query archive objects
• It retrieves data between 3-5 hours. in uncompressed CSV format and store the output to the S3 bucket.
Bulk retrievals - Amazon S3 Glacier Select uses common SQL statements like
• It retrieves data between 5-12 hours. SELECT, FROM, and WHERE.
❖ Dashboard in the AWS Backup console to monitor backup and restore activities.
❖ Lifecycle policies configured to transition backups from Amazon EFS to cold storage automatically.
AWS Snowball
What is AWS Snowball?
AWS Snowball is a data transfer service that ❖ If data transfers involve large files and multiple jobs, you might
uses storage gadgets to transfer a huge separate the data into several smaller data segments. Parallelization
amount of data ranging from 50TB - 80TB helps to transfer data with Snowball at a faster rate.
between Amazon Simple Storage Service and E.g., ten segments of 7 TB each in a size of 80 TB Snowball.
onsite data storage location at high speed.
❖ AWS Snowball is integrated with other AWS services such as AWS
CloudTrail to capture all API calls as events and with Amazon Simple
It makes use of AWS Key Management Service to protect data in
Notification Service (Amazon SNS) to notify about data transfer.
transit securely.
Tape Gateway -
It uses Amazon Glacier to archive backup data.
It stores data on virtual tape cartridges using VTL (virtual tape library)
interface with an iSCSI connection.
It supports WORM based file system.
AWS Storage Gateway