Key Principles of Highly Resilient Systems

Uploaded by

demy2014

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Key Principles of Highly Resilient Systems

Uploaded by

demy2014

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Key Principles of Highly Resilient AWS CloudFormation, or

Systems Terraform.
1. Fault Tolerance:
o Ensure that system Example Architecture for a Large-
components can fail Scale Highly Resilient System
without causing overall Architecture Overview:
system failure.  Cloud Platform: AWS
o Use redundancy and  Purpose: Global insurance claim
failover mechanisms. management system
2. Scalability:  Key Features: Resilience,
o Support both horizontal scalability, and fault tolerance
(adding more nodes) and
vertical (increasing 1. Components
resource capacity of 1. Frontend:
nodes) scaling. o Amazon CloudFront:
o Use auto-scaling Distribute static assets
techniques based on globally.
demand. o Amazon S3: Host static
3. Disaster Recovery: content with versioning
o Implement multi-region enabled.
deployments with backup 2. Backend:
and failover capabilities. o Amazon ECS (Elastic
o Regularly test recovery Container Service): Run
processes. microservices using
4. High Availability (HA): Fargate.
o Maintain uptime by o API Gateway: Expose
distributing workloads APIs securely.
across multiple zones or 3. Database:
regions. o Amazon RDS (Aurora):
o Use load balancers and Highly available relational
distributed systems. database with multi-AZ
5. Observability: replication.
o Integrate monitoring, o Amazon DynamoDB: For
logging, and tracing for low-latency, high-
quick identification and throughput non-relational
resolution of issues. data.
o Use tools like Prometheus, 4. Data Storage:
Grafana, ELK Stack, or o Amazon S3: Store logs,
AWS CloudWatch. backups, and insurance
6. Automation: documents.
o Automate deployments 5. Resilience Enhancers:
and infrastructure o Route 53: For DNS routing
management using and failover between
Infrastructure as Code regions.
(IaC) tools like Terraform,

1
o Elastic Load Balancer 5. Monitoring and Alerts:
(ELB): For traffic o CloudWatch alarms notify
distribution. teams about anomalies.
o Auto Scaling Groups o S3 holds logs for long-term
(ASG): Scale EC2 storage.
instances for backend Infrastructure Engineering Example
processing. with Tools
6. Monitoring: Terraform Example for Resilient AWS
o Amazon CloudWatch: Setup
Metrics and alerting. provider "aws" {
o AWS X-Ray: Distributed region = "us-east-1"
tracing for debugging. }
7. Disaster Recovery:
o Multi-Region resource "aws_s3_bucket"
Deployment: Primary "static_site" {
region in us-east-1, bucket = "insurance-claims-static-
failover to us-west-2. site"
o Regular snapshots and acl = "public-read"
backups using AWS
Backup. versioning {
enabled = true
2. Workflow }
1. User Request:
o Users interact via a global tags = {
insurance claims portal. Environment = "production"
o Requests are routed Team = "infrastructure"
through CloudFront and }
reach backend services }
via API Gateway. resource "aws_rds_cluster"
2. Data Processing: "aurora_cluster" {
o Backend services cluster_identifier = "insurance-
deployed in ECS/Fargate claims-db"
process data. engine = "aurora-mysql"
o Data is stored in Aurora for engine_version =
structured data and "5.7.mysql_aurora.2.10.0"
DynamoDB for non- master_username = "admin"
relational data. master_password =
3. Resilience Mechanisms: "securepassword"
o Failover configured in backup_retention_period = 7
Route 53. availability_zones = ["us-east-
o ASGs handle load spikes 1a", "us-east-1b", "us-east-1c"]
automatically. scaling_configuration {
4. Disaster Recovery: auto_pause = false
o In case of failure in us- min_capacity = 2
east-1, traffic is routed to max_capacity = 8
us-west-2. }

2
tags = {  Resilience: Protect against
Environment = "production" failures with redundancy and
Team = "infrastructure" failover mechanisms.
}  Security: End-to-end encryption,
} role-based access, and
compliance with insurance
Challenges and Solutions regulations.
Challenge 1: Load Spikes During
Peak Claims Season Components
 Solution: Use auto-scaling in Frontend
ECS and ASGs to dynamically 1. Azure Front Door:
adjust resources. o Global load balancer for
low-latency routing and
Challenge 2: Multi-Region Failover enhanced availability.
Latency o Handles SSL termination
 Solution: Optimize Route 53 and forwards traffic to the
health checks for quicker failover. backend.
2. Azure App Service (Web Apps):
Challenge 3: Debugging Distributed o Hosts the insurance portal
Failures frontend.
 Solution: Implement AWS X-Ray o Autoscaling and SLA of
for full request lifecycle tracing. 99.95%.
Backend
Challenge 4: High Operational Costs 1. Azure App Service (API Apps):
 Solution: Use AWS Savings o Hosts backend APIs for
Plans for ECS and RDS to claims processing and
reduce costs by up to 50%. user data.
o Supports auto-scaling and
seamless updates.
Architecture Overview 2. Azure Functions:
Purpose o For serverless execution of
A global insurance claims management lightweight tasks like policy
system designed to handle high traffic, calculation and claim
ensure data consistency, and provide validation.
disaster recovery with minimal
downtime. Data Layer
Core Principles 1. Azure SQL Database
 High Availability: Services (Hyperscale):
operate seamlessly across o Fully managed relational
availability zones and regions. database with auto-scaling
 Scalability: Automatically handle and high availability.
traffic spikes during events like o Geo-replication for disaster
natural disasters or policy recovery.
enrollments. 2. Azure Cosmos DB:
o NoSQL database for
storing unstructured data

3
like documents, logs, and o Securely stores API keys,
user activities. database credentials, and
o Multi-region writes for certificates.
resilience. 3. Network Security Groups
Data Storage (NSGs):
1. Azure Blob Storage: o Protect PaaS services by
o For storing large insurance restricting
documents, images, and inbound/outbound traffic.
claims reports.
o Redundant across Resilience and High Availability
availability zones (ZRS) or 1. Azure Availability Zones:
regions (GRS). o Deploy App Services and
2. Azure Data Lake: databases across zones
o For analytics and large- for fault tolerance.
scale data processing. 2. Multi-Region Deployment:
Integration and Messaging o Primary region in East US,
1. Azure Service Bus: secondary in West US.
o Reliable message queue o Azure Traffic Manager
for communication handles failover and
between services. routing.
o Guarantees message 3. Disaster Recovery:
delivery for asynchronous o Azure SQL Database geo-
processes like claims replication ensures RTO
approval. (Recovery Time Objective)
2. Azure Event Grid: of minutes.
o Event-driven architecture o Regular backups using
to trigger workflows, such Azure Backup.
as notifications.
Monitoring and Observability Workflow
1. Azure Monitor: 1. User Interaction:
o Tracks metrics, logs, and o Users access the
alerts. insurance portal via Azure
o Centralized dashboard for Front Door.
application performance o Requests are routed to the
monitoring. nearest Azure App Service
2. Application Insights: for low latency.
o Provides detailed 2. Claims Processing:
telemetry for frontend and o Claim details are sent to
backend services. backend API Apps and
Security processed.
1. Azure Active Directory (AAD): o Long-running tasks are
o Secure identity and access offloaded to Azure
management. Functions.
o Single sign-on (SSO) for 3. Data Storage and Retrieval:
users and employees. o Customer data is stored in
2. Azure Key Vault: Azure SQL Database.

4
o Insurance documents are resource "azurerm_app_service_plan"
uploaded to Azure Blob "insurance_plan" {
name = "insurance-app-plan"
Storage. location = "East US"
4. Notification: resource_group_name =
o Status updates are sent to azurerm_resource_group.main.name
customers via Azure kind = "Windows"
sku {
Service Bus and Event
tier = "Standard"
Grid. size = "S1"
5. Monitoring: }
o Application Insights }
provides insights into
resource "azurerm_app_service" "frontend"
response times and errors. {
o Azure Monitor tracks name = "insurance-frontend"
overall system health. location =
azurerm_resource_group.main.location
Example Diagram resource_group_name =
azurerm_resource_group.main.name
(Simplified Flow) app_service_plan_id =
1. Global Access: azurerm_app_service_plan.insurance_plan.i
o Azure Front Door routes d
traffic to the appropriate }
region. resource "azurerm_sql_server"
"insurance_db" {
2. Compute Layer: name = "insurance-db-server"
o Azure App Services for location =
web/API apps. azurerm_resource_group.main.location
o Azure Functions for resource_group_name =
serverless operations. azurerm_resource_group.main.name
version = "12.0"
3. Database Layer: administrator_login = "adminuser"
o Azure SQL Database administrator_login_password =
(structured data). "ComplexPassword123!"
o Azure Cosmos DB }
(unstructured data). resource "azurerm_sql_database"
4. Storage: "insurance_db" {
o Azure Blob Storage for name = "insurance-db"
documents. resource_group_name =
o Data Lake for analytics. azurerm_resource_group.main.name
location =
5. Messaging: azurerm_sql_server.insurance_db.location
o Service Bus for server_name =
asynchronous tasks. azurerm_sql_server.insurance_db.name
6. Monitoring: sku_name = "GP_Gen5_2"
o Azure Monitor and }
resource "azurerm_storage_account"
Application Insights. "insurance_docs" {
name = "insurancedocs"
Terraform Configuration Example resource_group_name =
provider "azurerm" { azurerm_resource_group.main.name
features {} location =
} azurerm_resource_group.main.location
account_tier = "Standard"

5
account_replication_type = "LRS"
}

main_powershell-active-directory-cheat-sheet
No ratings yet
main_powershell-active-directory-cheat-sheet
2 pages
Microsoft Azure AZ 900 Notes
No ratings yet
Microsoft Azure AZ 900 Notes
8 pages
LifePlace - BioRegional Thought and Practice - Robert Thayer
100% (1)
LifePlace - BioRegional Thought and Practice - Robert Thayer
320 pages
ARTS7 - Q3 - M4 - Appreciation of Arts and Crafts of Mindanao and Their Usage - v4
100% (1)
ARTS7 - Q3 - M4 - Appreciation of Arts and Crafts of Mindanao and Their Usage - v4
27 pages
Driving Azure and AWS deployments using Infrastructure as Code
No ratings yet
Driving Azure and AWS deployments using Infrastructure as Code
15 pages
What is Kubernetes
No ratings yet
What is Kubernetes
3 pages
AWS architecture
No ratings yet
AWS architecture
4 pages
Step-by-Step Guide to Architect, Plan, Design, Setup, and Configure Infrastructure Automation Using Terraform for Dev, Staging, and Production Environments with Kubernetes AKS EKS for the Financial Industry
No ratings yet
Step-by-Step Guide to Architect, Plan, Design, Setup, and Configure Infrastructure Automation Using Terraform for Dev, Staging, and Production Environments with Kubernetes AKS EKS for the Financial Industry
4 pages
Network Services Doc and Presentation
No ratings yet
Network Services Doc and Presentation
39 pages
ccl viva
No ratings yet
ccl viva
5 pages
sample2
No ratings yet
sample2
3 pages
AWS Services
No ratings yet
AWS Services
34 pages
ESA - Exercises
No ratings yet
ESA - Exercises
4 pages
EC2
No ratings yet
EC2
28 pages
Aws Dev Ops Scenario Interview Questions
No ratings yet
Aws Dev Ops Scenario Interview Questions
4 pages
Disaster Recovery Using AWS
No ratings yet
Disaster Recovery Using AWS
22 pages
CloudComputingNotes
No ratings yet
CloudComputingNotes
76 pages
Cloud computing
No ratings yet
Cloud computing
8 pages
Cloud Mechanisms
No ratings yet
Cloud Mechanisms
7 pages
Cloud
No ratings yet
Cloud
14 pages
Driving Azure and AWS Deployments Using Infrastructure as Code aC for the Financial Industry to Reduce Waste, Eliminate Manual Repetitive Tasks and Prevent Problem Recurrence
No ratings yet
Driving Azure and AWS Deployments Using Infrastructure as Code aC for the Financial Industry to Reduce Waste, Eliminate Manual Repetitive Tasks and Prevent Problem Recurrence
5 pages
??????? ??????????
No ratings yet
??????? ??????????
9 pages
unit-5 cloud computing
No ratings yet
unit-5 cloud computing
15 pages
AWS Complete Notes For Beginners 1732013231
No ratings yet
AWS Complete Notes For Beginners 1732013231
21 pages
m5 tie answers
No ratings yet
m5 tie answers
10 pages
Azure Automation
No ratings yet
Azure Automation
3 pages
revision
No ratings yet
revision
7 pages
AWS Cloud Practitioner Practice Set 1
No ratings yet
AWS Cloud Practitioner Practice Set 1
63 pages
Architecting and Managing Apps Matt Tavis July 2010
No ratings yet
Architecting and Managing Apps Matt Tavis July 2010
30 pages
TrailHead_ArchitectingInTheCloud(2)
No ratings yet
TrailHead_ArchitectingInTheCloud(2)
24 pages
Joy - AWS Study Guide
No ratings yet
Joy - AWS Study Guide
31 pages
AWS Challenge Project
No ratings yet
AWS Challenge Project
38 pages
AWS and Linux Interview Questions and Answers
No ratings yet
AWS and Linux Interview Questions and Answers
3 pages
Cia2 Dc Questionbank
No ratings yet
Cia2 Dc Questionbank
3 pages
Azure Fundamentals
No ratings yet
Azure Fundamentals
35 pages
Clustering
No ratings yet
Clustering
21 pages
AWS Interview Questions With Answers: Explain The Steps To Set Up A Secured VPC With Subnets and Everything
No ratings yet
AWS Interview Questions With Answers: Explain The Steps To Set Up A Secured VPC With Subnets and Everything
20 pages
Migrating a 10 on-premises mixed OS server environment to an existing Azure Cloud network for a Property Management System company involves several steps
No ratings yet
Migrating a 10 on-premises mixed OS server environment to an existing Azure Cloud network for a Property Management System company involves several steps
5 pages
CS8791 Cloud Computing Unit-4 Notes
No ratings yet
CS8791 Cloud Computing Unit-4 Notes
12 pages
D306 Study Guide
No ratings yet
D306 Study Guide
83 pages
Saaray Case Study Sol
No ratings yet
Saaray Case Study Sol
4 pages
Sample Q and A
100% (1)
Sample Q and A
47 pages
Fresher Linux, AWS and DeVops Interview Questions & Answers
No ratings yet
Fresher Linux, AWS and DeVops Interview Questions & Answers
7 pages
Interview data Vulnerability Management
No ratings yet
Interview data Vulnerability Management
9 pages
Azure architecture
No ratings yet
Azure architecture
4 pages
Portworx Disaster Recovery
No ratings yet
Portworx Disaster Recovery
5 pages
Azure Topics
No ratings yet
Azure Topics
5 pages
Find A Carer AWS Architecture v3.1
No ratings yet
Find A Carer AWS Architecture v3.1
13 pages
AWS Training
100% (1)
AWS Training
13 pages
Cloud Computing Reviewer 2
No ratings yet
Cloud Computing Reviewer 2
11 pages
1735715656223
No ratings yet
1735715656223
29 pages
CC Answers
No ratings yet
CC Answers
12 pages
Print Notes
No ratings yet
Print Notes
26 pages
Module3 5
No ratings yet
Module3 5
11 pages
Cloud Computing 3 Unit
No ratings yet
Cloud Computing 3 Unit
8 pages
GCP architecture
No ratings yet
GCP architecture
5 pages
Aws Inteview
No ratings yet
Aws Inteview
55 pages
Aws disaster recovery
No ratings yet
Aws disaster recovery
36 pages
All QA
No ratings yet
All QA
39 pages
Aws Recon Webinar Material
No ratings yet
Aws Recon Webinar Material
52 pages
Module 8: Network and Security: Hands-On
No ratings yet
Module 8: Network and Security: Hands-On
9 pages
Sun Cluster
No ratings yet
Sun Cluster
87 pages
AWS Cloud Practitioner Study Guide & Practice Tests
From Everand
AWS Cloud Practitioner Study Guide & Practice Tests
SUJAN
No ratings yet
DevOps Engineer exam as of 2024-10-14 Answer
No ratings yet
DevOps Engineer exam as of 2024-10-14 Answer
13 pages
NewRelic-Kickstarting-Devops-eBook
No ratings yet
NewRelic-Kickstarting-Devops-eBook
19 pages
Reviewer
No ratings yet
Reviewer
7 pages
gauravpandey44_docker-compose (1)
No ratings yet
gauravpandey44_docker-compose (1)
3 pages
Reviewer2
No ratings yet
Reviewer2
11 pages
Architecture Presentation
No ratings yet
Architecture Presentation
10 pages
PowerShell_Cheat_Sheet.pdf
No ratings yet
PowerShell_Cheat_Sheet.pdf
5 pages
main_linux-terminal-cheat-sheet
No ratings yet
main_linux-terminal-cheat-sheet
3 pages
This guide will walk you through deploying an Insurance System using Infrastructure as a Service
No ratings yet
This guide will walk you through deploying an Insurance System using Infrastructure as a Service
4 pages
git-cheat-sheet
No ratings yet
git-cheat-sheet
1 page
DemetrioDelaRosaJr.__DagupanCity_13.11_yrs(4)
No ratings yet
DemetrioDelaRosaJr.__DagupanCity_13.11_yrs(4)
5 pages
main_powershell-cheat-sheet-version-4-sans-institute
No ratings yet
main_powershell-cheat-sheet-version-4-sans-institute
2 pages
Step-by-Step Guide to Implement and Automate CICD for .NET Insurance Application in Kubernetes Using GitLab, Helm, and Azure Cloud
No ratings yet
Step-by-Step Guide to Implement and Automate CICD for .NET Insurance Application in Kubernetes Using GitLab, Helm, and Azure Cloud
5 pages
main_kubectl-commands-cheat-sheet
No ratings yet
main_kubectl-commands-cheat-sheet
1 page
Week 1 Implementation Detailed Guide to Create AWS EKS Cluster and Cluster Setup Using eksctl and Bash Scripts
No ratings yet
Week 1 Implementation Detailed Guide to Create AWS EKS Cluster and Cluster Setup Using eksctl and Bash Scripts
10 pages
docker-commands-cheat-sheet-pdf
No ratings yet
docker-commands-cheat-sheet-pdf
2 pages
main_linux-commands-cheat-sheet
No ratings yet
main_linux-commands-cheat-sheet
1 page
main_linux-commands-cheat-sheet (1)
No ratings yet
main_linux-commands-cheat-sheet (1)
2 pages
main_unix-commands-cheat-sheet
No ratings yet
main_unix-commands-cheat-sheet
1 page
main_windows-xp-pro-2003-server-vista-intrusion-discovery-cheat-sheet-v2-0-sans-institute
No ratings yet
main_windows-xp-pro-2003-server-vista-intrusion-discovery-cheat-sheet-v2-0-sans-institute
2 pages
main_devops-engineer-linux-commands-cheat-sheet (1)
No ratings yet
main_devops-engineer-linux-commands-cheat-sheet (1)
4 pages
Detailed Guide
No ratings yet
Detailed Guide
8 pages
Devops Interview Questions and Answer
No ratings yet
Devops Interview Questions and Answer
27 pages
Docker Cheatsheet r4v2
No ratings yet
Docker Cheatsheet r4v2
10 pages
Blue Green and Canary Testing
No ratings yet
Blue Green and Canary Testing
5 pages
May Devotion 2023
No ratings yet
May Devotion 2023
2 pages
2021_WSEC_C_2ndEd_012824 Chapter 2
No ratings yet
2021_WSEC_C_2ndEd_012824 Chapter 2
16 pages
Con Justicia
No ratings yet
Con Justicia
62 pages
Spec Esc
No ratings yet
Spec Esc
5 pages
6.0 - Volume Control Damper & Splitter Damper
No ratings yet
6.0 - Volume Control Damper & Splitter Damper
12 pages
Best Face Rec PDF
No ratings yet
Best Face Rec PDF
1 page
A1 - Basics of Designing
No ratings yet
A1 - Basics of Designing
14 pages
Why We Show Children How Sex Works
No ratings yet
Why We Show Children How Sex Works
3 pages
RPA112 Structural Cable Catalogue 72dpi 0
No ratings yet
RPA112 Structural Cable Catalogue 72dpi 0
60 pages
Essay
No ratings yet
Essay
2 pages
Transformational Leadership Assignment
No ratings yet
Transformational Leadership Assignment
14 pages
Multi-Criteria Assessment Tool For Floating Offshore Wind Power Plants
No ratings yet
Multi-Criteria Assessment Tool For Floating Offshore Wind Power Plants
6 pages
The Cavern of The Fountain-Beast
No ratings yet
The Cavern of The Fountain-Beast
1 page
Book - 12: Ramu's Institute of Spoken English
No ratings yet
Book - 12: Ramu's Institute of Spoken English
20 pages
Lesson Plan IVth Grade ANDYS BIRTHDAY PARTY
No ratings yet
Lesson Plan IVth Grade ANDYS BIRTHDAY PARTY
6 pages
Ahmar CH - Guest Posting Sites
0% (1)
Ahmar CH - Guest Posting Sites
307 pages
FixtureOffsets Demo
No ratings yet
FixtureOffsets Demo
10 pages
OPIF Reference Guide
No ratings yet
OPIF Reference Guide
106 pages
Progress in Civil, Architectural and Hydraulic Engineering: Editor: Yun-Hae Kim
100% (2)
Progress in Civil, Architectural and Hydraulic Engineering: Editor: Yun-Hae Kim
1,447 pages
AP Statistics Syllabus
No ratings yet
AP Statistics Syllabus
11 pages
ENG1002 Project 2 Specification
No ratings yet
ENG1002 Project 2 Specification
6 pages
Oil and Gas Literature Review
50% (2)
Oil and Gas Literature Review
5 pages
Hci h2 Chem p4 QP With Ans Ms
No ratings yet
Hci h2 Chem p4 QP With Ans Ms
13 pages
Ix Icse Hindi - Prelim-1 - Set A QP
No ratings yet
Ix Icse Hindi - Prelim-1 - Set A QP
8 pages
Diagnosis and Management of Neck Masses An Issue of Atlas of the Oral & Maxillofacial Surgery Clinics of North America pdf download
100% (1)
Diagnosis and Management of Neck Masses An Issue of Atlas of the Oral & Maxillofacial Surgery Clinics of North America pdf download
25 pages
Cataracts
No ratings yet
Cataracts
72 pages
Errors in PMR 2010 English Paper
No ratings yet
Errors in PMR 2010 English Paper
2 pages
Knowing our limits Ballantyne - Quickly download the ebook to explore the full content
100% (1)
Knowing our limits Ballantyne - Quickly download the ebook to explore the full content
53 pages

Key Principles of Highly Resilient Systems

Uploaded by

Key Principles of Highly Resilient Systems

Uploaded by

Key Principles of Highly Resilient AWS CloudFormation, or

You might also like