0% found this document useful (0 votes)
67 views

SREF Blueprint

The document discusses Site Reliability Engineering (SRE), which incorporates aspects of software engineering to create highly reliable distributed systems. SRE focuses on reducing toil through automation and tooling, managing service level objectives through metrics like availability and latency, and improving resilience using techniques like chaos engineering and security automation.

Uploaded by

daviziko
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views

SREF Blueprint

The document discusses Site Reliability Engineering (SRE), which incorporates aspects of software engineering to create highly reliable distributed systems. SRE focuses on reducing toil through automation and tooling, managing service level objectives through metrics like availability and latency, and improving resilience using techniques like chaos engineering and security automation.

Uploaded by

daviziko
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Site Reliability Engineering Site Reliability Engineering (SRE) is a discipline

and a role that incorporates aspects of software


(SRE) Foundation SM engineering and applies them to infrastructure
and operations problems to create ultra scalable
DevOps
INSTITUTE BLUEPRINT and highly reliable distributed software systems.

Culture Toil Reduction SLAs/SLOs/SLIs Measurements Anti-Fragility

Reliability @ Scale, Reduce Non-Value Metrics such as Observability, Improve Resilience


Shift-Left “Wisdom Add Work using Availability, Latency, Monitoring, using Fire Drills,
of Production”, and Tooling and and Response Time Telemetry, and Chaos Monkey,
Continuous Automation with Error Budgets Instrumentation Security and
Improvement Automation

Continuous Integration (CI) Pipeline Continuous Delivery / Deployment (CD)

Plan Operate
Backlog Code Commit Build SAT Approve Deploy Post-Prod
& Design & Test & Merge & Test Artifacts & UAT Release to Prod Tests

Work Sharing Deployments Performance Incident


Management Management
Work Technical Debt Gradual Releases using
in Small Increments Green/Blue, A/B, Canary Monitoring, APM, Emergency Response, 50%
Deployments, Automation Capacity Testing Ops/Dev Load, 25% On-Call Load,
Manage Load % for Ops, Scripts, Testing and Monitoring & Auto-Scaling and Blameless Retrospectives
Dev and On-Call Work
© DevOps Institute. All rights reserved. www.DevOpsInstitute.com

You might also like