DR SRM Slides 2019 03 19
DR SRM Slides 2019 03 19
March 2019
John A. Davis
Virtualization Architect, @johnnyadavis, vLoreBlog.com
Problems Addressed
Let’s focus on these issues today
Many organizations have components of a Disaster Recovery (DR) solution in place but do not necessarily
have confidence that they can successfully execute a failover in the event of an actual disaster.
2
Overview
What are we covering today?
• The need for DR and common DR challenges • Tips on designing a solid DR solution based on Site
Recovery Manager (SRM)
• Solution overview
• Example Design:
• Understanding of the solution components,
including SRM, storage based replication and
‣ key requirements vSphere Replication
3
Disaster Recovery
What is it? Why do we need it?
4
Disaster Recovery
What are the key challenges?
5
DR Solution Objectives
What are the short comings of your current solution?
It is Inadequate It Lacks
6
VMware Site Recovery Manager (SRM)
Solution Overview
7
SRM Solution Overview
Why SRM?
8
SRM Use Cases
DR is just one use case, here are some others
9
What’s New in SRM 8.1?
https://blogs.vmware.com/virtualblocks/2018/04/17/srm-vr-81-whats-new/
10
Terminology
Here is our vocabulary lesson for the day
• Recovery time objective (RTO): Targeted amount of time a business process should be restored after a disaster or
disruption in order to avoid unacceptable consequences associated with a break in business continuity.
• Recovery point objective (RPO): Maximum age of files recovered from backup storage for normal operations to
resume if a system goes offline as a result of a hardware, program, or communications failure.
• Consistency group: One or more LUNs or volumes that are replicated at the same time. When recovering items in a
consistency group, all items are restored to the same point in time.
• Datastore group: One or more datastores that are treated as a unit in Site Recovery Manager. A common example is a
consistency group in an array replication solution.
• Protected site: Site that contains protected virtual machines.
• Recovery site: Site where protected virtual machines are recovered in the event of a failover.
NOTE: It is possible for the same site to serve as a protected site and recovery site when replication is occurring in both
directions and Site Recovery Manager is protecting virtual machines at both sites.
11
SRM Solution Components
Management, data movers, and orchestration
12
vSphere Replication vs Storage Replication
https://blogs.vmware.com/vsphere/2015/04/srm-abrvsvr.html
Feature Array-Based Replication vSphere Replication
Vendor / Array / Storage types FC, iSCSI or NFS Supports any storage covered by the vSphere
HCL
Cost / Licence Replication and snapshot licensing is required Included in vSphere Essentials Plus 5.1 and
higher
Application consistency Depends on vendor, may require guest based Supports VSS & Linux file system application
agents consistency
Powered off VMs, Templates, Linked clones, Able to replicate Can only replicate powered on VMs.
ISO’s
RDM support Physical and Virtual mode RDMs can be Only Virtual mode RDMs can be replicated
replicated
Multiple Points in Time (MPIT) MPIT is supported by some storage vendors Supports up to 24 recovery points
13
SRM / Storage Compatibility
http://www.vmware.com/resources/compatibility/search.php?deviceCategory=sra
14 Footer
SRM with Storage-based Replication
SRM integrates with vendor specific SRA to manage replication
15
SRM with vSphere Replication
Software based virtual disk replication that integrated easily with SRM
16
vSphere Replication Data Flow
Hypervisor based replication
17
Network and Inventory Mapping
Map source networks, compute resources, VM folders between sites
18
Recovery Plan Orchestration
Predefine your recovery plans in SRM
19
SRM Licensing
Work with your VMware license provider to understand your unique options
NOTE: some SRM bundling options may exist that allow per processor instead of per VM
20
Multi vCenter Server Deployment
Multi-vCenter Server instances per site
21
Example: Key Requirements
DR Test Success Criteria
Example: Requirements included Test Plan with application specific steps and expected results.
22
Example: High Level Design
Mapping your Unique Requirements to potential solution components
Requirement Solution Component
SLA Tiers: RPO < 15 minutes, RPO =4 hours, RPO = 24 hours Storage based replication, vSphere Replication RPO setting
DR tests plans with application functionality NSX based networks, virtual desktops, required services (AD, DNS)
Backup and recovery of the DR solution Backup Exec – daily full and differential backups
23
Example: High Level Design
High-level design: SRM with vSphere Replication, NFS, and block storage
24
Example: Application / VM Details
VM worksheet identifying application, priority, target IP, dependencies, etc.
25
Example: Recovery Site Logical Design
Provide network infrastructure and services for non-disruptive DR testing
26
Example: Monitoring / Alerting
We configured email notifications on these specific vCenter Server alarms
27
Example: Multi-site Deployment
Shared Recovery or Protected Site Site A to B to C
28
Lessons Learned
A few lessons I learned the hard way
29
Call to Action
Lots of ways to get started
30