DRP Whitepaper
DRP Whitepaper
Preface
Disasters happen. They can be small and straightforward to deal with, or if
youre unlucky you can be faced with a full-blown catastrophe. We cant
always avoid disaster, but through diligent planning and preparation we can
minimize downtime and data loss. If your company relies on its IT systems to
stay in business, then this white paper and the rest of the series on Disaster
Recovery and Business Continuity Planning are a must-read for you! You can
find those white papers at http://www.Comp-Soln.com/whitepapers/.
2 Disaster Recovery Planning An Overview
This white paper describes, at a high level, some of the many areas to address
during the creation of a Disaster Recovery (DR) Plan. It is not all-inclusive, but
is intended to provide insight into the overall process and our approach.
Overview
Most businesses stay very busy working on projects that support the growth of
that business. Many companies do not have a Disaster Recovery Plan (DRP).
Often there is a desire for a DRP, but the level of effort and/or cost required to
create a DRP can cause this project to have a low priority relative to other more
immediate projects. A DRP is viewed as nice to have or just insurance that
will not be used, and not as a critical business component.
That is, until there is a failure that causes a significant outage or loss of data
(often at a significant cost to the business). It is our opinion that every company
could benefit from both a Disaster Recovery Plan and a Business Continuity
Plan (BCP). Companies purchase insurance to protect their business. Investing
in a DRP and BCP is just as important for most businesses in our opinion.
Are you able to quantify the cost of downtime for mission critical systems?
Does this figure include lost sales and/or lost customers? Does it include the
cost of wages for people unable to perform their primary job function while the
system is down, as well as the amount of time required to get caught up once the
system is available? Once a company quantifies the cost of unscheduled
downtime the business case justification for a DRP usually becomes easy.
So, now that you have decided to go forward with this type of project, what
next? Where do you start? What needs to be addressed? How will you know
that the plan really works? Do you need to find external expertise for this type
of project? If so, exactly what type of expertise is required?
Using experienced Consultants to help develop the process and then later audit
and refine the process is usually idea sound investment. They will often identify
gaps or ambiguities that might have been missed by someone who has more
familiarity with the systems, processes, and procedures in use. It is also
important to find a team that will build plans based on how your business and
systems work, and not try to make your business fit into a predefined template.
Every business is different, and every system being recovered has its own
nuances. Being able to capture that knowledge is critical to success.
An overview of the Disaster Recovery Planning Process - From Start to Finish 3
The next step is to define business goals. The goals should address items such
as what functional areas need to be recovered (lines of business, locations,
functionality, etc.), what length of time is acceptable for recovery, and what
amount of data loss is acceptable (some data loss is almost inevitable unless you
are very lucky or are using a SAN / real-time replication to a remote site). This
often involves prioritization and a cost-benefit analysis (often initially based on
assumptions, not fact) to determine the worth of recovery something that may
be premature at this phase of the project.
Once the systems have been identified, attempt to quantify their impact relative
to the overall business goals. This will help be essential with the prioritization
efforts for system recovery. If there are good estimates on the cost of downtime,
this information can be used to complete an accurate cost benefit analysis as
well.
It is important that issues such as these be defined in the overall scope of the
effort. Everyone involved with this effort (including Upper Management within
a company) needs to have a single vision of what success looks like. Without
this you risk wasting time and money on a plan that may be viewed as a failure.
An overview of the Disaster Recovery Planning Process - From Start to Finish 5
This information is too valuable to be without, and it also too valuable to let out
to unauthorized resources. The issues of availability and confidentiality are
important to consider when defining the document management plan. It is also
important to provided updated copies of the documentation at a periodic basis
(usually quarterly or semi-annually).
6 Disaster Recovery Planning An Overview
Execution of the plan will be stressful and people may forget simple, everyday
things. Also, resources/staffing may change and the people assigned to execute
the DRP may not be familiar with it. The use of checklists is very desirable.
These lists should have sections for a timestamp, initials of the person doing the
work, and room for comments. This information will be critical if a problem is
found downstream. A single person should be identified as a DR Coordinator,
with a backup person identified to fill-in if necessary. That person will be
responsible for monitoring each phase of the DRP, coordinating with the various
groups involved with executing the DRP, and providing status information to the
"outside world" during DRP execution. Resources should be identified as being
responsible for each and every task and procedure, and they should know
exactly what is expected of them. Again, nothing should be left to chance!
Detailed test plans should be developed prior to execution and should address all
critical functional areas of the DRP. Data should be gathered during testing
(e.g., reports, screen prints, transaction logs, etc.) and saved for future review.
In the event of problems that data may help the team make a root cause
determination regarding the problem so that it can be corrected. If everything
goes right it provides the necessary documentation to support an external
validation effort of the DRP exercise. The only way to really know if
"everything worked" is to know what "everything" is, and then to be able to
demonstrate that the necessary tasks were completed successfully!
A common problem that we see is that plans are developed, but they are never
tested, or are tested once and forgotten. A plan that is not continuously refined
and validated is almost worthless. In order to maximize the chance for success
in the event of a real disaster it is essential that the DRP be executed on a regular
basis (semi-annually is recommended). Specific recovery procedures can
generally be tested in-house on a more frequent basis. Staff should be rotated as
much as possible, thus providing a more comprehensive test of the process and
the plan, and providing trained resources in the event that they are ever needed.
An overview of the Disaster Recovery Planning Process - From Start to Finish 9
A good Project Manager will also document important events, such as detailed
meeting minutes that include information about issues, positions and opinions,
decisions made, etc. This type of supporting and historical information can
prove invaluable when recovering from a disaster, especially if key members of
the team are no longer available to provide their insight and knowledge.
Summary
It is important to define the true purpose of the DRP & BCP, define the specific
requirements (as opposed to goals) and what constitutes success during
execution (many sites consider anything less than 100% restoration and
continuation to be a failure), and then develop a plan that addresses all of those
requirements.
Remember, the DRP is a "living" document that is refined over several iterations
and updated over time. No matter how good it is it probably will fail during the
first execution. The key is to continue to improve the plan so that it will work if
and when it is ever needed. Please see our other white papers on Disaster
Recovery at http://www.comp-soln.com/whitepapers.
Comprehensive Solutions
4040 N. Calhoun Road
Suite 105
Brookfield, WI 53005
U.S.A.
No part of this document may be copied without the express written permission
of Comprehensive Consulting Solutions, Inc., 4040 N. Calhoun Rd., Suite 105,
Brookfield, WI 53005.
This document is provided for informational purposes only, and the information
herein is subject to change without notice. Please report any errors herein to
Comprehensive Consulting Solutions. Comprehensive Consulting Solutions,
Inc. does not provide any warranties covering and specifically disclaims any
liability in connection with this document.