IT BC DR Policy
IT BC DR Policy
This document covers the overall approach to Business Continuity in the IT department. It is
complemented by the IT Business Continuity Plan and IT Disaster Recovery Procedures
which provide the details of how to prepare, respond and recover. The policy focuses on what
should be done compared to the plans documents which relate to how to execute.
Changes to this document are summarized in the following table in reverse chronological
order (the latest version first).
Table of Contents
Introduction....................................................................................................................... 2
Definitions ................................................................................................................................. 2
Scope ......................................................................................................................................... 2
Policy ................................................................................................................................. 2
Risk Assessment ........................................................................................................................ 3
Critical Resources and Processes Identification .......................................................................... 3
Service Design............................................................................................................................ 4
Business Continuity Plan ............................................................................................................ 4
Plan Maintenance and Testing ................................................................................................... 5
Training ..................................................................................................................................... 5
Standards .................................................................................................................................. 5
Open Points ............................................................................................................................... 6
Compliance ................................................................................................................................ 6
Appendix #1: Definitions .................................................................................................... 6
Background Material ......................................................................................................... 7
Introduction
CERN requires that business continuity plans be created such that operations can continue
(potentially at a reduced capacity) and quickly recover following a loss of service or
resources due to a major interruption. Our users, employees, contractors and other
stakeholders depend on the IT department (http://cern.ch/it) to provide computing services as
promised. This policy defines who is responsible for protecting the organisation’s assets by
planning for a disaster and what needs to be done to ensure that CERN can continue
operations during and after a major outage.
This policy is maintained by the Department Business Continuity Coordinator (DBCC) and
will be reviewed annually. Significant changes will be submitted to the IT department head
office for review and endorsement. The latest version is available at https://disaster-
recovery.web.cern.ch/documents/cern-it-business-continuity-policy.
This policy was approved by IT management in October 2023. The IT department Business
Continuity Plan (BCP) includes the roadmaps for implementation.
Definitions
Scope
This policy encompasses all IT and business processes used in the day-to-day operation of the
IT department, i.e. the role as a provider. Every employee of the department is responsible for
understanding their activity in following this policy.
Throughout this document, IT services will be used to describe the processes and IT
applications, frameworks and systems used to deliver them.
Policy
The IT department is required to create, maintain, and test a business continuity plan for
services under their responsibility.
The following sections list the activities that must be performed.
Risk Assessment
The IT department will maintain an IT department risk register including details of the impact
and likelihood of risks to IT services.
This risk register is maintained by the IT Department Risk Manager (DRM) following these
guidelines and is updated annually. Following a review with the IT Department Head, the
results are submitted to the CERN Enterprise Risk Management team.
Following the ERM process, the criteria for department risk registers is defined as the
following sections.
A Business Impact Analysis (BIA) must be performed for each key process and system used
in the department’s areas of responsibility. Each critical system and source of data for these
systems must be identified and the importance to the Organisation documented. The BIA and
risk assessments must include:
Service Design
All IT services should consider recoverability as a central part of their design and
implementation.
Each system should clearly identify the backup and recovery processes including frequency,
retention, resilience, and verification. This information should be made easily accessible to
users of the service as part of the service level description. Any significant change in the
design should be reviewed by the IT Architecture Review Board (ARB) according to the Data
Recoverability concepts and the ARB checklists. The IT Change and Release Management
Board (CRMB) checklist should include the need to produce evidence of compliance
according to the CIS Critical Security Controls - control 11.
Each IT service is responsible to ensure that the recovery objectives of any IT service it
depends on is compatible with its service levels.
The IT Business Continuity Plan (BCP) covers how an incident would be handled. This is
based on a “Prepare, Respond and Recover” approach.
The DBCC is responsible for validating that the BCP is kept up to date and is tested on a
regular basis.
The IT Service Catalog as defined in Service Now will be used to structure the analysis and
planning. Where this process suggests improvements to the catalog, these will be reviewed in
conjunction with the IT Service Management team.
a) The BCP should be reviewed regularly for new or changed risks and updated to
reflect any organizational changes. Frequency to be defined in the BCP, but at least
annually.
b) Each service should execute a test of its individual DRP regularly (such as to restore
an environment and validate functionality) and recovery times from the disaster
scenario logged. Frequency to be defined in the DRP, but at least annually. This is
part of the standard responsibilities of all IT services.
c) Where there are differences between the desired recovery objectives from the
business and the actual results from the testing, these should be addressed via new
initiatives with the engagement channels or modification of the service levels if the
risk of delayed recovery is accepted
d) The test of the IT BCP and the DRP for each IT service must be performed at least
during every accelerator long shutdown to ensure compliance with RTO/RPO
defined in the IT service level. An example scenario would be a disconnect test of
one of the CERN data centres.
e) Any exemptions should be submitted including justification to the DBCC. If these are
accepted, they will be documented in the corresponding plan documentation.
Training
The BCP for the IT department should define a training plan for those IT department
members who may be involved in designing, testing and recovery of IT services.
Standards
There are a number of industry standards in this area such as ISO22301 (Business Continuity)
and ISO 27001 (Disaster Recovery). While these can provide inspiration for some of the
approaches, it is not proposed to become certified due to the significant effort to fully comply
(even if not relevant to CERN IT service delivery) and handling divergence between industry
and CERN governance.
For assessments, checklists such as CIS Critical Security Controls can be used to provide
an industry framework.
Open Points
Any open issues with the BC/DR policy are tracked in https://its.cern.ch/jira/browse/BCDR-
33.
Compliance
An as-is assessment on the status of the BCP will be performed by the DBCC annually with a
report submitted to the IT Department Head Office (DHO).
Background Material
Description Description
Data Recoverability Concepts and Strategy How backup and restore should be implemented
and validated in CERN IT