Bispro Transport & RAN
Bispro Transport & RAN
Transport Network
Business Process
Version
1.1
Incident Management
Business Process
Page
2/21
Transport Network
Document Approval
Prepared By
No
Name
Title
Thomas Heriyanto
Mohamad Ichsan
Agus Wijaya
Ronald Renaldi
Hendri M. Tarigan
Bany Nugroho
Adhe Pahlevi
Andon B. Bandono
Signature
Reviewed by
No
Name
Charles Mankin
Moelky Furqan
Title
General Manager NOC
Management and
Service Helpdesk
Remarks
Signature
Remarks
Signature
General Manager
Transport Network
Operation
Approved by
No
1
Name
Juanita Erawati
Title
Vice President Network
Operation Management
Approved by
No
Name
Title
Director of Network
Remarks
Date
23/04/14
Signature
Version
1.1
Incident Management
Business Process
Page
3/21
Transport Network
Version History
No
1
2
3
Date
July, 24 2012
August 6,2012
February 14,2014
Version
0.1
1.0
1.1
Description
First Draft
Initial Release
Update Release
Date
23/04/14
Author
Suseno AW
Suseno AW
Ronald Renaldi
Incident Management
Business Process
Version
1.1
Date
23/04/14
Transport Network
Table Of Contents
Executive Summary..............................................................................................................5
2.
Purpose ................................................................................................................................6
3.
Scope ....................................................................................................................................6
4.
4.2.
Sub Process Incident Management for Transmission Non Leased Line .......................8
4.3.
4.4.
4.5.
5.
6.
7.
Page
4/21
Incident Management
Business Process
Transport Network
Version
1.1
Date
23/04/14
Page
5/21
1. Executive Summary
Telkomsel as one of the biggest Telco operator in the world, have the vision of Best, Leading and
Trusted Mobile Lifestyle and Solutions Provider in the Region and mission of Deliver mobile
lifestyle-services & solution in excellent way that exceed customer expectation, create value for
all stakeholders, and the economic development of the nation.
In an effort to realize the Telkomsel Vision and Mission as well as in order to face the second
curve of the Telkomsel business dynamics, comprehensive transformation program implemented
in all function unit in Telkomsel, Not least in the Network Operation.
Network operation transformations including People & Organization, Process & Procedures,
Tools Application & Infrastructure domain, while the main objective is to realize Operational
Excellence.
Version
1.1
Incident Management
Business Process
Date
23/04/14
Page
6/21
Transport Network
2. Purpose
The purpose of Incident Management business process is to be an operational guidance of
incident handling in network operation that can be conduct more effectively, and follow
administrative procedures and documented.
3. Scope
Incident Management focus on managing the lifecycle of incident that arise within Transport
Network NE, comprise but not limited to :
- Transmission Non Leased Line
SDH & DWDM FO, Microwave
- Data Communication
IPBB Router,CS Router, PS Router, OCS Router,IN Router,VAS Router, CLNS Router (OAM BSC),
MSC Router(OAM MSC), XOT Router(OAM BSC X25), CE Router(consentrator BSC router), BSC
Router(OAM BSC IP),Hi-Cap Router
- Transmission Leased Line
Terrestrial, Satellite
Event Arise
Incident
Detection &
Preliminary
Impact
Analisys
Incident
Assignment &
Notification
Broadcast
Investigation,
Resolution &
Recovery
Closure
Event Arise
Event arriving as the input of Incident Management Process. It can be NE Alarm, performance
degradation alert.
Incident Detection & Preliminary Impact Analysis
This step is related to incident detection, including NE Alarm detection from system & Alarm
Management on how to manage alarm arriving from NE including, filtering/suppress,
normalization and correlation to have preliminary root cause analysis.
Preliminary impact analysis also conduct in this step in order to classify the priority of
Event/Alarm to be handled by assigned workgroup.
Incident Assignment & Notification Broadcast
In this activity, Incident is assigned to specific workgroup by Ticketing System. For many specific
Event/Alarm, according to user agreement, Incident assignment can be done either by Manually
or Automatic in the ticketing system.
Fault / Incident notification also broadcasted via defined media (SMS & Email) followed by phone
call to Network Operation and Business User.
Incident Management
Business Process
Transport Network
Version
1.1
Date
23/04/14
Page
7/21
Version
1.0
Incident Management
Business Process
Date
23/04/14
Page
8/21
Transport Network
4.2. Sub Process Incident Management for Transmission Non Leased Line
Business Process
Network Element
ICT Region
NS (1st Layer)
NOM-TNO
(3rd Layer)
Remarks
Investigation &
Diagnose
4
8
5
Resolution (Workaround
or permanent)
12
Investigation &
Diagnose
Investigation &
Diagnose
9
Change Management
Network (If Needed)
7
No
Incident Update
Service
Recovered?
Yes
Monitoring
Stability
Resolution (Workaround
or permanent)
Resolution, (Workaround
or permanent)
Change Management
Network (if needed)
Change Management
Network (if needed)
Escalate
11
Service
Recovered?
rd
No
party TSA
covered?
No
Service
Recovered ?
No
Yes
Yes
Yes
10
Monitoring
Stability
14
Closed
15
Monitoring
Stability
Yes
:
:
:
:
1.0
06 August 2012
Suseno Ari Wibowo, Liberty S
1 of 4
Version
1.0
Incident Management
Business Process
Date
23/04/14
Page
9/21
Transport Network
Business Process
Network Element
: IPBB Router,CS Router, PS Router, OCS Router,IN Router,VAS Router, CLNS Router (OAM BSC), MSC Router(OAM MSC),
XOT Router(OAM BSC X25), CE Router(consentrator BSC router), BSC Router(OAM BSC IP),Hi-Cap Router
ICT Region
NS (1st Layer)
NOM-TNO
DCCO & DCSO (3rd Layer)
Investigation &
Diagnose
4
8
5
Resolution (Workaround
or permanent)
12
Investigation &
Diagnose
Investigation &
Diagnose
9
Change Management
Network (If Needed)
7
No
Incident Update
Service
Recovered?
Yes
Monitoring
Stability
Resolution (Workaround
or permanent)
Resolution, (Workaround
or permanent)
Change Management
Network (if needed)
Change Management
Network (if needed)
Escalate
11
Service
Recovered?
rd
No
party TSA
covered?
No
Service
Recovered ?
No
Yes
Yes
Yes
10
Monitoring
Stability
14
Closed
15
Monitoring
Stability
Yes
Remarks
1.Event arriving as the input of Incident Management
Process
2. This step is related to incident detection, including NE
Alarm detection from system & Alarm Management on how to
manage alarm arriving from NE including, filtering/suppress,
normalization and correlation to have preliminary root cause
analysis.
Preliminary impact analysis also conduct in this step in order
to classify the priority of Event/Alarm to be handled by
assigned workgroup.
3. In this activity, Incident is assigned to specific workgroup
by Ticketing System. For many specific Event/Alarm,
according to user agreement, Auto fault rules is defined and
running to have an automatic ticket assignment to specific
workgroup.
Incident notification also broadcasted via defined media
(SMS & Email) to Network Operation and Business User.
Incident Mgt Perform tracking and update the info to the
Incident are being handled, to the service back to normal
(until the ticket closed), as well as collecting evidence from
the incident happened
4. NS Conduct investigations and diagnosis of the root
Caused Incident happened
5. NS Conduct Incident Resolution either by workaround or
permanent solution. In this step, change mgt process might
be involved.
6. If Service recovered, monitoring stability is performed &
NS will do change ticket status into Resolved. .
7. If Service not recovered, and need 2nd level support
escalation, NS will Reassign ticket to RPA/TPA
8. RPA/TPA parallel with DCCO/DCSO will do root caused
Investigation & diagnosis.
9. RPA/TPA parallel with DCCO/DCSO Conduct Incident
Resolution either by workaround or permanent solution. In
this step, change mgt process might be involved.
10. If Service recovered, monitoring stability is performed &
RPA/TPA will change ticket status into Resolved
11. If Service not recovered, and include for 3rd party TSA.
RPA/TPA will escalate to 3rd party vendor. 3rd Party Vendor
will do incident resolution.
Otherwise need for HQ 3rd level support escalation, RPA/TPA
will Reassign ticket to DCCO/DCSO.
12. DCCO/DCSO will do Investigation, Diagnose, Resolution
& Recovery
13. Incident Mgt will do ticket closing & Closed Notification
Broadcast. Incident Inventarization & Documentation
performed. Problem Management Process Is Performed if
necessary
Document Information
Version
Date
Author
Page
:
:
:
:
1.0
06 August 2012
Suseno Ari Wibowo, Liberty S
2 of 4
Version
1.0
Incident Management
Business Process
Date
23/04/14
Page
10/21
Transport Network
Business Process
Network Element
External Network
Provider
NOM-TNO
(3rd Layer)
Start
Receive
Notification
Incident
Transmission
Leased Line
Acknowledge
Event
Transmission
Leased Line From
NMS
Escalate Incident
to Helpdesk
Network Provider
Broadcast
Notification
Incident Leased
Line
Receive Notification
Incident Transmission
.Leased Line
Helpdesk Provider
Create Ticket
Incident
Investigate &
Diagnose
Escalate Incident
to Network
Provider
5
Escalate Incident to
Network Provider
Resolution &
Recovery
Remarks
Incident Update
6
Broadcast
Notification
Incident Closed
Recapitulation
Data of Incident
Leased Line
Report Incident
(Closed)
7
Progress Update
Incident Closed
Recapitulation Data of
Incident Leased Line
Document Information
End
Version
Date
Author
Page
:
:
:
:
1.0
06 August 2012
Suseno Ari Wibowo, Hirwandi, Liberty S
3 of 4
Version
1.0
Incident Management
Business Process
Date
23/04/14
Page
11/21
Transport Network
: Problem Management
Customer Complain
Management Process
ICT Region
ROC
SOC
Incident
Management
Process
Remarks
NOM-TNO
1. Problem Identification
Identification Problem to that came from
Incident, Customer Complain.
Problem
Identification
Categorization &
Prioritization
Problem
Type?
Infrastructure
Reason
Others
(Process, etc)
Problem Mgt
Meeting
Diagnosis (Root
Cause Analysis)
3
Problem Mgt
Meeting
Diagnosis (Root
Cause Analysis)
Recommendation of
Permanent Solutions
4
Recommendation of
Permanent Solutions
Apply Fix?
Problem
Tracking &
Monitor
YES
Change & Release
Management
Process
Record Know
Error Database
6
Implementation
Fix
Problem Closing
Review
NO
Effective?
Implementation
Fix
5
NO
3, Diagnosis
Conduct Diagnosis to Identify the underlying
root cause of a problem.
Problem Management Meeting with related
party including 3rd party vendor will be
conducted if necessary.
4. Recommendation of permanent solution
Document Deliver to TNO
5. Problem Resolution, initiate the most
appropriate & economical problem
resolution. Change Management Process
will be conducted if necessary.
6. Implementation & Evaluation Resolution
To ensure that after a successful problem
solution, the problem record contains a full
historical description and that related Known
Error Record are updated.
7. Problem Closing Review
To Review the resolution of a problem in
order to prevent recurrence and learn any
lesson for the future.
If the resolution is not effective to solve
problem, then have to initiate a further
diagnosis.
Document Information
YES
Closed
Version
Date
Author
Page
:
:
:
:
1.0
20 Juli 2012
Suseno Ari Wibowo, Liberty S
4 of 4
Incident Management
Business Process
Version
1.1
Page
12/21
Transport Network
Role
Event Management
Incident Management
Problem Management
Description
Event Management including
- Alarm Monitoring & Detection
Manage lifecycle of incident
- Incident Escalation/assignment
- Incident Tracking
- Incident Notification Broadcast
- Incident closing
First Layer Incident Resolution
- Investigation
- Resolution
- Recovery
Second Layer Incident Resolution
- Investigation
- Resolution
- Recovery
Third Layer Incident Resolution
- Investigation
- Resolution
- Recovery
Problem Identification,
Administration & Inventory
Problem Diagnose (root cause
analysis),
Resolution/Implementation Fix
Date
23/04/14
Role Mapping
Resource Operation
Center Department
Resource Operation
Center Department
Resource Operation
Center Department
Transport Network
Operation Division
Version
1.1
Incident Management
Business Process
Page
13/21
Transport Network
NOC
Ticket Started
Assigned
InProgress
Assign to Prev
Assignee
Resolved
Yes
Re Open ?
Canceled
No
Yes
Solved?
No
ICT Region - NS
Received
Assignment
Working On
Ticket
Yes
Canceled ?
Ticket
Cancelled
No
Yes
Solved ?
No
Ticket
Resolved
No
Escalate ?
Yes
Received
Assignment
Working On
Ticket
Ticket
Cancelled
Canceled ?
No
Yes
Solved ?
No
Ticket
Resolved
No
Escalate ?
Yes
NOM - TNO
Received
Assignment
Working On
Ticket
No
Canceled ?
Yes
No
Yes
Solved ?
Ticket
Resolved
Date
23/04/14
Ticket
Cancelled
Closed
Ticket Closed
Version
1.1
Incident Management
Business Process
Date
23/04/14
Page
14/21
Transport Network
Detail :
1. Ticket creation (Status = New).
2. Incident ticket will be assigned to her/himself in order to perform 1st level troubleshooting. If
the incident still remains, the ticket will be assigned to 1st level support , ICT Region Network
Service (NS) workgroup (Status = Assigned).
3. A member of assigned group will take the assignment to her/himself (Status = Assigned).
4. Assignee will work on ticket (Status = In Progress).
5. Assignee can update ticket to pending (Status = Pending) and then resumes it again after
some time (Status = In Progress).
6. If NS cannot solve ticket, it can be escalated to 2nd support, ICT Region Resource Performance
Assurance (RPA) workgroup. (Status = Assigned).
7. If 2nd Support RPA cannot solve ticket, it can be escalated to 3rd support NOM HQ group, TNO
(Status = Assigned).
8. Once incident ticket is solved (Status = Resolved), it will need validation conducted by NOCROC Department
9. ROC-Incident Management can close CC ticket (Status = Closed), or reopen it if the solution is
not solving incident ticket (Status = Assigned).
10. If there is no validation until 3 days after ticket resolved, then it will be closed automatically
by system (Status = Closed).
Incident Management Status & Lifecycle
The status represents the state of the incident ticket handling. The following is the table of status
Status
New
Assigned
In Progress
Pending
Resolved
Closed
Description
The Incident Ticket is unassigned.
The Incident ticket has been assigned to workgroup
but has not acknowledge
The assignment has been accepted and the assignee
is working toward resolution
Work on the incident ticket has been temporary
suspended
A resolution or work around to restore the service
has been determine, need further validation
Restoration of the service has been validated and no
longer required.
The following is the flow diagram of status changes during the life cycle of incident ticket.
Cancelled
New
Assigned
In Progress
Pending
Resolved
Closed
Version
1.1
Incident Management
Business Process
Date
23/04/14
Page
15/21
Transport Network
Description
Incident that has criteria of :
- NE Down
- No Service Available/Totally
Outage
- Totally signaling down
- CDR Billing Interruption
Major
Minor
Sample of Case
- NE Down due to power module problem
- Router Core Down.
- FO Cut for main & protection route for
backbone link
- Cross connect Main Module fail and cant
switch to protection for main ring of
backbone link
- Optical Transceiver main module problem
fail and cant switch to protection for
backbone link.
- Aggregate module problem for STM-16
above.
- Etc
Note : Correlate with critical severity at Core &
RAN
- Cross connect Main Module fail and cant
switch to protection for satellite ring.
- Microwave link stm1 propagation problem.
- Etc.
Note : Correlate with major severity at Core &
RAN
- Crossconnect Main Module fail and able to
switch to protection
- Optical Transceiver main module problem
fail and can switch to protection
- NE Module problem, but Service still UP.
- Etc
Note : Correlate with minor severity at Core &
RAN
Incident Assignment
Incident Resolution
Critical
10 Minutes
4 Hours
Major
30 Minutes
8 Hours
Minor
30 Minutes
24 Hours
Incident Management
Business Process
Version
1.1
Date
23/04/14
Transport Network
Page
16/21
Term Definition
In ITSM (IT Service Management) terms, the use of the word 'agreement' rather
than contract signifies less the legal differences between the two and more a
difference in approach and style.
'Agreement' is used exclusively for an understanding, normally written, between
internal parties (though it may be
appended to and therefore form part of an external contract). An agreement is
likely to register an aspiration for a particular service level whereas a contract will
usually record the minimum service level permissible. The wording in a contract
must represent its legally binding nature but the wording of an ITSM agreement
reflects much more the nature of the (aimed for) relationship between the parties
involved.
Closure
Customer
Diagnose
Escalation
Event
Function Unit
Help Center
Impact
Impact Analysis
The identification of critical business processes and the potential damage or loss
Incident Management
Business Process
Version
1.1
Date
23/04/14
Transport Network
Page
17/21
An event which is not part of the standard operation of a service and which
causes or may cause disruption to, or a reduction in, the quality of services and
Customer productivity. An Incident might give rise to the identification and
investigation of a Problem. In the Remedy Help Center application a incident can
be promoted to a problem and have additional cases tied to it. Problem
Management might, however, manage the resolution of the Incident and Problem
in tandem, for instance if the Incident can only be closed by resolution of the
Problem.
Job Description
Key Performance
Indicator (KPI)
Knowledge
Knowledge
Base
Objective
Occurrence
The first stage in an Incident life-cycle when the loss of service quality occurs.
Occurrence precedes Detection
(OLA) An internal document, owned by both parties, that defines the working
relationship between different functional areas within IS&T. The OLA sets out the
responsibilities for the support and delivery of IT services to Customers. Between
the Help Center and other support/software maintenance/network management it
may be mainly concerned with the activities that must take place should a service
fail. In other circumstances, for example in support of Change Management, it is
likely to describe various executive responsibilities and activities of the parties
involved. The terms of any OLA must support the qualitative and quantitative
statements contained in the SLAs. There is a strong relationship between OLAs
and procedures.
All of the individuals employed by the organization including full time, part time,
temporary and contract employees. The term may also include Customers, Users
Operating Level
Agreement
People
Incident Management
Business Process
Version
1.1
Date
23/04/14
Transport Network
Page
18/21
and contractors.
Problem
The root cause of one or more existing or potential Incidents. Problems may
sometimes be identified because of multiple incidents that exhibit common
symptoms. Problems can also be identified from a single significant Incident,
indicative of a single error, for which the cause is unknown and all standard
troubleshooting and diagnostics have failed to produce a verifiable solution or the
case is unable to define a bounded time for a solution. A problem may also have a
known solution, but implementing the solution is not possible due to resource
limitations or application version. Occasionally Problems will be identified well
before any related Incidents occur.
Priority
Procedure
A set of specific steps that describe how an activity should be carried out, and by
whom. For example, the procedure dealing with carrying out a postimplementation
review of a Change would be likely to describe the scope of the
procedure (to what Changes does this procedure apply), its purpose and how the
success of the Change will be measured, the individual procedural steps and the
responsibilities for carrying out or being involved in each of those steps.
Procedures may be supported by more detailed Work Instructions.
Process Owner
A Process Owner is a senior manager with overall responsibility for ensuring the
sustainability of a process. The Process Owner's responsibilities include those of
sponsorship, design (including relevant metrics for process) and operation, mainly
quality assurance of continuing process suitability
A party who provides a service. May be an internal service department (e.g.
engineering, computer department, building services), or an external outsourcing
company or third party supplier
The totality of features and characteristics of a product or service which bear on
its ability to satisfy stated and implied needs
A request for a change, usually both common and straightforward, to be made to
a service. A Service Request is characterized by the fact that the Change can be
made under strict, well-defined procedural control and is therefore (virtually) risk
free. Providing access to services for a new member of staff and relocating PCs
are two typical examples.
Finding the real cause of the problem and dealing with it rather than simply
continuing to deal with the symptoms.
Provider
Quality
Request
Root Cause
Analysis
Service
Service Hours
Severity Code
Single Point of
Contact
Incident Management
Business Process
Version
1.1
Date
23/04/14
Transport Network
Page
19/21
able to concentrate on their work without interruption and work can be coordinated
and matters dealt with once.
Service Level
Agreement (SLA)
System
Trouble Ticket
A term used in a number of Service Support tools, analogous but not normally
directly equivalent to the more precise ITIL terms Incident and Problem
A measure of business criticality of an Incident, Problem or Change where there is
an effect upon business deadlines. The urgency reflects the time available for
repair or avoidance before the impact is felt by the business. Together with
impact, and perhaps technical severity, it is the major means of assigning priority
for dealing with Incidents, Problems or Changes. It is set by the customer based
on the assessment of the business impact.
Urgency
Work in
Progress
Tasks formally identified but not yet completed. WIP reports will normally
comment on the extent to which the WIP is complete and on any aspect of the
WIP that changes previous assumptions about time, cost or quality.
Version
1.1
Incident Management
Business Process
Date
23/04/14
Page
20/21
Transport Network
Core NE Domain
OLA NSA
(Hours)
OLA
RPA/NRA
(Hours)
10 Minutes
10 Minutes
OLA NIM
TOTAL
(Hours) RESOLUTION TIME
(Hours)
30 Minutes
30 Minutes
30 Minutes
12
12
24
30 Minutes
12
12
24
Incident Management
Business Process
Transport Network
Version
1.1
Date
23/04/14
Page
21/21