ZTE - UMTS - KPI Optimization Analysis Guide V1 (1) .1
ZTE - UMTS - KPI Optimization Analysis Guide V1 (1) .1
V1.1
LEGAL INFORMATION
By accepting this certain document of ZTE CORPORATIN you agree to the following terms. If you do not agree to the following terms, please notice that you are not allowed to use this document. Copyright 2009 ZTE CORPORATION. Any rights not expressly granted herein are reserved. This document contains proprietary information of ZTE CORPORATION. Any reproduction, transfer, distribution, use or disclosure of this document or any portion of this document, in any form by any means, without the prior written consent of ZTE CORPORATION is prohibited. and are registered trademarks of ZTE CORPORATION. ZTEs company name, logo and product names referenced herein are either trademarks or registered trademarks of ZTE CORPORATION. Other product and company names mentioned herein may be trademarks or trade names of their respective owners. Without the prior written consent of ZTE CORPORATION or the third party owner thereof, anyones access to this document should not be construed as granting, by implication, estopped or otherwise, any license or right to use any marks appearing in the document. The design of this product complies with requirements of environmental protection and personal security. This product shall be stored, used or discarded in accordance with product manual, relevant contract or laws and regulations in relevant country (countries). This document is provided as is and as available. Information contained in this document is subject to continuous update without further notice due to improvement and update of ZTE CORPORATIONs products and technologies.
ZTE CORPORATION
Address : NO. 55 Hi-tech Road South ShenZhen P.R.China 518057 http://support.zte.com.cn [email protected]
Website : Email:
Revision History
Product Version Document Version V1.0 V1.1 Serial Number Reason for Revision First published Content modification
Author
Date 2007-12-28 2010-07-12 Document Version V1.0 V1.1 Prepared by Qin Jianhan Wang Cun Reviewed by Wang Zhenhai, and Jin Zhengtuan Wang Zhenhai, and Jin Zhengtuan Approved by Jin Zhengtuan Jin Zhengtuan
II
Key Words:
KPI (key performance indicator), indicator definition, formula, KPI monitoring flow, KPI optimization, KPI classification
Abstract:
This guide mainly describes the formulae, KPI classification, KPI monitoring methods and flows, and KPI optimization methods.
Abbreviations
Abbreviation ATM CDR CE CN CPICH CQI CQT DT E-DCH HSDPA HS-DSCH HS-SCCH HSUPA ICMP IP IPoA KPI LAN MAC MBMS NodeB OMC PDP PI PPP PS QoS RAB Full name Asynchronous Transfer Mode Call Drop Rate Channel Element Core Network Common Pilot Channel Channel Quality Indicator Call Quality Test Drive Test Enhanced uplink Dedicated Channel High Speed Downlink Packet Access High Speed Downlink Shared Channel High Speed Shared Control Channel High Speed Uplink Packet Access Internet Control Message Protocol Internet Protocols Internet Protocols Over ATM Key Performance Index Local Area Network Media Access Control Multimedia Broadcast and Multicast Service Node B Operation & maintenance Centre Packet data protocol Performance Index Point to Point Protocol Packet-Switched domain Quality of Service Radio Access Bearer
III
Abbreviation RF RNC RRC RRU RSCP RTWP SAAL SCCP SNR TB TCP UDP UE VIP VP WAN
Full name Radio Frequency Radio Network Controller Radio Resource Control Radio Remote Unit Received Signal Code Power Received Total Wideband Power Signaling ATM Adaptation Layer Signaling Connection Control Part Signal to Noise Ratio Transport Block Transfer Control Protocol User Datagram Protocol User Equipment Very Important People Video Phone Wide Area Network
IV
TABLE OF CONTENTS
1 Overview 1 2 KPI Monitoring Process................................................................................................2 2.1 KPI Monitoring Process....................................................................................................2 2.2 Routine KPI Monitoring Process......................................................................................2 2.3 KPI Monitoring Process During Parameter Modification.................................................4 2.4 KPI Monitoring During RNC or NodeB Version Upgrade................................................5 2.5 KPI Monitoring During Cutover........................................................................................6 3 KPI Analysis Methods...................................................................................................6 3.1 KPI Analysis Methods......................................................................................................6 3.1.1 TOP N Worst Cells Method.....................................................................................7 3.2 Basic KPI Analysis Skills................................................................................................10 3.2.1 KPI Monitoring Tools.............................................................................................11 3.2.2 KPI Analysis Tools.................................................................................................11 3.3 General Process of KPI Optimization Analysis..............................................................12 4 KPI Optimization Analysis ..........................................................................................16 4.1 CS Call Drop Optimization ............................................................................................16 4.1.1 Definition of Call Drop............................................................................................16 4.1.2 CS Call Drop Analysis Flowchart..........................................................................17 4.2 PS Call Drop Optimization..............................................................................................19 4.2.1 Optimization Flowchart..........................................................................................19 4.3 Optimization of Accessibility Indicators..........................................................................21 4.3.1 Definition of Access Failure...................................................................................21 4.3.2 Analysis on RRC Connection Failures..................................................................22 4.3.3 Analysis on RAB/RB Setup Failures.....................................................................27 4.4 Optimization of Mobility Indicators.................................................................................33 4.4.1 Optimization of Soft Handovers............................................................................33 4.4.2 Optimization of Hard Handovers...........................................................................39 4.4.3 Optimization of Inter-RAT Handovers...................................................................44 4.5 Optimization of Resource Indicators..............................................................................48 4.5.1 Resource Indicator Optimization Flowchart..........................................................49 4.5.2 Code Resource Optimization................................................................................51 4.5.3 Monitoring CE Resource.......................................................................................51 4.5.4 Optimization of Power Control..............................................................................52 4.5.5 Speeding up Rate Downgrade..............................................................................52 4.5.6 Monitoring and Optimizing Uplink Capacity..........................................................52 4.5.7 Optimization of Uplink Capacity at the Whole Network Level..............................53 4.5.8 Optimization of Uplink Capacity for a Single Cell.................................................53
VI
FIGURES
Figure 1-1 Joint KPI analysis...........................................................................................1 Figure 2-2 Routine KPI monitoring process...................................................................2 Figure 2-3 KPI monitoring process during parameter modification.............................4 Figure 2-4 KPI monitoring workflow during RNC or NodeB version upgrade.............5 Figure 3-5 RF configuration at HIC site...........................................................................8 Figure 3-6 Antenna energy distribution at HIC site........................................................8 Figure 3-7 KPI optimization analysis process .............................................................15 Figure 4-8 CS call drop analysis flowchart...................................................................17 Figure 4-9 PS call drop optimization flowchart............................................................19 Figure 4-10 Analysis flowchart of RRC connection setup failures.............................23 Figure 4-11 Analysis flowchart of RAB setup failures.................................................28 Figure 4-12 Soft handover optimization flowchart.......................................................34 Figure 4-13 Hard handover optimization flowchart......................................................41 Figure 4-14 Inter-RAT handover optimization flowchart..............................................46 Figure 4-15 Resource indicator optimization flowchart...............................................49
TABLES
VII
Table 3-1 List of CS TOP N Worst Cells..........................................................................7 Table 3-2 Indicators Related with CS Call Drop..............................................................9 Table 4-3 Parameters That 2G Shall Provide to 3G......................................................44 Table 4-4 Parameters That 3G Shall Provide to 2G......................................................44 Table 4-5 Resource KPIs and Alarm Thresholds..........................................................48 Table 4-6 Adjusting Code Resource Allocation............................................................51 Table 4-7 Example of Parameter Modification for Rate Downgrade...........................52 Table 4-8 Example of Power Control Parameter Modification.....................................53 Table 4-9 Example of Power Control Parameter Modification for Heavy-Traffic Cell 53
VIII
1 Overview
The radio network KPIs directly reflect the network quality, and KPI monitoring is an important means to locate the faults. KPI monitoring and optimization are mostly performed during the network operation and maintenance stage. Abnormal events are supposed to be detected as early as possible and handled with proper solutions so that sound voice and data services can be ensured for the subscribers. At the beginning of the network construction, the optimization team should put more emphasis on the RF adjustment rather than the optimization of KPIs except for CS call drop rate, the PS call drop rate, and the RTWP indicator. During the network operation and maintenance stage, KPI optimization (also called parameter optimization) plays the main role, that is, the optimization team should optimize a certain indicator through integrated parameter adjustment so as to meet the customers requirements. KPI data comes from NetNumenT31, the network management system in the operation and maintenance center (OMC). Based on the analysis on KPIs, the current states of those indicators are learned and they are important reference for assessing the network performance. The KPIs include the network service retaining capacity, accessibility, mobility, system capacity, and so on. According to the current values of these indicators, for example, some site has congestion, some site has a call drop rate of 10%, or some RNC has a certain worst cell proportion, busy cell proportion, cell code resource availability, access success rate, call delay and handover success rate, the optimization team should judge and locate the area, scope and severity of the fault. KPIs are divided into service KPIs and network KPIs by the statistic sources. Service KPIs are collected through field drive tests (DTs) while network KPIs are collected from the unified network management system. This article mainly discusses the analysis on network KPIs. Usually, the final solution is made based on the joint analysis on the OMC KPI data, alarms, subscribers complaints, and DT results.
Problem handling team classifies, collects and locates the worst cells
Yes
No
Locate the worst cells, and check if they are caused by the parameter modification.
No
Whether the KPIs at the RNC level are normal. Yes Keep on monitoring (15minute granularity)
Output a report in Word (hourly granularity KPIs before and after the parameter modification)
End
Locate the worst cells. Determine whether they are related with the version update. No
Yes
End
Table 3-1 List of CS TOP N Worst Cells Index 1 2 3 Cell ZBL1U-AI-1 ZBL1U-AI-3 HKE1U-5H1 RNS subnet ID (201) (201) (203) Cell ID 12911 12913 30461 Call drop rate, CS AMR 41.58% 39.55% 15.56% Number of call drop, voice 553 545 370
Index 4 5 6 7 8 9 10
Call drop rate, CS AMR 15.81% 3.39% 2.26% 2.49% 2.30% 3.92% 3.41%
Number of call drop, voice 360 282 216 215 205 169 167
Step 2: Check the transmission and hardware of the TOP N cells and check whether they are caused by external abrupt incidents, such as terrible whether, gatherings, or holidays when traffic is usually heavy. And then, we conducted a health check for each cell and paid attention to routine alarms and BPC board problems. We found there were broken associations in some HKE sites. Step 3: Check the radio parameters configuration of these cells, the radius of these cells and their neighboring cells, and compare them with the normal cells. (1)Problem with the cell radius: After the check, we found the cell radius of the LAK site was 2.5 km. Because the LAK site was situated by the sea and the antenna was placed very high, the radius of 2.5 km was far from enough. So we changed the cell radius to 10 km, and the problem of high call drop rate was thus solved. (2)Problem with configuration: HIC site is an indoor POI site. The RRU RxTx port and the RRU Rx port were configured reversely, which is the cause of high call drop rate. After modifying HIC, we found that signals of the second RRU were received by the Rx port. So we changed the configuration of the RxTx port and the Rx port, the problem of high call drop rate was thus solved. Figure 3-5 RF configuration at HIC site
Step 4: Export the indicator relevant most closely with the indicators you care about and analyze it to find the problem indirectly.
Table 3-2 Indicators Related with CS Call Drop RAB release number for Iu connection release request by UTRAN for CS domain in cell, radio connection with UE lost 482 473 346 330 69 100 64 RAB release number for Iu connection release request by UTRAN for CS domain in cell, failure in the radio Interface procedure 43 40 16 18 196 100 131 RAB release number for Iu connection release request by UTRAN for CS domain in cell, release due to overload control 0 0 0 0 0 0 0 RAB release number for Iu connection release request by UTRAN for CS domain in cell, unspecified failure 29 33 8 13 18 16 20
Index
Cell
1 2 3 4 5 6 7
Index
Cell
RAB release number for Iu connection release request by UTRAN for CS domain in cell, radio connection with UE lost
RAB release number for Iu connection release request by UTRAN for CS domain in cell, failure in the radio Interface procedure
RAB release number for Iu connection release request by UTRAN for CS domain in cell, release due to overload control
RAB release number for Iu connection release request by UTRAN for CS domain in cell, unspecified failure
In the process of abnormity location, keep a clear aim in mind, and be able to apply the process and basic principle to check the other relevant indicators rapidly to facilitate the analysis. Be familiar with the process and basic principle and be able to make logical association between abnormal KPI problems and network problems (such as the coverage problem and the interference problem). Be able to determine the problem nature according to the abnormal KPI, and then choose the appropriate tool to analyze the problem in depth. Performance analysis requires engineers to understand basic signaling process, be familiar with the protocol stacks of standard interfaces, and know relevant algorithms to realize the product functions. Engineers should at least have a concept about the various algorithms. If the analysis of a commercial network involves some algorithms, engineers should study these algorithms in depth.
10
11
abnormity probe analysis, you need to download abnormity probe files from different OMCB servers and then use the abnormity probe tool to make a comprehensive analysis. CTS Tool: CTS is a tool developed by the CN department, which can trace signaling in depth according to IMSI, and trace signaling across RNCs. So this is particularly suitable to trace VIP subscribers. In this case, CTS is easier to use than SignalTrace, which can only trace signaling of RNCs one by one. CTS can trace the interactive signaling between network elements (NEs) within the CN, as well as the signaling of the Iu interface and the Uu interface. This kind of signaling tracing is what we called in-depth tracing. The work principle of CTS is to set up an IMSI task on the CTS server and send it to the CN front side, which will then send this task to each CN module via the interfaces dedicated to the CN modules and the RNC, and then each module, after receiving the signaling related to the IMSI task, will send the signaling back to the CTS server via the CN front side. The interfaces mentioned above are private interfaces, so this tool can only support our own CN and RNC. CTS signaling can be checked and analyzed with an offline tool, but the offline tool does not work very well because of the lack of continuous optimization and perfection. UE log: DT test is also an important auxiliary way in analyzing KPI indicators. There are many problems that cannot be located by tracing signaling at the network side, and can only be located by the use of UE log. The commonly used drive test software includes: QXDM/APEX (QCAT), CNT/CAN and TEMS. CNT/CAN and TEMS are often used for network optimization. For the use of CNT/CAN, please refer to the corresponding help file and the instruction document publicly released by the Network Optimization Tool Department. QXDM and the analysis tool APEX (QCAT) provided by Qualcomm is very powerful, which have contributed a lot for the stability and maturity of our system for many years.
12
problem, just ignore them. Otherwise, try to locate the RNC NE that has the problem. Step 2: Analyze the indicators of the corresponding RNC to find out the RNC whose indicators have the problem. Step 3: Analyze the indicators of the cell under the problem RNC to find out the worst cells or TOP N cells. If the indicators of all the cells under the RNC are tend to be low, it is a common problem probably caused by parameter configuration. And then check whether the radio parameter configuration in the cells under this RNC is the same as that in the cells under the normal RNCs. Step 4: Make a comprehensive analysis on the KPIs, alarms, DT test data, and customer complains of the worst cells to find out a solution. Analysis method: After learning the KPI analysis ideas, we must know some common KPI analysis methods to rule out causes of problems from the obvious ones to the hidden ones. For example, we found that the TCP code words were strictly limited at eight sites near a park, and the call drop rate rose suddenly. How to solve this problem? Method one: First, we checked whether the alarms, transmission, and boards of these sites were normal. After they are proved all normal, we sent some engineers to the site to do test. And meanwhile, we traced the RNC signaling at the OMC. It turned out that the test result was normal, and the indicators of these sites of that day did not have any problem and code words were not limited. And later we knew from the news that there was a big gathering of about one million people at the park at that moment. Until then we came to know that the congestion was caused by too many users using the network at the same time. Method two: First, because the eight sites went worse all of a sudden, it was unlikely that the problem lied in the hardware. Then we checked whether the radio parameters had been modified the day before. The result is no worksheet had been issued to modify those parameters, and no alarm was found at those sites. Therefore, we excluded the possibility of hardware problem. Then we checked the traffic trend graph of the last few days (over seven days) and found that the high call drop rate might be caused by high traffic. The graph showed that traffic of each site rose suddenly on the day before. Thus we came to the conclusion that this was an abnormal abrupt event, which may have been caused by a gathering. And later we were told that there was a big gathering at the park. So we were assured the code words limitation and high call drop rate at the eight sites were caused by too many subscribers using the network at the same time. By comparing the two methods above, we can find that although the first one (sending engineers to the site, without the consideration of abnormal events) is commonly used, it is inefficient and costs more resource. The second method (analyzing the problem by the means of exclusion and association) is more efficient. From this case, we would like to emphasize that KPI analysis is a process of problem exclusion. Using the comprehensive
13
methods (like Method One) at the first brush may be making a detour. Exclusion method: Check the alarms on the OMC to learn about the state of the RNC, NodeB, BPC board, and the transmission. If there are obvious broken link in transmission or hardware problem, the cause of the problem is easy to locate. Incident association: If the problem is with a great number of sites, take abrupt incidents into account, such as large-scale gathering, terrible weather of incorrect operation. These incidents will put influence of different levels and ranges on the network indicators. Comparison of radio parameters: If some site goes wrong in a sudden, check whether the radio parameter configuration of this site is consistent with that of other normal sites. If not, change it as that of the normal sites, because the indicator decrease may be caused by an incorrect modification of radio parameters. Relevant indicators association: If a certain indicator is in poor condition, check its relevant indicators and find the common problem from these relevant indicators. Comprehensive problem location: When the above reasons are excluded, use DT data, KPI data, RNC signaling analysis data to locate the problem with indicators comprehensively.
14
Start
RNC index abnormal? Y Analyze and record causes Abrupt and self-curable abnormality? N Equipment alarms exist ? Y Deal with equipment alarms N Y RNC index recovers? N Show TOPN abnormal cells and their locations Transmission, software/hardware version, wireless parameter configuration
CN/RNC
Common problems analysis N Y Problem resolved? N Abnormal indexes analysis in one cell
Successful call N
Call drop
Soft handover
PS rate
If the problem is proved to be equipment bug or system problem after several attempts, feed it back to R&D dept.
15
16
17
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not?
Call-droprelated counters
RTWP
Traffic volume
No
Call-droprelated counters C301230362 C301230363 C301230365 C301230315 C301230316 C301230318 C301230319 C301230322 C301230323
Yes
Solved or not?
Yes
End
18
19
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
RTWP
Traffic volume
No
Call -drop related counters C301230372 C301230373 C301230375 C301230330 C301230332 C301230333 C301230334 C301230337 C301230338 Yes
Solved or not?
Yes
End
20
21
CONNECTION SETUP COMPLETE, the UE does not receive the SETUP direct transfer message. Or the UE sends RELEASE COMPLETE. Or the UE receives DISCONNECT from CN. RAB assignment failure: The UE does not receive RB SETUP delivered by RNC after sending CALL CONFIRM. Or the UE replies with RB SETUP FAIL after receiving RB SETUP. Or the UE receives DISCONNECT with the cause value not being Normal Release after receiving RB SETUP. At this time, the UE has not reported RB SETUP CMP. Failure after RAB assignment: After the UE sends RB SETUP COMPLETE, the terminating UE receives DISCONNECT/RELEASE from CN.
22
23
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
RTWP
No
Counters related to RRC setup failures C301480485 C301480486 C301480487 C301480489 C301480490 C301480491 C301481288 C301481289 C301481337 C301481338 C301481339 C301481407 C301481408
Solved or not
Yes
Yes
End
24
4.3.2.2 UE sends RRC Connection Request, but RNC does not receive it
If the Ec/Io of downlink CPICH is relatively low, it is the problem of coverage. If the Ec/Io of downlink CPICH is not very low (for example, the value is larger than -14 dB). Usually, it is the problem of RACH, and the following issues may cause the problem: The power of Preamble does not rise to a required value, and the rising times of Preamble should be increased. The output power of UE is lower than the required value, which is caused by poor UE performance. In this case, the UE should be changed. The NodeB equipment has a standing wave and the engineer should check whether NodeB has any SWR alarm. The radius of the cell is set improperly. If the radius parameter of the cell is set too small, the NodeB can not synchronize the UE beyond the range of the radius, and the access fails. This problem often happens in the places with large coverage, such as the rural areas and the suburbs.
4.3.2.3 RNC delivers RRC Connection Reject after receiving RRC Setup Request.
When RRC Connection Reject appears, the engineer should check the specific reject cause value. Usually, there are two kinds of causes: The CPU load of RNC control plane board is too heavy and more boards should be added. DCH and FACH admission is rejected. However, this situation does not always happen.
25
the PCPICH Ec/Io coverage of the current network. For example, if all the pilot Ec/Io values are larger than -12 dB in the coverage area, the power proportion of the common channel should be configured on the basis of the situation that the Ec/Io value is larger than -12 dB. And so, the success rate of the idle UE assessment can be ensured. As for the access problem caused by cell selection and reselection, the engineer can speed up the cell selection and reselection by adjusting the cell selection and reselection parameters, and the problem of RRC connection setup failure caused by improper cell selection and reselection parameters can be solved.
Note:
The RRC Connection Setup message is borne by FACH. RRC Connection Request sent by the UE is received by UTRAN at the preamble of PRACH, and then it is sent from the RACH channel based on the current preamble power. And the transmit power of preamble can rise all the time until the response is received (There is a limitation for the maximum number of preamble retransmissions). Therefore, in the areas with poor coverage, the RACH coverage and FACH coverage may become unbalanced, and as a result, UTRAN can receive RRC Connection Request sent by the UE but the UE can not receive RRC Connection Setup sent by RNC.
4.3.2.5 UE receives RRC Connection Setup and does not send RRC Setup Complete
If the downlink signal quality is normal, this problem may be caused by the abnormal condition of the cell phone. Another reason of this problem may be the downlink synchronization failure caused by the low initial power of downlink dedicated channel. You can solve this problem by adjusting the service downlink Eb/No.
26
Note:
RRC Connection Setup Complete is sent through uplink DPCH, and the UE calculates the initial power of uplink DPCCH according to the received IEDPCCH_Power_offset and the measured CPICH_RSCP value. DPCCH_Initial_power = DPCCH_Power_offset - CPICH_RSCP DPCCH_Power_offset = Primary CPICH DL TX Power + UL Interference + Constant Value. The Constant Value can be configured in the OMC. If this value is set too small, the UE may not have enough power to send RRC Connection Setup Complete.
27
28
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
RTWP
No
Yes
Counters related to RAB setup failures CS domain PS domain C301230290 C301230302 C301230291 C301230303 C301230292 C301230304 C301230293 C301230305 C301230294 C301230306 C301230295 C301230307 C301230296 C301230308 C301230297 C301230309 C301230298 C301230310 C301230299 C301230311 C301230300 C301230312 C301230301 C301230313
Solved or not
Yes
End
29
4.3.3.2 RNC Directly Rejecting RAB Setup Request Because Of Wrong Parameter Configuration
The case that RNC responds with RAB Setup Failure directly is seldom caused by invalid parameter configuration in the business network. Usually, this case is caused by special operations of the special users. The main scenario is that the subscription information of the users PS service is beyond the capability of the UE, which leads to the direct refusal from RNC. For example, a special users subscription rates of uplink and downlink are 384 K, but the maximum uplink rate of the UE is only 64 K. The maximum uplink and downlink rates of the QoS message used for activating PDP set by the AT command or mobile terminal software used by the user are 384 K, so the RNC will find the maximum uplink rate is beyond the UEs capability, directly reply with RAB Setup Failure and will not launch the RB setup process, when it receives RAB Assignment Request. After the RAB setup fails because the parameter configuration is beyond the UEs capability, SGSN will negotiate again to launch the new RAB assignment until the UE has the capability to support the assignment, and the RAB assignment is finished. For the users, the PDP activation is still successful, and the actual maximum rate is the maximum rate the UE can support. However, if the minimum guaranteed bit rate required by the QoS setting in the UEs PDP activation request is beyond the UEs capability, though the network negotiates a lower rate to accept the UEs PDP activation request, the UE will launch the request of deactivating PDP when it finds that the rate negotiated by the network in PDP activation accept request is lower than the minimum guaranteed bit rate, and finally the PDP activation can not be completed.
30
The admission control of the NodeB Credit resource is similar to the power admission control. Whether the remaining Credit can support the currently requested service or not can be judged according to the spectrum spreading factor of the new access user. According to the condition of the RAB Downsizing Switch, RNC will deal with the issue in the corresponding way. For the HSDPA user, in the dynamic power allocation mode, besides the mentioned system resources such as the power, channel code, lub transmission resource and CE, the admission reject should take into consideration whether the number of H users supported by NodeB and the number of H users supported by the cell are over the regulated threshold or not into consideration. For the HSDPA user, when the bandwidth configuration on lub interface is insufficient, the admission reject will not happen, but the rate will be reduced. What is more, the AAL2PATHs of HSDPA and R99 are configured respectively, and the HSDPA AAL2PATH must be configured to the HSDPA_RT or HSDPA_NRT type. If the HSDPA AAL2PATH is configured to RT or NRT of R99 AAL2PATH type, the RAB assignment failure will not happen, but RNC will establish the HSDPA service as R99 384 Kbps. For the downlink power admission, Besides whether the R99 service load is over the non-HSDPA service threshold, DCH service should take into consideration whether non-HSDPA power and HSDPA GBP (the minimum power needed for the guaranteed bit rate) are over the general power threshold of the cell. For the HSDPA service, it is necessary to check whether the throughput rate provided by the cell is over the sum of all the users GBR thresholds, or whether the GBPs of the stream service and the background service are over the HSDPA power of the cell. At the same time, whether the non-HSDPA power and the HSDPA GBP (the minimum power needed for the guaranteed bit rate) are over the overall power threshold of the cell should be also taken into consideration. For the lub admission, For the DCH service, the admission is made according to the multiplication of the peak rate and the service activation factor. For HSDPA service, the admission is made according to the GBR. If the lub exceeds the congestion threshold, the DCCC rate reduction will be triggered. And if the RLC_AM retransmission rate is over a certain threshold, the Iub Overbooking switch can be opened to trigger the TF which limits R99 or to reduce the rate of HSDPA service by a certain factor.
31
32
Adjust the engineering parameters for antennas in areas with severe pilot pollution. And adjust the handover parameters, such as the values of 1A, 1B, CIO, TTT (time to trigger), Hysteresis and so on, to solve the problem that the handover is not prompt or there are pingpong handovers. This section tries to solve this kind of problems through OMC data analysis and parameter optimization.
33
34
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
RTWP
No
Counters related to intra-frequency handover failures C301391101 C301391102 C301391103 C301390586 C301390587 C301390588 C301390589 C301390590 C301390591 C301390592 Yes C301390593 C301390594 C301390595 C301390596 C301390597 C301390598 C301390599 C301390600 C301390601 C301390602
Solved or not
Yes
End
35
36
less than a threshold, it is regarded that there is no pilot signal strong enough to be the best server at this point. The formula is:
Th N .
37
Corner effect: Ec/Io of the source cell decreases drastically, and Ec/Io of the target cell increases sharply (very high when it appears). Fast fading: Ec/Io of the source cell decreases quickly for a while and then increases, and Ec/Io of the target cell increases for a short while.
2.
Pingpong handovers. The following phenomena may accompany this problem. The best server changes quickly: Two or more cells take turns to be the best server. But as the best server, none of the cells can last long though they has good RSCPs and Ec/Ios. There is no best server: There are multiple cells. Their RSCPs are normal and similar to each other. But Ec/Io of every cell is very bad.
From the perspective of the signaling flow, Event 1A is reported immediately after one cell is deleted. Because the UE cannot receive Active Set Update from the RNC, the handover fails.
4.4.1.6 Solutions
Corresponding adjustments should be taken for the confirmed problems. Handover failures caused by pilot pollution: Adjust the engineering parameters of a certain antenna to set this antenna as the best server in this interfered location. If the power of one of its sectors is reduced, then Io of the pilot pollution area will decrease; even if the powers of other pilots are not adjusted, Ec/Io will also increase. Thereby the Ec/Io differences with other scrambling codes in the active set will become larger and pilot pollution will be eliminated. Through a lot of research, ZTE has proved that the reduction in the pilot transmit power will not change cell capacity greatly. If condition allows, new base stations can be added to cover this area. Equipment malfunctions: Consult the customer service engineers, and ask them to help check whether there are alarms and whether the transport layer is abnormal. If there are alarms, coordinate with the customer service engineers and the engineering personnel to solve the problems. Call drops because the handovers are not prompt:
38
Adjust the antenna to expand the handover zone. Configure the Event 1A handover parameters to make the handover easier to happen. Increase CIO to make the handover happen earlier in the target cell. The sum of CIO and the actually measured value is used for judging the UE events, including the UE intra-frequency handover. CIO helps shift the cell border in the handover algorithm. If CIO is configured with a larger value, the handover will be easier to happen and there will be more UE in the soft-handover status, but more resources will be occupied. If CIO is configured with a smaller value, the soft handover will be more difficult to happen and the receiving quality may be impaired. A CIO of about 5dB is quite good for eliminating the fast fading and the corner effect, but this configuration has some side effects, such as the increase of handover proportion.
Call drops caused by pingpong handovers: Adjust the antenna to form a best server in its coverage zone or set the Event 1B handover parameters (increase the threshold of Event 1B, the Event 1B hysteresis or the time to trigger Event 1B) to increase the difficulty in deleting the active set.
The intra-frequency soft/softer handovers can not be performed (intra-frequency hard handover should be triggered) in the following scenarios. When the intra-frequency handover happens, the UE is using the transmit diversity in the active set cell but the target cell does not support the transmit diversity. The intra-frequency measurement report does not contain OFF and TM of the
39
target cell. When the intra-frequency handover happens between RNCs, the Iur interface is unavailable. The UE performs the multiuser detection in the active set cell, but the target cell does not support the multiuser detection. The target cell and the original cell belong to different classifications (The cells of R99, R5+R99, and R6+R5+R99 belong to the same classification while the cells of R5 and R6+R5 belong to another classification).
Inter-frequency hard handovers: The inter-frequency hard handover means a UE connection is handed over from a cell of a UTRAN frequency to a cell of another frequency. Many factors, including the radio quality, the load, and the speed of the moving UE, may trigger inter-frequency hard handovers. For example, the radio quality triggers the inter-frequency hard handover in the following way: When the quality of the UE radio frequency becomes lower, the inter-frequency measurement will be triggered, and according to the measurement result, the UE connection will be handed over to a better frequency.
40
41
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
Solved or not
Yes
RTWP
No
Yes
Solved or not
Yes
End
42
Solutions: Increase the threshold of activating the compressing mode. The compressing mode is usually activated before the inter-frequency handover or the inter-RAT handover, and it is used to measure the quality of the inter-frequency or inter-RAT cell. You can set a threshold of the CPICH RSCP or Ec/Io to activate the compressing mode. And the RSCP is widely used. Requirements on setting the threshold of activating the compressing mode: Before the cell quality becomes low enough to cause a call drop, the signal of the target cell should be measured and reported, and the handover should be performed. Requirement on setting the threshold of deactivating the compression mode: Frequent activation/deactivation of the compression mode should be avoided.
Increase CIOs of the inter-frequency cell pair. Reduce the threshold of triggering the target frequency handover under the interfrequency coverage.
2.
Solution: Increase the hard handover hysteresis and the time to trigger the event.
43
Table 4-3 Parameters That 2G Shall Provide to 3G MCC 460 MNC 2 LAC 202 ID CI 193 NCC 0 BCC 0 Frequency band indicator 900 BCCH 102
Complete GSM BSC parameter configuration for the WCDMA neighboring cell: The 3G system shall provide the 2G system with the correct radio parameters based on negotiation MCC, MNC, LAC, RNC ID, cell ID (C_ID), downlink frequency, scrambling code, and RAC.
Table 4-4 Parameters That 3G Shall Provide to 2G MCC 460 MNC 3 LAC 20 RNC ID 18 Cell ID C_ID 51 Downlink frequency 10787 Scrambling code 51 RAC 20
According to the current strategy of one-way inter-RAT handovers, if the parameter configuration is complete, one probable cause of the inter-RAT handover failure is that the handover is not prompt. The common parameter adjustment is to increase CIO, the threshold to activate/deactivate the compressing mode, and the threshold to trigger the WCDMA-to-GSM handover at the same time. Call drops during the inter-RAT handovers between WCDMA and GSM may be caused by: Inconsistent data configuration at the GSM side and the WCDMA side after GSM modifies the configuration data but does not inform WCDMA. Missed configuration of neighboring cells, which can be solved by the correct configuration of neighboring cells. Too fast signal changes.
44
Pingpong reselection. Faults with the UE, for example, the UE fails to respond to the handover or report the inter-RAT measurement report. Changes of the best server during the physical channel reallocation. Wrong LAC configuration, which can be located through data configuration check.
45
46
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
RTWP
No
Counters related to inter-RAT handover failures C301130667 C301130681 C301130668 C301130682 C301130669 C301130690 C301130670 C301130691 C301130671 C301130692 C301130672 C301130693 C301130673 C301130694 C301130674 C301130696 C301130675 C301130697 C301130677 C301130698 C301130678 C301130699 C301130679 C301130700 C301130680
Solved or not
Yes
Yes
End
47
Table 4-5 Resource KPIs and Alarm Thresholds Resource type KPI No. and name C301320150 Number of rejected services, DCH downlink TCP limit TCP PI30167 Average non-HS (SOA) (SOA)DPA TCP PI30092 Maximum Cell TCP (%) PI30093 Average Cell TCP UL Traffic PI30029 Handover Blocking Rate KPI RTWP (in busy hour) C301320153 Number of rejected services, DCH no code Code PI30205 Average Cell HSUPA Users PI30172 Cell Average HSDPA Users PI301830006 Maximum use Ratio of Uplink NodeB CE PI301830010 Maximum use Ratio of Downlink NodeB CE Alarm threshold 50 40% 100% 70% 0.5% -98 dBm 50 12 16 60% 60%
CE
48
49
1. Check NE alarms
2. Associate emergencies
4. Associate indicators
No No
No
No
Solved or not
CE resource indicators No
Yes
End
50
Table 4-6 Adjusting Code Resource Allocation Abbreviation Parameter name DPCH Code Hysteresis Code Update Hysteresis A Range and step 0..512 Current value 16 Update value 28 Remark To decrease the number of rejected services for DCH no code To decrease the number of rejected services for DCH no code
DpchCodeHy
CodeUptHyA
0..512
16
28
51
Table 4-7 Example of Parameter Modification for Rate Downgrade Abbreviation UlDnMaxStg Parameter name Maximum Number of Degraded Uplink Load Steps Every Time Maximum Number of Degraded Downlink Load Steps Every Time Range and step [1, 8] Current value 1 Update value 2 Remark Downgrade from 384 Kbps to 16 Kbps Downgrade from 384 Kbps to 8 Kbps
DlDnMaxStg
[1, 8]
52
Table 4-8 Example of Power Control Parameter Modification Abbreviation ULINITSIR Parameter name Uplink Initial SIR target (dB) Maximum Uplink SIR target (dB) Uplink SIR Target Down Step Size (dB) Range UL 3.4 K/13.6 K signaling, UL 12.2 K AMR UL 3.4 K/13.6 K signaling, UL 12.2 K AMR UL 3.4 K/13.6 K signaling, UL 12.2 K AMR Current value 4-5 Update value 3.5
ULMAXSIR
15
10
UlSirTargDnStep
0.1
0.2
Table 4-9 Example of Power Control Parameter Modification for Heavy-Traffic Cell SRVTYPE 0 104 28 50 54 ULINITSIR 3.5->1.5 4.0->2.0 3.5->1.5 6.0->2.0 6.0->2.0 ULMAXSIR 10.0->3.5 15.0->5.0 10.0->3.5 15.0->5.0 15.0->5.0 ULMINSIR 2.0->1.0 0.5->0 0.5->0 2.0->0 2.0->0
53