Professional Documents
Culture Documents
3 HSS9860 Maintenance and Troubleshooting (LTE)
3 HSS9860 Maintenance and Troubleshooting (LTE)
and Troubleshooting
(LTE)
www.huawei.com
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page1
Objectives
Upon completion of this course, you will be able to:
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page2
Contents
1. HSS9860 Maintenance and Troubleshooting Flow
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page3
Troubleshooting Flow
The troubleshooting flow is:
Information Collection
Fault Classification
Fault Location
Fault Removal
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page4
Information Fault Fault Fault
Collection Classification Location Removal
Troubleshooting Flow(Cont.)
Information to be collected Information collection means
Specific fault symptoms (such as
Fault report from the subscribers or
subscriber perception and system
prompts) customer center
Time, place, and frequency of the fault Fault report from the maintenance
Scope and impact of the fault personnel in the neighboring office
Equipment running status before the Alarm report from the alarm system
fault occurs
Abnormalities found in daily
Operations performed on the
equipment before the fault occurs and maintenance or inspection
the results of the operations
Measures that are taken after the fault
occurs and the results after the
measures are taken
Equipment alarms when the fault
occurs and the relevant or associated
alarms
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page5
Information Fault Fault Fault
Collection Classification Location Removal
Troubleshooting Flow(Cont.)
Determining the fault scope Determining the fault type
To correctly determine the Service fault - It is the fault
fault scope is to determine showing that the service is
the troubleshooting direction. directly affected, for
It is the most important example, failure to access
element for quickly removing the network.
the fault. Non-service fault - It is the
In terms of fault symptoms fault showing that the
and impacts, faults can be service is indirectly affected,
classified into two types: for example, the disk array
service faults and non- fault and OMU cluster fault.
service faults. Each type of
faults can be further
classified.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page6
Information Fault Fault Fault
Collection Classification Location Removal
Troubleshooting Flow(Cont.)
Original Information Analysis Check of Logs & Other NEs Alarm Analysis
After the fault occurs, collect Check the operations before the fault
Try to remove the fault based
the related information (such occurs. Ask the on-site person and
query the operation logs. Determine on the suggestions in the alarm
as fault time, symptom, and whether the fault is caused by incorrect information.
place) through various means. operations. When multiple alarms are
Determine the scope and type Know about the running state (such as reported on the local alarm
of the fault. version upgrade and failure) of the console, you can firstly check
Analyze the information such neighboring NEs before the fault occurs. the high-level alarms based on
Check whether there is some special the alarm level. The event
as complaining subscriber event (such as important holiday) that
numbers, IP addresses. alarms can be handled finally.
affects the network.
Performance Measurement
Information Analysis Interface Tracing Analysis Data Configuration Check
Check measurement success rate. Perform message tracing on the Service configuration: including
Compare the rate with that in the services of the complaining EPS service registration
same time segment of the recent subscriber.
several days. Check whether there is Signaling configuration: including
Analyze the message streams, Diameter link configuration
obvious fluctuation and analyze the
trace the abnormal interruption DS configuration: including the
failure cause (through comparison of
all modules). points in the message streams, verification switch,
Based on the failure cause, check and compare the abnormal active/standby work mode, and
whether the local data configuration message stream with the IP addresses
is modified recently, and whether the message streams in normal PGW configuration: including
IP addresses of the neighboring NEs cases. Check the data redundancy in service layer,
are modified recently. configuration of the subscriber parallel/serial mode, and buffer
The performance measurement is and determine the fault cause. length.
suitable for locating the service faults
and signaling faults.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page7
Information Fault Fault Fault
Collection Classification Location Removal
Troubleshooting Flow(Cont.)
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page8
Contents
2. HSS9860 Fault Information Collection Methods
2.1 Collecting the Alarm Information
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page9
Querying Alarm Logs
Querying alarm logs helps to identify the cause of a fault and for rectifying the fault.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page10
Querying Alarm Logs(Cont.)
You can also query alarm logs by running LST ALMLOG in the
MML Command - CGP window.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page11
Saving Alarms
Save the alarm information to the specified files for query.
Right-click in the alarm display
pane, A shortcut menu is
displayed.
Click Save.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page12
Contents
2. HSS9860 Fault Information Collection Methods
2.1 Collecting the Alarm Information
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page13
Collecting the Operating System Log
Information
The operating system logs are stored in the files named in the format of
messages*, boot.*, and mail.*. The log files are located in the /var/log directory on
each board.
You can download the operating system log files from the OMU server through the
FTPS function of the OMU client.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page14
Collecting the Operating System Log
Information(Cont.)
You can log in to the board through KVM over IP to view the logs.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page15
Contents
2. HSS9860 Fault Information Collection Methods
2.1 Collecting the Alarm Information
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page16
Collecting the Board Hardware
Information
To collect the hardware information of the boards, open the MML
Command - CGP window and run DSP BRD commands.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page17
Contents
2. HSS9860 Fault Information Collection Methods
2.1 Collecting the Alarm Information
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page18
Collecting the Performance
Measurement Information
Performance measurement collects the running information
of the system in real time. The performance measurement
information reflects the running status of the system. It can
be used for fault identification when the system experiences
a fault.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page19
Exporting Measurement Results on a
Real-Time Basis
You can set export conditions to export measurement results in real time and then
save the measurement results to a local terminal.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page20
Exporting Measurement Results on a
Scheduled Basis
You can set exporting conditions to export measurement results according to the
schedule and then save the measurement results to a local terminal.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page21
Contents
2. HSS9860 Fault Information Collection Methods
2.1 Collecting the Alarm Information
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page22
Collecting the Message Tracing
Information
Message tracing provides dynamic and
real-time monitoring on the call
connection process, resource usage, and
service flow over ports and signaling
links. The traced messages can be
saved for future view.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page23
Contents
2. HSS9860 Fault Information Collection Methods
2.1 Collecting the Alarm Information
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page24
Using the NIC Tool
You can use the Network Information
Collection (NIC) tool to collect required
information.
To conduct the health check by using
network management system, In this
case, the HSS9860 can be deployed as
an SAE-HSS, a GU-HLR, or an
HSS9860. Therefore, install the
adaptation package for related NE
before the health check.
To conduct the health check by using
the VTS tool, In this case, the HSS9860
can be deployed only as the HSS9860.
Therefore, install the HSS9860
adaptation package before the health
check.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page25
Contents
3. HSS9860 Common LTE Fault Troubleshooting and
Troubleshooting Cases
3.1 HSS9860 Common LTE Fault Troubleshooting
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page26
Attach Procedure
Function of attach procedure:
The UE register to the EPS network.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page27
Attach Procedure(Cont.)
2. Identification req/rsp
3. Identity req/rsp
massages involved
4. Security function with HSS in the
EPS Attach
5. Update location request
procedure
6. Cancel location /Ack
7. Update location Ack
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page28
Attach Procedure(Cont.)
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page29
TAU Procedure
S-GW2
S-GW1
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page30
Inter TAU with SGW change
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page31
Inter TAU with SGW change(Cont.)
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page32
Inter TAU without SGW change
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page33
Inter TAU without SGW change(Cont.)
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page34
EPS Attach/TAU Fault
(unknownEpsSubscriber)
Symptom:
In the EPS attach/TAU procedure, HSS sends the message ULA which
contains the failure cause unknownEpsSubscriber to MME.
Fault Analysis:
The subscriber is not EPS subscriber.
The subscriber has LOCK service.
Possible The subscriber doesn’t register EPSAPN.
Causes
The subscriber registers the ODB BAPOS service, and the
value of the parameter ODBPOS_REJ_ULR is REJECT in
MAPSERV
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page35
EPS Attach/TAU Fault
(unknownEpsSubscriber)
Symptom Procedure Fault Analysis Fault Diagnosis
The subscriber is not EPS
subscriber.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page36
EPS Attach/TAU Fault
(roamingNotAllowed)
Symptom:
In the EPS attach/TAU procedure, HSS sends the message ULA which
contains the failure cause roamingNotAllowed to MME.
Fault Analysis:
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page37
EPS Location Update Fault
(roamingNotAllowed)
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page38
Contents
3. HSS9860 Common LTE Fault Troubleshooting and
Troubleshooting Cases
3.1 HSS9860 Common LTE Fault Troubleshooting
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page39
Attach failure Case 1:
Problem Description:
The Diameter links between HSS and MME are all normal.
In the Diameter link tracing, we can find HSS return error code
3002 to MME.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page40
Attach failure Case 1:
Possible reasons:
The subscription data in HSS is incorrect
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page41
Attach failure Case 1:
Handling Process:
Check the Diameter link status, normal.
Check the Diameter link message, we can find the error code in
the AIA message to MME: DIAMETER_UNABLE_TO_DELIVER
(3002).
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page42
Attach failure Case 1:
Handling Process:
According to RFC3588:
DIAMETER_UNABLE_TO_DELIVER 3002
This error is given when Diameter can not deliver the message to
the destination, either because no host within the realm supporting
the required application was available to process the request, or
because Destination-Host AVP was given without the associated
Destination-Realm AVP.
That means the HSS cannot process the message from MME
because the Destination-Host in the message is inconsistent
with the HSS side.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page43
Attach failure Case 1:
Handling Process:
Check the Destination-Host in the MME request message:
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page44
Attach failure Case 1:
Solution:
Modify the Host name in the HSS and the attach procedure
is successful.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page45
Attach failure Case 2:
Problem Description:
In the new LTE network, we define some test subscribers.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page46
Attach failure Case 2:
Handling Process:
The problem happened after we modify the Maximum
bandwidth in the new EPSQOSTPL, so the Maximum bandwidth
may be the cause.
But in the message between USN and UGW, after the UGW
received SM_MM_CTRL_CREATE_DEFAULT_BEARER_REQ,
UGW rejected to create the bearer with the error code: no-
resource-available .
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page47
Attach failure Case 2:
Handling Process:
There are two possible reasons if UGW rejected to create the
bearer with the error code: no-resource-available:
No available IP address;
No available bandwidth.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page48
Attach failure Case 2:
Solution:
Run the MOD EPSQOSTPL and modify the Maximum
bandwidth to 200,000,000.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page49
Attach failure Case 3:
Problem Description:
During the test of the new
HSS in one LTE network,
the subscribers attach
failed.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page50
Attach failure Case 3:
Handling Process:
Run LST DYNSUBDATA to check the dynamic information of
the subscriber, no subscriber data found;
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page51
Attach failure Case 3:
Solution:
Run MOD OPTGPRS to change the optimized GPRS/EPS
data, and set APN type to EPS_APN, then the subscriber
attach is normal.
MOD OPTGPRS: IMSI="460018888888888",
PROV=ADDPDPCNTX, APN_TYPE=EPS_APN, APNTPLID=1,
DEFAULTCFGFLAG=TRUE, EPS_QOSTPLID=1,
PDPTYPE=IPV4, ADDIND=DYNAMIC, VPLMN=TRUE,
CHARGE=NORMAL;
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page52
Case 4:No IDR or DSR when Modify
PLMNRSZI
Problem Description:
During the test of the new HSS in one LTE network, when we
run MOD PLMNRSZI, HSS does not send DSR or IDR to the
MME.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page53
Case 4:No IDR or DSR when Modify
PLMNRSZI
Handling Process:
Check the link status between HSS and MME, it’s normal.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page54
Case 4:No IDR or DSR when Modify
PLMNRSZI
Handling Process:
Run MOD PLMNRSZI:IMSI=“XX”,PROV=FALSE; to delete
the PLMN Roaming service, HSS does not send DSR to
MME, and Run LST PLMNRSZI to check, the service is
deleted in HSS.
Run MOD PLMNSRI in HSS again, now HSS can send DSR
or IDR to the MME normally.
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page56
Summary
This course introduces:
Copyright © 2012 Huawei Technologies Co., Ltd. All rights reserved. Page57
Thank you
www.huawei.com