Hitachi Solutions

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

SRDB ID Synopsis Date

43621 SE99x0: Multiple Disks Failing in same HDU; Same disk slot fails consistently 26 Apr 2002

Status Issued

Description

On the SE 9960 and 9910, sometimes a [SIM] Service Information Message will be generated for multiple
failing disks. Several SIMs may be generated on different disks, which can make it difficult to diagnose exactly
which piece of hardware is failing.

Another symptom that has been seen is a single HDUdisk slot consistently fails

A third symptom that has been seen is disks which fail sporadically but then come back online. In this situation a
disk will fail, generating a SIM if Hitrack is set up, but will remedy itself. Then a separate disk will fail and
come back online. This will continue until the bad piece of hardware has been replaced.

What these three situations have in common, which you need to look for, is that all the disks which are failing
are in the same HDU. The SIM will show the failing disk location and the [DKA] Disk Adapter connected to the
HDU in which the disk resides. There is also a third component which doesn't show up in every SIM. This
component is the FSW boards which are located in the HDU to the right side of the unit. The following SIMs
illustrate the situation where multiple disks are failing in the same HDU. Notice in each SIM the HDD
referenced is a different number, which means that Hitrack is reporting a different disk failing for various SIMs.
The DKA is the same for all the SIMs and it would be a common reaction to replace this part since it's the first
component listed under the Action Codes. Since all the disks are in the same HDU you would want to replace a
hardware component which controls those disks specifically. This would be the FSW board!
HI-TRACK SIMs:

CASE_START
CASE_SUMMARY: Hi-Track 9900 SIM
CASE_DESCRIPTION:
<ht_mail_id_2002031808383060.16000> Error information follows:

_System Type: 9960


_Site ID: X100737
_System S/N: 31518
_Microcode: DKCMAIN=01-16-40-00/00

The following SIMs have been transferred from this site by Hi-Track:

------- SIM 01 Follows: ----------------------------------------------------


Severity: Service SIM Time: Mar 17, 2002 23:37:10
SIM Type: Device HDD S/N: 30306173 Type: DKR2D-J72

Reference Code: DF7F02


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE PORT TEMPORARY ERROR(PATH 1)

Remarks: HDD: L172

Action Codes:
Code: 80000000
Location:
Function: SEE MANUAL
Additional: TROUBLESHOOT SECTION

Code: 1010B000
Location: DKA-2L
Function: DKA PCB
Additional: Option

Code: 10668720
(Not in Dictionary)

Code: 10C28700
Location: FSW-L17L
Function: FSW PCB
Additional: FSW PCB for HDU-L17

------- SIM 01 Follows: ----------------------------------------------------


Severity: Moderate SIM Time: Mar 17, 2002 23:32:08
SIM Type: Device HDD S/N: 302X8251 Type: DKR2D-J72

Reference Code: DF9F01


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE PORT BLOCKADE(PATH 1)

Remarks: HDD: L171

Action Codes:
Code: 80000000
Location:
Function: SEE MANUAL
Additional: TROUBLESHOOT SECTION

Code: 1010B000
Location: DKA-2L
Function: DKA PCB
Additional: Option

Code: 10668710
(Not in Dictionary)

Code: 10C28700
Location: FSW-L17L
Function: FSW PCB
Additional: FSW PCB for HDU-L17

------- SIM 01 Follows: ----------------------------------------------------


Severity: Moderate SIM Time: Mar 17, 2002 23:40:03
SIM Type: Device HDD S/N: 30306173 Type: DKR2D-J72

Reference Code: DF9F02


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE PORT BLOCKADE(PATH 1)

Remarks: HDD: L172

Action Codes:
Code: 80000000
Location:
Function: SEE MANUAL
Additional: TROUBLESHOOT SECTION

Code: 1010B000
Location: DKA-2L
Function: DKA PCB
Additional: Option

Code: 10668720
(Not in Dictionary)

Code: 10C28700
Location: FSW-L17L
Function: FSW PCB
Additional: FSW PCB for HDU-L17

------- SIM 01 Follows: ----------------------------------------------------


Severity: Service SIM Time: Mar 17, 2002 23:42:35
SIM Type: Device HDD S/N: 302Z3752 Type: DKR2D-J72

Reference Code: DF7F03


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE PORT TEMPORARY ERROR(PATH 1)

Remarks: HDD: L173

Action Codes:
Code: 80000000
Location:
Function: SEE MANUAL
Additional: TROUBLESHOOT SECTION

Code: 1010B000
Location: DKA-2L
Function: DKA PCB
Additional: Option

Code: 10668730
(Not in Dictionary)

Code: 10C28700
Location: FSW-L17L
Function: FSW PCB
Additional: FSW PCB for HDU-L17

------- SIM 01 Follows: ----------------------------------------------------


Severity: Moderate SIM Time: Mar 17, 2002 23:52:05
SIM Type: Device HDD S/N: 302S4630 Type: DKR2D-J72

Reference Code: DF9F04


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE PORT BLOCKADE(PATH 1)

Remarks: HDD: L174

Action Codes:
Code: 80000000
Location:
Function: SEE MANUAL
Additional: TROUBLESHOOT SECTION

Code: 1010B000
Location: DKA-2L
Function: DKA PCB
Additional: Option

Code: 10668740
(Not in Dictionary)

Code: 10C28700
Location: FSW-L17L
Function: FSW PCB
Additional: FSW PCB for HDU-L17

------- SIM 01 Follows: ----------------------------------------------------


Severity: Service SIM Time: Mar 18, 2002 01:19:44
SIM Type: Device HDD S/N: 302S4630 Type: DKR2D-J72
Reference Code: 461F04
Type: DRIVE ERROR(NORMAL R/W)
Description: DYNAMIC SPARING(DRIVE COPY)START

Remarks: HDD: L174

Action Codes:
Code: F0000000
Location:
Function: NO ACTION
Additional: -

------- SIM 01 Follows: ----------------------------------------------------


Severity: Service SIM Time: Mar 18, 2002 02:40:02
SIM Type: Device HDD S/N: 302S4630 Type: DKR2D-J72

Reference Code: EF2F04


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE BLOCKADE(EFFECT OF DRIVE COPY NORMAL END)

Remarks: HDD: L174

Action Codes:
Code: 10668740
(Not in Dictionary)

SIM Bytes:
Byte: 00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15
Data: 00 00 00 00 00 00 CF D2 11 00 00 80 01 04 05 0C

Byte: 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Data: 22 00 7B 1E 00 00 EF 2F 01 00 00 00 FE 00 00 00

----------------------------------------------------------------------------

------- SIM 02 Follows: ----------------------------------------------------


Severity: Service SIM Time: Mar 18, 2002 02:40:03
SIM Type: Device HDD S/N: 302S4630 Type: DKR2D-J72

Reference Code: 462F04


Type: DRIVE ERROR(NORMAL R/W)
Description: DYNAMIC SPARING(DRIVE COPY)NORMAL END

Remarks: HDD: L174

Action Codes:
Code: F0000000
Location:
Function: NO ACTION
Additional: -

------- SIM 01 Follows: ----------------------------------------------------


Severity: Service SIM Time: Mar 18, 2002 03:37:53
SIM Type: Device HDD S/N: 302X8251 Type: DKR2D-J72

Reference Code: DF7F01


Type: DRIVE ERROR(NORMAL R/W)
Description: DRIVE PORT TEMPORARY ERROR(PATH 1)

Remarks: HDD: L171

Action Codes:
Code: 80000000
Location:
Function: SEE MANUAL
Additional: TROUBLESHOOT SECTION

Code: 1010B000
Location: DKA-2L
Function: DKA PCB
Additional: Option

Code: 10668710
(Not in Dictionary)

Code: 10C28700
Location: FSW-L17L
Function: FSW PCB
Additional: FSW PCB for HDU-L17

SOLUTION SUMMARY:

What these three situations have in common, which you need to look for, is that all the disks which are failing
are in the same HDU. The SIM will show the failing disk location and the DKA connected to the HDU in which
the failing disk[s] reside. There is also a third component which doesn't show up in every SIM. This component
is the FSW board. The FSW board is like a controller for an individual HDU. There are 2 FSW boards located in
the HDU to the right of the disk slots. The addressing for the FSW board FSW-xyzL or FSW-xyzR. The L and
R stands for whether the FSW is the left board or the right board in the HDU. The x will either be an L or an
R, which locates the frame to the right or left side of the DKU. The y is the frame number. For example L1 is
the frame directly to the left of the DKU. L2 would be the second frame to the left of the DKU. The z is the
HDU number. For HDU 7 in the frame the z will be a 7.

In this situation the replacement of the FSW board remedied the problem. If the disks that are reporting are in
different HDUs then this SRDB doesn't apply, since we are targeting a hardware replacement to remedy disks
failing in the same HDU.

INTERNAL SUMMARY:

glenn.thoren@sun.com
storage TSE
http://storage.east/hitachi

SUBMITTER: Glenn Thoren APPLIES TO: AFO Vertical Team Docs/Storage ATTACHMENTS:

Copyright (c) 1997-2003 Sun Microsystems, Inc.

You might also like