Professional Documents
Culture Documents
Known Issues On Cisco 7600 Router ES
Known Issues On Cisco 7600 Router ES
VERSION 2
Introduction
ES+ linecards on Cisco 7600 Series Routers are using highly programmable components. Some
of the issues observed on these cards had a symptom that would normally be interpretted as a
hardware faiure, e.g. double-bit or repeated single-bit parity errors.
1. increase awareness of the fact that the old-time notion of what is a HW and what is a SW
failure may not be applicable any more
2. help Cisco customers and partner evaluate issues observed on ES+ card
This is not an exhaustive list. If your symptom does not match any of the ddtses listed in this
document, please do make an additional search in the Bug Toolkit before opening a TAC Service
Request.
On-board log may show isolated occurrence(s) of Single-bit parity errors. This should not be a
concern becase:
1. isolated single-bit parity errors can be considered soft-parity errors, caused by sources
external to the memory chip
2. ECC logic on ES+ linecards corrects single-bit errors
IMPORTANT NOTE: There were multiple issues related to ECC parity errors on ES+ linecard.
All of the known issues are fixed in latest release, the recommendation for customers who have
ES+ deployment is to upgrade software to 12.2(33)SRE5 or 15.0(1)S5 or future releases.
Customer who have deployed 12.2(33)SRD release and if they cannot upgrade to 12.2(33)SRE5
for some reason then the recommendation is to have them upgrade to latest rebuild -
12.2(33)SRD6.
========================================
CSCsv05515
x40g: Improve the message wordings for recoverable tcam errors
----------------------------------------
If this error message is encountered, please contact Cisco TAC for further support.
========================================
CSCsw31515
ES+: %DEV_SELENE-DFCx-3-SRAM_ECC: Selene SRAM ECC Errors
----------------------------------------
If this error message is encountered, please contact Cisco TAC for further support.
========================================
CSCtb76621
ES+ ROMMON: MPC8548 DDR20 errata fix for Multi-bit ECC errors
----------------------------------------
Symptom:
Conditions:
Workaround:
There is no workaround.
========================================
CSCtb78538
ES+ ROMMON: controller setting changes to prevent Multi-bit ECC errors
----------------------------------------
If this error message is encountered, please contact Cisco TAC for further support.
========================================
CSCtc17311
ES+: TCAM_MGR_HW_ERR: TCAM device had corrupted data errors
----------------------------------------
Symptoms: TCAM device is reporting corrupted data:
Conditions: Observed on ES+ linecards of Cisco 7600 Series Routers, by a background TCAM
consistency checker.
Further Problem Description: These messages can safely be ignored as the entries are already
corrected.
========================================
CSCtd66014
ES+: ECC_DOUBLE: Double-bit ECC error detected on NP - High T, Normal V
----------------------------------------
Symptoms: ES+ line card crashes at powerup of a Cisco 7600 router that is
running Cisco IOS 12.2SRE image if either the Traffic Manager or Frame
memories in the ES+ Network processors report a double bit ECC error. The ES+
line card crashinfo will have the following string:
========================================
CSCtd99244
ES+: ECC_SINGLE or ECC_DOUBLE error detected on NP
----------------------------------------
Symptoms:
7600 series router with ES+ line card crashes reporting single bit or double bit ECC error.
Conditions:
Symptom observed on ES+ linecard of C7600 series routers, usually in the initial phases of line
card
bootup, but this has also been reported after a few hours of traffic through the ES+ line card
ports.
Workaround:
There is no workaround.
========================================
CSCtd99248
ES+: ECC_DOUBLE: Double-bit ECC error detected on NP
----------------------------------------
Symptoms:
7600 series routers with ES+ line cards there could be occasional double bit ECC errors for the
traffic manager and other metadata memories that are reported on the Network processor on the
ES+ line card.
Conditions:
This symptom is observed when the router reloads, OIR of ES+ cards, system environment
temperatures that slowly vary around an ambient temperature of about 30 degreesC. This
happens at system power up. The double bit ECC errors reported after a few hours of traffic.
========================================
CSCte14535
Invalid LinkFPGA or LINKFPGA Bus Error
----------------------------------------
Symptom:
Conditions:
Observed during boot/reload of ES+ line card in Cisco 7600 Series Routers. Rare in normal
working ES+ cards.
Workaround:
This fix is an enhancement which adds an additional recovery cycle for reading the LinkFPGA.
========================================
CSCtg31984
DBUS-HDR error in ES/ES+ Modules
----------------------------------------
Symptom:
7600 with ES/ES+ module may report error EARL_L2_ASIC-DFC2-4-DBUS_HDR_ERR on
after boot up. There is no function impact to the switch due to this error.
Conditions:
7600 with ES/ES+ modules present. The problem can happen up to a few hours
after boot up.
Workaround:
No workaround. Problem has been resolved in 12.2(33)SRD5 and 12.2(33)SRE2.
========================================
CSCth11714
ES+ ECC_DOUBLE: Double-bit ECC error or reset due to eznp_ecc_err_isr
----------------------------------------
Symptom:
7600 Series router with ES+ line card crashes reporting error:
Conditions:
Workaround:
None.
========================================
CSCth15790
Low-queue ES+: ECC_DOUBLE: Double-bit ECC error detected on NP, Mem 16
----------------------------------------
Symptoms:
Conditions:
Symptom observed on Low-queue ES+ line cards (ES+T) of C7600 series routers, in NP Mem
16.
Workaround:
There is no workaround.
========================================
CSCth20868
Link FPGA Update Failures with Different signatures
----------------------------------------
Symptom:
ES+ card crashes with different failure messages during production. In Most of the cases the
initial message for reload will be FPD upgrade failure for multiple attempts.
The crash messages in this case will be different at different bootup attempts. These messages
can be System Exception, FPD upgrade failure, IOFPGA bus error. Message Examples are
Conditions:
Workaround:
None.
========================================
CSCth25959
ENV-4-MINORTEMPALARM - updating the new temperature thresholds for ES+
----------------------------------------
Symptom:
Temperature alarm (ENV-4-MINORTEMPALARM) is reported, with AMBER LED on the line
card faceplate.
Conditions:
7600 series router with any model of the ES+ line card.
Workaround:
No workaround.
--------------------------------------------
Sensor Minor Major
ID Threshold Threshold
--------------------------------------------
BB Outlet 0 65 80
BB Outlet 1 70 85
--------------------------------------------
========================================
CSCti80887
Temperature 128 degC reported when sensor is Not_Operational
----------------------------------------
Symptom:
Faceplate LED on the linecard is red. Temperature sensor is reporting 128 degC.
In addition, following I2C error may be reported by the linecard, confirming that the temperature
sensor can not be read:
I2C Read Error READ bus=0x1 addr=0x4D port_sel=0x0 flags = 0x0 cmd=0x0 size=2
Conditions:
Workaround:
None.
This SW fix is correcting the reporting of an invalid sensor. Under same circumstances, 'NO'
(Not Operational) will be reported instead of 128 degC.
========================================
CSCtn41667
IOS fix for handling the Power calcuation issues with ES+ Combo cards
----------------------------------------
Symptom:
Following ES+ PIDS consume more power than the expected values.
76-ES+XC-20G3C
76-ES+XC-20G3CXL
76-ES+XC-40G3C
76-ES+XC-40G3CXL
This might lead to situation of other modules getting powered down due to "power deny" .
Conditions:
Specific to ES+XC variants (Combo cards) of Cisco 7600 Series Routers.
Workaround:
Configure power redundancy-mode combined until the IOS is upgraded to a release with
correct power settings.
========================================
CSCtn68668
Fix LC inlet temp issue (ES+XC) and Alarm handling issues (All ES+)
----------------------------------------
Symptoms: The following symptoms are observed:
----------------------------------------------------------
Temperature and Threshold Table
----------------------------------------------------------
Sensor Minor Major Current
ID Threshold Threshold Temperature
----------------------------------------------------------
BB Outlet 0 60 75 47
BB Inlet 0 50 65 27
BB Outlet 1 75 85 54
BB Inlet 1 50 65 32
PE Outlet 60 75 53
PE Inlet 50 65 34
LC Outlet 60 75 49
LC Inlet 50 65 50 <<<<<<<<
Conditions: This issue is specific to the following Cisco 7600 ES+ combo
cards:
76-ES+XC-20G3C
76-ES+XC-20G3CXL
76-ES+XC-40G3C
76-ES+XC-40G3CXL
Conditions: Observed on ES+ linecards of C7600 Series Routers when heavy configuration
changes are applied to the linecard. In addition, there are other unknown race conditions that can
cause this. This bug-fix is specific to Double-bit errors on Mem 17.
========================================
CSCto55567
ES+: FABRICCRCERRS after SSO due to Metropolis lockup
----------------------------------------
Symptoms: line card reports fabric errors:
Conditions: Symptom is observed on ES+ line cards of C7600 Series Routers after SSO with
multicast traffic flowing through the line card.
Workaround: Soft reload the line card using the hw-module module module reset exec
command.
========================================
CSCtq07626
ES+: DEV_SELENE XAUI_LEN, FIFO_FULL, XAUI_GNT and XAUI_MIN errors
----------------------------------------
Symptom:
Errors detected by selene ASIC:
%DEV_SELENE-DFC1-3-XAUI_LEN
%DEV_SELENE-DFC1-3-FIFO_FULL
%DEV_SELENE-DFC1-3-XAUI_GNT
%DEV_SELENE-DFC1-3-XAUI_MIN
Conditions:
Observed on ES+ linecards of Cisco 7600 Series Routers.
Workaround:
None.
========================================
CSCtr37182
ES+: single occurrence of DEV_SELENE XAUI_CODE error
----------------------------------------
Symptoms: Single occurrence of XAUI_CODE and XAUI_RX_RDY message in the syslog:
Conditions: This symptom is observed on ES+ linecards of Cisco 7600 series router.
Further Problem Description: Single occurrence of this error can safely be ignored.
========================================
CSCtr74529
ES+: LONGBUSYREAD: C2W Interface busy for long time reading temp sensor
----------------------------------------
Symptoms:
%ENVM-4-LONGBUSYREAD: C2W Interface busy for long time reading temperature sensor
========================================
CSCtr74953
ES+: Watchdog resets fail to write crashinfo, causing Keep Alive failure
----------------------------------------
Symptom:
%OIR-SP-3-PWRCYCLE: Card in module 1, is being power-cycled off (Module not responding
to Keep Alive polling)
%C7600_PWR-SP-4-DISABLED: power to module in slot 1 set off (Module not responding to
Keep Alive polling)
Conditions:
Observed on ES+ linecards of Cisco 7600 Series Routers. This bug is specific to a condition
where no other explanations exist for the failure of Keep Alive polling.
Workaround:
There is no workaround.
========================================
CSCts25729
ES+: PCI read hang causes Keep Alive failure, fails to write crashinfo
----------------------------------------
Symptom:
Conditions:
Observed on ES+ linecards of Cisco 7600 Series Routers. This bug is specific to a condition
where no other explanations exist for the failure of Keep Alive polling.
Workaround:
There is no workaround.
Traffic will not pass with greater than 7091 byte packet size.
Conditions:
When MTU is set greater than 7091, sending packet size with > 7092 bytes may hit the issue.
There is no specific trigger for this. But when issue is hit , ifdma_status register last byte reads
"C0".
Workaround:
========================================
CSCsy88170
----------------------------------------
Symptom:
Conditions:
Observed on the console or syslog of ES+ linecards of Cisco 7600 Series Routers.
Workaround:
None.
Issue is cosmetic. Some registers are not meant to be read by the firmware on the chip. When the
chip tries to read these registers, it prints the error.
========================================
CSCsz04660
----------------------------------------
Symptom:
On bootup or normal operations, a few ES+ cards might show the following traceback.
Conditions:
Workaround:
None
Further Problem Description:
This message indicates that the TCAM consistency checker has detected a few TCAM entries
that were not in the initialized states. The TCAM consistency checker has already corrected these
TCAM entries.