Professional Documents
Culture Documents
MDS Slow Drain Device Training: © 2009 Cisco Systems, Inc. All Rights Reserved. Cisco Confidential
MDS Slow Drain Device Training: © 2009 Cisco Systems, Inc. All Rights Reserved. Cisco Confidential
Device Training
Today almost all of data traffic between end devices is carried by FC Class 3
service that uses link level, per hop based, buffer to buffers flow control.
There is no End-to-End flow control with Class 3 services and in presence of
slow devices, that are not accepting frames at the originating device rate, the
FC networks are prone to ISL credit starvation and may choke the links.
Frames destined to slow devices have to be carefully isolated in separate
queues and switched over to egress queues without congesting the backplane.
Eventually decision has to be made if the frames are considered stuck and
when to drop them.
FC Switch
1
Server Receive 4 Disk
B2B Credit Receive
6
B2B Credit
Transmit
Transmit Buffers
Frame
Buffers 3
2 Frame
5
ISL
Slow
NO
X Drain
B2B Credit
Buffers
Full X Device
NO
3 2 B2B Credit
1
© 2009 Cisco Systems, Inc. All rights reserved. Cisco Confidential 5
MDS Solution to ISL
Blockage Problem
S1-D1
ISL
S1 D1 S3 S3-D2
D2
Slow
Drain
D3
S2 Buffers Buffers X Device
S3-D3
S2-D1
NO
B2B Credit
G
G
T
ISL
R T
Slow
R Drain
Buffers Buffers X Device
NO
Frames Holdup Frames Holdup B2B Credit
G
G
T
ISL
R T
Slow
R Drain
B2B Credit
Buffers Buffers X Device
NO
Drop Frames
B2B Credit
after “Twait” expires
Gen-2:
module-x# debug fc-mac port <y> reg-wr 0x566 <to-val-1>
module-x# debug fc-mac port <y> reg-wr 0x567 <to-val-1>
Gen-3:
module-x# debug fc-mac port <y> reg-wr 0x3cb <to-val-1>
Sys Log/Trap
(Switch/Interface)
0 sec credits
10% (100
ms)
e.g. 10%
X
0 sec credits
10% (100ms)
e.g.10%
Frees up credits
0 sec ---
e.g. No-Credit Timeout = 300 ms
300 ms -- NO credit
(100 ms – 1000 ms), 100 ms increments ------
------
-----
Frames
Start dropping
-----
the frames
-----
* No-Credit Preemptive Drops is disabled by default. at the line rate till
Credits are received. ------
Timeout Value and Enable Decision is based on topology and ------
--------
customer requirements. credit
--------
--------
credits
0 sec --
LR
shut/no shut No Credits (Dead)
1 sec -- LR
No Response
shut
The interface should run out of credits for 1 s (F) or 1.5 s (E) continuously
no shut
for LR to kick-in.
(Sys Log/Trap)
(Switch/Interface)
X credits
* Error Disable 0 sec --
No Credits (Dead)
1 sec -- LR
How to check for issues queuing frames to a port on the ingress port
switch# attach module < modulenumber>
modulex# show hardware internal upxbar 0 queuedpacketinfo
...
++++++
| GI (Hex) | Prio 0 | Prio 1 | Prio 2 | Prio 3 |
++++++
| c | 000000 | 000000 | 000000 | 001005 |
| 1c | 000000 | 000000 | 000000 | 00200a |
^ ^
| |
egress port (slow) ingress port
switch# attach module < module_number>
modulex# show process creditmon eventhistory | include CREDIT_LOSS next 1
Event:CREDITMON_EVENT_CREDIT_LOSS, length:1, at 818948 usecs after Sat Nov 21 06:33:33 2010 port = 18
http://bock-bock/wiki/User:Smarkovi:Troubleshooting:Identify_Slow_Draining_Device