Professional Documents
Culture Documents
Brksan 2821
Brksan 2821
Cisco Public
BRKSAN-2821
BRKSAN-2821 14572_05_2008_c1
Cisco Public
Agenda
Section 1: What is I/O Consolidation Section 2: Enabling Technologies Section 3: FCoE (Fibre Channel over Ethernet) Section 4: I/O Consolidation Use Cases Challenges Closing Remarks
BRKSAN-2821 14572_05_2008_c1
Cisco Public
BRKSAN-2821 14572_05_2008_c1
Cisco Public
I/O consolidation supports all three types of traffic onto a single network Servers have a common interface adapter that supports all three types of traffic
Processor Memory
Processor Memory
I/O
Storage
I/O
I/O
I/O Subsystem
Storage
LAN
IPC
LAN
IPC
FC Traffic FC Traffic
CNA
Enet Traffic Enet Traffic Enet Traffic IPC Traffic IPC Traffic
Cisco Public
CNA
BRKSAN-2821 14572_05_2008_c1
Cisco Public
FCoE FCoE
FCoE
SAN A SAN B
FCoE
FCoE SAN
FC Storage
FC Switch
FCoE Switch
Server
No Storage Gateway
BRKSAN-2821 14572_05_2008_c1
Cisco Public
Storage
Must follow the Fibre Channel model Losing frames is not an option
IPC
(Inter-Process Communication)
BRKSAN-2821 14572_05_2008_c1
Cisco Public
10
Before PCI-Express there was not enough I/O bandwidth in the servers It needs to be Ethernet, but
1 GE didnt have enough bandwidth
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
11
PCI-Express
PCI Express (PCI-E or PCIe)
A computer expansion card interface format designed to replace PCI, PCI-X, and AGP
PCIe 1.1
Serial links at 2.5 Gbps (2 Gbps at the Datalink) Speeds from 2 Gbps (1x) to 32 Gbps (16x) 8x is required for 10 GE
12
10 GE
2008 will be the year of 10GE 10 GE has enough bandwidth Merging example
2 x 1 GE Ethernet NIC 1 x 4 Gbps FC (really 3.2 Gbps) Total 5.2 Gbps over a 10 Gbps link
BRKSAN-2821 14572_05_2008_c1
Cisco Public
13
SFP+ to SFP+
Early-2000s
Late-2000s 10 Gb
SFP+ 1 Gb Cu
UTP Cat 5 SFP Fiber
1W
0 0
2.5 s 2.5 s 1.5 s
SFP+ SR
Short Reach
1W
8W 8W 4W
10GBASE-T
BRKSAN-2821 14572_05_2008_c1
Cisco Public
14
SAN A
LAN
SAN B
Application Server Application Server Application Server Application Server Application Server Application Server Application Server Application Server
Easier to manage cabling solution reduces deployment time All copper cables are contained within rack
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
16x10 GE Cables
Application Server
15
Server Virtualization Driving the Need for More Bandwidth per Server Due to Server Consolidation
Growing Need for Network Storage Driving the Demand for Higher Network Bandwidth to the Server
Multicore CPUs and Server Virtualization Driving the Demand for Higher Bandwidth Network Connections
BRKSAN-2821 14572_05_2008_c1
Cisco Public
16
BRKSAN-2821 14572_05_2008_c1
Cisco Public
17
BRKSAN-2821 14572_05_2008_c1
Cisco Public
18
Transmission Error
Very rare in the data center
Congestion
Most common cause Congestion is a switch issue, not a link issue
A full duplex IEEE 802.3 link does not lose frames
BRKSAN-2821 14572_05_2008_c1
Cisco Public
19
STOP
Switch A
PAUSE
Queue Full
Switch B
20
10
A PAUSE Frame Start Sending Stop Frames for This Frames Again Interval of Time
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
21
A
R_RDY
BRKSAN-2821 14572_05_2008_c1
Cisco Public
22
11
PAUSE
PAUSE
S1
S2
S3
BRKSAN-2821 14572_05_2008_c1
Cisco Public
23
A standard Ethernet frame, not tagged EtherType = 0x8808 means MAC Control Frame Opcode = 0x0101 means PAUSE Pause_Time is the time the link needs to remain paused in Pause Quanta (512-bits time) There is a single Pause_Time for the whole link
Pad 42 Bytes
CRC
BRKSAN-2821 14572_05_2008_c1
Cisco Public
24
12
BRKSAN-2821 14572_05_2008_c1
Cisco Public
25
16
EtherType = IEEE 802.1Q
12 Bits
VLAN ID
Priority CFI
26
13
Receive Queues
One Two Three Four Five Six Seven Eight
Eight Priorities
Switch A
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
Switch B
27
Similar to the PAUSE frame Opcode = 0x0101 is used to distinguish PFC from PAUSE Class vector indicates for which priorities the frame carries valid Pause information There are eight Time fields, one per priority
Pad 28 Bytes
CRC
BRKSAN-2821 14572_05_2008_c1
Cisco Public
28
14
Is Lossless Better?
TCP relies on losses We can run it on a priority where we do not enable Pause
BRKSAN-2821 14572_05_2008_c1
Cisco Public
29
BRKSAN-2821 14572_05_2008_c1
Cisco Public
30
15
Discovery Protocol
DCBX: Data Center Bridging eXchange
Data Center Ethernet Links DCBX Data Center Ethernet Links with Partial Enhancements
DCBX
BRKSAN-2821 14572_05_2008_c1
Cisco Public
31
DCBX
Hop-by-hop negotiation for:
Priority Flow Control (PFC) Bandwidth management Congestion management (BCN/QCN) Applications Logical link-down
32
16
Bandwidth Management
IEEE 802.1Q defines priorities, but not a simple, effective, and consistent scheduling mechanism Products typically implement some form of Deficit Weighted Round Robin (DWRR)
Configuration and interworking is problematic
BRKSAN-2821 14572_05_2008_c1
Cisco Public
33
Priority Groups
Priority Groups Are Then Scheduled LAN
SAN
34
17
Goals
BW assignment for each Priority Group
Example: 40% LAN, 40% SAN, 20% IPC
BRKSAN-2821 14572_05_2008_c1
Cisco Public
35
10 GE Link Realized Traffic Utilization (30%) HPC Traffic (30%) LAN Traffic (40%) (20%)
3 Gbs
4 Gbs
6 Gbs
(30%)
(50%)
3 Gbs
3 Gbs
3 Gbs
(30%)
(30%)
T1
T2
T3
T1
T3
HPC TrafficPriority Class High20% Guaranteed Bandwidth LAN TrafficPriority Class Medium50% Guaranteed Bandwidth Storage TrafficPriority Class Medium-High30% Default Bandwidth
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
36
18
Congestion Management
Layer 2, end-to-end congestion management Standards track in IEEE 802.1Qau a.k.a. BCN (Backward Congestion Notification) or QCN (Quantized Congestion Notification)
Switch Switch
Thro ttle
Receive Buffer
e ottl Thr
BRKSAN-2821 14572_05_2008_c1
Cisco Public
37
Inspired by:
TCP AIMD (Additive Increase, Multiplicative Decrease) rate control TCP window increases linearly in absence of congestion Decreases exponentially (gets halved) at every congestion indication (either implicit or explicit) FCC (Fibre Channel Congestion Control) A feature on Cisco MDS switches
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
38
19
RP
CP
BRKSAN-2821 14572_05_2008_c1
Cisco Public
39
BRKSAN-2821 14572_05_2008_c1
Cisco Public
40
20
BRKSAN-2821 14572_05_2008_c1
Cisco Public
41
Layer 2 Multipathing
Increase bandwidth of L2 networks via multiple active links
BRKSAN-2821 14572_05_2008_c1
Cisco Public
42
21
L2 Multipathing
Multiple paths are used, reclaiming network bandwidth L3 multipathing is common in IP networks Important when there is limited or no differentiation in speed between access links and backbone links Reduces latency L2 multipathing
Eliminates Spanning Tree from the backbone No packet flooding Small forwarding tables
BRKSAN-2821 14572_05_2008_c1
Cisco Public
43
Layer 2 Multipathing
Cisco DCE is:
A precursor of TRILL, an IETF project for Layer 2 multipath Inspired to FSPF (Fibre Channel Shortest Path First)
Cisco DCE
Computes topology and forwarding via IS-IS Provides optimal pair-wise unicast forwarding Provides multipathing for unicast and multicast frames Provides seamless interoperability with existing devices
BRKSAN-2821 14572_05_2008_c1
Cisco Public
44
22
SAN
LAN
SAN
BRKSAN-2821 14572_05_2008_c1
Cisco Public
45
Delayed Drop
Delayed Drop is a mechanism that:
Allows a switch buffer to virtually extend to previous hop This reduces packet drop for transient congestions Is enabled per priority
BRKSAN-2821 14572_05_2008_c1
Cisco Public
46
23
Proxy Queue
UNPAUSE
During short-term congestion, both queues drain fast enough that the actual queue releases the PAUSE on its own During long-term congestion, the proxy queue fills to its high-water mark, and it releases the PAUSE; the actual queue begins dropping packets, and the congestion is managed through higherlevel protocols
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
47
BRKSAN-2821 14572_05_2008_c1
Cisco Public
48
24
BRKSAN-2821 14572_05_2008_c1
Cisco Public
49
1, 10Gbps
BRKSAN-2821 14572_05_2008_c1
Cisco Public
50
25
FCoE Benefits
FCoE benefits are the same of any I/O consolidation solution
Fewer cables Both block I/O and Ethernet traffic coexist on same cable Fewer adapters needed Overall less power
BRKSAN-2821 14572_05_2008_c1
Cisco Public
51
FCoE Benefits
LAN
4
SAN-A
SAN-B
LAN
4
SAN-A
SAN-B
Enet 16 2 36 2
FC 16 2 36 2
Total 32 4 72 4
Cisco Public
Enet 16 2 36 2
FC 0 0 4 0
Total 16 2 40 2
52
26
BRKSAN-2821 14572_05_2008_c1
Cisco Public
53
Protocol Organization
FCoE Itself Is the data plane protocol It is used to carry most of the FC frames and all the SCSI traffic
FIP (FCoE Initialization Protocol) It is the control plane protocol It is used to discover the FC entities connected to an Ethernet cloud It is also used to login to and logout from the FC fabric
54
27
FC Header
24 Bytes
Tota l:
2180 Byte s
EOF FCS
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
FC Payload
ET = FCoE
55
Source MAC Address IEEE 802.1Q Tag Ver Reserved Reserved Reserved Encapsulated FC Frame (Including FC-CRC) Reserved SOF Reserved
56
28
Enet Port
Enet Port
BRKSAN-2821 14572_05_2008_c1
Cisco Public
57
Ethernet Bridge
Eth Port
Eth Port
Eth Port
Eth Port
Eth Port
BRKSAN-2821 14572_05_2008_c1
Cisco Public
58
29
FC-BB-5 Terminology
Unchanged from previous FC standard
VN_Port: Virtual N_Port VF_Port: Virtual F_Port VE_Port: Virtual E_Port
BRKSAN-2821 14572_05_2008_c1
Cisco Public
59
FC-3/FC-4s
FC-3/FC-4s
FC-3/FC-4s
FC-3/FC-4s
VN_Port
FC Entity
VN_Port
FC Entity
VN_Port
FC Entity
VN_Port
FC Entity
FCoE_LEP
FCoE_LEP
FCoE Entity
FCoE_LEP
FCoE_LEP
FCoE Entity
Ethernet_Port
Ethernet_Port
SC DI FLOGI F
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
60
30
Means Optional
F_Port
FC Switching Element
VE_Port
FC Entity
VF_Port
FC Entity
VE_Port
FC Entity
VF_Port
FC Entity
FCoE_LEP
FCoE_LEP
Ethernet_Port
Ethernet_Port
Ethernet_Port
Ethernet_Port
Ethernet_Port
Ethernet_Port
BRKSAN-2821 14572_05_2008_c1
Cisco Public
61
VF_Ports VN_Ports
10 GE 4/8 Gbps FC
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
62
31
VF_Ports
VN_Ports
10 GE 4/8 Gbps FC
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
63
VN_Ports VF_Ports
VN_Ports
10 GE 4/8 Gbps FC
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
64
32
VE_Ports VF_Ports
VN_Ports
10 GE 4/8 Gbps FC
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
65
FC Storage
FCID 7.1.1 FC Fabric FC Domain 7 FC Fabric FC Domain 3 MAC A Ethernet Fabric FC Domain 1 MAC B Ethernet Fabric
Dest. = MAC B Srce. = MAC A D_ID = FC-ID (1.1.1) S_ID = FC-ID (7.1.1)
Dest. = MAC C Srce. = MAC B D_ID = FC-ID (1.1.1) S_ID = FC-ID (7.1.1)
FC Frame
FC Frame
FCoE Frame
BRKSAN-2821 14572_05_2008_c1
Cisco Public
66
33
MAC Addresses are negotiated in FIP Initial deployment will use FPMA only
BRKSAN-2821 14572_05_2008_c1
Cisco Public
67
MAC Address
BRKSAN-2821 14572_05_2008_c1
FC-ID 7.8.9
Cisco Public
68
34
FCoE Switch
Discovery FIP: FCoE Initialization Protocol
FLOGI/FDISC
FLOGI/FDISC Accept
FC Command
FC Command Responses
FCOE Protocol
BRKSAN-2821 14572_05_2008_c1
Cisco Public
69
Encapsulated FIP Operation (Self-Describing Length) PAD to Minimum Length or Mini-Jumbo Length Ethernet FCS
BRKSAN-2821 14572_05_2008_c1
Cisco Public
70
35
Reserved
Priority
Reserved
Reserved
Reserved
71
Type = 7
Len = XX
Reserved
FLOGI Request, FLOGI LS_ACC/LS_RJT NPIV FDISC Request, FDISC LS_ACC/LS_RJT Fabric LOGO Request, LOGO LS_ACC/LS_RJT (No SOF/EOF / FC-CRC?)
BRKSAN-2821 14572_05_2008_c1
Cisco Public
72
36
FIP
Multicast Solicitation from H2
MAC (H1) FCF-MAC (A)
H2
Solicitation identifies VF_Port capable FCF-MACs with compatible addressing capabilities Other parameters may include ENodes Port_Name for optional duplicate MAC address detection
Cisco Public
BRKSAN-2821 14572_05_2008_c1
73
FIP
Unicast Advertisements from A and B
MAC (H1) FCF-MAC (A)
H2
MAC(H2) FCF-MAC(B)
74
37
FIP
FLOGI Request
MAC (H1) FCF-MAC (A)
H2
BRKSAN-2821 14572_05_2008_c1
Cisco Public
75
FIP
FLOGI LS_ACC
MAC (H1) FCF-MAC (A)
H2
ENode uses MAC address in FIP FLOGI LS_ACC as the VN_Port MAC address for the FC-ID contained in the FLOGI data for subsequent FCoE frames
76
BRKSAN-2821 14572_05_2008_c1
Cisco Public
38
FCoE
Data Transfer
MAC (H1) FCF-MAC (A)
H2
All subsequent FCoE frames use granted MAC address and assigned FC-ID
FIP frames continue to use MAC(H2) For SPMA, MAC(H2) = MAC(H2) = MAC(H2) For FPMA, MAC(H2) and MAC(H2) use FC-IDs as low order 24 bits and FC-MAP for upper 24 bits
77
Cisco Public
FCoE
1, 2, 4, (8), 10 Gbps
1, 10 . . . Gbps
10, 20 Gbps
BRKSAN-2821 14572_05_2008_c1
Cisco Public
78
39
Is FCoE Routable?
Most folks mean Is FCoE IP-routable
The answer is NO, there is no IP layer in FCoE This was a design goal to keep FCoE simple FC-BB-5 contains FCIP that is IP-routable
FCoE is FC-routable
FCoE switches may forward FC frames across different Ethernet clouds FCoE switches may forward FC frames over the Internet using FCIP
FCoE
FCIP
IP Cloud
FCIP
FCoE
BRKSAN-2821 14572_05_2008_c1
Cisco Public
79
HBA
HBA HBA
CNA
10GbEE 10GbEE
BRKSAN-2821 14572_05_2008_c1
10GbE
Link
Link
PCIe
Ethernet Drivers
Ethernet
PCIe
PCIe
Ethernet Drivers
Operating System
2008 Cisco Systems, Inc. All rights reserved. Cisco Public
Operating System
80
40
BRKSAN-2821 14572_05_2008_c1
Cisco Public
81
Open-FCoE Software
HBA HBA Mgmt Plane Linux Kernel File System layers SCSI Layer Linux Kernel File System layers SCSI Layer OpenFC Layer HBA Driver FCoE Layer Net Device HBA Ethernet Driver Ethernet FCoE FCoE Mgmt Plane
Fibre
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved.
Server
Cisco Public
Ethernet
Server
82
41
BRKSAN-2821 14572_05_2008_c1
Cisco Public
83
Wireshark
Once known as Ethereal Captures and displays network traffic Available from: http://wireshark.org/ Sample trace file
/common/openfc/traces/fcoe-t11.cap
Screenshots/demo
BRKSAN-2821 14572_05_2008_c1
Cisco Public
84
42
Wireshark Screenshot
BRKSAN-2821 14572_05_2008_c1
Cisco Public
85
BRKSAN-2821 14572_05_2008_c1
Cisco Public
86
43
Distribution
4 MDS 9500 Cisco Catalyst 6509
4 x 4G FC
1 x 10 GE
STP BLK
POD 1
POD N
Access
MDS 9500 Cisco Catalyst 6509 NIC Teaming Active/Standby Discrete 1 GE NICs and FC HBA
Server Cabinet 1
Server Cabinet N
Server Cabinet 1
Server Cabinet N
BRKSAN-2821 14572_05_2008_c1
Cisco Public
87
Distribution
MDS 9500 Cisco Catalyst 6509
STP BLK
POD 1
POD N
Access
Nexus 5000
44
Distribution
MDS 9500 Cisco Catalyst 6509
Access
Nexus 5000
VSS
8
Distribution
MDS 9500 Cisco Catalyst 6509
Access
Nexus 5000
45
4x10 GE Ports
4x10 GE Ports
6x4 GFC
6x4 GFC
BRKSAN-2821 14572_05_2008_c1
Cisco Public
91
4x10 GE Ports
6x4 GFC
BRKSAN-2821 14572_05_2008_c1
Cisco Public
92
46
Distribution
8 MDS 9500 Cisco Catalyst 6509
POD 1
POD N
Access
Nexus Family
Blade Server
Copper Pass-Through
Blade Server 1
BRKSAN-2821 14572_05_2008_c1
Blade Server N
Cisco Public
Blade Server 1
Blade Server N
93
Distribution
8 MDS 9500 Cisco Catalyst 6509
POD 1
POD N
Access
Nexus Family
Blade Server
Ethernet-Only Switch
Blade Server 1
BRKSAN-2821 14572_05_2008_c1
Blade Server 1
Cisco Public
Blade Server 1
Blade Server 1
94
47
2x10 GE Ports
2x10 GE Ports
4x4 GFC
4x4 GFC
Nexus 5000
BRKSAN-2821 14572_05_2008_c1
Cisco Public
95
BRKSAN-2821 14572_05_2008_c1
Cisco Public
96
48
Distribution
8 MDS 9500 Nexus 7000
Access
Nexus 5000
Distribution
8 MDS 9500 Nexus 7000
Access
Nexus 5000
49
Conclusions
BRKSAN-2821 14572_05_2008_c1
Cisco Public
99
Challenges
FCoE redefines consolidated scenarios
Ethernet switch manufacturers will try to enter the FC switching market FC switch manufacturers will try to enter the Ethernet switching market HBA manufacturers will try to enter the NIC market NIC manufacturers will try to enter the HBA market
BRKSAN-2821 14572_05_2008_c1
Cisco Public
100
50
Web Pointers
PCI Express
http://en.wikipedia.org/wiki/Pci_express
IEEE 802.3
http://standards.ieee.org/getieee802/802.3.html
Improvements to Ethernet
http://www.nuovasystems.com/EthernetEnhancements-Final.pdf
FCoE
http://www.fcoe.com/ http://www.t11.org/ http://www.open-fcoe.org/ http://www.fibrechannel.org/OVERVIEW/FCIA_SNW_FCoE_WP_Final.pdf
TRILL
http://www.ietf.org/html.charters/trill-charter.html
BRKSAN-2821 14572_05_2008_c1 2008 Cisco Systems, Inc. All rights reserved. Cisco Public
101
Thank You
BRKSAN-2821 14572_05_2008_c1
Cisco Public
102
51
Recommended Reading
Continue your Cisco Live learning experience with further reading from Cisco Press Check the Recommended Reading flyer for suggested books
103
BRKSAN-2821 14572_05_2008_c1
Cisco Public
104
52
BRKSAN-2821 14572_05_2008_c1
Cisco Public
105
53