Professional Documents
Culture Documents
Implementation and Performance Evaluation of Load Balanced Routing in SDN Based Fat Tree Data Center
Implementation and Performance Evaluation of Load Balanced Routing in SDN Based Fat Tree Data Center
Abstract—As the number of data intensive and distributed corresponding to precomputed paths to be stored in the
applications increase, data centers are highly utilized for their forwarding element at the time of flow creation and avoids
storage, processing and accessing. These applications cause per packet computation [2]. So selection of a path from
throughput sensitive incast traffic in the data center. multiple available paths can be done proactively.
Traditional shortest path routing algorithms cause traffic
hotspots in the shortest path links and lead to congestion. POX [4], Beacon [6], Floodlight [5], and Ryu [7] are the
Therefore, load balanced routing strategies need to be major controller platforms for implementing Software
employed to spread the traffic across multiple paths. This work Defined Networking. None of these controllers provide
employs the idea of multi path routing to achieve load inbuilt module for multi path support. They provide
balancing in SDN based Fat tree Data center. It implements Spanning Tree Protocol(STP) to deal with ARP broadcast
Equal Cost Multi Path routing algorithm on Fat tree Data streams occurring in the topologies having loops. But STP
Center Network in Ryu SDN controller and evaluates its load result in the selection of a single path and load balancing
balancing performance against traditional Shortest Path along multiple available paths cannot be performed. This
Routing algorithm under different data center benchmark paper implements a load balanced routing strategy for Fat
traffic scenarios. We have conducted an exhaustive study about tree based data center that works on software defined
the load balancing effect of ECMP on Ryu based SDN Fat tree network architecture. Here, Mod-N based Equal Cost Multi
data center and found that ECMP increases Average network
Path routing algorithm is implemented on top of the Ryu
throughput by 68% and Average Link utilization by 60%. Also
SDN controller. Performance of this algorithm in terms of
packet drop is decreased by 44% with ECMP.
average network throughput, average link utilization, and
Keywords—Data Center, Multi path routing, Software packet drop are compared with the implementation of
Defined Networking, Load Balancing, Fat Tree traditional Shortest Path Routing algorithm under different
benchmark traffic patterns to observe its load balancing
effect in the data center. This work utilizes parallel TCP
I. INTRODUCTION flows to conduct an exhaustive study of the load balancing
Data Centers Networks(DCN) host thousands of effect of ECMP Mod-N strategy in Ryu SDN controller
networked servers to provide storage and computing to based Fat tree data canter.
business organizations. They are used to store, process and
access data intensive and distributed applications like Map The rest of the paper is organized as follows: Section II
Reduce applications, social networking platforms, large scale explains the background topics and Section III gives related
web search etc. These applications create high volume of works. The proposed ECMP routing framework for SDN
throughput intensive east west traffic in the DCN. To provide based Fat tree DCN is given in section IV. Section V
network efficiency and to satisfy the service requirement of describes the implementation details and section VI gives
diverse flows in the data center, congestion possibility should results and a discussion on it. Paper concludes in section VII.
be avoided by sharing the traffic load across available
alternative links. Traditional shortest path routing in a heavy II. BACKGROUND
traffic scenario results in the creation of congestion hot spots Data Centers are highly utilized for their storage,
due to single best path selection. This demands for load processing and accessing by different applications.
balanced routing techniques in the data center. Traditional TCP mechanisms are not suitable for data center
Inefficiency of traditional network technologies in traffic and fails to prevent the occurrence of congestion [15].
handling the large network of thousands of servers in the Also commonly used shortest path routing strategy results in
data center network introduced Software Defined the selection of the same least cost path in all the
Networking into it. SDN uses centralized software communications between a source-destination pair. Thus
controllers to connect with switches in the network and shift Data centers need to avoid the creation of traffic hot spots
the routing logic implementation from the switch to the through appropriate load balance based network layer or
controller. Also, in SDN paradigm, switching/forwarding transport layer mechanisms.
element can store flow state in flow table as flow entries. Fat tree is a symmetric, non-blocking, fault tolerant, and
Thus each forwarding element can store flow state without switch-centric data center topology. Fat tree connect the
any additional overhead. This allows next hops
2
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on December 11,2023 at 03:39:45 UTC from IEEE Xplore. Restrictions apply.
this packet is to be forwarded, so that it will reach the for the interested events and can handle it through its own
destination. Source to destination path computation is done at instruction.
this point to find multiple equal cost shortest paths between
them. We emulate a 4-ary Fat tree that has 16 hosts and 20
switches to make Data center topology. Emulation
Selection of one path from these multiple equal cost parameters are given in the Table I. Each link is given 1GB
paths are done using Hash based Mod-N operation. bandwidth. The Data center topology is connected with Ryu
Algorithm 1 describes it. Here G indicates the graph to implement SDN based architecture. An application is
representation of the network, H is the set of hosts, S is the written to compute equal cost shortest paths among a
set of switches and L denotes the set of links. particular source destination pair. Path selection module
selects a path based on the ECMP Hash Mod-N algorithm
discussed in the design section. Newly selected path
information is installed among the switches along the path.
In Mod-N hash based selection, flow is identified at the Switch to controller protocol Open Flow V1.3
micro level using source and destination ip addresses, source
and destination port numbers and protocol fields. A unique
hash value is generated for each such flow. A mod-N Two types of traffic can be seen in the data centers, east-
operation on this hash value, where N is the number of equal west traffic and north-south traffic. East-west traffic is the
cost paths available between the source and the destination, traffic between server racks in a data center and north-south
is performed to get the choice number for the selection of traffic is the communication coming and going from internet
path. Path corresponding to this choice number is selected to the data center. Studies[12] have shown that east-west
from the list of all paths. traffic contributes about seventy seven percentage of total
data center traffic. Due to the difficulty in getting real data
Major steps in the load balanced routing framework center traffic traces, we have generated data center
design are listed below: benchmark communication patterns as per[13] namely
stride(i), staggered(Pe,Pp), and random. Stride pattern with
1) Collect network topology information into a graph G
argument value 1, 2, 4 and 8 generates east-west traffic
2) Start network status monitoring to collect performance between the server hosts. When 1 is used as the argument,
evaluation parameters each server sends to the next neighbour in the fat tree
topology and the traffic pattern is h1 to h2, h2 to h3...., h15
3) Initiate parallel network traffic between source to h16 and h16 to h1. With arguments 2, 4 and 8
destination pairs, that can test the effectiveness of load communication distance between the hosts increases. Stride
balanced routing strategy pattern generates both intra pod and inter pod
4) For an incoming flow with no matches in the flow communications.
table of a switch, find the list of alternative paths between the With staggered pattern a server sends traffic to a server
source-destination of that flow within the same edge switch with a probability of Pe and to
5) Select a path based on Modulo-N Hash based ECMP the server in the same pod with a probability Pp and to the
load balancing route selection strategy, as explained in the remaining servers in the network with 1 − Pe − Pp. In
Algorithm 1 Random pattern, for each source host, the destination host is
selected randomly. Iperf tool [8] is used to generate TCP
6) Install flow entry on all switches along the selected traffic flows between hosts in the network. Parallel flows are
path applied to create multiple flow between a source destination
ECMP Hash Mod-N algorithm assigns each flow to a pair. All the traffic patterns are implemented and applied
particular path between the source destination pair. So it do over the hosts in the fat tree data center multiple times to take
not require packet reordering at the receiver side. average measurements.
3
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on December 11,2023 at 03:39:45 UTC from IEEE Xplore. Restrictions apply.
Mod-N give high throughput compared to single shortest
path routing in all the patterns.
Fig. 5. Average Network Throughput comparison of SPR and ECMP Mod-N algorithms on different traffic patterns in SDN Fat tree Data Center
4
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on December 11,2023 at 03:39:45 UTC from IEEE Xplore. Restrictions apply.
Fig. 6. Average Link Utilization comparison of SPR and ECMP Mod-N algorithms on different traffic patterns in SDN Fat tree Data Center
Fig. 7. Packet Drop comparison of SPR and ECMP Mod-N algorithms on different traffic patterns in SDN Fat tree Data Center
5
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on December 11,2023 at 03:39:45 UTC from IEEE Xplore. Restrictions apply.
[18] X. Shi, L. Yangyang, X. Haiyong, Y. Tengfei, Z. Linchao, L. Panyu,
Z. Heng, and L. Zhiyao, ”An openflow-based load balancing strategy
in SDN.” Comput. Mater. Contin 62, no. 1 (2020): 385-398.
6
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on December 11,2023 at 03:39:45 UTC from IEEE Xplore. Restrictions apply.