Professional Documents
Culture Documents
5G Ready Multi-Failure
5G Ready Multi-Failure
(Invited Paper)
Abstract—Data explosion and higher service level agreements degree of network availability. Via network simulation, the paper
(SLA) are common terminologies in modern communication net- demonstrates that multiple failure resiliency and cost effectiveness
works. Recent advancements toward 5G networks add to the do not necessarily represent a contradiction in transport networks
complexity where billions of devices, both static and mobile, are as long as an appropriate network architecture is chosen and that
connected generating trillions of megabits of data flooding the efficient network planning and optimization are applied.
network and where service providers scuffle daily to deliver key
performance indicators: Lower latency, longer battery life, higher Index Terms—Networks, network optimization, network surviv-
data rates, ultra-high reliability, and more connected devices. The ability, protection and restoration.
optical-fiber world simultaneously witnesses a denser, fiber-rich
network infrastructure which is critically challenged in fast eco-
nomically developing countries by heavy construction of roads and
I. INTRODUCTION
other infrastructures that inflict repeated and unpredictable cuts to 5G network trials and commercial launches are all set to hap-
the extent that traffic is simultaneously interrupted in up to 10% of pen as the telecommunication world gears up towards the fifth
the optical links. Thus, network resiliency to fiber failures is already
and will continue to be a top priority for operators. Depending on generation of cellular mobile communications, succeeding 4G
the SLA, operators choose from various resiliency options, e.g., at (LTE/WiMax), 3G (UMTS) and 2G (GSM) systems [1]. 5G
single network layers or across multiple ones, with dedicated pro- networks will have to efficiently support billions of devices and
tection or shared restoration, depending on decisive factors like different key performance indicators (KPIs), ranging from ultra-
network availability and total cost of ownership (TCO). With tra- low latency, very high data rates and ultra-high reliability [2].
ditional network architectures, high resiliency comes with signifi-
cantly increased TCO involving optical transport network (OTN) Traffic forecasts already hint that by 2023 networks are expected
switches, L1 protection switching, and L1 restoration. This trans- to generate monthly 107 exabytes of mobile data traffic [3], [4].
lates into high CAPEX and excessive space and power consump- Though generally one envisions 5G network of being an intri-
tion, hampering the desirable seamless scalability with the contin- cate wireless technology, integral to the success of 5G deploy-
uously increasing demand for more capacity. This paper explores ment and operation is the extensive optic fiber cable network
an alternative hyperscale network architecture, which is adapt-
able and resilient to multiple fiber failures, thus delivering com- that will sustain it [5]. As mentioned before, there are strin-
mitted service levels to the end users and which, at the same time, gent requirements for the 5G network ranging from ultra-low la-
significantly reduces network cost compared to a traditional all tency to very high data rates and ultra-high reliability [2]. Higher
OTN-switched network. The hyperscale architecture exploits low data rates per cell will require more transmission capacity while
footprint carrier grade, data center interconnect type high-density higher transmission rate in the cloud radio access networks will
transponders and cost-effective optical protection switching and
restoration via software defined networking control. The paper demand higher capacity in the underlying (fixed) network in-
also describes a routing and grooming algorithm to dimension a frastructure. In addition, a 3 ms latency constraint in the cloud
shared resource pool for optical restoration and a statistical simu- radio access network can constrain the distance between radio
lation method for time effective execution, simulating failures over head and baseband unit–which will have a direct impact on net-
a carefully selected sub-set of failure scenarios to guarantee a high work architecture [6]. Cell densification will dictate the fiber
point of presence and ultralow latency (<1 ms) would push de-
Manuscript received January 3, 2019; revised May 20, 2019; accepted May ployment of fiber into 5G mobile base stations thus resulting in
27, 2019. Date of publication May 31, 2019; date of current version July
31, 2019. This work was supported in part by the H2020 METRO-HAUL deep fiber penetration (in the range of 500 m maximum), mo-
project under Grant 761727, in part by FCT/MEC through national funds, tivating the adoption of a new fronthaul/backhaul fiber based
and in part by FEDER—PT2020 partnership agreement under the Project network. Overall, this will imply deploying significantly more
UID/EEA/50008/2019. (Corresponding author: Bodhisattwa Gangopadhyay.)
B. Gangopadhyay is with Infinera Portugal, 2790-078 Carnaxide, Portugal optical fiber and likely adopting a new flavor of optical transport
(e-mail: BGangopadhya@infinera.com). technologies, characterized by high capacity and flexibility. The
J. Pedro is with Infinera Portugal, 2790-078 Carnaxide, Portugal, and also focus of this paper is the transport network architecture that will
with the Instituto de Telecomunicações (IT), Instituto Superior Técnico (IST),
1049-001 Lisbon, Portugal (e-mail: JPedro@infinera.com). support future 5G and cloud requirements.
S. Spaelter is with Infinera GmbH, D-81541 Munich, Germany (e-mail: A highly interconnected backbone catering 5G fiber-optic
SSpaelter@infinera.com). backhaul means seamless streaming of bandwidth-intensive ap-
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org. plications because only fiber will be able to provide capacity
Digital Object Identifier 10.1109/JLT.2019.2920305 for the most demanding Peak Data Rates bandwidth required
0733-8724 © 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4063
which employs DCI-oriented muxponders with ROADM traffic frame-by-frame on every port at any layer (ranging be-
based optical switching, CD add/drop terminations, a virtualized tween L1 and L2.5) and change the way traffic is managed
control plane for multi-layer protection and restoration, as well dynamically without the need to deploy new protocol specific
as with a shared pool of 3R regenerators. Section III describes hardware. Figure 2 illustrates the concept of employing a uni-
design algorithms to: (i) dimension the network with dedicated versal switching fabric.
resources for working and protection; (ii) dimension a shared The advantages of universal OTN switching include: (i) elim-
regenerator resource pool used for restoration and (iii) cover inating all switching boundaries and restrictions by allowing na-
maximum number of failure scenarios used for regenerator pool tive switching and interworking on every universal module port;
dimensioning. Section IV deals with the network simulation re- (ii) maximizing network resources by dynamically adapting to
sults and comparison, which validate the proposed transport ar- changing conditions through a universal matrix; (iii) improving
chitecture. Finally, Section V presents concluding remarks. overall network efficiency by providing aggregated handoffs to
routers and maximizing utilization of both metro and core wave-
II. TRANSPORT NETWORK ARCHITECTURE lengths; (iv) providing traffic grooming at any layer irrespective
of the physical interface and enable grooming of multiple pro-
As we move towards the 5G-era, technological bottlenecks
tocols on a single port; and (v) reducing CAPEX and OPEX
and operational rigidities are to be eradicated to scale up to the
with a minimum number of highly flexible multi-service uni-
traffic volume and quality expected. Optical networks are no
versal interface cards that support all applications [18]. Due to
exception to the same. Developments in the transport network
this, OTN switching is still seen as paramount in future network
specifically in the optical communication world have seen dif-
architectures [14].
ferent network architectures being proposed and opted for. OTN
Taking advantage of grooming in electrical layer, optical
switch based, Muxponder based, Router based, with or without
channel data unit (ODU) protection switching in sub-50 ms and
optical bypass and the list goes on. Obviously, each has their pros
restoration switching within few hundreds of milliseconds could
and cons. However, studies over the last years made it very clear
be achieved in an all OTN switch based architecture contrary to a
that, there is no single winner in all conditions and a mix and
pure transponder/muxponder solution. However, up to recently,
match of a few helps to define the ideal one for the network being
traffic based on 10G or lower client port speeds were being
examined. Recent year’s development around the fast growing
aggregated into 100G wavelengths and thus traffic collection,
DCI space have focused on high density low footprint muxpon-
grooming, switching, protection and restoration were naturally
ders [13]. This forms an important part of the network architec-
executed at (electrical) L1 at the OTN nodes. While there is a
ture proposed in this paper [12]. Alongside, the universal OTN
significant difference between the client and line rate and ex-
switch deserves mention being one of the only kind capable
ists a mesh network with fragmented traffic distribution, OTN
to handle all traffic in a single switching domain, thus enable
switching is a must taking advantage of traffic grooming and
to maximize optical channel fill ratio [14]. In addition, higher
delivering a leaner TCO. But, explosive traffic growth, hinting
order modulation formats and higher symbol rates enabled by
at 100G becoming yesterday’s 10G, is a matter of concern [19].
next-generation line interfaces also have a relevant role in re-
As the client rate (100 GbE, 400 GbE, …) is gearing up to
ducing TCO [15]. In the following sections, we overview each
match with line rates, grooming might deliver less significant
of the key transport network architectures.
savings in TCO and a leaner network element might make more
Previous works have established the benefits of optical layer
economic sense. Particularly, an OTN switch might not be the
restoration in terms of resource savings [16]. Still, concerns exist
best choice at every transport node, as the larger the cross-
regarding the failure recovery times when using this approach.
connect, the larger is space and power consumption [20]. In-
It is noteworthy that, centralized virtualized control plane is a
dustry trends shows already 25 Tbps switching matrix in the
cornerstone to the cost-effective solution delivering the highly
horizon, but given its expected cost, footprint and power con-
resilient network where the benefits of the distributed gener-
sumption, it is not sustainable to deploy this functionality at ev-
alized multiprotocol label switching (GMPLS) are preserved
ery node. Still, OTN switches could be useful at the network edge
and the challenges of future networks are addressed and im-
for multi-service and low-rate aggregation onto an OTN infras-
proved resilience is provided through multi-layer L0-L3 co-
tructure and today’s unpredictable bandwidth demands require
ordination in multi-service and multi-domain networks. Com-
flexible network architectures to quickly adapt to changing traf-
pared to traditional distributed automatically switched optical
fic patterns. For example, fabric-based switching solutions are
network (ASON)-GMPLS, this L0 restoration based on cen-
well-suited for metro enterprise networks that require flexible,
tralized ASON delivers a sub-50 ms switching for restoration
on-demand, any-to-any core site connectivity.
once the failure is triggered, by searching and setting up a new
maximum disjoint restoration path after any 50-ms protection
switching event. This scheme is further detailed in Section II.C. B. DCI-Cloud Oriented High-Density Muxponder
Based Architecture
A. OTN Switch Based Architecture
Recent trends in DCI-cloud oriented muxponders saw the in-
Innovations in OTN switches are advancing by leaps and troduction of disaggregated platform based muxponders deliv-
bounds, offering an innovative approach to networking solutions ering up to 9.6 Tbps in 1 rack unit (RU) as per industry standards
by providing completely protocol agnostic switching capabili- [21]. These muxponders simplified the provisioning of 10G,
ties [17]. With these solutions, network operators can groom 40G, and 100G Wide Area Network (WAN) cloud connectivity
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4065
and online operation. For simplicity, the description assumes combinations by expected importance and then shortlist them
that line interfaces can be operated at either 100 G or 200 G, but according to a given budget (i.e., maximum number of failure
it can be easily extended to support a wider array of optical chan- combinations that can be explicitly considered) is described. Al-
nel formats. Grooming of a given traffic demand into a sequence though it cannot guarantee in advance that all the required shared
of existing and/or new lightpaths is modelled by representing in resources are identified, especially if the failure simulation bud-
the auxiliary graph both existing lightpaths with enough free ca- get is very small, it has been recently shown that in a reference
pacity to accommodate the demand and possible (i.e., feasible) network it was possible to approximately dimension the shared
new lightpaths that can be created to carry the demand. This al- resources when the failure simulation budget covers at least 10%
gorithm is used to route all demands and determine the resources of the total number of possible failure combinations [27].
required for working and protection paths. The restoration re-
sources (e.g., number of shared 3Rs at each node) can only be
determined taking into account specific failure scenarios. For
that purpose, the following algorithm was developed.
Importantly, this algorithm must be executed for all the mean- IV. NETWORK SIMULATION STUDIES
ingful link failure combinations in order to determine the total This section aims at quantitatively validate the effectiveness
number and location of the shared resources. For instance, to of the HCA in reference transport networks. The algorithms have
ensure survivability against up to three link failures and assum- been implemented in C language and were run on a computer
ing protection is used to recover from the first link failure, all with an Intel Core i7 and 16 GB RAM. Execution time was found
combinations of two link failures have to be considered and for to be in 10 s of minutes for all the three networks. In this study, all
each one the design algorithm is executed. traffic demands have high resiliency requirements making use of
The effort required to compute the shared resources will grow a 1 + 1 + Restoration scheme with client traffic signals of 100
with the network size, number of traffic demands and number GbE. Under these assumptions, two architectural choices – a
of simultaneous link failures to be considered. This can imply conventional and the HCA – are compared with each other: The
a very long process, especially when the number of simultane- first one is based on OTN switches at all ROADM nodes with
ous link failures the traffic demands must survive to increases. 1 + 1 ODU layer protection for the first failure followed by ODU
Consequently, in order to guarantee that it is possible to design restoration. The second one relies on high-density muxponders
a transport network based on the proposed HCA architecture, at all ROADM nodes (for traffic aggregation or electrical signal
and more generally to design a transport network relying on regeneration) with 1 + 1 client side protection for the first failure
optical restoration, it is useful to investigate methods that can followed by OCh restoration. For both architectures, DWDM
reduce the number of failure combinations that need to be explic- ports (i.e., line interfaces) are dimensioned such that link failure
itly simulated. In the following, an algorithm to sort the failure survivability is about the same in both cases.
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4069
Fig. 13. GBN traffic distribution. Fig. 15. Comparative analysis OTN vs HCA for SPN.
TABLE I
RELATIVE POWER CONSUMPTION, SPACE OCCUPATION
AND CAPEX OF OTN SWITCH AND DCI MUXPONDER
[31] A. Eira, M. Quagliotti, and J. Pedro, “Impact of client-and line-side flex- João Pedro received the M.Sc. and the Ph.D. degrees in electrical and computer
ibility in the lifecycle of next generation transport networks [Invited],” engineering from Instituto Superior Técnico (IST), University of Lisbon, Lisbon,
IEEE/OSA J. Opt. Commun. Netw., vol. 8, no. 7, pp. A101–A115, Jul. Portugal. He was a Research Engineer and a System Architect for Network Plan-
2016. ning Tools with Nokia Siemens Networks from 2008 to 2013 and with Coriant
[32] Sparkle. [Online]. Available: http://www.tisparkle.com/ from 2013 to 2018. He currently leads the Multilayer Performance Optimization
[33] C. Politi et al., “Integrated design and operation of a transparent optical team with Infinera, Carnaxide, Portugal, which investigates future-looking op-
network: A systematic approach to include physical layer awareness and tical and multi-layer network architectures and design algorithms and supports
cost function,” IEEE Commun. Mag., vol. 45, no. 2, pp. 40–47, Feb. 2007. both algorithmic development and optical performance for planning tools and
[34] J. Pedro, “Designing transparent flexible-grid optical networks for max- SDN controller. He has also been a lecturer of courses on network planning
imum spectral efficiency [Invited],” IEEE/OSA J. Opt. Commun. Netw., and transport networks. He holds 6 patent applications and has co-authored over
vol. 9, no. 4, pp. C35–C44, Apr. 2017. 150 publications in international conferences and journals, participated in EU
projects FUTON, IDEALIST, and METRO-HAUL. Currently, his research in-
terests include high capacity optical networks, node architectures, routing and
spectrum assignment, multi-layer optimization, disaggregated networks, and
machine learning applications.
Dr. Pedro served as a reviewer for IEEE/OSA JOURNAL OF OPTICAL COM-
MUNICATIONS AND NETWORKING, JOURNAL OF LIGHTWAVE TECHNOLOGY, and
Bodhisattwa Gangopadhyay was born in Haldia, India, in 1981. He received IEEE PHOTONICS TECHNOLOGY LETTERS. He has also served as a TPC member
the B.E. degree in instrumentation technology from Siddaganga Institute of
of OFC from 2015 to 2018.
Technology, Visvesvaraya Technological University, Belgaum, India, in 2005
and the M.Tech degree in digital electronics and advanced communication from
Manipal University, Manipal, India, in 2007. After his initial years as an In-
tern, an Engineer, and a Ph.D. student (2006–2011) with Siemens (2006–2007)
and with Nokia Siemens Networks (2007–2011), he served as Research Engi-
neer, System Architect, and Solutions Consultant with Nokia Siemens Networks
(2011–2013) and Coriant (2013–2018) for network planning tools, network plan-
ning services, network migrations, multi-layer planning, and optimization of cus-
tomer networks. At present, he is a Principal Solution Consultant with Infinera, Stefan Spaelter received the Ph.D. degree in the field of quantum fiber optics.
Carnaxide, Portugal, where he is involved in multi-layer planning and optimiza- He has held various leadership positions in research, system engineering, op-
tion of transport networks, focusing toward 5G. He has authored several book tical engineering, and technical sales with Lucent Bell Labs, Siemens, Nokia
chapters, journals, and conference papers in these fields. He has participated in Siemens Networks, and Coriant. He is currently the Vice President of Solutions
EU project FUTON. His research interests include transport networks for 5G Product Line Management with Infinera, Munich, Germany, where he is globally
roll outs and planning/operation of network protection/restoration in multi-layer responsible for defining innovative and competitive solutions to packet optical
networks (L3-L0). transport networks. To his role, he brings more than 25 years of experience in
Mr. Gangopadhyay has served as a reviewer for JOURNAL OF LIGHTWAVE optical communications. He has published more than 75 reviewed conference
TECHNOLOGY. papers and journal articles.