Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

4062 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 37, NO.

16, AUGUST 15, 2019

5G-Ready Multi-Failure Resilient


and Cost-Effective Transport Networks
Bodhisattwa Gangopadhyay , João Pedro , and Stefan Spaelter

(Invited Paper)

Abstract—Data explosion and higher service level agreements degree of network availability. Via network simulation, the paper
(SLA) are common terminologies in modern communication net- demonstrates that multiple failure resiliency and cost effectiveness
works. Recent advancements toward 5G networks add to the do not necessarily represent a contradiction in transport networks
complexity where billions of devices, both static and mobile, are as long as an appropriate network architecture is chosen and that
connected generating trillions of megabits of data flooding the efficient network planning and optimization are applied.
network and where service providers scuffle daily to deliver key
performance indicators: Lower latency, longer battery life, higher Index Terms—Networks, network optimization, network surviv-
data rates, ultra-high reliability, and more connected devices. The ability, protection and restoration.
optical-fiber world simultaneously witnesses a denser, fiber-rich
network infrastructure which is critically challenged in fast eco-
nomically developing countries by heavy construction of roads and
I. INTRODUCTION
other infrastructures that inflict repeated and unpredictable cuts to 5G network trials and commercial launches are all set to hap-
the extent that traffic is simultaneously interrupted in up to 10% of pen as the telecommunication world gears up towards the fifth
the optical links. Thus, network resiliency to fiber failures is already
and will continue to be a top priority for operators. Depending on generation of cellular mobile communications, succeeding 4G
the SLA, operators choose from various resiliency options, e.g., at (LTE/WiMax), 3G (UMTS) and 2G (GSM) systems [1]. 5G
single network layers or across multiple ones, with dedicated pro- networks will have to efficiently support billions of devices and
tection or shared restoration, depending on decisive factors like different key performance indicators (KPIs), ranging from ultra-
network availability and total cost of ownership (TCO). With tra- low latency, very high data rates and ultra-high reliability [2].
ditional network architectures, high resiliency comes with signifi-
cantly increased TCO involving optical transport network (OTN) Traffic forecasts already hint that by 2023 networks are expected
switches, L1 protection switching, and L1 restoration. This trans- to generate monthly 107 exabytes of mobile data traffic [3], [4].
lates into high CAPEX and excessive space and power consump- Though generally one envisions 5G network of being an intri-
tion, hampering the desirable seamless scalability with the contin- cate wireless technology, integral to the success of 5G deploy-
uously increasing demand for more capacity. This paper explores ment and operation is the extensive optic fiber cable network
an alternative hyperscale network architecture, which is adapt-
able and resilient to multiple fiber failures, thus delivering com- that will sustain it [5]. As mentioned before, there are strin-
mitted service levels to the end users and which, at the same time, gent requirements for the 5G network ranging from ultra-low la-
significantly reduces network cost compared to a traditional all tency to very high data rates and ultra-high reliability [2]. Higher
OTN-switched network. The hyperscale architecture exploits low data rates per cell will require more transmission capacity while
footprint carrier grade, data center interconnect type high-density higher transmission rate in the cloud radio access networks will
transponders and cost-effective optical protection switching and
restoration via software defined networking control. The paper demand higher capacity in the underlying (fixed) network in-
also describes a routing and grooming algorithm to dimension a frastructure. In addition, a 3 ms latency constraint in the cloud
shared resource pool for optical restoration and a statistical simu- radio access network can constrain the distance between radio
lation method for time effective execution, simulating failures over head and baseband unit–which will have a direct impact on net-
a carefully selected sub-set of failure scenarios to guarantee a high work architecture [6]. Cell densification will dictate the fiber
point of presence and ultralow latency (<1 ms) would push de-
Manuscript received January 3, 2019; revised May 20, 2019; accepted May ployment of fiber into 5G mobile base stations thus resulting in
27, 2019. Date of publication May 31, 2019; date of current version July
31, 2019. This work was supported in part by the H2020 METRO-HAUL deep fiber penetration (in the range of 500 m maximum), mo-
project under Grant 761727, in part by FCT/MEC through national funds, tivating the adoption of a new fronthaul/backhaul fiber based
and in part by FEDER—PT2020 partnership agreement under the Project network. Overall, this will imply deploying significantly more
UID/EEA/50008/2019. (Corresponding author: Bodhisattwa Gangopadhyay.)
B. Gangopadhyay is with Infinera Portugal, 2790-078 Carnaxide, Portugal optical fiber and likely adopting a new flavor of optical transport
(e-mail: BGangopadhya@infinera.com). technologies, characterized by high capacity and flexibility. The
J. Pedro is with Infinera Portugal, 2790-078 Carnaxide, Portugal, and also focus of this paper is the transport network architecture that will
with the Instituto de Telecomunicações (IT), Instituto Superior Técnico (IST),
1049-001 Lisbon, Portugal (e-mail: JPedro@infinera.com). support future 5G and cloud requirements.
S. Spaelter is with Infinera GmbH, D-81541 Munich, Germany (e-mail: A highly interconnected backbone catering 5G fiber-optic
SSpaelter@infinera.com). backhaul means seamless streaming of bandwidth-intensive ap-
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org. plications because only fiber will be able to provide capacity
Digital Object Identifier 10.1109/JLT.2019.2920305 for the most demanding Peak Data Rates bandwidth required
0733-8724 © 2019 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4063

network catering to the 5G world: (1) a cost-effective network


architecture with least TCO to conclude upon; (2) a proper
routing and grooming algorithm to dimension the network re-
sources and (3) a time-effective method to cover a maximum
number of failure scenarios, which is essential when not all pro-
tection/restoration mechanisms are pre-planned.
Coming to the first point, in the pre-5G era, traditional
network architectures involving OTN switches accomplish
high resiliency at the expense of significantly increased TCO,
hampering scalability [13]. To elaborate, conventional highly
resilient network architectures involve OTN switches, L1
protection switching and L1 restoration. It has been shown
that, in these conditions, multi-failure survivability requires
a very large amount of additional resources, which translate
into high capital expenditures (CAPEX) and excessive power
Fig. 1. 5G network architecture. and space consumption. As a result, conventional architectures
lack seamless scalability with the continuously increasing
for 5G at a reasonable TCO, while guaranteeing room for future demand for more capacity and higher resiliency. Hence, there
capacity needs (i.e., beyond 5G). With per channel data rates is strong motivation to develop scalable and more cost-effective
above 100 Gbps and a very low latency this transmission medium architectures, featuring a higher capability to share restoration
will need to be deployed increasingly closer to the user (i.e., to resources. This leads to the second point, since state-of-the-art
the antenna in the case of 5G networks) [7]. With wide scale 5G routing and resource allocation algorithms are not tailored to
deployments expected to start already in 2020, it is paramount address the additional requirement to dimension this shared
that transport networks evolve to become an enabler of 5G net- resource pool. Finally, dimensioning a network considering
works and its KPIs, as illustrated in Fig. 1. Therefore, guaran- all possible failure scenarios is a computationally intensive
teeing that transport networks are an integral part of the solution task. For example, simulating a minimum of all two-fiber cut
is a current focus of extensive research [8], [9]. combinations in a network of 71 links, already results in sim-
Measurable KPIs in the domain of optical networks can be ulating 2485 restoration scenarios, each potentially involving
defined in terms of capacity, spectral efficiency, cost, reach, con- the rerouting of several hundreds of traffic demands. However,
nectivity, flexibility and resiliency and these are vital in deliv- dimensioning being one of the key aspects when it comes to
ering cost-effective and efficient network design and operations minimizing TCO, a fast and scalable simulation method is a
[10]. Network availability or network resiliency represented and valuable tool to estimate resulting failure resiliency.
will continue to be an utmost important KPI for many network A suitable alternative to the conventional network archi-
operators both in fixed and mobile networks and optical net- tecture exploits data center interconnection (DCI) optimized
works are no exception to it [11]. Especially in fast developing interface technology complemented by reconfigurable optical
countries, heavy construction of roads and other infrastructures add/drop multiplexer (ROADM)-based optical switching, col-
can critically challenge network availability, posing a serious orless/directionless (CD) add/drop terminations and a virtual-
threat to the communications infrastructure. In particular, opti- ized control plane for multi-layer protection and restoration.
cal fiber suffers from repeated and unpredictable cuts inflicted To make the architecture more cost-effective, protection and
by the construction industry to the extent that traffic is simulta- restoration switching can be moved from Layer 1 to Layer 0,
neously interrupted in up to 10% of the optical links [12]. As thus requiring a shared pool of regenerators (3Rs) besides the
a consequence, network resilience to fiber failures represents a colorless/directionless add/drop terminations [12]. This paper
top priority for operators. Thus, depending on the SLA oper- extends the work reported in [12] and (1) explores this alterna-
ators choose from various resiliency options – e.g., single- vs tive hyperscale network architecture (built upon carrier-grade,
multi-layer, dedicated protection vs shared restoration – taking DCI focused high-density transponders and cost-effective opti-
into account fundamental factors such as the resulting network cal protection switching and restoration) adaptable and resilient
availability and TCO. However, it brings us to a few questions: to multiple fiber failures, delivering committed service levels
how does cost relate to network resiliency and how does it scale to end-users and significantly reducing network TCO compared
with the forecasted increase in required capacity? Or, what com- to traditional all OTN-switched network; (2) proposes a rout-
bination of network elements should be considered in the net- ing and grooming algorithm to dimension the network and the
work architecture such that in any given scenario it can translate shared resource pool for optical restoration and (3) proposes a
into cost-effective optical networks that achieve certain func- statistical simulation method for time effective execution, sim-
tionality or an augmented resiliency? And furthermore, what ulating failures over a carefully selected sub-set to guarantee a
are the required algorithms or algorithmic extensions that can high degree of network availability.
efficiently dimension a network with least TCO but addressing The paper is organized as follows. Section II outlines the
all the inputs and constraints? network architecture depicting the conventional OTN switches,
Typically, network operators are thus looking at a three dimen- high density DCI-oriented muxponders (a key component of
sional problem while constructing a state-of-the-art transport the hyperscale architecture), and the hyperscale architecture
4064 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 37, NO. 16, AUGUST 15, 2019

which employs DCI-oriented muxponders with ROADM traffic frame-by-frame on every port at any layer (ranging be-
based optical switching, CD add/drop terminations, a virtualized tween L1 and L2.5) and change the way traffic is managed
control plane for multi-layer protection and restoration, as well dynamically without the need to deploy new protocol specific
as with a shared pool of 3R regenerators. Section III describes hardware. Figure 2 illustrates the concept of employing a uni-
design algorithms to: (i) dimension the network with dedicated versal switching fabric.
resources for working and protection; (ii) dimension a shared The advantages of universal OTN switching include: (i) elim-
regenerator resource pool used for restoration and (iii) cover inating all switching boundaries and restrictions by allowing na-
maximum number of failure scenarios used for regenerator pool tive switching and interworking on every universal module port;
dimensioning. Section IV deals with the network simulation re- (ii) maximizing network resources by dynamically adapting to
sults and comparison, which validate the proposed transport ar- changing conditions through a universal matrix; (iii) improving
chitecture. Finally, Section V presents concluding remarks. overall network efficiency by providing aggregated handoffs to
routers and maximizing utilization of both metro and core wave-
II. TRANSPORT NETWORK ARCHITECTURE lengths; (iv) providing traffic grooming at any layer irrespective
of the physical interface and enable grooming of multiple pro-
As we move towards the 5G-era, technological bottlenecks
tocols on a single port; and (v) reducing CAPEX and OPEX
and operational rigidities are to be eradicated to scale up to the
with a minimum number of highly flexible multi-service uni-
traffic volume and quality expected. Optical networks are no
versal interface cards that support all applications [18]. Due to
exception to the same. Developments in the transport network
this, OTN switching is still seen as paramount in future network
specifically in the optical communication world have seen dif-
architectures [14].
ferent network architectures being proposed and opted for. OTN
Taking advantage of grooming in electrical layer, optical
switch based, Muxponder based, Router based, with or without
channel data unit (ODU) protection switching in sub-50 ms and
optical bypass and the list goes on. Obviously, each has their pros
restoration switching within few hundreds of milliseconds could
and cons. However, studies over the last years made it very clear
be achieved in an all OTN switch based architecture contrary to a
that, there is no single winner in all conditions and a mix and
pure transponder/muxponder solution. However, up to recently,
match of a few helps to define the ideal one for the network being
traffic based on 10G or lower client port speeds were being
examined. Recent year’s development around the fast growing
aggregated into 100G wavelengths and thus traffic collection,
DCI space have focused on high density low footprint muxpon-
grooming, switching, protection and restoration were naturally
ders [13]. This forms an important part of the network architec-
executed at (electrical) L1 at the OTN nodes. While there is a
ture proposed in this paper [12]. Alongside, the universal OTN
significant difference between the client and line rate and ex-
switch deserves mention being one of the only kind capable
ists a mesh network with fragmented traffic distribution, OTN
to handle all traffic in a single switching domain, thus enable
switching is a must taking advantage of traffic grooming and
to maximize optical channel fill ratio [14]. In addition, higher
delivering a leaner TCO. But, explosive traffic growth, hinting
order modulation formats and higher symbol rates enabled by
at 100G becoming yesterday’s 10G, is a matter of concern [19].
next-generation line interfaces also have a relevant role in re-
As the client rate (100 GbE, 400 GbE, …) is gearing up to
ducing TCO [15]. In the following sections, we overview each
match with line rates, grooming might deliver less significant
of the key transport network architectures.
savings in TCO and a leaner network element might make more
Previous works have established the benefits of optical layer
economic sense. Particularly, an OTN switch might not be the
restoration in terms of resource savings [16]. Still, concerns exist
best choice at every transport node, as the larger the cross-
regarding the failure recovery times when using this approach.
connect, the larger is space and power consumption [20]. In-
It is noteworthy that, centralized virtualized control plane is a
dustry trends shows already 25 Tbps switching matrix in the
cornerstone to the cost-effective solution delivering the highly
horizon, but given its expected cost, footprint and power con-
resilient network where the benefits of the distributed gener-
sumption, it is not sustainable to deploy this functionality at ev-
alized multiprotocol label switching (GMPLS) are preserved
ery node. Still, OTN switches could be useful at the network edge
and the challenges of future networks are addressed and im-
for multi-service and low-rate aggregation onto an OTN infras-
proved resilience is provided through multi-layer L0-L3 co-
tructure and today’s unpredictable bandwidth demands require
ordination in multi-service and multi-domain networks. Com-
flexible network architectures to quickly adapt to changing traf-
pared to traditional distributed automatically switched optical
fic patterns. For example, fabric-based switching solutions are
network (ASON)-GMPLS, this L0 restoration based on cen-
well-suited for metro enterprise networks that require flexible,
tralized ASON delivers a sub-50 ms switching for restoration
on-demand, any-to-any core site connectivity.
once the failure is triggered, by searching and setting up a new
maximum disjoint restoration path after any 50-ms protection
switching event. This scheme is further detailed in Section II.C. B. DCI-Cloud Oriented High-Density Muxponder
Based Architecture
A. OTN Switch Based Architecture
Recent trends in DCI-cloud oriented muxponders saw the in-
Innovations in OTN switches are advancing by leaps and troduction of disaggregated platform based muxponders deliv-
bounds, offering an innovative approach to networking solutions ering up to 9.6 Tbps in 1 rack unit (RU) as per industry standards
by providing completely protocol agnostic switching capabili- [21]. These muxponders simplified the provisioning of 10G,
ties [17]. With these solutions, network operators can groom 40G, and 100G Wide Area Network (WAN) cloud connectivity
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4065

Fig. 2. Universal OTN switching concept.

Fig. 4. State-of-the-art DCI-oriented high density optical system.

the flexibility to operate at modes with lower-order modulation


Fig. 3. Coherent "pizza box" architecture.
(from QPSK to 64QAM) that provide a software-programmable
tradeoff between reach and data rate to use the same equipment
services for metro, regional, and long-haul DWDM transport for metro, regional and long-haul links. The programmability
applications (Fig. 4). Designed to meet the scalability require- also allows the interfaces to be used as shared 3R resource pool.
ments of today’s cloud and data center networks, these muxpon- There are many factors contributing to the introduction of
ders feature the industry’s most compelling pay-as-you-grow ap- such a high throughput system, including the latest innovations
proach that enables the lowest initial costs, reduced equipment in high speed optics, photonic/electrical integration, and silicon
sparing costs, and cost-effective scalability. photonics. Modularity is a key feature. State-of-the-art consists
With data centers rapidly evolving, network equipment man- of a three tier modular architecture where four service slots in
ufacturers are developing these disaggregated interface port 1RU chassis support up to four single-slot sleds/modules or two
shelves, thus delivering high data volume across point-to-point double-slot sleds/modules that are field replaceable, individually
links between data centers at reasonably low cost. The intro- configurable, and hot swappable. Supporting 1.6 Tbps client and
duction of these high density disaggregated network elements 1.6 Tbps line traffic in a compact 1RU high density and scal-
(muxponders and transponders) to deliver data between node- able platform cost effectively scales DC transport capacity up
pairs is revolutionizing the optical networking space due to: (i) to 25.6 Tbps per fiber and up to 134 Tbps per 42RU rack. This
their simplified model and architecture; (ii) lowest footprint in enables lowest first cost 10G, 40G and 100G services on the
terms of both space and power. Industry leading examples exist- client side and 100G up to 800G on the line side. Complementing
ing today boasts of delivering the said 9.6 Tbps of capacity in a the pay-as-you-grow, the feature of power-as-you-grow guaran-
compact 1RU form factor with an ultra-low power consumption tees that there is no phantom power drain for unused capacity.
of 0.2 W per Gbps of data. Another key aspect lies in the fact that these network elements
Figure 3 shows an example of disaggregated implementa- are designed as an open platform for rapid introduction and in-
tion in a standardized “pizza box” form factor [22]. This is tegration within any DC or telecom operating environment and
a 2.4 Tbps coherent “pizza box” internal architecture where seamless interoperability with existing metro or long-haul line
24-port client interface card on left goes under line interface system [23].
card on right so that connectors are aligned and both client and
C. Optical Hyperscale Carrier Architecture
line ports are on the same edge of the box. Next generation coher-
ent interfaces, including both DSPs and optics, offers significant Around 2022, both 5G and cloud traffic will push traffic
improvements in performance, density, and power, and will have growth to 4.8 ZB/yr from all connected devices, contextual
4066 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 37, NO. 16, AUGUST 15, 2019

Fig. 5. Hyperscale Carrier Architecture optical transport solution.

automated experiences, 8K video and user-generated content


Fig. 6. Traditional architecture based on OTN switches with L1 shared restora-
from cloud hosting [24]. Client traffic rates are pushing from tion vs hyperscale carrier architecture with L0 shared restoration.
10G towards 100G and 400G. As the market grew from 10G
towards 100G channels, anything over a 40% filling ratio was
switched to a higher data rate (example, 4 × 10G was revised can switch within 50 ms to this already built path. A traditional
to be deliver from 100G router port). To process such a volume OCh restoration scheme cannot comply with this strict recovery
of data in the traditional all OTN based network architecture, time, since the path is only provisioned after the failure occurs
non-blocking switching capacity requirements of up to 25 Tbps and the process is triggered. This scheme benefits operators by:
will be required today in the OTN switch and the same will (i) delivering a high network availability at the expense of mini-
be needed to scale to 100 Tbps in the next few years to provide mum network resources; (ii) delivering high optical performance
dedicated working and protection paths and augmented network margins and/or exploiting higher order modulation formats by
resiliency at the electrical layer with fast recovery time in the using the shared 3R regenerators during a restoration operation
event of multiple and unpredictable network failures. However which often requires a longer path; and (iii) operating the net-
due to operational constraints (e.g., space and power consump- work with low blocking probability by also using the shared
tion), augmented resiliency and the operator’s imperative of min- 3Rs as wavelength converters aided by flexi-grid capable, CD
imizing TCO, an alternative architecture is clearly required. or colorless/directionless/contentionless (CDC) ROADMs [25].
The hyperscale carrier architecture (HCA) is a disruptive The Transport Controller residing within the SDN controller
and alternative solution that relies on the aforementioned forms the core of the virtualized Control plane and virtual-
pizza box architecture (which originally only targeted the DCI ized/centralized ASON service. Based on the requirement of
segment) complemented by a centralized virtual solution for being a multi-domain multi-layer control plane, a northbound
multi-layer protection and restoration, as exemplified in Fig. 5. REST API is an absolute necessity allowing service provi-
HCA promises carrier-grade, high throughput and multi-failure sioning over single or multiple domains. During operation, the
resilience with significantly reduced TCO when compared to network state is continuously supervised and updated in the
traditional architecture based on OTN switches. Firstly, TCO transport controller. With the variety of services running across
is reduced by replacing bulky OTN switches by cost-effective, different domains and layers, a service manager is required to co-
high-density DCI-type port shelves. Secondly, as illustrated in ordinate, monitor and update the status of these services which
Fig. 6, protection and restoration switching are moved from resides within the transport controller. During a failure in the
Layer 1 to Layer 0, resulting in an optimized cost structure. network, alarms are raised and the affected services are identi-
As discussed in earlier work [12], this enables to make a better fied. Restoration paths for these services are calculated in the
usage of additional line interfaces deployed in shared pools, path computation engine which is always updated with the ac-
thus reducing their number. tual network state with detailed fault status and other factors
A key ingredient of HCA is the virtualized control plane. It such as utilization, domain specific parameters (resource class),
ensures permanent Optical Channel (OCh) restoration by search- trunk costs, etc. The restoration path computed can thus avoid
ing for a new maximum disjoint restoration path after any 50-ms the failed network resources while stay on an optimized end to
protection switching event. For an equivalent resiliency level end route and further being disjoint from the nominal working
with OTN switching, line interfaces need to be placed in ev- and protection paths, assuming a service is having a “permanent
ery possible direction and, as a consequence, the required port 1 + 1” SLA. The service manager thus can instruct a provision-
count increases significantly (see Fig. 6). In detail, a centralized ing engine to proceed with the path configuration according to
SDN-based platform assists in switching within 50 ms to an the calculated restoration path in a time effective manner where
already built disjoint protection/restoration path when a failure the provisioning engine can execute the path set up with min-
is encountered and at the same instance search and provision a imum network configuration actions. The provisioning engine
new (third or more) path which is disjoint to the current path takes a two-step approach where in (i) compares failed route to
of the traffic. This ensures that, on the next failure, the service new restoration route to identify the commonalities in each of
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4067

of having appropriate design algorithms, in order to be able to


materialize the objectives of HCA in each specific network and
traffic scenario. In the following, a set of three algorithms is
considered. Particularly, these algorithms target to: (i) optimize
resource dimensioning for working and protection paths, min-
imizing the number of line interfaces deployed; (ii) optimize
resource dimensioning for restoration paths and (iii) speed up
failure simulation by explicitly simulating only a sub-set of fail-
ure combinations.
The first algorithm is based on the multi-layer optimization
framework [14] compatible with flexible-rate line interfaces and
is utilized to perform traffic grooming, optical channel routing
and wavelength assignment tasks, which are key to minimize
the number of expensive line interfaces.
Fig. 7. Centralized virtualized control plane benefits. BASELINE ROUTING, GROOMING AND SPECTRUM ASSIGNMENT

them to identify where it must tear down network configurations


and rig up new ones for the restoration route and in (ii) issues
commands for tear down actions followed by rig up actions. This
parallel processing is key to speed up the restoration path set-up.
Further insight can be gained by a qualitative comparison
between the centralized virtualized control plane and the tradi-
tional distributed GMPLS control plane (see Fig. 7). In a dis-
tributed GMPLS control plane, all the network elements need
detailed network information along with computational abili-
ties to calculate optical performance and perform path compu-
tation. Introducing a new restoration scheme requires a longer
time as this is closely tied to the telco development cycle.
Further in this traditional scheme, during operation if there
would be a contention among two services, this would only
be discovered after signaling and path set-up and a crank back
would be necessary for re-calculation adding to the delay. Con-
trary to this, the centralized virtualized ASON solves many of
these issues and preserves the benefits. To list a few of them,
improved resilience can be achieved through multi-layer L0-L3
coordination in multi-service and IP-Optical networks. Better
network resource utilization is attained through optimized multi-
layer and multi-domain routes and operations became simplified
through simpler protocols/functions [26]. As any new function-
ality/feature can be introduced through centralized software up-
grade, this allows shorter time-to-market following software de-
velopment cycle.
In summary, HCA for optical transport networks is a cost-
effective alternative to guarantee scalability when supporting
multi-failure resiliency. It achieves this by taking advantage of
high density high capacity cost-effective compact DCI-oriented
muxponders, parsimonious usage of universal OTN switch for
sub-rate <10G, CD/CDC ROADMs, sub-50 ms protection, op-
tical restoration reusing same line interface and shared pool of
3Rs and a centralized virtualized SDN control plane for efficient
multi-layer protection and restoration.

III. NETWORK DESIGN ALGORITHMS


The first part of the problem being covered with the scalable The baseline design algorithm described above is sequential in
and cost-effective HCA architecture, the next challenge consists nature and, therefore, can be utilized both for network planning
4068 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 37, NO. 16, AUGUST 15, 2019

and online operation. For simplicity, the description assumes combinations by expected importance and then shortlist them
that line interfaces can be operated at either 100 G or 200 G, but according to a given budget (i.e., maximum number of failure
it can be easily extended to support a wider array of optical chan- combinations that can be explicitly considered) is described. Al-
nel formats. Grooming of a given traffic demand into a sequence though it cannot guarantee in advance that all the required shared
of existing and/or new lightpaths is modelled by representing in resources are identified, especially if the failure simulation bud-
the auxiliary graph both existing lightpaths with enough free ca- get is very small, it has been recently shown that in a reference
pacity to accommodate the demand and possible (i.e., feasible) network it was possible to approximately dimension the shared
new lightpaths that can be created to carry the demand. This al- resources when the failure simulation budget covers at least 10%
gorithm is used to route all demands and determine the resources of the total number of possible failure combinations [27].
required for working and protection paths. The restoration re-
sources (e.g., number of shared 3Rs at each node) can only be
determined taking into account specific failure scenarios. For
that purpose, the following algorithm was developed.

RESTORATION ROUTING, GROOMING AND SPECTRUM ASSIGNMENT

Importantly, this algorithm must be executed for all the mean- IV. NETWORK SIMULATION STUDIES
ingful link failure combinations in order to determine the total This section aims at quantitatively validate the effectiveness
number and location of the shared resources. For instance, to of the HCA in reference transport networks. The algorithms have
ensure survivability against up to three link failures and assum- been implemented in C language and were run on a computer
ing protection is used to recover from the first link failure, all with an Intel Core i7 and 16 GB RAM. Execution time was found
combinations of two link failures have to be considered and for to be in 10 s of minutes for all the three networks. In this study, all
each one the design algorithm is executed. traffic demands have high resiliency requirements making use of
The effort required to compute the shared resources will grow a 1 + 1 + Restoration scheme with client traffic signals of 100
with the network size, number of traffic demands and number GbE. Under these assumptions, two architectural choices – a
of simultaneous link failures to be considered. This can imply conventional and the HCA – are compared with each other: The
a very long process, especially when the number of simultane- first one is based on OTN switches at all ROADM nodes with
ous link failures the traffic demands must survive to increases. 1 + 1 ODU layer protection for the first failure followed by ODU
Consequently, in order to guarantee that it is possible to design restoration. The second one relies on high-density muxponders
a transport network based on the proposed HCA architecture, at all ROADM nodes (for traffic aggregation or electrical signal
and more generally to design a transport network relying on regeneration) with 1 + 1 client side protection for the first failure
optical restoration, it is useful to investigate methods that can followed by OCh restoration. For both architectures, DWDM
reduce the number of failure combinations that need to be explic- ports (i.e., line interfaces) are dimensioned such that link failure
itly simulated. In the following, an algorithm to sort the failure survivability is about the same in both cases.
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4069

Fig. 10. SPN topology.

Fig. 8. IBN topology.

Fig. 11. SPN traffic distribution.

Fig. 9. IBN traffic distribution.

The first network topology considered is a reference opti-


cal transport network defined by Telecom Italia/TIM. Particu-
larly, this Italian national backbone network (IBN) has 44 nodes
and 71 fiber links (mainly G.655, with some G.652 and G.653
fiber types) and has already been used in previous studies [14],
[28], [29]. It meets the needs of circuits at a national level,
mainly for IP router interconnection and for connectivity of Fig. 12. GBN topology.
large size clients. With its shortest routing paths under 2200
km and backup routing paths (disjoint from the shortest one)
under 2600 km, it can be classified as a long-haul transport net- of the traffic among the 5% node pairs in the entire network (69
work. Figure 8 depicts the topology of IBN. More detailed in- node pairs have traffic out of 1176 node-pairs) , as defined in
formation is available in a FP7 IDEALIST project deliverable [30].
[30] and in [31]. In the study reported here, the 100 GbE client The third network covers Germany and is based on the na-
traffic distribution, as illustrated in Fig. 9, amounts to a total tional backbone network operated by Deutsche Telekom. The
traffic load of 27 Tbps. Fig. 9 displays the spatial distribu- German national backbone network (GBN) [33], shown in
tion of the traffic among 13% node pairs in the entire network Fig. 12, has 17 nodes and 26 fiber links, with the longest rout-
(123 node pairs have traffic out of 946 node-pairs), as defined ing path under 2000 km. The 100 GbE client traffic distribution
in [30]. illustrated in Fig. 13 corresponds to an approximate total load of
The second network, whose topology is depicted in Fig. 10, is 23 Tbps. Fig. 13 displays spatial distribution of the traffic among
based on TIM Sparkle Pan-European backbone network (SPN) 85% node pairs in the entire network (116 node pairs have traffic
[31], [32], a geographically expanding network that currently out of 136 node-pairs), as defined in [30].
covers Central, Southern and Eastern Europe with 49 nodes and In this study, the line interfaces utilized operate at 34 Gbaud
72 fiber links. It classifies as an ultra-long-haul network (shortest and support two modulation formats to realize two line rates
paths under 5500 km and backup paths under 7000 km). with different reach: quadrature phase-shift keying (QPSK),
For the purpose of this study, the 100 GbE client traffic dis- used to create 100 Gbps lightpaths, and 16 quadrature amplitude
tribution assumed for SPN, as illustrated in Fig. 11, amounts to modulation (16QAM), enabling 200 Gbps lightpaths. In order
a total load of 18 Tbps. Fig. 11 displays the spatial distribution to estimate the maximum reach with each format, the simplified
4070 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 37, NO. 16, AUGUST 15, 2019

Fig. 13. GBN traffic distribution. Fig. 15. Comparative analysis OTN vs HCA for SPN.

TABLE I
RELATIVE POWER CONSUMPTION, SPACE OCCUPATION
AND CAPEX OF OTN SWITCH AND DCI MUXPONDER

Fig. 16. Comparative analysis OTN vs HCA for GBN.

In the IBN, the HCA provides about 80% savings in DWDM


port count (i.e., line interfaces) for restoration, about 33% sav-
ings in overall DWDM port count, 80% savings in CAPEX, 78%
savings in typical power consumption and 90% savings in foot-
print. It should be noted that the routing solution obtained in this
network mostly relies on 200G lightpaths.
Fig. 14. Comparative analysis OTN vs HCA for IBN.
The dimensioning results from SPN also show consistent sav-
ings when comparing the HCA with a traditional OTN-based ar-
chitecture. Particularly, the hyperscale architecture enables 27%
optical performance model described in [34] is used. This model savings in overall port count, around 50% savings in restoration
is based on the well-known Gaussian noise (GN) for nonlinear port counts, 66% savings in CAPEX, 73% savings in power
interference (NLI) caused by the Kerr effect and includes ad- consumption and 90% savings in footprint. The slightly smaller
ditional penalties and margins to account for optical filtering, advantage of the HCA in this network, when compared to the
crosstalk, aging, polarization dependent losses, among other ef- same results obtained in IBN, can be a consequence of having a
fects not considered in the GN model. smaller total traffic load in SPN. In addition, it should be noticed
The architectures are compared in terms of key metrics such that the routing solution of SPN requires significantly longer
as CAPEX, power consumption and footprint. Table I shows the routing paths. As a result, this solution comprises a majority
elemental values, which have been normalized. Part of the actual of 100G lightpaths. Further details on the number of dedicated
values used are available in [18] for the OTN switch and in [23] and shared lightpaths and their rate can be observed for all three
for the DCI Muxponder. reference networks in Fig. 17.
The dimensioning results are shown in Fig. 14, Fig. 15 and For the GBN, the results are also encouraging, with around
Fig. 16 for the IBN, SPN and GBN, respectively. The figures 75% savings in CAPEX, approximately 90% savings in power
show a comparison between HCA and conventional OTN ar- consumption and 95% savings in footprint. Noteworthy, there
chitecture and the mentioned savings are with respect to OTN are smaller savings in the overall port count (8%) and in the
architecture. In addition to CAPEX, footprint and power con- restoration port count (43%). This may be explained by the fact
sumption, the total number of ports (i.e., line interfaces) required that routing paths are typically small and there is little need for
and the number of ports required for restoration are also shown. intermediate regeneration, decreasing the potential for savings
In both cases, the values are also normalized. in the number of shared 3R devices.
GANGOPADHYAY et al.: 5G-READY MULTI-FAILURE RESILIENT AND COST-EFFECTIVE TRANSPORT NETWORKS 4071

[10] D. King et al., “The dichotomy of distributed and centralized control:


METRO-HAUL, when control planes collide for 5G networks,” Opt.
Switching Netw., vol. 33, pp. 49–55, 2019.
[11] W. Wang and J. Doucette, “On the availability and spare capacity require-
ments of path-restorable transport networks,” in Proc. 10th Int. Workshop
Resilient Netw. Des. Model., Longyearbyen, Norway, 2018, pp. 1–7.
[12] B. Gangopadhyay, J. Pedro, J. Kivimaa, and S. Spaelter, “Multi-failure
resiliency and cost-effectiveness in transport networks: A contradiction?”
presented at the OSA Adv. Photon. Congr.: Photonic Netw. Devices, Zurich,
Switzerland, 2018, Paper NeM2F.2.
[13] B. Gangopadhyay, J. Pedro, and S. Spaelter, “Minimizing the cost and
augmenting the resilience of vulnerable optical transport networks,” pre-
sented at the Opt. Fiber Commun. Conf., Anaheim, CA, USA, 2018, Paper
M1A.6.
Fig. 17. Port count analysis OTN vs HCA. [14] B. Gangopadhyay, J. Pedro, and S. Spaelter, “Cost-effective next-
generation information highways leveraging universal OTN switching and
flexible-rate,” presented at the Opt. Fiber Commun. Conf., Los Angeles,
CA, USA, 2017, Paper Th4F.5.
V. CONCLUSION [15] D. Moniz, J. Pedro, and J. Pires, “Network design framework to optimally
The overriding purpose of this paper is to depict the relative provision services using higher-symbol rate line interfaces,” IEEE/OSA J.
Opt. Commun. Netw., vol. 11, no. 2, pp. A174–A185, Feb. 2019.
importance of the hyperscale architecture complemented by cen- [16] J. Pedro and B. Gangopadhyay, “On the trade-offs between ODU and OCh
tralized virtual control plane in the context of different network preplanned shared restoration in transport networks,” in Proc. 16th Int.
scenarios and traffic conditions. It is evident that DCI concepts Telecommun. Netw. Strategy Planning Symp. (Netw.), Funchal, Portugal,
2014, pp. 1–6.
and technologies will penetrate into traditional carrier network [17] V. Hutcheon, “OTN to enable flexible networks,” presented at the Opt.
driven by 5G and to exploit the benefits from a disaggregated Fiber Commun. Conf. Expo Nat. Fiber Optic Engineers Conf., San Diego,
network architecture. The proposed HCA solution is a similar CA, USA, 2011, Paper NMC1.
[18] Infinera, “mTera Universal transport platform,” Sunnyvale, CA, USA, In-
concept built on the building blocks of DCI-oriented muxpon- finera, White Paper. [Online]. Available: https://cdn.extranet.coriant.com/
ders, a flexible optical line system and a virtualized (central) resources/Data-Sheets/DS_mTera_UTP_74C0035.pdf
control plane for ASON. By means of network simulations over [19] M. Sabelhaus, “100G Metro transformation,” [Online]. Available: http://
www.lightwaveonline.com/articles/print/volume-31/issue-3/features/
reference transport networks, it has been shown that there are 100g-metro-transformation.html
significant savings in CAPEX, footprint and power consumption [20] M. Carroll, J. Roese, and T. Ohara, “The operator’s view of OTN evolu-
when compared to the traditional all OTN switch based architec- tion,” IEEE Commun. Mag., vol. 48, no. 9, pp. 46–52, Sep. 2010.
[21] B. Gangopadhyay, J. Pedro, and S. Spaelter, “Augmented and cost-
ture while maintaining similar network resilience. This provides optimized network resilience exploiting DCI oriented muxponders, OTN
evidence that is feasible to conciliate augmented multi-failure switching and network clustering,” in Proc. 19th ITG-Symp.: Photon.
resiliency and cost-effectiveness in transport networks. An av- Netw., Leipzig, Germany, 2018, pp. 12–17.
[22] Driving down the cost of metro networks using next-generation co-
erage of around 65% savings in CAPEX, around 70% savings in herent technology. [Online]. Available: https://www.lightwaveonline.
power consumption and around 85% savings in space utilization com/optical-tech/electronics/article/16654834/driving-down-the-cost-of-
could be attained delivering a similar degree of failure resiliency. metro-networks-using-nextgeneration-coherent-technology
[23] Infinera, “Coriant groove G30 network disaggregation platform—
Muxponder,” Sunnyvale, CA, USA, Infinera, White Paper. [On-
line]. Available: https://cdn.extranet.coriant.com/resources/Data-Sheets/
REFERENCES DS_Groove_G30_NDP_Muxponder_74C0157.pdf
[1] A. Gupta and R. K. Jha, “A survey of 5G network: Architecture and emerg- [24] Cisco visual networking index (VNI) 2017-2022. [Online].
ing technologies,” IEEE Access, vol. 3, pp. 1206–1232, 2015. Available: https://www.cisco.com/c/en/us/solutions/service-provider/
[2] M. Vaezi and Y. Zhang, Radio Access Network Evolution. in: Cloud Mobile visual-networking-index-vni/index.html
Networks. Wireless Networks. Cham, Switzerland: Springer, 2017. [25] J. Pedro and S. Pato, “Impact of add/drop port utilization flexibility in
[3] Y. Fu, S. Wang, C. Wang, X. Hong, and S. McLaughlin, “Artificial intelli- DWDM networks [Invited],” IEEE/OSA J. Opt. Commun. Netw., vol. 4,
gence to manage network traffic of 5G wireless networks,” IEEE Network, no. 11, pp. B142–B150, Nov. 2012.
vol. 32, no. 6, pp. 58–64, Nov./Dec. 2018. [26] D. Levin, A. Wundsam, B. Heller, N. Handigol, and A. Feldmann, “Log-
[4] G. Song, W. Wang, D. Chen, and T. Jiang, “KPI/KQI-driven coordinated ically centralized?: State distribution trade-offs in software defined net-
multipoint in 5G: Measurements, field trials, and technical solutions,” works,” in Proc 1st Workshop Hot Topics Softw. Defined Netw. New York,
IEEE Wireless Commun., vol. 25, no. 5, pp. 23–29, Oct. 2018. NY, USA, 2012, pp. 1–6.
[5] N. J. Gomes et al., “Boosting 5G through Ethernet: How evolved Fronthaul [27] B. Gangopadhyay, J. Pedro, and S. Spaelter, “Multi-failure resilient and
can take next-generation mobile to the next level,” IEEE Veh. Technol. cost-effective hyper-scale transport networks for the 5G-era,” presented
Mag., vol. 13, no. 1, pp. 74–84, Mar. 2018. at the Opt. Fiber Commun. Conf., Anaheim, CA, USA, 2019, Paper
[6] J. Kani, S. Kuwano, and J. Terada, “Options for future mobile backhaul NeM2F. 2.
and fronthaul,” Opt. Fiber Technol., vol. 26, pp. 42–49, Dec. 2015 [28] N. Costa, J. Pedro, M. Quagliotti, and L. Serra, “Preplanning frame-
[7] R. Muñoz et al., “SDN control and monitoring of SDM/WDM and packet work to evaluate the potential of different modulation formats in
transport networks for 5G fronthaul/backhaul,” in Proc. IEEE Photon. Soc. DWDM networks,” in Proc. 10th Conf. Telecommun., Aveiro, Portugal,
Summer Topical Meeting Ser., Waikoloa Village, HI, USA, 2018, pp. 151– 2015, pp. 1–4.
152. [29] M. Schiano and M. Quagliotti, “Lambda switched future photonic net-
[8] K. Mullaley, “5G networks’ impact on fiber-optic cabling require- work development,” presented at the Opt. Fiber Commun. Conf. and Expo
ments,” Cabling Installation Maintenance, Aug. 2018. [Online]. Avail- Nat. Fiber Optic Engineers Conf., Los Angeles, CA, USA, 2012, Paper
able: https://www.cablinginstall.com/wireless-5g/article/16468593/5g- OW4A.4.
networks-impact-on-fiberoptic-cabling-requirements [30] FP7 IDEALIST, “Elastic optical network architecture: Reference sce-
[9] J. Baranda et al., “Orchestration of end-to-end network services in the nario, cost and planning,” Deliverable D1.1. [Online]. Available: http://
5G-crosshaul multi-domain multi-technology transport network,” IEEE cordis.europa.eu/docs/projects/cnect/9/317999/080/deliverables/001-
Commun. Mag., vol. 56, no. 7, pp. 184–191, Jul. 2018. D11ElasticOpticalNetworkArchitecture.doc
4072 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 37, NO. 16, AUGUST 15, 2019

[31] A. Eira, M. Quagliotti, and J. Pedro, “Impact of client-and line-side flex- João Pedro received the M.Sc. and the Ph.D. degrees in electrical and computer
ibility in the lifecycle of next generation transport networks [Invited],” engineering from Instituto Superior Técnico (IST), University of Lisbon, Lisbon,
IEEE/OSA J. Opt. Commun. Netw., vol. 8, no. 7, pp. A101–A115, Jul. Portugal. He was a Research Engineer and a System Architect for Network Plan-
2016. ning Tools with Nokia Siemens Networks from 2008 to 2013 and with Coriant
[32] Sparkle. [Online]. Available: http://www.tisparkle.com/ from 2013 to 2018. He currently leads the Multilayer Performance Optimization
[33] C. Politi et al., “Integrated design and operation of a transparent optical team with Infinera, Carnaxide, Portugal, which investigates future-looking op-
network: A systematic approach to include physical layer awareness and tical and multi-layer network architectures and design algorithms and supports
cost function,” IEEE Commun. Mag., vol. 45, no. 2, pp. 40–47, Feb. 2007. both algorithmic development and optical performance for planning tools and
[34] J. Pedro, “Designing transparent flexible-grid optical networks for max- SDN controller. He has also been a lecturer of courses on network planning
imum spectral efficiency [Invited],” IEEE/OSA J. Opt. Commun. Netw., and transport networks. He holds 6 patent applications and has co-authored over
vol. 9, no. 4, pp. C35–C44, Apr. 2017. 150 publications in international conferences and journals, participated in EU
projects FUTON, IDEALIST, and METRO-HAUL. Currently, his research in-
terests include high capacity optical networks, node architectures, routing and
spectrum assignment, multi-layer optimization, disaggregated networks, and
machine learning applications.
Dr. Pedro served as a reviewer for IEEE/OSA JOURNAL OF OPTICAL COM-
MUNICATIONS AND NETWORKING, JOURNAL OF LIGHTWAVE TECHNOLOGY, and
Bodhisattwa Gangopadhyay was born in Haldia, India, in 1981. He received IEEE PHOTONICS TECHNOLOGY LETTERS. He has also served as a TPC member
the B.E. degree in instrumentation technology from Siddaganga Institute of
of OFC from 2015 to 2018.
Technology, Visvesvaraya Technological University, Belgaum, India, in 2005
and the M.Tech degree in digital electronics and advanced communication from
Manipal University, Manipal, India, in 2007. After his initial years as an In-
tern, an Engineer, and a Ph.D. student (2006–2011) with Siemens (2006–2007)
and with Nokia Siemens Networks (2007–2011), he served as Research Engi-
neer, System Architect, and Solutions Consultant with Nokia Siemens Networks
(2011–2013) and Coriant (2013–2018) for network planning tools, network plan-
ning services, network migrations, multi-layer planning, and optimization of cus-
tomer networks. At present, he is a Principal Solution Consultant with Infinera, Stefan Spaelter received the Ph.D. degree in the field of quantum fiber optics.
Carnaxide, Portugal, where he is involved in multi-layer planning and optimiza- He has held various leadership positions in research, system engineering, op-
tion of transport networks, focusing toward 5G. He has authored several book tical engineering, and technical sales with Lucent Bell Labs, Siemens, Nokia
chapters, journals, and conference papers in these fields. He has participated in Siemens Networks, and Coriant. He is currently the Vice President of Solutions
EU project FUTON. His research interests include transport networks for 5G Product Line Management with Infinera, Munich, Germany, where he is globally
roll outs and planning/operation of network protection/restoration in multi-layer responsible for defining innovative and competitive solutions to packet optical
networks (L3-L0). transport networks. To his role, he brings more than 25 years of experience in
Mr. Gangopadhyay has served as a reviewer for JOURNAL OF LIGHTWAVE optical communications. He has published more than 75 reviewed conference
TECHNOLOGY. papers and journal articles.

You might also like