Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

830 Vol. 15, No.

11 / November 2023 / Journal of Optical Communications and Networking Research Article

Building a digital twin of an EDFA for optical


networks: a gray-box modeling approach
Yichen Liu,1 Xiaomin Liu,1 Yihao Zhang,1 Meng Cai,1 Mengfan Fu,1 Xueying Zhong,1
Lilin Yi,1 Weisheng Hu,1,2 AND Qunbi Zhuge1,2, *
1
State Key Laboratory of Advanced Optical Communication Systems and Networks, Department of Electronic Engineering,
Shanghai Jiao Tong University, Shanghai 200240, China
2
Peng Cheng Laboratory, Shenzhen 518000, China
*qunbi.zhuge@sjtu.edu.cn

Received 3 July 2023; revised 26 September 2023; accepted 26 September 2023; published 12 October 2023

High-accuracy physical layer models enable intelligent, self-driving optical networks. The dynamic wavelength-
dependent gain characteristics of erbium-doped fiber amplifiers (EDFAs) remain a crucial problem in terms of
modeling. The gain model directly determines the power spectrum and is therefore important for estimating
the optical signal-to-noise ratio as well as the magnitude of fiber nonlinearities. Black-box data-driven models
have been widely studied, but they require a large size of data for training and suffer from poor generalizability.
In this paper, we derive the gain spectra of EDFAs as a simple univariable linear function; then, based on it, we
propose a gray-box EDFA gain modeling scheme. Experimental results show that, for automatic gain control
(AGC) and automatic power control (APC) EDFAs, our model built with 8 data samples can achieve better per-
formance than the neural network (NN) based model built with 900 data samples, which means the required data
size for modeling can be reduced by at least 2 orders of magnitude. Moreover, in the experiment, the proposed
model demonstrates superior generalizability to unseen scenarios since it is based on the underlying physics
of EDFAs. With the proposed scheme, building a customized digital twin of each EDFA in optical networks
becomes more feasible, which is essential, especially for next-generation multiband network operations. © 2023
Optica Publishing Group

https://doi.org/10.1364/JOCN.499530

1. INTRODUCTION because the behavior of EDFAs directly determines the opti-


To support various emerging Internet applications, such as cal signal-to-noise ratio (OSNR) of a transmitted signal
virtual/augmented reality, cloud services, and video streaming, [12]. Moreover, the optical power at the output of an EDFA
optical networks are facing an explosive growth in capacity determines the magnitude of fiber nonlinearities [13] in the
demand. Over the past decade, technologies such as modu- following fiber span. These are the major impairments in a
lation and coding [1] and digital signal processing (DSP) long-haul optical transmission system. Therefore, the accuracy
[2] have been developed rapidly. The capacity of optical of EDFA models largely determines whether the quality of
transmission systems is now close to the Shannon limit [3]. transmission can be accurately estimated.
In order to further exploit the network capacity, more effi- On the other hand, to provide more spectrum resources,
cient network control and management is required. In this multiband optical transmission systems are being commercial-
context, novel network operation paradigms including the ized [14]. In a multiband system, multiple rare-earth-doped
software-defined network (SDN) [4], elastic optical network fiber amplifiers are utilized to amplify signals on differ-
(EON) [5], and self-driving optical network [6] have been ent wave bands [15]. The operating conditions of these
investigated and developed. These control and management amplifiers should be jointly optimized to ensure the trans-
strategies bring out the necessity to acquire accurate and fine mission performance in the presence of interband stimulated
models of the physical layer [7]. The most significant challenge Raman scattering [16]. In this scenario, the modeling of fiber
in constructing such physical layer models lies in evaluating amplifiers become even more critical.
the complex physical processes as signals pass through optical We focus on the modeling of an EDFA’s gain. In submarine
components including fibers, optical amplifiers, wavelength systems, EDFAs often work in an automatic current control
selective switches (WSSs), and optical transceivers [8,9]. mode, maintaining a constant pump power. The gain models
Among the physical layer models, the model of the erbium- for these EDFAs have been derived in [17–20]. In terrestrial
doped fiber amplifier (EDFA) is critical [10,11]. This is optical fiber systems, EDFAs often work in an automatic gain

1943-0620/23/110830-09 Journal © 2023 Optica Publishing Group


Research Article Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking 831

control (AGC) or automatic power control (APC) mode. In a digital twin model for each individual EDFA. The perform-
these cases, the pump power is dynamically adjusted to main- ance of this model is verified on our experimentally measured
tain an average gain or total output power over the loaded dataset and a public dataset [35]. Compared with the pure
channels [21]. In this paper, EDFAs operated under AGC or NN-based model, the proposed model reduces the required
APC mode are considered. The EDFA gain characteristics are data size by at least 2 orders of magnitude while achieving
wavelength- and input-dependent. First, the gain of an EDFA better accuracy and generalizability.
is a nonflat spectrum [22,23]. Multiple underlying physics The remaining part of the paper is structured as follows. In
such as the homogeneous and inhomogeneous broadening Section 2, the underlying physics of the EDFA is introduced,
[23] result in a wavelength-dependent gain characteristic. and the EDFA gain characterization is analyzed. In Section 3,
Second, the gain spectrum varies significantly on different the proposed gray-box modeling scheme is described. In
input signal configurations and device settings [24]. For AGC Section 4, the experimental setup and dataset are presented. In
or APC EDFAs, the pump power varies with input, leading Section 5, the superior performance of the modeling scheme
to a dynamic gain characteristic. Additionally, the character- is demonstrated on our experimental dataset and the public
istics of various physical devices are usually inconsistent due dataset.
to the diversity in physical design and variance in fabrication
[25]. This further necessitates the requirement for customized 2. EDFA GAIN CHARACTERISTICS
modeling of each individual EDFA.
Many investigations on modeling EDFAs operated with The EDFA gain spectrum is a complex function of input signal
nonconstant pump powers have been reported. Analytical spectrum, EDFA settings, and multiple underlying parameters,
models for erbium-doped fibers (EDFs) are proposed in which are generally difficult to acquire. In this section, we first
[22,23]. These models can be accurate, but the underlying describe the EDFA’s underlying physics and mathematics by
parameters are difficult to obtain. Moreover, as the structure introducing its structure and the process of optical amplifica-
of an EDFA, the cross-section parameters of the EDF, and the tion. Afterward, we derive a simple expression with only one
pump powers are usually inaccessible in real networks, these dynamic variable to describe the EDFA gain behaviors. The
models are difficult to deploy in practice. Another analytical simplified expression will serve as the basic framework of the
model for AGC EDFA gain estimation is proposed in [26]. proposed gray-box EDFA gain model.
Based on only a few experimental measurements, a simple
equation is used to obtain the gain spectrum. However, the A. Typical Structure of an EDFA
model is derived on the basis of several ideal assumptions,
The structure of a typical single-stage EDFA [36] is shown in
and it is shown that the model’s accuracy is not satisfactory in
Fig. 1. The key component is an EDF, in which the signal light
practical applications [10]. In recent years, data-driven EDFA is amplified through the process of the stimulated emission
gain models [10,27,28] have attracted increasing attention transition with pump light. At the input and output of the
for their ability to provide accurate estimations. For instance, EDF, isolators are adopted to eliminate the effects of reflected
with a relatively large dataset (about 40,000 data samples in light, such as self-excited oscillation. A gain flattening filter
[27] and 12,000 data samples in [10]) for training, neural (GFF) is usually used to reduce the wavelength-dependent
networks (NNs) are used to estimate the gain or noise power variation of gain [37]. A control circuit is employed to monitor
spectrum with a root mean square error (RMSE) below 0.1 dB. the input and output power of the EDFA and then adjust the
Some other NN-based models are further proposed to improve powers of the pumps. When the EDFA is set to work in an
the model’s generalizability [25], fasten the training process AGC or APC mode, the powers of the pumps are dynamically
[10], reduce the required training dataset [29–31], or improve adjusted to maintain a total gain for AGC or total output
the modeling accuracy [32,33]. Data-driven models have power for APC, respectively.
also found applications in optical line system modeling. For To achieve a larger gain and/or a lower noise figure (NF),
instance, in [34], a system model is established using telemetry a multistage structure of an EDFA is often commercially
data, thus enabling accurate estimation of signal spectral pow- adopted [38]. In such a structure, EDFs cascade and each stage
ers and OSNR. Nevertheless, implementing black-box models may contain GFFs and optical attenuators.
in a system is not straightforward. This is primarily because In the following, the gain characteristics of the multistage
the large dataset required to train these models is typically too EDFAs under different input signals and amplifier settings are
costly to obtain in real systems. In addition, the NN-based analyzed. When the input signal and amplifier setting vary,
model is generally a black-box model with poor interpretabil- most of the internal structures such as the isolator, GFF, and
ity and generalizability, making it difficult to guarantee its coupler, are generally considered to provide a static response.
performance in complex and dynamic systems. The EDFA gain variation is mainly caused by the dynamic
In this work, we aim at providing an easy-to-implement gain
model of an EDFA by greatly reducing the required data size
Input Output
and improving the model’s generalizability. We analyze the
Isolator Coupler EDF Coupler Isolator GFF
physics of a typical class of EDFAs and show that their gain
spectra are functions of a single independent variable. Then, Control Circuit
based on this physical derivation, we propose an accurate Forward Pump Backward Pump

gray-box model, which only needs 8 data samples to customize Fig. 1. Structure of a typical single-stage EDFA.
832 Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking Research Article

pump power adjustment and the EDF response variation all the terms in bold are vectors representing spectra over the
with different signals and pumps. Next, we will introduce the operating waveband. Each element in the vectors is the value
analytical expression of the EDF’s response. for a specific channel.
The previous models [22,23] have drawn the conclusion
B. Gain Characterization of the EDF
that knowing the value of a single variable is sufficient to deter-
mine the gain spectrum. Additionally, the conclusion enables
A widely applied model of Saleh et al. [22] accurately predicts the applications such as determining the gain coefficient of an
the evolution of signal power along the EDF, which can be EDFA [40]. Some simplified EDFA models are also provided
expressed as [41] based on this conclusion; however, ideal assumptions such
d Pλ (z) as a constant gain tilt may induce inaccuracy. In this paper,
= ρ0λ σe ,λ N2 (z) − σa ,λ N1 (z) − αs ,
 
(1) we would further extend this conclusion to AGC/APC EDFA
d z · Pλ (z) modeling through mathematical derivation.
where ρ is the Er3+ ion concentration, and αs is the back-
ground loss of the signal. σa ,λ and σe ,λ are the absorption and C. Gain Characterization of an EDFA
emission cross-sections at wavelength λ, respectively. 0λ is the
signal overlap factor at wavelength λ. N1 (z) and N2 (z) are Next, the gain spectrum of a complete multistage EDFA
the lower and upper levels’ fractional population of the Er3+ is derived, considering all the internal components of an
ion concentration at fiber length z, respectively, which are EDFA. The response of the ith stage of GFFs, couplers, etc., is
related to the signal and pump power at fiber length z [22]. regarded as a constant insertion loss spectrum, represented by
For a two-level EDFA [23], only the two energy levels of the Lossi . The overall gain spectrum of an n-stage EDFA can be
Er3+ ions are used, which means N1 (z) + N2 (z) = 1. In fact, expressed as
the two-level system model is a simplification neglecting the n
X
population in the third energy level. The simplification is rea- Goverall = ( A i · hN2 i L i + B i · L i + Lossi ). (4)
sonable because of the rapid thermalization in the metastable i=1
manifold. The model described in Eq. (1) is the homogeneous
model of the EDF. Inhomogeneous broadening effects, such For some EDFAs, the EDFs of each stage are of the same
as spectral hole burning [39], are not included. Research [17] specification, which is the case with the EDFA used in our
has shown that the homogeneous model can provide good experiment. In this case, the parameters A i and B i are the
accuracy (RMSE = 0.05 dB). In this paper, we focus only on same constants for each stage, denoted as A const and B const ,
the homogeneous gain effects. The inhomogeneous effects may respectively. Then, the equation can be simplified to
be significant for some EDFAs, and recent researches [39] have Goverall = A const · k + C const , (5)
been investigating this issue. The consideration of these effects
Pn Pn Pn
in our approach is left for future investigation. where C const = B const · i=1 L i + i=1 Lossi , k = i=1
By substituting the relationship N1 (z) = 1 − N2 (z) into hN2 i L i . It should be noted that there may be a variable optical
Eq. (1) and integrating both sides of the equation, the gain on a attenuator (VOA) between each pair of EDFs, but we assume it
logarithmic scale can be obtained as holds a constant attenuation when the EDFA is operated under
a given gain/output power setting. In this case, A const and
ln[Pλ (L)] − ln[Pλ (0)] C const are constant spectra, and k is a dynamic variable related
to input signals. For some other EDFAs, the specification of
= ρ0λ σe ,λ + σa ,λ · hN2 i L − 0λ · σa ,λ + αs · L, (2)
  
each stage may not be the same. Assuming the variation of
where hN2 i L is the integration of N2 (z) over fiber length hN2 i L i in each stage are of similar extent, the same equation
RL
L: hN2 i L = 0 N2 (z)dz. as shown in Eq. (5) can be derived. Equation (5) holds under
In Eq. (2), only the value of term hN2 i L is related to the sig- arbitrary input signals and pump configurations, indicat-
nals and pumps. All the other terms are constant parameters of ing that for a certain n-stage EDFA, the gain spectrum is a
the EDF. By combining the constant terms, a simplified expres- univariate linear function.
sion can be derived as On the other hand, for AGC/APC EDFAs, the control
circuit dynamically adjusts the pump power to meet the
GEDF (L) = A · hN2 i L + B · L, (3) gain/power settings. It should be noted that, in the controlling
process, only the total gain/power, which is the gain averaged
where GEDF (L) is the EDF’s gain spectrum on the dB over the full wavelength range or power summed over the
scale, and its value at wavelength λ is G EDF (L, λ) = full wavelength range, is maintained. When a wavelength set
4.34 · ln[Pλ (L)/Pλ (0)]. A and B are constant wavelength- {λ1 , λ2 , ... , λm } is input into the EDFA, the relationship of
dependent spectra for a certain type of EDF, and their values the input signal, the gain/power setting, and the corresponding
at wavelength λ are A(λ) = 4.34 · ρ0λ [(σe ,λ + σa ,λ )] and gain spectrum is
B(λ) = −4.34 · (ρ0λ · σa ,λ + σs ). Equation (3) indicates
m
that, for arbitrary signals and pumps, the logarithmic gain of
10(G overall (λi )+Pin (λi ))/10 = G
X
an EDF experienced by each wavelength is only linearly related set + Pin, total , (AGC)
d
to a single variable hN2 i L . Note that, throughout this paper, i=1

the gain spectrum will appear on the dB scale. In this paper, m


10(G overall (λi )+Pin (λi ))/10 = Pc
X
set , (APC), (6)
i=1
Research Article Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking 833

where G overall (λi ) denotes the value of overall gain Goverall this equation, parameters 1G and G0 show the wavelength-
at wavelength λi . Pin (λi ) denotes the input power at wave- dependency of EDFA gain characteristics, and the variable x
length λi . Pin, total is the total power of the input signal. Due shows the input-dependency.
to the inaccuracy of the power monitoring process, there A set of parameters (1G, G0 ) can be determined by using
may be a difference between the target gain/power and the a measured EDFA gain spectrum dataset. A small dataset con-
real gain/power, which can be calibrated. For a single EDFA, taining n EDFA gain spectra is given as G1∼n . Each spectrum
especially when controlled in AGC mode, the variation in in the dataset corresponds to a value of the variable k, and the
gain spectrum caused by pump dynamics may not be strong. gain spectrum dataset corresponds to a set of n values of the
However, it is still necessary to be considered to provide an variable k. There must exist a linear transformation to trans-
accurate model for system performance estimation. form the variable k into a variable x such that the n values of
x have an average of 0 and a range of 1. Each gain spectrum in
the dataset can be expressed as Eq. (7), and a total of n equa-
3. GRAY-BOX EDFA GAIN MODEL tions with different values of the variable x can be set up. As the
The above mathematical framework proves that, for an EDFA distribution of the values of variable x is known, we can solve
in which the EDFs are of the same specification, the gain spec- the equations to obtain the parameters (1G, G0 ). 1G can be
trum is a univariate linear function, and the total gain/power regarded as the variation of the gain spectra, which is solved by
is related to the AGC/APC setting. Based on this inference, taking the difference of the highest and lowest gain spectra’s
we propose an accurate gray-box EDFA gain model based corresponding equations. As the range of the values of x is 1,
on a minuscule dataset. The steps of the modeling scheme 1G can be represented as
are as follows: First, a minuscule dataset of an EDFA gain 1G = Gmax − Gmin , (8)
spectrum is collected. Second, the gain spectra are written as
a univariate linear function: Goverall = 1G · x + G0 , which where Gmax and Gmin are the gain spectra with the highest and
will be explained in detail in the following part of this section. lowest total gain, respectively. To avoid random errors and pro-
The parameters in the function, namely (1G, G0 ), can be vide a more accurate result, Gmax and Gmin can be replaced by
easily determined through simple processing of the dataset the average of several gain spectra with the largest and smallest
by calculating the variation and average of the gain spectra. total gains. When calculating the parameter 1G, gain spectra
Third, the variable x in the linear function is determined based with a larger gain difference are preferred. G0 can be regarded
on Eq. (6), which will also be detailed later. Finally, with the as the average of the gain spectra, which is solved by calculating
results of the second and third steps, the EDFA gain spectrum the average of all the n gain spectra’s corresponding equations.
can be constructed. The framework of the modeling scheme As the average of the values of x is 0, G0 can be represented as
is shown in Fig. 2. In the following, the detailed procedures of n
1X
the second and third steps are respectively described. G0 = Gi . (9)
n i=1

A. Determination of the Gain Spectrum Function


In the second step, a simple data processing method is pro- B. Estimation of the Variable
posed to determine the univariable equation of the EDFA gain In the third step, on determining the univariate linear function
spectrum. Equation (5) presents the relationship between the of the gain spectrum, we only need to estimate the variable x to
gain spectrum and a single variable k. However, the equation obtain the model. Equation (6) provides the relationship of the
is difficult to obtain for the unavailability of parameters, but input signal, gain/power setting, and the corresponding gain
it can be equivalently transformed to other expressions. Let spectrum, which can be substituted into Eq. (6), showing as a
a variable x be an arbitrary linear transformation to the vari- function of variable x :
able k, which is x = a · k + b, where a 6 = 0, and b is arbitrary m
constant. Equation (5) can be rewritten as X
10(1G(λi )·x +G 0 (λi )+Pin (λi ))/10 = G set + Pin, total , (AGC),
d
Goverall = 1G · x + G0 , (7) i=1

m
where 1G, G0 are constant spectra, which are, respectively, 10(1G(λi )·x +G 0 (λi )+Pin (λi ))/10 = Pd
X
set , (APC),
denoted as G0 = − A const · b/a + C const , 1G = A const /a . In i=1
(10)
where 1G 0 (λi ) and G 0 (λi ) are the values of 1G and G0 at
wavelength λi .
This is a monotonic equation about the variable x .
Therefore, the value of the variable can be uniquely
determined.
Finally, based on the gain spectrum function determined in
the second step and the value of the variable estimated in the
Fig. 2. Proposed gray-box EDFA gain modeling scheme. third step, the gain spectrum can be constructed by Eq. (7).
834 Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking Research Article

There are also some considerations for applying the model Table 1. Configuration of Our Experimental Dataset
in real systems. For the systems in which optical channel mon- Parameters Value
itors (OCMs) are available, training and calibration can be
done after deployment. EDFA input and output spectra under EDFA setting mode APC, AGC
different input channel loading conditions and/or different APC power setting (dBm) 15
input power can be collected online, and a model for device AGC gain setting (dB) 18
Input spectrum shape Even-number channel: empty
under the current condition can be obtained. For the systems
Odd-number channel:
without OCMs, training and calibration should be done before
load/unload randomly
deployment. The data samples collected offline should be used Average input power per Loaded channels: about
to establish a model, and, due to the drift of environment and channel (dBm) −18 : −14
the aging of the device, the accuracy of model may be limited. Unloaded channels: about −28
Using these collected data, the calibrated real gain/output Power perturbation per −1.5:1.5
power can be represented by the average of several measured channel (dB)
values, and the training process is described as Eqs. (8)–(10). Data size for each setting 1500

4. DATASETS FOR MODELING PERFORMANCE measured output power spectrum. It is measured based on
EVALUATION the empty even-number channels and then subtracted from
the odd-number channels. The ASE noise power spectrum
The performance of the proposed modeling scheme is evalu- on the total 80 channels can be obtained by the interpolation
ated on our experimentally measured dataset and a public of the output on the even-number channels. Subsequently,
dataset. In this section, we first introduce our experimental the gain spectrum on odd-number channels can be obtained
setup and the measured dataset and then briefly introduce the by Gain = ( P out − P noise )/ P in , where P out , P noise and P in
public dataset. indicates the output power spectrum, noise power spectrum,
and input power spectrum, respectively.
A. Experimental Setup and Measured Dataset The data collection is implemented using an automatic
controller with the following three steps: (1) set the target
The experimental system for data collection is shown in Fig. 3.
gain/output power of the EUT and the output power of the
An amplified spontaneous emission (ASE) source is utilized
ASE source before the experiment, (2) set the programmable
to generate a wideband signal. The wideband signal is then
filter to generate the input signal power spectrum, and (3) read
spectrally shaped by a programmable filter to generate signals
and store the data from the two OSAs after the system reaches
with different channel loading conditions. The filtered signal,
a stable status. These steps are repeated until the whole target
ranging in frequency from 192.1 to 196.1 THz, is composed
input signal spectra are all measured.
of 80 channels with 50 GHz bandwidth. Two optical spectrum
For each AGC/APC setting, there is an inaccuracy in the
analyzers (OSAs) are utilized to measure the input and output
process of the EDFA’s power monitoring, and the difference
spectra of the EDFA under test (EUT). For our experiment,
between the target gain/power and the real gain/power should
the EUT is set to work in APC mode with a target output
be calibrated.
power of 15 dBm and AGC mode with a target gain of 18 dB.
The total 80 channels are divided into odd-number and
even-number channels. The even-number channels are set to B. Public Dataset
empty to measure ASE noise. The odd-number channels are The public dataset [35] contains datasets of three EDFAs of
set to be loaded or unloaded in a random manner. Further, the same make (i.e., the same type from the same company), as
a power perturbation is applied to the loaded channels. The introduced in [25]. The EDFAs are set to work in APC mode.
average power of the loaded channels in an input spectrum A total of four output power settings are considered, covering
ranges from about −18 to about −14 dBm. The power of the the range of target output power from 15 to 18 dBm with a
unloaded channels is about −28 dBm. The power perturba- step size of 1 dB. Further, a total of nine input power condi-
tions are randomly generated for each loaded channel with a tions are considered, covering the range of total input power
range of −1.5 to 1.5 dB. To avoid generating redundant data, from about −6.5 to 1.5 dBm. The input signal is a random
we removed the duplicated channel loading conditions. For continuous spectrum, in which the power of each channel is
each setting mode, a total of 1500 data samples are measured. generated to be close to the power of adjacent channels. In each
The configuration of the dataset is summarized in Table 1. output power setting and input power scenario, about 2000
In this experiment, to obtain an accurate gain spectrum, sets of input spectra and output spectra are measured. In this
the ASE noise generated by the EUT is excluded from the paper, the dataset of one EDFA under 15 dBm target output
power containing 16,497 data samples is investigated. The
ASE Source Programmable Filter EDFA configuration of the public dataset is summarized in Table 2.
The gain spectrum is directly calculated from the output
OSA OSA spectrum and input spectrum by Gain = P out − P in . It is
worth mentioning that the presence of ASE noise can lead to
Controller
inaccuracy in the calculation of the gain spectrum. However,
Fig. 3. Experimental data-collection system. considering the fact that the typical ASE noise power is usually
Research Article Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking 835

Table 2. Configuration of the Adopted Data from the 200 data samples are collected with less than 12 channels
Public Dataset loaded, and all of them are used as the testing dataset. The
Parameters Value RMSE of the proposed model utilizing 8 data samples reaches
0.087 dB, whereas the RMSE of the traditional NN model
EDFA setting mode APC utilizing 900 data samples is 0.262 dB. The comparison of
APC power setting (dBm) 15
error distribution is shown in Fig. 4(c). The absolute value
Input spectrum shape Continuous power spectrum
of error is 0.149 dB for the proposed modeling scheme and
Total input power (dBm) About −6.5:1:1.5
0.522 dB for the traditional NN modeling scheme when the
Data size for each setting About 2000
CDF reaches 90%. This result illustrates that the black-box
much lower than the amplified signal power, the degree of NN model suffers from poor generalizability, and the perform-
inaccuracy will be acceptable. ance of the model is strongly related to the distribution of the
input dataset. An NN model should be retrained on another
large dataset when utilized in new scenarios. On the contrary,
5. EXPERIMENTAL RESULTS AND the proposed gray-box model benefits from the knowledge of
DISCUSSIONS the underlying physics and can be easily generalized to other
The performance of the proposed gray-box model for an AGC scenarios. This is because, as long as all the data fit an analytical
EDFA and an APC EDFA is evaluated. For comparison, the model, no matter which of them are selected as training sam-
performance of the traditional NN modeling scheme is also ples, equivalent model parameters would be calculated through
shown. The configuration of the NN is similar to the one mathematical equations. In this way, the bias of the training
proposed in [25]. The input of the NN model is the input dataset will not influence the model.
power spectrum, the total input power, and the EUT setting;
the output of the NN model is the gain spectrum. The NN B. Modeling Performance for an APC EDFA
model has two hidden layers. For our dataset with 40 channels,
First, the data efficiency of the proposed modeling scheme
the neuron numbers of the two hidden layers are 128 and 64,
respectively. For the public dataset with 83 channels, they are for an APC EDFA is demonstrated on our experimentally
256 and 128, respectively. measured dataset; 400 data samples are used as the testing
The modeling accuracy is evaluated by the RMSE of the pre- dataset. RMSEs of the proposed model and the NN model
dicted gain spectrum, and 400 data samples are used as the test- utilizing different data sizes are plotted in Fig. 4(d). The result
ing dataset. shows that the proposed modeling scheme achieves an average
RMSE result of 0.070 dB when the model is established using
only 8 data samples. The performance is stable in multiple
A. Modeling Performance for an AGC EDFA rounds of modeling, as the maximum RMSE is 0.085 dB.
First, for an AGC EDFA, RMSEs of the proposed model and On the contrary, the NN model requires 900 training data
the NN model utilizing different training data sizes are plotted samples to reach an average RMSE of 0.112 dB. The proposed
in Fig. 4(a). Multiple rounds of modeling are conducted, in scheme requires only 0.9% training data samples compared
which the training data samples are randomly selected from the with the NN scheme, while achieving better accuracy. The
dataset, and the obtained fluctuations of RMSEs are shown as error distribution of the proposed scheme utilizing 8 data
the error bars; 400 data samples are used as the testing dataset samples; the traditional NN scheme trained on 900 data sam-
for performance evaluation. In the training dataset, it should ples is compared in Fig. 4(e). The absolute value of error is
be ensured that the selected gain spectra are not similar. The 0.109 dB for the proposed modeling scheme and 0.247 dB for
result shows that the proposed modeling scheme achieves the traditional NN modeling scheme when the CDF reaches
an average RMSE of 0.062 dB using only 8 data samples. 90%.
The performance is stable in the 50 rounds of modeling, as The evaluation of the model’s generalizability is con-
the maximum RMSE is 0.070 dB. On the contrary, the NN ducted with the same channel loading conditions as the AGC
model requires 900 training data samples to reach an average EDFA. The 200 data samples collected with less than 12
RMSE of 0.109 dB. We can see that the proposed scheme channels loaded are used as the testing dataset. The RMSE of
requires only 0.9% training data samples compared with the the proposed model with 8 data samples reaches 0.093 dB,
NN scheme while still achieving much better accuracy. The whereas the RMSE of the NN model with 900 data samples
error distribution of the proposed scheme utilizing 8 data is 0.438 dB. The error distributions are shown in Fig. 4(f ).
samples and the traditional NN scheme trained on 900 data For a CDF of 90%, the absolute value of error is 0.157 and
samples is compared in Fig. 4(b). Based on this, the cumulative 0.772 dB for the two models, respectively. For the NN model,
distribution function (CDF) for the absolute value of error is an offset of about 0.3 dB exists in the results. The reason is that,
calculated. When the CDF reaches 90%, the absolute value of besides the channel loading condition, the APC EDFA’s pump
error is 0.104 dB for the proposed modeling scheme, while it is adjustment process is also sensitive to the total input power.
0.219 dB for the traditional NN modeling scheme. The data utilized to establish the model and the data utilized to
Next, the proposed model’s superior generalizability is verify the performance are totally different in the distribution
demonstrated. The model is established on the data samples of total input power. For the NN model, the difference in
with more than 12 loaded channels; then, the performance is distribution is not considered, leading to poor generalization
verified on the data samples with less than 12 loaded channels; performance. For the proposed gray-box model, the difference
836 Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking Research Article

(a) (b) (c)

(d) (e) (f)

Fig. 4. Modeling performance on our experimental dataset. (a) Modeling accuracy versus the utilized data size for an AGC EDFA.
(b) Distribution of the modeling error for an AGC EDFA. (c) Generalizability for an AGC EDFA. (d) Modeling accuracy versus the utilized
data size for an APC EDFA. (e) Distribution of the modeling error for an APC EDFA. (f ) Generalizability for an APC EDFA.

(a) (b) (c)

Fig. 5. Modeling performance for an APC EDFA on the public dataset. (a) Modeling accuracy versus the utilized data size. (b) Distribution of
the modeling error. (c) Generalizability.

in distribution is taken into account by the embedded physics, are established on the data samples with a total input power
so the performance is not seriously deteriorated. ranging from −6.5 to −4.5 dBm; then, the performance is
For the public dataset, similar results are observed. As shown verified on the data samples with a total input power ranging
in Fig. 5(a), our model with 8 data samples achieves an average from −3.5 to 1.5 dBm. The RMSE is 0.163 and 4.825 dB for
RMSE result of 0.099 dB and a maximum RMSE result of the two models, respectively. Figure 5(c) plots the error distri-
0.112 dB, whereas the NN model with 900 data achieves an butions. The results demonstrate that the proposed modeling
average RMSE result of 0.120 dB and a maximum RMSE scheme performs well on different physical devices.
result of 0.253 dB. Figure 5(b) plots the error distributions.
For a CDF of 90%, the absolute value of error is 0.186 and 6. CONCLUSION
0.330 dB for the two models, respectively. It is worth men- A gray-box EDFA gain modeling scheme is proposed to pro-
tioning that the public dataset considers a fully loaded spectra vide accurate estimation based on small datasets. We first
scenario, which is different from our own experiment, where analyze the typical structure of an EDFA and the physics of
only odd-number channels are loaded with signals. The mod- optical amplification process and then derive the EDFA gain
eling performance of the public dataset further demonstrates spectrum as a univariable linear function. Then, based on the
that our approach is applicable to various signal loading sce- principles of the control circuit’s dynamic pump adjustment,
narios. For the evaluation of the generalizability, the models the function of the gain spectrum can be uniquely solved, and
Research Article Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking 837

the gain spectrum can be further obtained. Experiments are 16. M. Ibrahimi, G. S. Sticca, F. Musumeci, A. Castoldi, R. Pastorelli,
carried out to verify the modeling performance. The proposed and M. Tornatore, “Selective hybrid EDFA/Raman amplifier
placement to avoid lightpath degradation in (C+L) networks,” in
scheme is proved to be capable of constructing a model using
European Conference on Optical Communication (ECOC) (2022),
only 8 data samples while still outperforming the traditional paper We1B.3.
NN model with 900 data samples on accuracy and generaliz- 17. A. C. Meseguer, J.-C. Antona, A. Bononi, J. Cho, S. Grubb, P.
ability. In addition, the performance is also verified in a public Pecci, O. Courtois, and V. Letellier, “Highly accurate measurement-
dataset, and similar results are achieved. This implies that the based gain model for constant-pump EDFA for non-flat WDM
inputs,” in Optical Fiber Communication Conference (OFC) (2021),
modeling scheme is applicable to a variety of physical devices paper M5C.4.
and can be used to build a digital twin of each EDFA in optical 18. V. M. C. Mathias, A. C. Meseguer, J.-C. Antona, A. Bononi,
networks. S. Grubb, O. Courtois, and V. Letellier, “Extension of the
measurement-based gain model for non-flat WDM inputs and
Funding. The Shanghai Pilot Program for Basic Research–Shanghai Jiao various pump currents,” in European Conference on Optical
Tong University (21TQ1400213); National Natural Science Foundation of Communication (ECOC) (2021).
China (62175145). 19. Y. Li, M. Zhang, and M. Tang, “A data-effective black-box EDFA gain
model with singular value decomposition,” in Asia Communications
and Photonics Conference (ACP) (2022), pp. 985–987.
REFERENCES 20. M. Zhang, Y. Li, W. Li, J. Chen, Y. Chen, and M. Tang, “Data-
effective and accurate EDFA gain prediction black-box model,”
1. P. J. Winzer, D. T. Neilson, and A. R. Chraplyvy, “Fiber-optic trans- in Asia Communications and Photonics Conference (ACP) (2021),
mission and networking: the previous 20 and the next 20 years paper T4A.98.
[Invited],” Opt. Express 26, 24190–24239 (2018). 21. M. Hashimoto, M. Yoshida, and H. Tanaka, “The characteristics
2. S. J. Savory, “Digital coherent optical receivers: algorithms and of WDM systems with hybrid AGC EDFA in the photonics net-
subsystems,” IEEE J. Sel. Top. Quantum Electron. 16, 1164–1179 work,” in Optical Fiber Communication Conference (OFC) (2002),
(2010). pp. 517–518.
3. K. Roberts, Q. Zhuge, I. Monga, S. Gareau, and C. Laperle, “Beyond 22. A. A. M. Saleh, R. M. Jopson, J. D. Evankow, and J. Aspell,
100 Gb/s: capacity, flexibility, and network optimization,” J. Opt. “Modeling of gain in erbium-doped fiber amplifiers,” IEEE Photon.
Commun. Netw. 9, C12–C24 (2017). Technol. Lett. 2, 714–717 (1990).
4. P. N. Ji, “Software defined optical network,” in International 23. C. R. Giles and E. Desurvire, “Modeling erbium-doped fiber ampli-
Conference on Optical Communications and Networks (ICOCN) fiers,” J. Lightwave Technol. 9, 271–283 (1991).
(2012), pp. 31–34. 24. J. Junio, D. C. Kilper, and V. W. S. Chan, “Channel power excur-
5. O. Gerstel, M. Jinno, A. Lord, and S. J. B. Yoo, “Elastic optical net- sions from single-step channel provisioning,” J. Opt. Commun.
working: a new dawn for the optical layer?” IEEE Commun. Mag. Netw. 4, A1–A7 (2012).
50(2), s12–s20 (2012). 25. F. da Ros, U. C. de Moura, and M. P. Yankov, “Machine learning-
6. X. Chen, R. Proietti, C. Liu, and S. J. B. Yoo, “Towards self-driving based EDFA gain model generalizable to multiple physical devices,”
optical networking with reinforcement learning and knowledge in European Conference on Optical Communication (ECOC) (2020),
transferring,” in International Conference on Optical Network Design paper Tu1A.4.
and Modeling (ONDM) (2020). 26. K. Ishii, J. Kurumida, and S. Namiki, “Experimental investigation
7. W. Mo, C. L. Gutterman, Y. Li, S. Zhu, G. Zussman, and D. C. Kilper, of gain offset behavior of feedforward-controlled WDM AGC EDFA
“Deep-neural-network-based wavelength selection and switching in under various dynamic wavelength allocations,” IEEE Photon. J. 8,
ROADM systems,” J. Opt. Commun. Netw. 10, D1–D11 (2018). 7901713 (2016).
8. E. Seve, J. Pesic, and Y. Pointurier, “Accurate QoT estimation by 27. Y. You, Z. Jiang, and C. Janz, “Machine learning-based EDFA gain
means of a reduction of EDFA characteristics uncertainties with model,” in European Conference on Optical Communication (ECOC)
machine learning,” in International Conference on Optical Network (2018), paper Mo3E.5.
Design and Modeling (ONDM) (2020). 28. S. Zhu, C. L. Gutterman, W. Mo, Y. Li, G. Zussman, and D. C. Kilper,
9. Q. Zhuge, X. Liu, Y. Zhang, M. Cai, Y. Liu, Q. Qiu, X. Zhong, J. Wu, “Machine learning based prediction of erbium-doped fiber WDM
R. Gao, L. Yi, and W. Hu, “Building a digital twin for intelligent opti- line amplifier gain spectra,” in European Conference on Optical
cal networks [Invited Tutorial],” J. Opt. Commun. Netw. 15, C242– Communication (ECOC) (2018).
C262 (2023). 29. Z. Wang, D. Kilper, and T. Chen, “Transfer learning-based ROADM
10. J. Yu, S. Zhu, C. L. Gutterman, G. Zussman, and D. C. Kilper, EDFA wavelength dependent gain prediction using minimized data
“Machine-learning-based EDFA gain estimation [Invited],” J. Opt. collection,” in Optical Fiber Communication Conference (OFC)
Commun. Netw. 13, B83–B91 (2021). (2023), paper Th2A.1.
11. Y. Liu, X. Liu, L. Liu, Y. Zhang, M. Cai, L. Yi, W. Hu, and Q. Zhuge, 30. X. Liu, Y. Chen, Y. Zhang, Y. Liu, L. Yi, W. Hu, and Q. Zhuge,
“Modeling EDFA gain: approaches and challenges,” Photonics 8, “Physics-informed EDFA gain model based on active learning,”
417 (2021). arXiv, arXiv:2206.06077 (2022).
12. Y. You, Z. Jiang, and C. Janz, “OSNR prediction using machine 31. J. Lin, X. Lin, and Z. Jiang, “Auxiliary neural network assisted
learning-based EDFA models,” in European Conference on Optical machine learning EDFA gain model,” in Optical Fiber Communica-
Communication (ECOC) (2019). tion Conference (OFC) (2023), paper M2E.2.
13. Q. Zhuge, X. Zeng, H. Lun, M. Cai, X. Liu, L. Yi, and W. Hu, 32. Z. Jiang, J. Lin, and H. Hu, “Machine learning based EDFA channel
“Application of machine learning in fiber nonlinearity modeling in-band gain ripple modeling,” in Optical Fiber Communication
and monitoring for elastic optical networks,” J. Lightwave Technol. Conference (OFC) (2022), paper W4I.2.
37, 3055–3063 (2019). 33. S. Zhu, C. Gutterman, A. D. Montiel, J. Yu, M. Ruffini, G. Zussman,
14. A. Ferrari, A. Napoli, J. K. Fischer, N. Costa, A. D’Amico, J. Pedro, and D. Kilper, “Hybrid machine learning EDFA model,” in Optical
W. Forysiak, E. Pincemin, A. Lord, A. Stavdas, J. P. F.-P. Gimenez, Fiber Communication Conference (OFC) (2020), paper T4B.4.
G. Roelkens, N. Calabretta, S. Abrate, B. Sommerkorn-Krombholz, 34. A. D’Amico, S. Straullu, A. Nespola, I. Khan, E. London, E. Virgillito,
and V. Curri, “Assessment on the achievable throughput of multi- S. Piciaccia, A. Tanzi, G. Galimberti, and V. Curri, “Using machine
band ITU-T G.652.D fiber transmission systems,” J. Lightwave learning in an open optical line system controller,” J. Opt. Commun.
Technol. 38, 4279–4291 (2020). Netw. 12, C1–C11 (2020).
15. L. Rapp and M. Eiselt, “Optical amplifiers for multi-band opti- 35. M. P. Yankov and F. da Ros, “Input-output power spectral den-
cal transmission systems,” J. Lightwave Technol. 40, 1579–1589 sities for three C-band EDFAs and four multi-span inline EDFAd
(2022). fiber optic systems of different lengths,” DTU Data (2020),
838 Vol. 15, No. 11 / November 2023 / Journal of Optical Communications and Networking Research Article

https://data.dtu.dk/articles/dataset/Input-output_power_spectral_ gain flattening,” in International Conference on Image and Signal


densities_for_three_C-band_EDFAs_and_four_multi-span_inline_ Processing and Their Applications (ISPA) (2022).
EDFAd_fiber_optic_systems_of_different_lengths/13135754. 39. J. T. de Araujo, A. C. Meseguer, and J.-C. Antona, “Experimental
36. B. Pedersen, M. L. Dakss, B. A. Thompson, W. J. Miniscalco, T. Wei, model of EDFA spectral hole burning for WDM transmissions sys-
and L. J. Andrews, “Experimental and theoretical analysis of effi- tems,” in Optical Fiber Communication Conference (OFC) (2023),
cient erbium-doped fiber power amplifiers,” IEEE Photon. Technol. paper M2E.1.
Lett. 3, 1085–1087 (1991). 40. R. di Muro, “The Er3+ fiber gain coefficient derived from a dynamic
37. R. Sommer, R. M. Fortenberry, B. Flintham, and P. C. Johnson, gain tilt technique,” J. Lightwave Technol. 18, 343–347 (2000).
“Multiple filter functions integrated into multi-port GFF compo- 41. A. D’Amico, “Telecominfraproject/oopt-gnpy: Optical Route
nents,” in Optical Fiber Communication Conference and Exposition Planning Library, Based on a Gaussian Noise Model,” GitHub
and the National Fiber Optic Engineers Conference (OFC/NFOEC) (2023), https://github.com/Telecominfraproject/oopt-gnpy.
(2007), paper JWA24.
38. B. Belmahdi and K. Mazighi, “Implementation efficiency of a
two-stage EDFA amplifier with an inverted trapezoidal filter on

You might also like