Professional Documents
Culture Documents
Neural Lyapunov Control For Power System Transient Stability A Deep Learning-Based Approach
Neural Lyapunov Control For Power System Transient Stability A Deep Learning-Based Approach
Abstract—Power system control and transient stability analysis the transient, frequency and voltage stability timescales [1]. This
play essential roles in secure system operation. Control of power work aims to design a novel and general control approach to
systems typically involves highly nonlinear and complex dynamics. ensure stability in various power system applications.
Most of the existing works address such problems with additional
assumptions in system dynamics, leading to a requirement for a The dynamics of power systems are usually highly complex
complete and general solution. This paper, therefore, proposes a and nonlinear. Numerous power system control methods have
novel control framework for various power system control and been reported in transient stability, frequency control and voltage
stability problems leveraging a learning-based approach. The pro- control [4]–[9]. The transient stability is to maintain synchro-
posed framework includes a two-module structure that iteratively nism and usually assessed by conventional eigenvalue analysis
and jointly learns the candidate Lyapunov function and control
law via deep neural networks in a learning module. Meanwhile, and direct methods [1], or emerging passivity-based methods [6].
it guides the learning procedure towards valid results satisfying However, these approaches require assumptions such that the
Lyapunov conditions in a falsification module. The introduced power system is governed by large SGs, and needs to be dissi-
termination criteria ensure provable system stability. This control pative. Frequency control ensures the power system frequency
framework is verified through several studies handling different at its nominal value when the power balance fluctuates. It is gen-
types of power system control problems. The results show that the
proposed framework is generalizable and can simplify the control erally implemented in a hierarchical architecture [10]. Various
design for complex power systems with the stability guarantee and methods have been proposed to improve the frequency stability
enlarged region of attraction. in the presence of load-side participation [9], uncertainties [5],
Index Terms—Machine learning, power system stability,
and operational constraints [4]. Lastly, voltage control aims to
Lyapunov function, neural Lyapunov control. prevent power system voltages from excessive fluctuations under
either normal or disturbed conditions. It is typically analyzed
by Lyapunov-based excitation control [8] or direct feedback
I. INTRODUCTION linearization [11]. The effectiveness of these above methods is
TABILITY is the primary objective in the power system verified with additional assumptions on nonlinear and complex
S control and operation, concerning the capability to maintain
the operating equilibrium after severe disturbances [1]. The
system dynamics for different forms of power system stability
and control.
increased penetration of renewable energy sources brings a va- Furthermore, the concept of microgrids has been identified as
riety of undesirable impacts and formidable challenges to power a key element for future power systems. It brings both changes
system control [2]. In this context, future power systems would and challenges to traditional power systems since microgrids are
be dominated by power electronic-interfaced small devices with usually dominated by inverter-based dynamics [12]. In current
different characteristics rather than some large synchronous practice, droop methods have been widely adopted for microgrid
generators (SGs) [3]. When a system is operated in such stressed stability analysis using different kinds of droop-based con-
conditions, different instability phenomena have emerged across trol [12]–[14]. In [12], a Lyapunov-based method is introduced
to assess the droop-based microgrid stability. The performance
Manuscript received September 17, 2020; revised March 24, 2021 and June of droop-based methods is further investigated and improved
22, 2021; accepted August 2, 2021. Date of publication August 5, 2021; date in [14]. The key element in droop-based control is the droop
of current version March 28, 2022. This work was supported by the U.S. gain, where a higher gain leads to increased sensitivity to power
Department of Energy’s Office of Energy Efficiency and Renewable Energy
(EERE) under the Solar Energy Technologies Office Award number 34230. injections but may induce system instability due to large fre-
Paper no. TPWRS-01600-2020. (Corresponding author: Tianqiao Zhao.) quency and voltage deviations from disturbances.
Tianqiao Zhao is with the Brookhaven National Laboratory, Upton, NY 11973 Overall, the control methodology and stability analysis need
USA (e-mail: tzhao1@bnl.gov).
Jianhui Wang is with the Department of Electrical and Computer En- to be further developed for future power systems. However,
gineering, Southern Methodist University, Dallas, TX 75275 USA (e-mail: these model-based approaches have additional requirements on
jianhui@smu.edu). the system, e.g., system positivity [6], lossless lines [7] and
Xiaonan Lu and Yuhua Du are with the College of Engineering, Tem-
ple University, Philadelphia, PA 19122 USA (e-mail: xiaonan.lu@ieee.org; a special form of bus dynamics [12]. More importantly, their
yuhua.du@temple.edu). effectiveness highly relies on the system model to design an
Color versions of one or more figures in this article are available at energy function that evaluates system stability. Therefore, any
https://doi.org/10.1109/TPWRS.2021.3102857.
Digital Object Identifier 10.1109/TPWRS.2021.3102857 model inaccuracy would degrade system stability and control.
0885-8950 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
956 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 37, NO. 2, MARCH 2022
The emerging learning-based approaches have recently gained in the use of machine learning but also in the combination
attention in power system research, leveraging the advantage of discovering Lyapunov functions and learning control laws
of the model-free feature and automatic knowledge extrac- simultaneously. It is more flexible to ensure the performance
tion. The authors in [15] introduced an analysis tool for tran- of control laws by adding simpler terms in the loss function
sient stability prediction based on deep learning. It solves a to enlarge the region of attraction (ROA) on demand. The pro-
regression problem to predict the stability margin that eval- posed framework consists of two modules: 1) Learning Module
uates system stability. The authors in [16] utilized deep re- leveraging neural networks discovers parameters for control and
inforcement learning (DRL) to prevent a power system from Lyapunov functions jointly; 2) Falsification Module utilizing a
ultra-low-frequency oscillations, which highly relies on a lin- δ-complete-based algorithm [23] validates the candidate control
earized system model [17] embeds deep neural networks in a law and Lyapunov function obtained from Learning Module and
frequency-constrained scheduling problem in microgrid island- ensures the correctness of results. The main contributions are
ing events, admitting a mixed-integer formulation. However, summarized as follows:
the proposed framework is restricted to a specific problem with r A novel control framework is proposed by exploiting the
less adaptability. Although [18] and [19] address frequency and advantage of neural networks and the completeness of
voltage control based on DRL respectively, their frameworks are satisfying nonlinear constraints to guarantee Lyapunov
sensitive to problem settings (e.g., control rewards). conditions fully. The model-free feature can deal with the
Lyapunov function-based methods have been widely adopted stability problem in power systems with heterogeneous
for power systems for both stability analysis and control synthe- interface dynamics.
sis [6]–[8]. However, finding a valid Lyapunov function remains r The combination of the learning and falsification modules
a challenge. Recently, learning-based methods show their effec- produces controllers with performance guarantees, i.e., the
tiveness in handling nonlinear control problems in robotics [20] validated function holds the Lyapunov conditions in the
by learning a parametric control Lyapunov function through verified state domain.
interacting with an unknown environment in an episodic learning r Introducing regulator costs of a ROA to the learning pro-
framework. However, there still exist problems that impede their cess results in a significantly enlarged ROA for power
immediate use in power systems due to the numerical sensitivity, system stability assessment.
complex nonlinearity and extra operational requirements. r The feasibility and scalability of the proposed framework
To summarize, existing literature faces emergent challenges are demonstrated through several power control problems,
when applied to future power systems. First, although a lin- i.e., transient stability and islanded microgrid control
earized small-signal model can guarantee asymptotic stability
using a Lyapunov function, the design of these controllers based II. PRELIMINARIES AND PROBLEM FORMULATION
on the linearized model becomes unsuccessful when a large
disturbance occurs. Furthermore, constructing a quadratic Lya- This section introduces preliminaries and the problem formu-
punov function is widely adopted to address transient stability lation to facilitate the control design.
problems. It could be conservative and restrict themselves to
transmission networks ignoring the high R/X ratio in distribution A. Lyapunov Function and Stability Conditions
networks, and meanwhile, they require specific system dynamics
to achieve asymptotic stability. Lastly, most of the existing Consider a controlled system governed by
approaches focused on the transient stability of the system where
ẋ = f (x, u), (1)
the buses have the same dynamics. The authors in [21] developed
stability conditions for power systems with heterogeneous bus where f : X → Rn is locally Lipschitz-continuous with X be-
dynamics. It still relies on the system passivity, and the stability ing a path-connected state-space and 0 ∈ X . x(t) ∈ X is the
condition requires the passivity index of each bus that is greater state vector, and K(x) : Rn → Rm is the feedback control law.
than a network index. Calculating this network index depends on Then the control synthesis problem is to design a feedback
the power flow and system operating conditions. It may provide law K(x) so that all traces x(·) of the closed-loop system are
conservative results and be sensitive to the modeling accuracy. asymptotically stable. To this end, we introduce the following
Therefore, it is necessary further to develop approaches for definition.
power systems with heterogeneous bus dynamics. Definition 1: For the system (1), a continuously differentiable
This paper focuses on filling the existing gaps in designing function V is a Lyapunov function if the following condition
feedback laws for various stability problems in power systems, holds for all x ∈ X ,
capturing the core difficulty in power system transient stability
and control. Inspired by [22], a deep learning-based control V is positive definite and V (0) = 0
(2)
framework, Neural Power System Control, is developed for minu∈R (f (x,u) V ) < 0
modern power systems. Our proposed approach utilizes neural
networks to capture nonlinearity, which lifts dependence on where f (x,u) V is the Lie derivative of V over the vector field
linear or specific models. In addition, our approach directly f (x, u). If there exists a Lyapunov function V , it guarantees the
handles various interface dynamics and rigorously satisfies the existence of K(x) stabilizing the trajectories of a system to the
Lyapunov conditions by using neural networks. It is novel not equilibrium [24].
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
ZHAO et al.: NEURAL LYAPUNOV CONTROL FOR POWER SYSTEM TRANSIENT STABILITY: A DEEP LEARNING-BASED APPROACH 957
Definition 2: A controlled system (1) with equilibrium at the III. LEARNING NEURAL POWER SYSTEM CONTROL
origin is asymptotically stabilizable, if there exists a continu- As outlined before, a Lyapunov function is needed in order
ously differentiable function V satisfying the conditions in (2). to ensure the stability of (5) at hand. Discovering a Lyapunov
function is generally computationally expensive subjecting to
B. Power System Model hard nonlinear constraints [20]. As shown in [26], [27], the
1) Bus Dynamic Model: A network-reduced power system nonlinearity and uncertainty can be handled by generic function
has n buses connected by transmission lines which is represented approximators. This paper utilizes neural networks to embed
by the undirected graph G = (V, E), where V is the bus set and the power system dynamics and provide a data-based solution
E is the line set. Each bus includes a phasor voltage Vi ∠θi and to address the nonlinearity and uncertainty in solving power flow
a complex power injection Pi + jQi , where Vi and θi are the problems. To deal with this problem, this section presents a deep
voltage magnitude and angle of bus i, respectively; Pi and Qi learning-based approach to search for a candidate Lyapunov
are active and reactive power injections of bus i, respectively. function over a hypothesis space. Specifically, the Lyapunov
For a bus with a dynamic power resource, its dynamics can be function is parameterized by a parameter vector and defined
described a dynamical relation which maps the complex power by V . For the sake of simplicity, we further denote Ψ for both
injection to the complex voltage [21]: the feedback law and the corresponding parameters which define
the law.
ẋi = f (xi , ui ) Before presenting the main result, the following definitions
i∈V (3)
yi = Ci x i are introduced.
where xi = [ηi , θi , Vi ] and yi = Ci xi = [θi , Vi ] with ηi being Definition 3: (Compatible Sample) Given a training set S :
an auxiliary state indicating the corresponding dynamics of each {x}, x ∈ X , a set of compatible samples for an iteration is
device. The introduction of ηi facilitates modeling bus dynamics defined by
with heterogeneous devices. ui is the control input to be designed C : {(x, Ψ)} ⊂ X × U , (6)
later. Here, the load bus has no dynamics. However, the proposed
solution can easily address the load-side frequency stability as which includes both training and falsifier samples defined later.
in [25] following the control framework in Section III. Definition 4: (Falsifier Compatibility) A candidate V is com-
2) Power Network Model: The interconnection between patible with C iff V respects the conditions in Definition 1:
each bus is represented by the admittance matrix. The power
injections at bus i are denoted by ∃x : x = 0 ∧ V (x) > 0 ∧ f (x,Ψ) V < 0. (7)
Piinj = Gii Vi2 + Vi Vj (Bij sin(θij ) + Gij cos(θij )) (4a) Note that the condition of V (0) = 0 is not included in Defini-
j∈N j
tion 4, and it will be guaranteed in advance using the ICNN fea-
ture in Learning Module. Given a power system governed by (5),
2
Qinj
i = −Bii Vi − Vi Vj (Bij cos(θij ) − Gij sin(θij )), the objective of neural power system control is to discover that
j∈N j ensures the existence of V s.t. Definition 1, and synchronously
(4b) returns a feedback control law stabilizing the system.
Fig. 2 outlines the developed architecture. It includes
where N j is the set of adjacent buses to bus i; Gij and Bij are Learning Module and Falsification Module interacting in a
the real and imaginary elements of the admittance matrix. training loop. Given a set of random samples S ⊂ X at iter-
Combining (3) and (4), the power system is then modeled by ation k, Learning Module optimizes the weights of neural
a feedback loop between the network and dynamical behavior of networks by back-propagating the Lyapunov risks (defined in
each bus as in Fig. 1, which is written as the following compact Section III.A) and discovers a parametric V to satisfy the condi-
form, tions in Definition 1. The returned V satisfies all the conditions,
⎧
⎨ ẋ = f (x, u) i.e., V (0) = 0, V (s) > 0, or (f (s,u) V (s)) < 0, ∀s ∈ S but
y = Cx (5) not necessarily over the entire region of X . In sequence, Falsifi-
⎩
u = K(x) cation Module reformulates the Lyapunov conditions following
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
958 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 37, NO. 2, MARCH 2022
Fig. 2. The control diagram. To address this issue, we adopt a smoothed ReLU as used in [29]
to make the Lyapunov function continuously differentiable.
Since g is convex in x [28], a single optimum of V is guaranteed,
and V is defined by
V = i+1 (g(x) − g(0)) (10)
where we shift V to ensure (1) V (0) = 0. The learned Lya-
punov function needs to satisfy all conditions in (2). Learning
Module will update both and Ψ to improve the likelihood of
Fig. 3. Input convex neural networks [28]. constraint satisfaction. The design problem can be written as the
minimization of the following min-max cost,
inf sup α max(0, −V (x)) + β max(0, f (x,Ψ) V (x))) .
,Ψ x∈X
satisfiability modulo theories (SMT) and adopts an SMT solver (11)
(i.e., dReal [23]) to check whether the resulting V violates where α and β are the tunable parameters. (11) iis the cost
the conditions over X . If any violation exists, the SMT solver conceptually indicating any violation of Lyapunov conditions
in Falsification Module generates the counterexamples xviolated over the state space. The first term on RHS of (11) is the penalty
that violate either V (xviolated ) > 0, (f (xviolated ,u) V (xviolated )) < 0. of the violation of the second condition in Lyapunov conditions
These samples are added to the sample set S for the next learning (i.e., V > 0), and the second term is to penalize the violation
iteration k + 1. With the updated sample set, Learning Module of the third condition (i.e., f (x,Ψ) V (x) < 0). The minimal
is enforced to discover a new candidate. The new V is fed zero-loss of (11) indicates that the learned function satisfies all
to Falsification Module again, forming a training loop. The the Lyapunov conditions with a valid controller.
loop repeats until Falsification Module certifies that no violated Note that the difficulty in calculating (11) is to estimate
samples exist, which proves the correctness of V . the violation of the Lyapunov conditions and meanwhile, to
ensure the estimation over the entire state space. Discovering
A. Learning Module all the estimations may be computationally intractable for a
large-dimensional space or deep networks during the learning
With a training set S, the objective is to construct a candidate
process. Therefore,inspired by [22], we propose the Lyapunov-
Lyapunov function V satisfying the conditions in Definition
like cost that defines the expected cost with the learned Lyapunov
1. Here, we will present a technique to discover both control
function and control law, evaluated by the sampled state over the
and Lyapunov functions at the same time, by which the system
state space at each learning step.
stability can be ensured rigorously by the Lyapunov conditions.
Definition 5: (Lyapunov-like Cost) For a candidate V , the
The Lyapunov conditions can be described by:
Lyapunov-like cost is a defined by
1) V (0) = 0
2) V > 0 L(, Ψ)ρ = Ex∼ρ(X ) (α max(0, −V (x))
3) f (x,Ψ) V (x) < 0.
+β max(0, f (x,Ψ) V (x)) , (12)
In addition, the property of a Lyapunov function requires
only the local optima at 0. To enforce this constraint, we where x is sampled over X with a distribution ρ. (12) can be
represent V (x) by utilizing an input convex neural network estimated through Monte Carlo simulation, i.e.,
(ICNN) function g [28] in Learning Module, and the structure N
of ICNN is given in Fig. 3. It updates the parameters for and 1
L(, Ψ)N,ρ = α (max(0, −V (xk ))
Ψ simultaneously to improve the likelihood of non-violations, N
k=1
such as
+β max(0, f (xk ,Ψ) V (xk )) , (13)
z1 = 0 (w0 x + b0 ), where xk , k = 1, . . . , N , are the samples of X over ρ.
zi+1 = i (Ui zi + wi x + bi ), i = 1, . . . , l − 1, One important metric for controller performance is its ROA.
The ROA is a forward invariant set containing the origin, in
g(x) = zl , (8) which all possible system trajectories will never leave. When a
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
ZHAO et al.: NEURAL LYAPUNOV CONTROL FOR POWER SYSTEM TRANSIENT STABILITY: A DEEP LEARNING-BASED APPROACH 959
disturbance occurs, the ROA for a power system is a region in always satisfied [23]. The detailed effects of the δ selection on
which the operational status (e.g., rotor angles and frequencies) the algorithm can be found in [31].
can converge to the stable equilibrium [30]. Therefore, one can
enhance the power system robustness to a disturbance through
C. Termination
maximizing the ROA of a power system. Similar to [22], two
loss items are added to the Lyapunov-like cost. The loss items Recall that in the proposed framework, Learning Module
measure the increasing rate of the Lyapunov function subject provides a learned Lyapunov function, and Falsification Module
to the radius of the level sets. The ROA-Lyapunov-like cost is refutes this function by reporting any violated states and appends
given by them into the training set. For a given Compatible set C, the
learning process is terminated when the dReal solver computes
N
1 2 ϕ(x) that is unsatisfying δ-complete in Falsification Module. A
L(, Ψ)N,ρ + xk − cV (xk ), (14)
N successful learning process returns a valid Lyapunov function
k=1
which satisfies (2) and stabilizes (5). In other words, the learn-
where c is a positive parameter. Learning Module learns a pair ing algorithm succeeds if Falsification Module can not return
of Ψ and V . However, the learned V needs not necessarily be any counterexample that violates Falsifier Compatibility. Thus,
a Lyapunov function since the limited samples do not cover all Theorem 3.1 concludes that the candidate function is a valid
states in X . To overcome this problem, the following falsification Lyapunov function.
module is introduced. Theorem 3.1: Considering the system (5) with a control
policy u with f being continuously differentiable, and the initial
B. Falsification Module states x0 ⊂ X , V (x) is a Lyapunov function, and then Ψ is the
corresponding control policy iff Falsification Module returns an
With the pair (V , Ψ) obtained in Learning Module, this empty set.
module implements a falsification process. It includes three Proof: See Appendix A.
steps: An SMT problem is formulated over real numbers for Note that when the learning process is terminated, the returned
formal verification. In particular, V is a Lyapunov function if solution will be rechecked by a much smaller δ to verify its
the following first-order logic formula is satisfied, correctness and robustness further. If there is a violation using
V (x) > 0 ∧ f (x,Ψ) V (x) < 0, x ∈ X \ 0, this smaller δ, these counterexamples will be added to the
training set, and the learning process is restarted until meeting
However, solving large and quantified formulas can be time- the termination condition.
consuming generally. To this end, we rewrite this formula to Remark 3.1: The proposed solution deploys neural networks
smaller satisfiability queries that can be solved efficiently by in Learning Module to equip our solution with the model-free
an SMT solver in practice. Specifically, we consider the dual feature. In sequence, Falsification Module ensures the correct-
falsification problem as ness of the learned results building on the δ-completeness of an
SMT solver [31]. Combining the above two modules equips our
(V (x) ≤ 0, x ∈ X \ 0) ∨ (f (x,Ψ) V (x) ≥ 0, x ∈ X \ 0), solution with the robustness to model uncertainties. Note that
ϕ1 ϕ2 the uncertainty would not only from the model development,
(15) but also from external disturbances. The influence of external
which is a nonlinear constraint satisfaction problem to determine disturbances will be investigated in our future work.
the existence of counterexamples. Therefore, we only need to Remark 3.2: The time-delay effect on the system stability has
verify two independent satisfiability queries, i.e., if either ϕ1 or been investigated in [32]. The influence of time-delay on system
ϕ2 is true. If any of them is true, an SMT solver returns the stability could be potentially integrated in our framework by 1)
violated samples for either of them, and the results of each of changing the Lyapunov-like cost representing Lyapunov condi-
formulas constitute the violated sample set, i.e., Sk :=
the two tions for a system with no delay to a cost representing Lyapunov
Sk−1 {(xa,k )}, where (xa,k ) is the violated sample at iteration conditions for a time-delayed system and 2) generating sample
k. On the other hand, if both of them have unsatisfied results, data from the time-delayed system.
no samples will be returned, and the corresponding Lyapunov
function is validated ∀x ∈ X . With the repression of the original
D. Solution Implementation
formula, we can speed up the verification process.
Note that solving an SMT formula with nonlinear constraints The flowchart of the proposed framework is shown in Fig. 4.
is NP-hard and computationally expensive. The recently devel- The learning process works until not satisfying the δ-complete
oped SMT solver over reals, dReal [31], is adopted in this work or exceeding the maximum learning iteration Kmax . At iteration
to handle this problem. The correctness and completeness of k, Learning Module contains a training set
violation reporting are guaranteed by the “δ - complete” defined
in [23]. A δ-complete algorithm returns a δ-weakening answer Sk := {x1 , . . . , xk } ⊂ X . (16)
such as 1) (15) has no solution; 2) a solution satisfies (15). Note
that δ-weakening is a syntactic relaxation of the original problem Accordingly, k defines the set of candidate parameters for
where if the original problem is satisfied, the δ-weakening is Vk (x) similar to Ψk . The aim is to learn a set of parameters
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
960 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 37, NO. 2, MARCH 2022
TABLE I
HYPERPARAMETERS
respecting Falsifier Compatibility, i.e., as well. These samples do not need to be the solution of (15).
⎧ ⎫ The introduced expedient does not hinder the overall sound-
⎪
⎪ V (xk ) > 0∧ ⎪
⎪
⎨ ⎬ ness of the solution but further enhances its performance and
k : | f (xk ,Ψk ) V (xk ) < 0 . (17) generalization.
⎪
⎪ ⎪
⎪
⎩ (xk ,Ψk )∈Ck ⎭ Remark 3.3: It should be noted that our framework could
potentially facilitate the assessment of small-signal stability.
Starting from an initialized sample set S0 = ∅ and 0 , the In particular, we consider linearized power system dynamics
learning procedure includes the following steps. that are widely adopted in analyzing small-signal stability [3],
1) Find the candidate Lyapunov function and control law: [7], and our framework can discover a corresponding Lyapunov
given a learning period Tl and Sk , Learning Module function. With the learned Lyapunov function, we can estimate
computes the candidate VTl (x) and Ψk , i.e., for j = the security region to certify system small-signal stability fol-
[1, . . . , Tl ], lowing [7].
Vj (x), Ψj (x) ← Neural networks(x), ∀x ∈ Sk
IV. CASE STUDIES
j , Ψj ← Lyapunov-likecostminimization (18)
In this section, we demonstrate the proposed solution that can
2) Verify the candidate: Falsification Module tests whether be potentially adapted to various power system control objec-
VTl (x) is a validated Lyapunov function over Ck , i.e., tives. In particular, we illustrate two widely adopted examples
a) Reformulate the condition (15) in the dReal format and in power system control and stability: i) power system transient
utilize the SMT solver to check condition violation stability; ii) islanded microgrid control.
b) Return the satisfiability, results and all violated samples The hyperparameters used in each case are listed in Table I.
if they exist. The performance of our solution is mainly decided by the pa-
3) Check the termination condition: rameters of α and β. We provide an empirical procedure to tune
a) If true, it terminates the learning process and return these parameters. Given an initial guess of α and β, let V be the
VTl (x) and Ψk . candidate function at iteration k. We can visualize the results of
b) Otherwise, it reports all violated samples {xviolated } to V and its derivative numerically, by which we may determine the
Learning Module. direction to tune some of the parameters based on the violation(s)
4) Update the training set: all violated samples are added to of Lyapunov conditions. Specifically, after the k iterations, we
the training set to produce the next learning iteration: visualize the learned k and its derivative respecting X . Then, we
can tune the parameters based on the visualization. For example,
Sk+1 : Sk {xviolated }. (19)
suppose the visualization of k returns a positive V , but the
The δ - complete guarantees the obtained VTl (x) is a derivative of V is non-negative, we will fix the other parameters
Lyapunov function and holds for all states in the state and increase β to recheck their selections.
space X , along with a stabilizer ΨTl . In the first two cases, the neural networks have 5 layers, and the
Note that the interaction frequency between Leaning Module number of nodes in layers is set as xn − 4xn − 4xn − 4xn − 1,
and Falsification Module is one of the essentials that affects where xn is the dimension of states. In the last case, it has
the convergence speed. Adding violated samples too frequently 6 layers, and the number of nodes in layers is set as xn −
might result in overfitting learning results since these samples 6xn − 6xn − 6xn − 6xn − 1. The input features are system
resemble each other. To overcome this issue, additional random states, and the output feature is V . The parametric variables of
samples from a neighborhood of xviolated are appended to Sk+1 neural networks are trained using subgradient-based methods.
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
ZHAO et al.: NEURAL LYAPUNOV CONTROL FOR POWER SYSTEM TRANSIENT STABILITY: A DEEP LEARNING-BASED APPROACH 961
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
962 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 37, NO. 2, MARCH 2022
Fig. 7. The results of SG 1 in Application I: V > 0 in the upper figure and Fig. 10. The estimated ROA for SG 1 in Application I.
f (x,Ψ) V < 0 in the lower figure.
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
ZHAO et al.: NEURAL LYAPUNOV CONTROL FOR POWER SYSTEM TRANSIENT STABILITY: A DEEP LEARNING-BASED APPROACH 963
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
964 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 37, NO. 2, MARCH 2022
TABLE II
STATISTICS OF LEARNING DIFFERENT TEST MODELS
V. CONCLUSION
Fig. 16. The results in Application III: V > 0 in the upper figure and
f (x,Ψ) V < 0 in the lower figure.
This paper proposes a neural power system control framework
leveraging a deep learning approach to closing the existing
gaps in power system stability. It can deal with a variety of
problems in power system control and stability analysis. The
proposed framework can produce the valid Lyapunov function
and control law simultaneously. The combination of learning and
falsification modules guarantees the system stability and further
demonstrates the performance of the obtained function over the
verified domain. With the flexible structure of the framework,
the control design for a typical power system problem is sig-
nificantly simplified while providing a larger ROA. Several case
studies are carried out to verify the effectiveness of the proposed
framework in different problems.
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
ZHAO et al.: NEURAL LYAPUNOV CONTROL FOR POWER SYSTEM TRANSIENT STABILITY: A DEEP LEARNING-BASED APPROACH 965
REFERENCES [24] Z. Arstein, “Stabilization with relaxed controls,” Nonlinear Anal., vol. 7,
no. 11, pp. 1163–1173, 1983.
[1] P. Kundur et al., “Definition and classification of power system stability [25] A. Kasis, E. Devane, C. Spanias, and I. Lestas, “Primary frequency
IEEE/CIGRE joint task force on stability terms and definitions,” IEEE regulation with load-side participation-Part I: Stability and optimality,”
Trans. Power Syst., vol. 19, no. 3, pp. 1387–1401, Aug. 2004. IEEE Trans. Power Syst., vol. 32, no. 5, pp. 3505–3518, Sep. 2017.
[2] C. Mishra, A. Pal, J. S. Thorp, and V. A. Centeno, “Transient stability [26] H. R. Baghaee, M. Mirsalim, and G. B. Gharehpetian, “Power calculation
assessment of prone-to-trip renewable generation rich power systems using using RBF neural networks to improve power sharing of hierarchical
Lyapunov’s direct method,” IEEE Trans. Sustain. Energy, vol. 10, no. 3, control scheme in multi-der microgrids,” IEEE Trans. Emerg. Sel. Topics
pp. 1523–1533, Jul. 2019. Power Electron., vol. 4, no. 4, pp. 1217–1225, Dec. 2016.
[3] Y. Zhang and L. Xie, “Online dynamic security assessment of microgrid [27] H. R. Baghaee, M. Mirsalim, G. B. Gharehpetan, and H. A. Talebi,
interconnections in smart distribution systems,” IEEE Trans. Power Syst., “Nonlinear load sharing and voltage compensation of microgrids based
vol. 30, no. 6, pp. 3246–3254, Nov. 2015. on harmonic power-flow calculations using radial basis function neural
[4] Z. Wang, F. Liu, S. H. Low, C. Zhao, and S. Mei, “Distributed frequency networks,” IEEE Syst. J., vol. 12, no. 3, pp. 2749–2759, Sep. 2018.
control with operational constraints, Part I: Per-node power balance,” IEEE [28] B. Amos, L. Xu, and J. Z. Kolter, “Input convex neural networks,” in Proc.
Trans. Smart Grid, vol. 10, no. 1, pp. 40–52, Jan. 2019. 34th Int. Conf. Mach. Learn., 2017, pp. 146–155.
[5] Y. Mi, Y. Fu, C. Wang, and P. Wang, “Decentralized sliding mode load [29] J. Z. Kolter and G. Manek, “Learning stable deep dynamics models,” in
frequency control for multi-area power systems,” IEEE Trans. Power Syst., Proc. Adv. Neural Inf. Process. Syst., 2019, pp. 11128–11136.
vol. 28, no. 4, pp. 4301–4309, Nov. 2013. [30] C. Zhai and H. D. Nguyen, “Region of attraction for power systems using
[6] T. Stegink, C. De Persis, and A. van der, Schaft, “A unifying energy-based Gaussian process and converse Lyapunov function-Part I: Theoretical
approach to stability of power grids with market dynamics,” IEEE Trans. framework and off-line study,” 2019, arXiv:1906.03590.
Autom. Control, vol. 62, no. 6, pp. 2612–2622, Jun. 2017. [31] S. Gao, S. Kong, and E. M. Clarke, “dReal: An SMT solver for nonlinear
[7] T. L. Vu and K. Turitsyn, “Lyapunov functions family approach to theories over the reals,” in Proc. Int. Conf. Automated Deduction, 2013,
transient stability assessment,” IEEE Trans. Power Syst., vol. 31, no. 2, pp. 208–214.
pp. 1269–1277, Mar. 2016. [32] H. R. Baghaee, M. Mirsalim, G. B. Gharehpetian, and H. A. Talebi, “A gen-
[8] H. Liu, Z. Hu, and Y. Song, “Lyapunov-based decentralized excitation con- eralized descriptor-system robust H∞ control of autonomous microgrids
trol for global asymptotic stability and voltage regulation of multi-machine to improve small and large signal stability considering communication
power systems,” IEEE Trans. Power Syst., vol. 27, no. 4, pp. 2262–2270, delays and load nonlinearities,” Int. J. Elect. Power Energy Syst., vol. 92,
Nov. 2012. pp. 63–82, 2017.
[9] E. Mallada, C. Zhao, and S. Low, “Optimal load-side control for frequency [33] M. Ayar, S. Obuz, R. D. Trevizan, A. S. Bretas, and H. A. Latchman, “A
regulation in smart grids,” IEEE Trans. Autom. Control, vol. 62, no. 12, distributed control approach for enhancing smart grid transient stability
pp. 6294–6309, Dec. 2017. and resilience,” IEEE Trans. Smart Grid, vol. 8, no. 6, pp. 3035–3044,
[10] A. J. Wood, B. F. Wollenberg, and G. B. Sheblé, Power Generation, Nov. 2017.
Operation, and Control. Hoboken, NJ, USA: Wiley, Inc., 2013. [34] F. Dorfler and F. Bullo, “Synchronization and transient stability in power
[11] Y. Guo, D. J. Hill, and Y. Wang, “Global transient stability and voltage networks and nonuniform Kuramoto oscillators,” SIAM J. Control Optim.,
regulation for power systems,” IEEE Trans. Power Syst., vol. 16, no. 4, vol. 50, no. 3, pp. 1616–1642, 2012.
pp. 678–688, Nov. 2001. [35] P. W. Sauer and M. A. Pai, Power System Dynamics and Stability. New
[12] Y. Zhang and L. Xie, “A transient stability assessment framework in York, NY, USA: Wiley Online Library, 1998, vol. 101.
power electronic-interfaced distribution systems,” IEEE Trans. Power [36] P. M. Anderson and A. A. Fouad, Power System Control and Stability.
Syst., vol. 31, no. 6, pp. 5106–5114, Nov. 2016 Hoboken, NJ, USA: Wiley, Inc., 2008.
[13] Y. Pan, L. Chen, X. Lu, J. Wang, F. Liu, and S. Mei, “Stability region of [37] F. Dörfler, M. R. Jovanović, M. Chertkov, and F. Bullo, “Sparsity-
droop-controlled distributed generation in autonomous microgrids,” IEEE promoting optimal wide-area control of power networks,” IEEE Trans.
Trans. Smart Grid, vol. 10, no. 2, pp. 2288–2300, Mar. 2019. Power Syst., vol. 29, no. 5, pp. 2281–2291, Sep. 2014.
[14] J. W. Simpson-Porco, F. Dörfler, and F. Bullo, “Voltage stabilization in [38] H. Kwakernaak and R. Sivan, Linear Optimal Control Systems. New York,
microgrids via quadratic droop control,” IEEE Trans. Autom. Control, NY, USA: Wiley-interscience, 1972, vol. 1.
vol. 62, no. 3, pp. 1239–1253, Mar. 2017. [39] J. W. Simpson-Porco, Q. Shafiee, F. Dörfler, J. C. Vasquez, J. M. Guer-
[15] L. Zhu, D. J. Hill, and C. Lu, “Hierarchical deep learning machine for rero, and F. Bullo, “Secondary frequency and voltage control of islanded
power system online transient stability prediction,” IEEE Trans. Power microgrids via distributed averaging,” IEEE Trans. Ind. Electron., vol. 62,
Syst., vol. 35, no. 3, pp. 2399–2411, May 2020. no. 11, pp. 7025–7038, Nov. 2015.
[16] G. Zhang et al., “Deep reinforcement learning-based approach for propor- [40] Y. Du, X. Lu, J. Wang, and S. Lukic, “Distributed secondary control
tional resonance power system stabilizer to prevent ultra-low-frequency strategy for microgrid operation with dynamic boundaries,” IEEE Trans.
oscillations,” IEEE Trans. Smart Grid, vol. 11, no. 6, pp. 5260–5272, Smart Grid, vol. 10, no. 5, pp. 5269–5282, Sep. 2019.
Nov. 2020. [41] P. Yang, F. Liu, Z. Wang, and C. Shen, “Distributed stability conditions for
[17] Y. Zhang, C. Chen, G. Liu, T. Hong, and F. Qiu, “Approximating trajectory power systems with heterogeneous nonlinear bus dynamics,” IEEE Trans.
constraints with machine learning microgrid islanding with frequency Power Syst., vol. 35, no. 3, pp. 2313–2324, May 2020.
constraints,” IEEE Trans. Power Syst., vol. 36, no. 2, pp. 1239–1249, [42] W. M. Haddad and V. Chellaboina, Nonlinear Dynamical Systems and
Mar. 2021. Control: A Lyapunov-Based Approach. Princeton, NJ, USA: Princeton
[18] Z. Yan and Y. Xu, “Data-driven load frequency control for stochastic power Univ. Press, 2011.
systems: A deep reinforcement learning method with continuous action
search,” IEEE Trans. Power Syst., vol. 34, no. 2, pp. 1653–1656, Mar. 2019.
[19] Y. Zhang, X. Wang, J. Wang, and Y. Zhang, “Deep reinforcement learn-
ing based volt-var optimization in smart distribution systems,” 2020, Tianqiao Zhao (Member, IEEE) received the B.Eng.
arXiv:2003.03681. degree in automatic control from North China Electric
[20] A. J. Taylor, V. D. Dorobantu, H. M. Le, Y. Yue, and A. D. Ames, “Episodic Power University, Hebei, China, in 2013 and the
learning with control Lyapunov functions for uncertain robotic systems,” Ph.D. degree in electrical and electronic engineering
2019, arXiv:1903.01577. from the University of Manchester, Manchester, U.K.,
[21] P. Yang, F. Liu, Z. Wang, and C. Shen, “Distributed stability conditions for in 2019. From September 2018 to August 2019, he
power systems with heterogeneous nonlinear bus dynamics,” IEEE Trans. was a Postdoctoral Associate with the Department
Power Syst., vol. 35, no. 3, pp. 2313–2324, May 2020. of Electrical & Electronic Engineering, University of
[22] Y.-C. Chang, N. Roohi, and S. Gao, “Neural Lyapunov control,” Manchester. From September 2019 to February 2021,
pp. 3245–3254, 2019. [Online]. Available: http://papers.nips.cc/paper/ he was a Postdoctoral Fellow with the Department
8587-neural-lyapunov-control.pdf of Electrical and Computer Engineering, Southern
[23] S. Gao, J. Avigad, and E. M. Clarke, “δ-Complete decision procedures for Methodist University, Dallas, TX, USA. He is currently with Brookhaven
satisfiability over the reals,” in Proc. Int. Joint Conf. Automated Reasoning, National Laboratory, as a Research Associate. His research interests include
2012, pp. 286–300. distributed optimization and control, microgrid, and energy storage systems.
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.
966 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 37, NO. 2, MARCH 2022
Jianhui Wang (Fellow, IEEE) is currently a Profes- Yuhua Du (Member, IEEE) received the B.S degree
sor with the Department of Electrical and Computer in electrical engineering from Xi’an Jiaotong Uni-
Engineering, Southern Methodist University, Dallas, versity, Xi’an, China, in 2013 and the Ph.D. degree
TX, USA. He has authored and/or coauthored more in electrical and computer engineering from North
than 300 journal and conference publications, which Carolina State University, Raleigh, NC, USA in 2019.
have been cited for more than 30000 times by his peers In 2018, he was a Research Aide with Argonne
with an H-index of 87. He has been invited to give National Laboratory. He is currently a Postdoctoral
tutorials and keynote speeches at main conferences, Fellow with Temple University, Philadelphia, PA,
including IEEE ISGT, IEEE SmartGridComm, IEEE USA. His research interests include voltage source
SEGE, IEEE HPSC, and IGEC-XI. converter modeling and control, microgrid distributed
He is the past Editor-in-Chief of the IEEE TRANS- secondary control, and development of microgrid
ACTIONS ON SMART GRID and an IEEE PES Distinguished Lecturer. He is also a hardware-in-the-loop testbed.
Guest Editor of a Proceedings of the IEEE special issue on power grid resilience.
He was the recipient of the IEEE PES Power System Operation Committee
Prize Paper Award in 2015, the 2018 Premium Award for Best Paper in IET
Cyber-Physical Systems: Theory & Applications, the Best Paper Award in IEEE
TRANSACTIONS ON POWER SYSTEMS in 2020, and the IEEE PES Power System
Operation, Planning, and Economics Committee Prize Paper Award in 2021. Dr.
Wang is a Clarivate Analytics highly cited Researcher for production of multiple
highly cited papers that rank in the top 1% by citations for field and year in Web
of Science (2018-2020).
Authorized licensed use limited to: ASU Library. Downloaded on July 04,2023 at 10:32:27 UTC from IEEE Xplore. Restrictions apply.