UAV Model-Based Flight Control With Artificial Neural - Survey

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 23

Journal of Intelligent & Robotic Systems (2020) 100:1469–1491

https://doi.org/10.1007/s10846-020-01227-8

UAV Model-based Flight Control with Artificial Neural


Networks: A Survey
Weibin Gu1 · Kimon P. Valavanis1 · Matthew J. Rutherford1 · Alessandro Rizzo2

Received: 5 February 2020 / Accepted: 23 June 2020 / Published online: 21 September 2020
© Springer Nature B.V. 2020

Abstract
Model-Based Control (MBC) techniques have dominated flight controller designs for Unmanned Aerial Vehicles (UAVs).
Despite their success, MBC-based designs rely heavily on the accuracy of the mathematical model of the real plant and they
suffer from the explosion of complexity problem. These two challenges may be mitigated by Artificial Neural Networks
(ANNs) that have been widely studied due to their unique features and advantages in system identification and controller
design. Viewed from this perspective, this survey provides a comprehensive literature review on combined MBC-ANN
techniques that are suitable for UAV flight control, i.e., low-level control. The objective is to pave the way and establish a
foundation for efficient controller designs with performance guarantees. A reference template is used throughout the survey
as a common basis for comparative studies to fairly determine capabilities and limitations of existing research. The end-
result offers supported information for advantages, disadvantages and applicability of a family of relevant controllers to
UAV prototypes.

Keywords Model-based control (MBC) · Artificial neural network (ANN) · Flight control · Hybridization · Unmanned
aerial vehicle (UAV)

1 Introduction Inversion (NDI) [1], Adaptive Control [2], Model Predictive


Control (MPC) [3], Sliding Mode Control (SMC) [4],
1.1 Motivation and Rationale Backstepping Control [5] and H∞ Robust Control [6].
Despite promising results, a challenge MBC-based
Model-Based Control (MBC) techniques have found great designs suffer from, is their dependence on the accuracy
applicability in the controller design for Unmanned of the mathematical model of the real plant [7]. A
Aerial Vehicles (UAVs). This is because control-oriented poorly derived or defined model, due to imprecise system
modeling, being the core of MBC techniques, allows for knowledge and ubiquitous exogenous disturbances, may
a systematic way to controller synthesis that basically adversely impact subsequent controller synthesis that leads
facilitates the design and development process by reducing to unacceptable performance or even instability. Such
the effort to the tuning and calibration processes. Moreover, uncertainties and disturbances may be classified as:
the focus of MBC techniques is on guaranteeing system
– Parametric uncertainties: These are normally induced
stability and performance, improving robustness with
by miss-modeling and/or system degradation (e.g.,
respect to uncertainties and disturbances, and finding the
mass and inertia changes, etc).
optimally designed controller. Recent findings are presented
– Unmodeled dynamics: These refer to difficult-to-
in Feedback Linearization (FL), a.k.a. Nonlinear Dynamic
model, ill-defined and intentionally ignored aspects of a
nonlinear model, which include advanced aerodynamic
This work is partially supported by an NSF Grant, CMMI-DCSD- effects such as blade flapping [8], effect of airflow [9],
1728454, Compagnia di San Paolo, and by an Amazon Research
Award granted to Dr. A. Rizzo.
ground and ceiling effect [10, 11], etc.
– Disturbances and noise: Disturbances may include
 Weibin Gu wind gusts and turbulence, while noise mainly refers
weibin gu@yeah.net
to sensor noise. Considered assumptions may not be
realistic as the statistical properties of sensor noise is
Extended author information available on the last page of the article. mostly non-Gaussian in practice.
1470 J Intell Robot Syst (2020) 100:1469–1491

To tackle such challenges, a wide spectrum of modern time; relatively easy processing procedure and hardware
control techniques has been proposed, each with advan- implementation.
tages, limitations and drawbacks. For example, although a Such advantages allow for ANNs to offer a perfect tool to
widely-used technique is Gain Scheduling (GS) [12] that build the system underlying model with high accuracy and
demonstrates good capabilities of coping with parameter low complexity, even when corrupted with uncertainties and
variations and nonlinearities, frequent and fast changes of disturbances, and to facilitate the implementation process
the controller gains may drive the system unstable [13]. In and to enhance real-time performance. Regardless, there
addition, the cost of design and implementation increases still exist challenges due to their data-driven nature that
with the number of operating points as stated in [14]. On precludes applicability in industry, to some extent, because
the other hand, robust control is effective when considering of: the requirement for large sets of training data; being
bounded parametric uncertainties but it has limitations when prone to learn spurious relationships that may lead to poor
considering either unbounded ones or unmodeled dynam- generalization capabilities [25]; lack of interpretability due
ics [6, 15]. Adaptive control, as a viable solution (due to to their black-box characteristics; no systematic approach
its strength of real-time adaptation) is suitable to manage for related ANN architecture designs (i.e., given a selected
parametric uncertainties; however, there barely exist widely ANN architecture, the number of layers and neurons, the
accepted solutions to the robust adaptive control problem, type of activation functions, weight updating mechanisms
so far [16]. Sliding mode control has been shown to be and so forth, are generally arbitrarily determined, rather than
robust with respect to modeling errors and parameter uncer- in a systematic way).
tainties, but chattering is an issue occurred by frequent When focusing on UAVs, a flight control system,
controller switches. Besides, characteristics of insensitive to being a safety-critical system, requires focus and priority
parameter changes may pose problems to self-stabilization on safety and reliability. As such, the ANN black-box
when exogenous disturbances appear. Last, but not least, characteristics have contributed to their acceptance and
Model-Free Control (MFC) techniques that have emerged use, at least from the practical point of view. The
to overcome unmodeled dynamics and uncertainties of non- same problems can be tackled and handled by MBC
linear systems, have demonstrated great adaptation and techniques if such techniques are incorporated properly
estimation capabilities due to the use of a continuously into the controller design. For example, the explicit
updated model, namely ultra-local model [17, 18]. However, system knowledge that MBC techniques heavily rely on,
this methodology is currently limited to system dynamics can be leveraged to facilitate the training process (by
that can be converted to Single-Input Single-Output (SISO) generating training data and/or speeding up the training
subsystems. Besides, there are questions related to analytic process, etc.) and to improve performance, generalization
stability and convergence proofs. capabilities, and interpretability of ANNs. This results in
Another important challenge of MBC techniques relates a complementary relation between MBC techniques and
to the explosion of complexity issue [19]. This arises ANNs, the combination of which, even by intuition, is
from, but it is not limited to: the mathematical inverse expected to overcome stated limitations.
model required in feedback linearization; the repeated
differentiation of virtual controllers in backstepping control; 1.2 Problem Statement and Contributions
the prediction over a future horizon of the plant behavior
as well as the online optimization process in MPC. All The objective of this paper is to provide the comprehensive
three limitations may result in a hard-to-implement in real- background to design and implement a combined controller,
time controller due to the computational burden, particularly focusing on UAV applications. More specifically, the
when it comes to highly nonlinear complex systems, such as controller aims to achieve stable and robust autonomous
UAVs, in which the complexity of the controller increases flight under uncertainties and distances with reduced design
drastically as the system order increases. and implementation complexity by combining MBC flight
To overcome the previously stated challenges of MBC- controller with ANNs, i.e., MBC-ANN flight controller. To
based designs, Artificial Neural Networks (ANNs) are being achieve this goal, the following issues need to be addressed
exploited to study complex control systems. This is mostly and solved:
attributed to the perceived advantages of ANNs in system
identification and controller design, to say the least [20–22], i. (Stability): How can system stability and weight
including: their capability to identify nonlinear and multi- convergence of ANNs is guaranteed when combining
variable systems [23, 24]; ability to learn and adapt in real MBC techniques with ANNs?
J Intell Robot Syst (2020) 100:1469–1491 1471

ii. (Robustness): How ANNs may be exploited during – In [21], deep learning techniques and their applications
modeling or controller synthesis along with MBC for UAV-based solutions are surveyed, mainly focusing
techniques to cope with uncertainties and distur- on high-level control including path planning, situation
bances? awareness, etc. Nonetheless, there is a lack of
iii. (Paucity of data): How sufficient and suitable training elaboration on low-level control design.
data becomes available, particularly when online – In [27], a general review on advances of modeling and
training (or adaption) is engaged? control of UAV maneuvering flights is given, whereas
iv. (Interpretability): How interpretability of ANNs is MBC techniques and ANNs are separately discussed.
improved to make them trustworthy to the users?
v. (Real-time performance): What is the computational There is no literature review on how ANNs have been
complexity of the designed controller? Is it imple- used in the MBC systems either. This survey, although built
mentable in real-time? Is the ANN architecture suit- on previous studies [28], is the first comprehensive review
able for hardware implementation? on model-based flight control combined with ANNs.
Next, a unified framework is presented, namely the pro-
Technically justified and supported answers to the above
posed reference template to facilitate comparative studies of
questions require a comprehensive literature review of MBC
existing research on UAV model-based flight control with
techniques combined with ANNs, with emphasis on UAV
ANNs. Specifically, the comprehensive literature review is
low-level flight control. This will pave the way to efficient
conducted based on set objectives in the reference tem-
controller designs with performance guarantees, resulting in
plate and discussed along with strengths and limitations in
the following contributions:
Section 3 and 4. The set of objectives elaborated in the
– A thorough review of existing literature related sequel include: (i) UAV type, (ii) ANN functionalities, (iii)
to different techniques, which are compared in a ANN architecture, (iv) ANN training methods, (v) com-
unified framework (i.e., reference template) to analyze putational complexity, (vi) control techniques, (vii) flight
advantages and limitations. maneuvers, (viii) type of obtained results, and (ix) other
– Study and determination of the computational com- problems to be addressed.
plexity of several frequently used ANNs in UAV flight
control applications to provide a-priori knowledge of 2.1 UAV Type
real-time performance during the design phase.
This refers to the general UAV type under control, e.g.,
The remainder of this paper is organized as follows.
rotary wing, fixed-wing, tilt wing, tail sitter, etc. Since
Section 2 presents a reference template to provide a
any low-level flight controller design is tightly associated
common basis for comparative studies and to fairly
with the UAV dynamic model, the type of UAV makes a
determine the capabilities and limitations of existing work.
significant difference. Hence, it is identified for comparison
An analysis on computational complexity is also carried out
purposes, separately from other components like hardware,
for several frequently used ANNs, therein. Section 3 and
software, sensors, etc.
4 conduct the comparison and classification of approaches
based on a thorough literature review on dynamic modeling
2.2 ANN Functionalities
and control techniques, respectively. Section 5 concludes
this survey and sheds light on the future work.
ANNs are composed of layers, of which applications
fall into two mathematical problems: regression and
classification. From the control perspective, however, ANN
2 Reference Template
functionalities may be defined based on the role the ANN
plays in the control system, e.g., UAV dynamic modeling,
It is essential to provide the rationale for the adopted
adaptive control augmentation, observer and estimator, fault
reference template that will be followed. As such, related
detection component, etc.
published surveys on MBC techniques and ANNs for
UAV applications have been studied to determine their
2.3 ANN Architecture
limitations.
– In [26], hardware implementation of ANNs for aero- It is considered essential to provide a brief summary of
nautical applications is reviewed. Some key issues on ANN mathematical models. With this knowledge in mind, it
FPGA (Field-Programmable Gate Array)-based imple- will be easier for the reader to comprehend classifications of
mentation are posed. However, the design of the flight different ANN architecture as well as their advantages and
control system is not treated. limitations for flight controller design.
1472 J Intell Robot Syst (2020) 100:1469–1491

2.3.1 Historical perspective be hard-limit, linear, log-sigmoid or tan-sigmoid function


[34], chosen by the designers considering the requirements
The smallest element in ANNs is named as neuron by of network performance.
analogy with neurophysiology, whose simplified model An ANN is a typical network (e.g., see Fig. 2), which is
was firstly studied by McCulloch and Pitts in the 1940s made up of a single input layer, single or multiple hidden
[29]. Few years later, the very first simple model of layers, and a single output layer. Each layer (except the
ANN, namely perceptron, was proposed by Rosenblatt input) comprises a number of neurons as introduced earlier,
[30], which was a supervised learning model receiving possibly governed by different activation functions. To draw
wide attention. However, subsequent research progress was consistent analyses in the sequel, we follow the convention
somehow impeded at that time until Back-Propagation throughout this survey that the total number of layers of
(BP) algorithm appeared in 1986 [31], a powerful tool for an ANN is only associated with hidden and output layers,
training feed-forward networks proposed by Rumelhart et regardless of input layer. For example, an ANN as depicted
al. Coupled with Multi-Layer Perceptron (MLP) network, a in Fig. 2 with single hidden layer is considered to have two
number of problems on classification and regression could layers in total (i.e., one hidden, one output). In the rest of
be practically solved with satisfactory results. Nonetheless, this survey, we denote the size of an ANN by nL ×nN , where
the thriving development of Support Vector Machine (SVM) nL , nN ∈ Z+ represent the number of layers and neurons,
starting from 1992 witnessed AI (Artificial Intelligence) respectively. Therefore, given a particular two-layer ANN
winter for the next decade. In 2006, Hinton et al. with 50 hidden neurons, we can simply specify its size by
proposed the concept of Deep Learning (DL) as opposed 2 × 50. Furthermore, we use the punctuation mark — colon
to shallow learning [32]. Given many more layers, Deep (:), along with a number ahead to denote the total number
Neural Networks (DNNs) are endowed with much powerful of a specified ANN being used. With 2 : 2 × 50, it means
potential, especially in pattern recognition and object that this particular two-layer ANN with 50 hidden neurons
detection [33]. Motivated by DL, an increasing number is used twice in the design (in other words, two 2 × 50
of ANNs with more complicated architecture have been networks).
popping up since then. Figure 1 depicts the brief history of
ANN indicated with big events happened. 2.3.3 Feed-forward and recurrent ANNs

2.3.2 Mathematical model of neuron and ANN In terms of neuron connections, ANNs can be categorized
into one of the following two types: feed-forward and
An artificial neuron (or simply, a neuron) behaves like recurrent. Neurons in a feed-forward network only receives
a function in principle. The simple model of a neuron input from previous layer, while feedback connections exist
generally consists of a scalar-valued activation function in a recurrent network. Due to the additional connections,
f : R → R and two training parameters, namely input recurrent networks are endowed with large dynamical
weight matrix W ∈ R1×N and bias weight b ∈ R1×1 , memory which feed-forward networks generally do not
where N is the number of elements in the input vector possess, however, at the cost of increasing complexity.
p ∈ RN×1 . The output a ∈ R1×1 is generated through a = In terms of dynamic characteristics, recurrent networks
f (Wp+b) where the engaged activation function f (·) could may also be named as dynamic networks as opposed to

Fig. 1 Historical development


of ANN
J Intell Robot Syst (2020) 100:1469–1491 1473

Fig. 2 A typical architecture of


feed-forward neural networks

static networks referring to feed-forward networks. Note networks were proposed which are normally two-layer
that with the use of Tapped Delay Lines (TDLs), feed- recurrent networks having a feedback connection from the
forward networks can also be regarded as dynamic net- output of the hidden layer to its input[40]. Due to this
works. For instance, Focused Time-Delay Neural Network additional connection, Elman networks have the capabilities
(FTDNN)1 (as part of a general class of dynamic networks, of detecting and generating time-varying patterns. Echo
namely focused networks) refers to feed-forward networks State Networks (ESNs) are a special type of recurrent
with tapped delay lines only at the input. networks, initially introduced by Jaeger in 2001 [41].
Some typical and widespread ANNs are presented Thanks to the concept of Reservoir Computing (RC), the
in Table 1, classified based on neuron connections. training of such recurrent networks becomes conceptually
Only a brief introduction to these ANNs is given here, simple and computationally inexpensive. Long Short-Term
mainly recalling their origin and key features. Multi- Memory (LSTM) networks are a typical type of recurrent
Layer Perceptron (MLP) networks [30] and Radial Basis networks, firstly proposed by Hochreiter and Schmidhuber
Function Neural Networks (RBFNNs) [35] are the two well- in 1997 [42], capable of learning long-term dependencies
studied architecture of feed-forward networks originated by solving the problem of vanishing gradients [43]. Similar
in the late 1900s, mainly differing in the underlying to LSTM units, Gated Recurrent Units (GRUs) are a gating
activation functions. Subsequently, many variations have mechanism particularly used in recurrent networks while
appeared with different functions taking on the role of involving fewer parameters, firstly introduced by Cho et al.
activation function. For instance, Wavelet Neural Network in 2014 [44]. At last, we stress the importance of a series
(WNN) was proposed by Zhang et al. in 1992 to decrease of biologically plausible neuron models, namely spiking
the number of nodes required in the network by using neuron models, which describe the properties of biological
Mexican hat wavelet as activation function [36]; Rectified- neurons to communicate via voltage spikes. The earliest
Linear Unit (ReLU) was firstly introduced to a dynamical spiking neuron model is called Hodgkin-Huxley model [45]
network by Hahnloser et al. in 2000 to solve gradient while the Leaky Integrate-and-Fire (LIF) model [46] is more
vanishing issues [37]. On the other hand, motivated by preferable in controller design nowadays due to its simpler
cerebellum neurophysiological model, Cerebellar Model structure. Such bio-inspired neural networks are known as
Articulation Controller (CMAC, a.k.a. cerebellar model Spiking Neural Networks (SNNs). All these representatives
arithmetic computer) was initially proposed by Albus for are also marked in Fig. 1 for visualization purposes.
robotic manipulator control in 1975 [38]. As for recurrent
networks, Hopfield networks serve as content-addressable 2.4 ANN Training Methods
memory systems with saturated linear transfer functions,
popularized by Hopfield in 1982 [39]. Later in 1990, Elman Back-Propagation (BP) algorithm is a gradient-based
approach belonging to supervised learning. The procedures
1 More details please refer to MathWorks Documentation for updating network weights by using BP algorithm are as
on “Design Time Series Time-Delay Neural Networks”: follows:
https://ww2.mathworks.cn/help/deeplearning/ug/design-time-series-
time-delay-neural-networks.html. Step 1: Take a batch of training data.
1474 J Intell Robot Syst (2020) 100:1469–1491

Table 1 Classification of ANNs computationally expensive and may suffer from divergence
owing to a paucity of data.
Type of connectionANNs

Feed-forward Multi-Layer Perceptron (MLP) network 2.5 Computational Complexity


Radial Basis Function Neural Network (RBFNN)
Rectified-Linear Unit (ReLU) network Despite the parallel processing capability that ANNs
Wavelet Neural Network (WNN) uniquely possess, not all the implementations can rely
Cerebellar Model Articulation Controller (CMAC) on it if not supported by the suitable hardware such
as FPGAs. This is not uncommon in practice since
Recurrent Elman network many ANN-based applications are still developed on
Hopfield network CPU due to simplicity reasons (e.g., relatively simple
Echo State Network (ESN) development and deployment in computer systems), where
Long Short-Term Memory (LSTM) network the data cannot be processed in parallel unfortunately. More
Gated Recurrent Unit (GRU) network importantly, the real-time performance should be stressed
Spiking Neural Network (SNN) from the perspective of controller design when ANNs are
incorporating into MBC system design. Hence, no matter
the hardware foundation, there is always a point to carry out
analyses on computational complexity of ANNs being used
Step 2: Perform forward-propagation to obtain the loss
beforehand.
(i.e., the difference between actual/predicted and desired
Computational complexity refers to the computational
output).
resources such as computational time and amount of
Step 3: Perform back-propagation on the loss to derive
memory required to solve a computational problem. In the
gradients.
context of this paper, the main focus has been placed on
Step 4: Update network weights using the gradients.
the computational time for ANNs, also known as Time
Complexity (TC). Motivated by [49], one of the very few
However, BP algorithm works only for feed-forward net- works that have treated computational issues meticulously,
works and requires modifications to be applied for recurrent an analysis on TC of ANNs is detailedly addressed in the
networks. For instance, Back-Propagation Through Time sequel, offering a clear mind on the required computational
(BPTT) [47] and Real Time Recurrent Learning (RTRL) resources during system design.
[48] are the two alternatives for the use of recurrent network The TC of an ANN can be attributed to three phases:
training. (i) offline training, (ii) online training, and (iii) feed-
Apart from gradient-based approach, linear regression is forward propagation. For offline ANNs, TC can be simply
also an efficient and effective way for some typical ANN associated with feed-forward propagation, given that the
architecture (e.g., ESN) due to its one-shot training process. time for offline training is not critical for the applications.
Additionally, network weights can also be adjusted by using For online ANNs, the time for online training has to be
Lyapunov stability theory which provides mathematical considered along with that for feed-forward propagation.
proof of stability when network is employed in the control Similarly, the time for offline training can be negligible
design. depending on applications. Since there is a wide variety
Based upon the mode of network training, ANNs can of ANN architecture and online training techniques, the
also be categorized into offline and online networks. analysis of TC for online training phase can be very
Offline networks refer to those with pre-trained and fixed complicated. Hence, we will leave this topic as one of
network parameters. For applications where offline training our future research topics. In the following, we show that
time is not critical, such networks are prevalent as they TC with regard to feed-forward propagation can be easily
offer higher accuracy models in general. On the contrary, expressed as a function of input nodes, hidden neurons
online networks refer to those with online adaptation of and output nodes. We denote the number of input nodes,
network parameters. Due to this unique learning ability, output nodes and hidden neurons by N, K, and L ∈ Z+ ,
online networks are of great interest to applications where respectively, in the formulation.
online adaptation is essential. For example, a dynamic The time required for running algorithms, generally
model of UAV constructed by an online network would expressed as the number of elementary operations included,
be more ideal. As such, all the dynamic changes can be can be varying with respect to the computer being used.
automatically captured by adjusting the network weights However, since the elementary operations are normally
in real time. However, since online networks involves assumed to take constant execution time on a specific
online adaptation, the resulting training process can be computer, executing on different computers can be hence
J Intell Robot Syst (2020) 100:1469–1491 1475

regarded as changing the execution time by a constant factor SISO methods while MIMO methods include Linear
determined by the computer configurations. Motivated by Quadratic Regulator (LQR), Linear Quadratic Gaussian
[49], Table 2 presents TC for different operations that might (LQG), H∞ control, etc. Nonlinear controllers can be
be used in the subsequent ANNs. It is noted that the choice further divided into linearized and fully nonlinear methods.
of multiplication algorithm directly affects the complexity Linearized methods start with a nonlinear model and
of division, exponentiation, exponential function and square proceed with linearization process to derive a linear model
root. In our case, we select 3-way Toom-Cook algorithm for subsequent control design, which include adaptive
for multiplication in spite of the fact that there are other control, Model Predictive Control (MPC), Sliding Mode
variations of algorithm available such as Schoolbook long Control (SMC), etc. Fully nonlinear methods, for instance,
multiplication and Karatsuba algorithm [50]. The scalar backstepping control, can be directly applied on nonlinear
constants Ki ∈ R+ with i = A, E, M, φ represent the dynamics without the need of linearization process. As
ratios between the computational time cost for different this survey is focused on the combination of MBC
operations calculated with algorithms detailed in Table 2 techniques and ANNs, model-free approaches will not be
and the unit time cost (which is exactly defined as the discussed in detail. Table 4 summarizes the advantages and
time cost for addition/subtraction operation), hence it disadvantages of linear and nonlinear controllers presented
always holds that Ki  1 [49]. Since the constants Ki in Fig. 3.
for multiplication, division and square root are identical
which are all determined by multiplication algorithm, 2.7 Flight Maneuvers
these constants are equivalently denoted by KM with a
slight abuse of notation. Additionally, subscript A denotes Flight maneuvers of UAVs can be categorized into basic and
addition/subtraction operation, E denotes exponentiation, aggressive maneuvers. Basic maneuvers, including hover,
and φ denotes exponential function. Similarly, Ni ∈ R level flight, turns, and climbs or descents, are associated
with i = A, E, M, φ denote the number of each operation with a single operating condition, thereby can be normally
involved in a network. To make the results more numerical, realized using a linear controller. On the other hand, more
all the computations are assumed to be performed on complex controllers such as gain scheduling or nonlinear
a 32-bit microcontroller, although industrial embedded methods are required for aggressive maneuvers that go
controllers vary in number of bits. Therefore, n = k = through a wider flight envelope, for example, Figure-∞,
32 in Table 2, where n denotes the input size in units of split-S and other aerobatics.
bits needed to represent the input and k is the number of
digits of the exponent. Table 3 concludes the TC of several 2.8 Type of Obtained Results
frequently-used (fully-connected) ANNs in terms of feed-
forward propagation, of which derivations can be found in Verification and validation results of the design of a
our previous study [28]. flight control system can be categorized into theoretical,
simulation, and experimental. Theoretical results provide
2.6 Control Techniques rigorous mathematical proofs demonstrating the control
design can fulfill the set of control objectives. Simulation
Control techniques can be generally classified into model- results can be achieved coupled with the nonlinear aircraft
based (namely, linear and nonlinear models) and model- model, for example, either in MATLAB/Simulink or in X-
free, as illustrated in Fig. 3. Linear controllers can be further Plane Simulator. Experimental results are the most desired
classified into Single-Input Single-Output (SISO) and ones as they explicitly validate the effectiveness of the
Multi-Input Multi-Ouput (MIMO) methods. For instance, system. Nonetheless, they are also the most difficult to
Proportional-Integral-Derivative (PID) control belongs to obtain due to technical and practical reasons.

Table 2 Time complexity of different operations

Operation Algorithm Complexity Constant Ki (n = k = 32)

Addition and subtraction Basic O(n) KA = 1


Multiplication 3-way Toom-Cook multiplication O(n1.465 ) KM = n0.465 = 5.01
Division Newton-Raphson division O(n1.465 ) KM = n0.465 = 5.01
Exponentiation Exponentiation by squaring O(k · n1.465 ) KE = k · n0.465 = 160.34
Exponential function Taylor series O(n0.5 · n1.465 ) Kφ = n0.5 · n0.465 = 28.34
Square root Newton’s method O(n1.465 ) KM = n0.465 = 5.01
1476 J Intell Robot Syst (2020) 100:1469–1491

Table 3 Time complexity of feed-forward propagation of different ANNs

ANN architecture Time complexity (TC = NA · KA + NM · KM + Nφ · Kφ )

Feed-forward network TC = L(N + K + 1) + L(N + K + 1)KM + LKφ


RBFNN TC = L(2N + K − 1) + L(N + K + 3)KM + LKφ
ReLU network TC = L(N + K + 1) + L(N + K)KM
ESN TC = ((N + K + L)L + (N + L − 1)K) + ((N + K + L + 3)L + (N + L)K)KM + 2LKφ

2.9 Problems that Remain Unsolved which greatly eases the modeling process and reduces the
computational complexity of mathematical formulation.
During the review, we will examine to what extent the Apart from data-driven approach, there is another
problems listed in Section 1.2 are well addressed. The branch of dynamic modeling called hybrid approach.
problems that remain unsolved will be indicated by the The core idea of hybrid modeling can be described
corresponding Roman Numerals in the final analysis table as one of the two following: (i) Besides ANN-based
to reveal the limitations of each previous work as well as model, a physical model is augmented (either in series
clarifying the future research direction. or in parallel) to improve the overall performance; (ii)
Explicit or implicit physical knowledge is incorporated
into ANN-based model to enhance its interpretability.
3 Dynamic Modeling Consequently, hybrid modeling tends to yield more accurate
model comprehending physical knowledge, thereby having
As one of the most ubiquitous methods for years, improved generalization capability and interpretability.
mathematical modeling such as First Principle Model In the sequel, we first separately review the existing
(FPM) is efficient and effective for plant dynamics that literature on dynamic modeling based on data-driven and
are well studied and easy to model. However, modeling hybrid approach. Then, comparison and classification of
of complex systems such as UAVs, normally engaged approaches are conducted, followed with possible open
with identifying nonlinearities and uncertainties, remains a problems.
challenging topic in system identification. To handle these
issues, there has been explosive growth in developing ANN- 3.1 Literature Review
based models in recent years thanks to their abilities of
approximating nonlinear functions and real-time learning. 3.1.1 Data-driven approach
Such modeling technique is also known as data-driven or
black-box approach. Instead of building the model from Data-driven approach, which builds an ANN-based model
explicit physical laws, ANN-based models are trained based entirely from data without the need of a-priori physical
on data science techniques given sufficient informative data, knowledge, can be classified into one of the three categories

Fig. 3 Classification of control


techniques (reprinted from [51])
Table 4 Advantages and disadvantages of control techniques

Control techniques Advantages Disadvantages


J Intell Robot Syst (2020) 100:1469–1491

PID Easy implementation. (i) Lack of robustness; (ii) Coupling of dynamics is ignored.
LQR/LQG Guaranteed stability margins for LQR. (i) Limited to certain operating conditions; (ii) Required full state
feedback for LQR; (iii) Performance degradation of Kalman filter
for LQG; (iv) Output limitations are not considered.
H∞ Parametric uncertainties and unmodeled dynamics can be handled. (i) Conservative design for structural uncertainties; (ii) Good
knowledge of system model is required.
GS Large range of flight envelope and operating conditions can be (i) Unsteady transition issues between switches; (ii) Cumbersome
covered. design procedures.
Adaptive control Bounded unmodeled dynamics, parametric uncertainties, and (i) Complex analysis and good knowledge of system dynamics is
disturbances can be handled. required; (ii) Parameter drift caused by process noise; (iii) Large
adaptation rates are non-implementable.
Feedback linearization Nonlinearities can be handled. (i) High computational complexity; (ii) High requirement for model
accuracy.
MPC (i) Prediction capabilities; (ii) Abilities of explicitly handling (i) High requirement for accuracy of prediction model; (ii) Time-
constraints on control input. consuming for online optimization.
SMC Control performance is insensitive to modeling errors and (i) Chattering is induced by frequent controller switches; (ii)
parameter uncertainties. Highly nonlinear sliding mode surface poses difficulties to stability
analysis.
Backstepping control Good for underactuated systems. (i) Nonlinear model of a lower triangle form (known as pure-
feedback form) is required; (ii) High computational complexity for
computing the derivative of pseudo control inputs.

The top half of the table belongs to linear controllers, while the bottom part of the table belongs to nonlinear controllers.
1477
1478 J Intell Robot Syst (2020) 100:1469–1491

based on network architecture: feed-forward, recurrent, and used associated with 60 hidden neurons for MLP and 5
mixed. In Section 2.3, feed-forward and recurrent networks radial neurons for RBFNN. For both subsystems, number
have already been discussed in detail. Mixed networks refer of outputs are 3 (attitude and position, respectively).
to the combination of feed-forward and recurrent networks Simulation results show that mixed model with RBFNN
in one single model. Noticed that these networks may have has better performance in terms of local approximation
other names such as “hybrid networks” or “complementary (especially for takeoff and landing phase affected by the
networks” used in different literature, however, we find ground effect) while mixed model with MLP demonstrates
it more appropriate to use the term “mixed modeling” to better capability of global approximation as lower error is
distinguish from the concept of hybrid modeling as defined achieved during the flight. Moreover, it is founded that the
earlier. offline training time required for RBFNN (∼ a few minutes)
In [52] (1997), feed-forward network models are used is much less than that for MLP (∼ 18hrs) to obtain the final
for inverse modeling of a fixed-wing aircraft for feedback results of same level.
linearization. Representing the inverse longitudinal and In [57] (2007), a comparison is carried out between an
lateral dynamics, both of the two networks comprise two online ANN-based model and an offline ANN-based model
layers using a combination of RBF and sigma-pi units [53] for modeling of a fixed-wing UAV. The offline model is
(polynomial representations). The training data, consisting trained as a single block with control inputs as 4 inputs
of angular accelerations (second derivatives of Raw-Pitch- and linear velocities along with angular rates as 6 outputs.
Yaw angles) and control inputs, are generated by simulation The online model, on the other hand, consists of two
software at different flight conditions. Furthermore, angular individually trained ANN subsystems connected in parallel
accelerations are converted into moment coeffcients as which represent longitudinal and lateral dynamics. To take
input data so as to facilitate a inverse mapping that is into account the coupling effects between two dynamics,
less sensitive to altitude and Mach number variation. Since roll rate and yaw rate (for longitudinal ANN-based model)
network weights appear linearly, standard linear Least and pitch rate (for lateral ANN-based model) are augmented
Square (LS) approximation method is adopted for offline as additional inputs apart from control inputs in online
learning algorithm. Simulation results show that the offline model. Both offline and online ANN are feed-forward
trained networks provide satisfactory average performance networks with size of 2 × 8 and 2 × 4 for each subsystem,
over a given flight envelope. However, they might also respectively, and with a maximum of three past inputs used
exhibit large errors within a portion of envelope having a for the prediction of the present output. All the training is
high degree of mapping complexity. performed on real flight data using Levenberg-Marquardt
In [54] (2006), a feed-forward network is used for online (LM) algorithm. HIL simulation shows that offline model
estimation of plant dynamics of a damaged aircraft to avoid demonstrates smoother approximation of flight dynamics
aggressive learning induced by significant discrepancy while online model exhibits better adaptability.
between the model and real flight dynamics. Motivated by In [58] (2012), an offline feed-forward network is trained
[52], single-hidden-layer sigma-pi network is adopted to for learning heave (or vertical) dynamics of an unmanned
estimate incremental plant matrices in dynamic inversion helicopter. The network has a single hidden layer consisting
control. The network parameters are updated by an adaptive of 4 hidden neurons. Vertical flight data are collected for
law derived from Lyapunov stability theory and a recursive network training using LM algorithm. Simulation shows
LS formula using gradient method for comparative studies. consistent results on prediction of vertical acceleration.
In [55, 56] (2006), a mixed model of a mini-helicopter Therefore, the resulting network model is subsequently used
is proposed through offline training. Due to the flight for heave control design.
dynamics of helicopter, the model can be divided into two In [59] (2014), the inverse model of a fixed-wing UAV
subsystems (namely, radio-signal-to-attitude and attitude- is trained by a two-layer feed-forward network, specifically
to-position) in cascade form. For each subsystems, a MLP, for the use in feedback linearization. The model
mixed architecture is adopted, consisting of a series of is first trained offline with training data generated from
contextual neurons [40] to memorize previous states and the validated nonlinear aircraft model using sinusoids as
a two-layer feed-forward network (MLP or RBFNN). The inputs. Then, the network weights are adjusted online using
training patterns include attitude, position and 4 control the real-time data from flight simulator in Software-In-
inputs, which all coming from real flight data. For radio- the-Loop (SIL) simulation. Different from offline model,
signal-to-attitude subsystem, 54 input neurons are selected momentum term is added for online model. Despite the
for two different types of feed-forward networks (MLP satisfactory results achieved, the network architecture is
and RBFNN) for comparative studies, each including not presented in details and more investigations would be
60 hidden neurons and 500 radial neurons, respectively. necessary on online model to be effectively used along with
For attitude-to-position subsystem, 48 input neurons are real flight data.
J Intell Robot Syst (2020) 100:1469–1491 1479

In [60] (2015), an offline ESN model is trained for tion function is used in the hidden layer. Since the proposed
modeling of a quadrotor, a system with 4 control inputs model is non-convex, LS regression and Stochastic Gradient
and 6 outputs (position and attitude), using real flight Descent (SGD) with momentum term and variable learn-
data. Thanks to RC techniques, the training of ESN is ing rate are applied separately for training, which can be
conceptually simpler and less time-consuming than that of also regarded as one iteration of Block-Coordinate Descent.
other recurrent networks. Initially, 100 dynamic reservoirs Real-world data across 19 maneuvers are used for both
and spectral radius of 0.1 are chosen empirically. Then, an training and testing. However, no label in terms of maneu-
evolutionary algorithm for parameter optimization, namely vers is given during training, resulting in a global model
Covariance Matrix Adaptation Evolution Strategy (CMA- which is applicable across all maneuvers. With 2500 hid-
ES), is applied, yielding the finalized network with 127 den neurons for ReLU network, it takes nearly 1 hour for
dynamic reservoirs and spectral radius of 0.9191. The 200,000 iterations of training on a 6-core Intel i7 server with
optimized ESN shows imperceptible results in simulation 32GB RAM. Simulation results show the proposed ReLU
compared with real flight data. model improves 58% overall in Root Mean Square (RMS)
In [61] (2015), an offline recurrent architecture is acceleration prediction over state-of-the-art methods.
employed for MIMO modeling of quadrotor in the presence In [67] (2016), two ReLU networks are employed for
of noise and ground effect, namely Modular Deep Recurrent learning quadrotor dynamics (translational and rotational,
Neural Network (MODERNN) [62], which is proposed respectively) through offline training. The paper mainly
to address the problem of vanishing/exploding gradient investigates whether a simple feed-forward network model
in deep networks [63]. Thanks to the presence of feed- can be used for controller synthesis for unspecified
forward connections throughout MODERNN, the proposed trajectories. To this end, two-layer ReLU networks only
recurrent network can be trained using ordinary gradient fed by current state and control input with 100 hidden
descent approach such as LM algorithm instead of BPTT neurons are trained using resilient back-propagation with
or RTRL as frequently used for recurrent networks. The a momentum term and fixed learning rate. Training data
training data (including 4 control inputs as inputs, positions consist of linear velocities, angular rates, Euler angles and
and heading angle as outputs) are sampled from simulated control inputs, all from real flight data of only translational
trajectory considering the sum of 10 sinusoids with and only rotational motion (e.g., sinusoids in XY plane
uniformly random frequencies ranging from 0 to 10Hz. To without yaw motion). Experimental results show good
reduce the computational load, three separate MODERNNs control performance of tracking sinusoid-yaw trajectory
are trained with size of 3 × 10, 4 × 20, and 3 × 8. Consistent (i.e., the quadrotor undergoes a sinusoidal motion in XY
modeling results are achieved after approximately 24-hour plane while yawing) when coupled with LQR and PD
training for each MODERNN on an i7 core machine with controller. It can be hence concluded that ANN-based model
16GB of RAM. is capable of generalizing the dynamics to learn nonlinear
In [64] (2015), a novel offline ReLU network model is couplings between translational and rotational motions,
proposed along with parameter initialization and optimiza- even though these are not explicitly captured by the training
tion methods to solve challenging issues of helicopter mod- data.
eling by extracting information about the hidden (difficult- In [68] (2016), an offline ESN with output feedback
to-measure) state such as airflow. Motivated by Takens The- is used for modeling the inverse dynamics of a fixed-
orem [65, 66], the proposed model consists of a quadratic wing UAV for feedback linearization. As an extension
lag model (with control lag) and a two-layer ReLU network work of [59], the paper investigates the feasibility of
(with control lag and time-lagged output including linear using a different ANN architecture following a similar
velocities and angular rates). The use of quadratic lag model design procedure. The ESN is trained using Weiner-Hopf
is to account for nonlinear dependence on current state and method for linear regression with sinusoidal simulated data
control, while ReLU network is employed to learn the cor- from flight simulator as training data. However, detailed
rection to quadratic lag model to better approximate the specifications and training procedure for ESN with output
helicopter dynamics locally. Noted that the ReLU network feedback are not given. As output feedback has been used,
with delayed units used in the paper has a similar archi- stability issues are not addressed as well [69].
tecture as nonlinear autoregressive network with exogenous In [70] (2017), quadrotor dynamics are decoupled into
inputs (NARX)2 in which ReLU rather than sigmoid activa- 6 subsystems (i.e., roll/pitch/yaw and X/Y/Z position),
each of which is modeled by a single feed-forward net-
2 More
work. Different from conventional two-layer ANN, there
details refer to MathWorks Documentation on
“Design Time Series NARX Feedback Neural Networks”:
are only one input layer which receives 5 inputs (state
https://www.mathworks.com/help/deeplearning/ug/design-time- and control input with TDLs) and one output layer which
series-narx-feedback-neural-networks.html. generates 1 output (state) for each feed-forward network.
1480 J Intell Robot Syst (2020) 100:1469–1491

Such architecture can be simply regarded as a two-layer (ANFIS)) in series. Using real flight data, the best para-
feed-forward network having linear output layer with metric model is first determined. Then, soft computing
identity weight matrix and no bias. All the feed- technique is applied for training the offline data-driven
forward networks are trained online using BP algorithm. model using simulated data as input and real flight data
Simulation results demonstrate satisfactory learning as output. Specifically, LM algorithm is adopted for
ability. RBFNN training. Unlike the conventional training pro-
cess where the loss function is only an MSE of out-
3.1.2 Hybrid approach put error, a novel loss function is defined as the aver-
age of the sum of MSE of three error signals, that are
Hybrid approach refers to modeling methods that either system output, step response, and the gain of the out-
use ANNs in conjunction with a mathematical model put sinusoidal signal of the system given a sinusoidal
(e.g., FPM) or incorporate physical knowledge directly signal of unitary amplitude and frequency as input. More-
into the training process of ANNs. As we will see in the over, computational complexity is analyzed for both para-
following review, all the existing work in flight control is metric and data-driven model to estimate the complex-
focused on the former idea. ity of different identification technique. Simulation results
In [49] (2018), a hybridization of analytical and empir- show improved performance of hybridization in terms of
ical techniques is proposed for linear velocity estimation model accuracy despite a low rising of computational
of quadrotor to improve the performance in terms of cost. In the end, online MLP is also discussed for online
model precision and computational time. The hybrid modeling taking consecutive samples as input. However, it
model is made up of a parametric model (e.g., ARX, is not investigated how the online ANN will influence the
ARMAX or OE) and a data-driven model (e.g., MLP, model accuracy and computational complexity in hybridiza-
RBFNN, or Adaptive Neural Fuzzy Inference System tion.
In [71] (2018), Multi-Step (MS) prediction problem
Table 5 Acronyms and abbreviations used in Table 6 of quadrotor dynamics is studied, as opposed to a
majority of literature focusing on Single-Step (SS) pre-
Abbreviations Explanations
diction. An offline hybrid model is proposed, consisting
OFF Offline modeling of three modules, namely input modeler, motion model
ON Online modeling and output modeler, in which motion model is an FPM
INV Inverse modeling while the rest two adopt LSTM networks. The hybrid
MS Multi-step prediction model receives 4 control inputs and predicts linear veloc-
SINGLE Single model architecture ities as well as angular rates. Real flight data of vari-
DE Decoupled model architecture ous flight regimes are collected and used for training by
TensorFlow. Comparison results are drawn through simula-
FF Feed-forward network tion, indicating the hybrid model with hybrid-parallel con-
TDL Tapped delay line figuration outperforms either FPM or black-box approach.
LS Least square However, the paper does not provide detailed information
SGD Stochastic gradient descent on LSTM networks (e.g., their size and training details)
BP Back-propagation algorithm (Table 5).
SDATA Simulated data for training
RDATA Real flight data for training 3.2 Comparison of Approaches
SIM Simulation results
To fairly determine the capabilities and limitations
THE Theoretical results
of existing work on UAV dynamic modeling, refer-
HIL Hardware-in-the-loop
ence template, as elaborated in Section 2, is applied
δ Control inputs which yields a detailed review as summarized in Table 6.
pos Position (X, Y, Z) For better visualization and understanding purposes,
att Attitude (φ, θ, ψ) acronyms and abbreviations are used and explained with
vel Linear velocities (u, v, w) their full name in Table 5. Specifically, the ANN
rpy Angular rates (p, q, r) architecture is briefly specified in the form of “AA-BB-
CC-DD”, where “AA=OFF/ON”, “BB=INV/(feed-forward
N/A or ? Not applicable modeling by default)”, “CC=MS/(Single-step prediction by
w/ with default)”, and “DD=SINGLE/DE” (which will be explained
Table 6 Summary of literature review of ANN-based dynamic modeling for UAV applications (Section 3.1)

REF UAV ANN FUNC ANN ARCH ANN TRAIN TC RESULT ISSUE

[52] (1997) Fixed-wing OFF-INV-DE 2 : 2×? RBF+sigma-pi LS, SDATA: N/A SIM ii
(φ̈, θ̈, ψ̈) → δ
[54] (2006) Fixed-wing Online plant estimation 2×? sigma-pi Lyapunov/recursive LS N/A THE, SIM ii
[55, 56] (2006) Helicopter OFF-DE Contextual neurons w/ RDATA: δ → (att,pos) A few minutes for SIM i, ii, iv
2 : 2 × 60 MLPs or 2 × RBFNNs and ∼ 18hrs
500, 2 × 5 RBFNNs for MLPs
[57] (2007) Fixed-wing OFF-SINGLE; ON-DE LM algorithm, RDATA: N/A HIL i, ii
J Intell Robot Syst (2020) 100:1469–1491

2 × 8 FF w/ TDLs for
offline; 2 : 2 × 4 FF w/ δ → (vel,pqr) for
TDLs for online offline, RDATA: δ +
(p, r/q) → (vel,pqr) for
online
[58] (2012) Helicopter OFF-SINGLE 2 × 4 FF LM algorithm, RDATA: N/A SIM i, ii, iv
δ→w
[59] (2014) Fixed-wing OFF-INV-SINGLE; ON- 2×? MLP Gradient descent, N/A SIM i, ii, iv
INV-SINGLE SDATA
[60] (2015) Quadrotor OFF-SINGLE 2 × 127 ESN CMA-ES, RDATA: δ → N/A SIM i, ii, iv
(pos,att)
[62] (2014), [61] (2015) Quadrotor OFF-DE 3 × 10, 4 × 20, 3 × 8 LM algorithm, SDATA: 24hrs (i7, 16GB RAM) SIM iv
MODERNNs δ → (pos, ψ)
[64] (2015) Helicopter OFF-SINGLE 2 × 2500 ReLU w/ TDLs LS and SGD, RDATA: 1hr (i7, 32GB RAM) SIM i, ii, iv
δ → (vel,rpy)
[67] (2016) Quadrotor OFF-DE 2 : 2 × 100 ReLUs Resilient BP, RDATA: N/A SIM i, ii, iv
(δ, pos,vel,att,rpy) →
(vel,rpy)
[68] (2016) Fixed-wing OFF-INV-SINGLE ESN w/ output feedback Weiner-Hopf method N/A SIM i, ii, iv
for linear regression,
SDATA
[70] (2017) Quadrotor ON-DE 6 : 1 × 1 FFs w/ TDLs BP algorithm, δ → N/A SIM i, ii, iv
(pos,att)
[49] (2018) Quadrotor OFF-SINGLE RBFNN LM algorithm, RDATA: Analytical results SIM i, ii
δ → vel for feed-forward
propagation
[71] (2018) Quadrotor OFF-MS-DE LSTMs w/ TDLs TensorFlow, RDATA: N/A SIM i, ii
δ → vel,pqr

The columns represent the referenced work and the year published (REF), the type of UAV under control (UAV), the functionalities of ANN (ANN FUNC), the architecture of ANN (ANN ARCH),
the training methods for ANN (ANN TRAIN), time complexity (TC), achieved results (RESULT), and problems that remain unsolved (ISSUE), as early defined in the reference template in
Section 2
1481
1482 J Intell Robot Syst (2020) 100:1469–1491

in the sequel). Moreover, the input-output specifications of only about one-fifth of literature works on online networks
trained model are also specified in “ANN TRAIN” column and all of them fall under feed-forward category. This,
along with the type of training data using right arrow and again, shows the preference for feed-forward networks
tuple. For example, “RDATA: δ → (vel, pqr)” denotes that over the rest thanks to their easy training process and
the model is trained by real flight data with control inputs as implementation. However, the small amount of work on
input and linear velocities and angular rates as output. online networks also indicates that most of the existing
modeling approaches do not possess learning ability to deal
3.3 Classification of Approaches with dynamic changes during the flight. Lastly, it can be
concluded that hybrid approach has been actually very little
As the first step of dynamic modeling, one should clarify studied so far, thereby remaining an open research topic for
the requirements for the model based on the specific UAV dynamic modeling.
application. For example, is the model going to be used
in feedback linearization? If the answer is yes, then an 3.4 Challenges and Opportunities
inverse model is required and it will definitely not work if
one ends up with a feed-forward model. Apart from that, We recall the five questions thrown in Section 1.2 about
there are two more questions that need to be identified, the integration between MBC techniques and ANNs, which
i.e., what kind of prediction is expected from the model, are stability, robustness, paucity of data, interpretability,
and are subsystems preferable? The answer to the former and real-time performance. Based on the classification
question will determine whether a single-step or a multi-step and comparison as carried out in the previous sections,
prediction model is desired, whereas the answer to the latter we investigate to what extent these problems have been
question will settle the model architecture. For cases where resolved as well as identifying what are the current
longitudinal and lateral dynamics can be approximately challenges and opportunities in the sequel.
separated, one might prefer to have decoupled model
architecture (either in parallel or in series); otherwise, a i. (Stability): In the context of dynamic modeling, stabil-
single block of model might be preferred. The design flow ity issue refers to the convergence of network weights
of selecting an appropriate model architecture based on within specified time interval. Despite being widely
specific requirements is depicted in Fig. 4. recognized as one of most powerful tools for train-
As soon as a suitable model architecture is determined ing feed-forward networks, BP algorithm (especially
to meet the requirements for specific task, it is then the Batch Gradient Descent (BGD)) suffers from slow
turn to choose ANN architecture and corresponding training convergence and sometimes yields suboptimal solu-
methods for building the ANN-based model. Based on tions due to the utilization of steepest descent method
the definitions of ANN categorization in Section 2.3 and [72]. To achieve faster convergence and better per-
2.4, the existing work on UAV dynamic modeling can formance, there are currently a dozen of superior
be classified as shown in Table 7 with distribution given options available for network training, for example,
by pie chart for visualization purposes as illustrated in SGD, Mini-Batch Gradient Descent (MBGD), con-
Fig. 5. It is straightforward to see that more than a half of jugate gradient algorithms such as Fletcher-Reeves
literature is focused on feed-forward networks due to their and Polak-Ribiére update, BFGS (Broyden–Fletcher–
simple architecture and working principle. However, almost Goldfarb–Shanno) algorithm, LM algorithm, etc.
all the feed-forward networks are used along with TDLs, Besides, one can also use some simple tricks including
resulting in dynamic networks. This is simply because adaptive learning rate and momentum term to obtain
static networks are incapable of capturing complex flight the desired performance. Fortunately, all the afore-
dynamics of UAV from time-series flight data. Moreover, mentioned approaches have already been supported

Fig. 4 Design flow of selecting


an appropriate model
architecture for UAV dynamic
model
J Intell Robot Syst (2020) 100:1469–1491 1483

Table 7 Literature classification of UAV dynamic modeling. Column: ANN architecture. Row: Training methods

Data-driven Hybrid
Feed-forward Recurrent Mixed

Offline [52, 57–59, 64, 67] [60–62, 68] [55, 56] [49, 71]
Online [49, 54, 57, 59, 70] N/A N/A N/A

and well documented by quite a few open source plat- result in large control input that would be practically
forms such as TensorFlow, PyTorch, and MATLAB, unrealizable. Hence, online networks are of great
which greatly ease the design and implementation pro- value on developing a real-time adaptive model thanks
cess for developers. However, it should be noted that to their learning abilities. To this end, however,
there is still no formula to guarantee the convergence stability issue and paucity of data problem should be
of the network and all the approaches we have men- all taken into considerations.
tioned only increase the likelihood of finding a better iii. (Paucity of data): Preparing the dataset for training,
model in a faster speed. The only way to guarantee testing and validation is a delicate task for training
the convergence, perhaps, is to use adaptive control an accurate ANN-based model. Foremost, one has to
theory as shown in [54] rather than using purely data- identify the appropriate input and output variables.
driven training methods (which are the mainstream in For example, what kind of aircraft states should
the literature). Under such circumstance, the network be engaged in training, position (x, y, z) or linear
weights are updated through Lyapunov stability theory velocity (u, v, w), Roll-Pitch-Yaw angle (φ, θ, ψ) or
and all the signals can be proved bounded analytically. angular rate (p, q, r)? According to our review, it
ii. (Robustness): The majority of existing work aims seems that the use of linear and angular velocity
to model flight dynamics under nominal conditions. will result in model of better performance. Next,
That being said, no uncertainties and disturbances are since we aim to train a model that fully describes
taken into account explicitly. Despite the inclusion the nonlinear flight dynamics over the entire flight
of noise data in network training as demonstrated regimes, collecting sufficient and suitable data is
in some literature, the objective is focused on a laborious work. Table 8 shows different type of
achieving better generalization capability as well training data that are used in the literature. Among
as avoiding overfitting. From control perspectives, them, real flight data for all possible maneuvers
there is no doubt that a robust dynamic model required for mission are mostly desired, but an
is desired in most occasions which should remain experienced UAV pilot is a must. Using simulated data
valid even though there are some small variations can remove this requirement and reduce the cost as
in parameters or modeling mismatch. In addition, a simulator software will be employed rather than a
substantial discrepancy between model and real flight real UAV. However, the quality of data is tightly linked
dynamics should be avoided because this might to the fidelity of the simulator. Some literature also

Fig. 5 Pie chart distribution of


literature on UAV dynamic
modeling. Left: Distribution of
literature categorized by ANN
architecture. Right: Distribution
of literature using feed-forward
networks categorized by training
methods
1484 J Intell Robot Syst (2020) 100:1469–1491

Table 8 Different type of training data and their priorities From the above analysis, we may conclude that an
online hybrid model is of the most preferable to dynamic
Priority Type of data Data information
modeling of real-time complex systems such as UAVs.
1 Real flight data All possible maneuvers required According to Table 7, there is no research on this direction
for mission so far. It is remarked that the hybrid approach should
2 Simulated data All possible maneuvers put emphasis not only on improving the model accuracy
required for mission but also on enhancing the intertepretability by leveraging
3 Simulated data Steps with different amplitude the complementary strengths of physics-based model and
as input signals ANN. Recent work has proposed two frameworks, namely
Physics-Guided Neural Network (PGNN) and Physics-
Guided Recurrent Neural Network (PGRNN), for offline
modeling of lake temperature, which seek models to have
uses step signals as inputs to generate training data. physical consistency and good generalization capability
This can be even simpler to extract the maximum apart from superior accuracy by incorporating explicit
possible information from the dynamic system at the physical knowledge into network training [73, 74]. The
expense of less representative data if not carefully theory behind these two frameworks is called Theory-
dealt with. For offline networks, it is usually not a Guided Data Science (TGDS) [75], which deserves further
big deal for dataset preparation given enough time investigations for UAV applications. On the other hand,
and experts following what we have just mentioned. specific considerations on both dataset preparation (pre-
However, things are completely different for online processing) and training algorithms are also required from
networks where real flight data are the main source implementation point of view.
for network training. Due to the limited data available
at each sampling period, the trained model would
most probably not accurate enough for subsequent 4 Control Techniques
control synthesis. This issue is referred to as “paucity
of data”, which occurs mainly in online training. To In this section, we will review different flight control tech-
counter this issue, one might find TDLs useful (e.g., niques combined with ANNs for flight control design.
see applications [57, 70]) because TDLs form a larger Due to the intrinsic learning ability of ANN, such com-
input vector consisting of the current time-step data bination can be substantially regarded as adaptive con-
instance and the past time-step data instances. trol. Hence, some literature coins the terms such as
iv. (Interpretability): It is apparent that modeling with “feedback-linearization-based adaptive control” to repre-
data-driven approach suffers from interpretability sent the control architecture that combines feedback lin-
issue, i.e., the internal working principle of the earization technique with ANN-based adaptive augmenta-
black box is hard to come by. As the only two tion. The review in the sequel is classified based on the
papers ([49, 71]) researching on modeling with main control techniques adopted for flight control so as to
hybrid approach, the objectives lie in improving avoid those lengthy terminologies for better categorization
the performance, particularly in terms of model purposes.
precision and computational time, rather than the
interpretability. Hence, how to open the black box 4.1 Literature Review
of ANN remains an ongoing challenge and it is
worthwhile devoting much effort to this. 4.1.1 Proportional-integral-derivative control
v. (Real-time performance): Generally speaking, offline
networks do not have issues with real-time applica- In [76] (2009), an online CMAC is proposed for baseline
tion. This is because the time complexity of feed- PID controller under turbulence in aircraft automatic
forward propagation is generally negligible compared landing system. Lyapunov stability theory is applied to
to other process. Nonetheless, considerable attention determine the learning rate and weight updating rule of
should be paid to online networks. Specifically, the CMAC. Simulation shows the adaptive CMAC controller
real-time performance of online networks is associ- is capable of tracking a desired path under severe
ated with the speed of convergence of online adapta- turbulence environment. Nonetheless, network structure and
tion. parameters are not given in detail.
J Intell Robot Syst (2020) 100:1469–1491 1485

4.1.2 Feedback linearization to mimic the control inputs generated from an optimal
feedback control law for the heave control of a helicopter.
In [52] (1997), three feed-forward networks are used Both simulation and flight test results show satisfactory
as adaptive control augmentation for each channel of tracking performance. However, the offline ANN-based
control input of a fixed-wing aircraft. Each feed-forward controller does not possess learning ability. Hence, repeated
network consists of 21 RBF units (one-dimensional ANN training cannot be avoided if operating condition has
Gaussian functions) and 40 sigma-pi units (polynomial been changed significantly (e.g., see the initial flight test
representations), resulting in a total number of 840 weights. results in gusty conditions therein).
To ensure the uniform boundedness of all the signals in the
loop as well as the convergence of online network weights, a 4.1.4 Adaptive control
stable weights adjustment rule is also derived via Lyapunov
analysis. Through simulation results, it is shown that online In [81] (2016), a two-layer static structure RBFNN3
adaptive networks are capable of canceling out model with 125 centers is used to compensate the unknown
aerodynamic forces for an autonomous helicopter in
inversion error without full knowledge of uncertainties
real time. It is then exploited by a new structural
under a high-g, fixed throttle turn manueuver. A related
identifier to identify the unknown inertial matrix fac-
work on control of rotational dynamics of helicopter refers
ing unknown aerodynamic forces and external distur-
to [77].
bances. To reduce online computational burden, an opti-
In [78] (2002), a two-layer feed-forward network with
mized algorithm is incorporated in RBFNN mechanism.
5 hidden neurons is used as adaptive control augmentation
A barrier Lyapunov function is used to avoid kinematic sin-
for an autonomous helicopter to account for inversion
gularity problem in updating network weights. Simulation
error. A stable adaptive law is derived from Lyapunov
shows the designed controller guarantees a good tracking
stability analysis to update network parameters online,
performance for both fixed-point and dynamic trajectory
which guarantees signal boundedness. Although simulation
tracking.
and flight tests show satisfactory tracking performance for
In [82] (2018), a deep RNN combined with CNN network
various maneuvers, the potential of ANN-based adaptive
is developed for regression of actuator failure model from
element has not been investigated under uncertainties and
state trajectories for fault tolerant design of a fixed-wing
disturbances.
aircraft. The network has 2 CNNs, 2 LSTMs, and a dense
In [79] (2011), a two-layer feed-forward network
regression layer (For more details on network parameters,
with 8 hidden neurons is used as adaptive con-
please refer to the original paper). Offline training is
trol augmentation for a helicopter to reduce model
conducted on a database of different failure scenarios using
inversion error, similarly to [78]. Concurrent-learning
TensorFlow library and a NVIDIA GTX 1060 GPU. The
adaptive law is developed to improve the weight
controller adopts a control methodology that combines
convergence properties and tracking performance by allevi-
Structure Model Reference Adaptive Control (SMRAC)
ating the rank-1 condition of traditional BP-based training
[83] and Nonlinear Dynamic Inversion (NDI), namely
law that use merely the current data. With such mod-
Structured Adaptive Model Inversion (SAMI). Thanks
ification, the controller is able to adapt using current
to the use of deep neural network, faster convergence
and stored data without affecting the responsiveness of
of controller coefficent is achieved. Simulation provides
the adaptive law to current data. Simulation and flight
satisfactory recovery results under severe engine and
test results show improved tracking performance for both
elevator failure scenario.
repeated forward-step and aggressive trajectory tracking
manuevers.
4.1.5 Sliding mode control
In [80] (2015), a two-layer feed-forward network with 50
hidden neurons is used as adaptive control augmentation for
In [84] (2015), a two-layer static structure RBFNN with 6
a fixed-wing aircraft to cope with inversion error. Lyapunov
hidden nodes is adopted as adaptive uncertainty observer
stability analysis is applied for updating network weights.
for a coaxial 8-rotor UAV. Along with Backstepping
Simulation results show remarkable performance under
Sliding Mode Controller (BSMC), the lumped uncer-
mass changes and loss of an aileron.
tainties, which include external disturbance and iner-
tia matrix uncertainty, can be effectively estimated and
4.1.3 Optimal feedback control
3 Staticstructure RBFNN refers to a specific type of RBFNN of
In [58] (2012), a two-layer feed-forward network with
which only weights are updated while centers and widths of Gaussian
4 hidden neurons is trained offline using LM algorithm function (if selected as the receptive field function) are fixed.
1486 J Intell Robot Syst (2020) 100:1469–1491

handled without the need of knowing their bounds. understanding purposes, acronyms and abbreviations are
Lyapunov stability theory is applied to update network used and explained with their full name in Table 9.
weights of the observer and prove the signal boundedness.
Simulation results show robust performance under model 4.3 Classification of Approaches
uncertainties and external disturbances.
In [85] (2016), RBFNN is used along with a double- Table 11 presents the classification of existing literature
loop Integral Sliding Mode Control (IntSMC) for position on UAV flight control combined with ANNs. It is
and attitude tracking of a quadrotor subject to disturbances straightforward to see that online networks are used more
and parametric uncertainties. Lyapunov theory is applied to frequently than offline ones due to their intrinsic learning
update the network weights of RBFNN as well as providing ability. This is quite different from the situation where we
stability proof. Faster convergence of state variables to saw for dynamic modeling (as shown in Table 7) mainly
their desired values is noted through simulation. However, because either control augmentation or uncertainty observer
network structure and parameters of RBFNN are not given requires real-time adaptability which offline networks do
in detail. not possess.
As for the role that ANNs play in the controller design,
4.1.6 Backstepping control control augmentation is mostly associated with feedback
linearization technique whereas uncertainty observer usu-
In [86] (2008), two-layer MLPs with 15 hidden neurons ally works together with backstepping and sliding mode
are used to estimate unknown linearities as well as control techniques. Additionally, ANNs have also been used
physical parameters for flight control of a helicopter. to mimic static controller or identify the online controller
Network weights are updated online using Lyapunov theory. coefficient.
From simulation, it is demonstrated that the proposed
backstepping controller achieves good position tracking 4.4 Challenges and Opportunities
performance even with a sudden mass change perturbation.
Noted from Table 10 and Table 11, the main purpose
4.2 Comparison of Approaches of using ANNs in flight control proposed by most of
the literature is to increase the robustness against uncer-
Similar to Section 3.2, reference template is applied to tainties and disturbances, either by means of adaptive
summarize the comprehensive review of previous studies, control augmentation or uncertainty observers. It should
yielding Table 10, to fairly determine the advantages also be emphasized that Lyapunov stability theory is
and disadvantages of existing work on UAV flight adopted as the mainstream approach to update net-
control combined with ANNs. For better visualization and work weights, rather than using data-driven approaches,
to provide rigorous analytical proof of stability and
convergence. Hence, it can be concluded that through
Table 9 Acronyms and abbreviations used in Table 10 and Table 11
these efforts, i-(Stability) and ii-(Robustness) have been
Abbreviations Explanations intentionally addressed. However, iii-(Paucity of data)
and v-(Real-time performance) remain to be under investi-
OFF Offline network
gation since only very few work has presented experimental
ON Online network
results and little concentration has been placed on imple-
ADAUG Adaptive control augmentation mentation aspects.
OBSV Uncertainty observer
Moreover, there has emerged some recent research
CTRLID Controller coefficient identification
(not included in Section 4.1 and Table 10) work-
CTRLMM Controller mimicking
ing on the controller design using bio-inspired neural
FL Feedback linearization
networks, Spiking Neural Network (SNN). For example,
OP Optimal feedback
a turbulence-tolerant spiking controller is developed for a
AC Adaptive control
quadrotor in [87] and an SNN controller is proposed for a
SMC Sliding mode control flapping insect-scale robot in [88]. Compared to the other
BS Backstepping control ANNs we have discussed in the previous context, SNN
PID Proportional-integral-derivative has unique advantages in online learning and hardware
J Intell Robot Syst (2020) 100:1469–1491 1487

i, ii, iii, iv
i, ii, iii, iv

The columns represent the referenced work and the year published (REF), the type of UAV under control (UAV), the functionalities of ANN (ANN FUNC), the architecture of ANN (ANN ARCH),
deployment [89], which could be particularly attractive to

the training methods for ANN (ANN TRAIN), time complexity (TC), main control techniques adopted for flight control (CTRL), flight manueuvers (MANUEUVER), achieved results (RESULT),
ISSUE
onboard flight controller design to cope with uncertainties

iii, v
iii, v

iii, v

iii, v
iii, v

iii, v

iii, v
and disturbances. The existing studies have shownpromis-

ii
ing results from simulation. However, a solid consensus
THE, SIM, EXP

THE, SIM, EXP


is missing about encoding scheme and neuron model and
THE, SIM

THE, SIM
THE, SIM
THE, SIM
THE, SIM

THE, SIM

THE, SIM
experimental tests are expected for validating the controller

SIM, EXP
RESULT

design in realistic environment.

SIM
Command tracking
Attitude tracking 5 Conclusions
CTRL MANUEUVER

In this survey, we started with the investigation of the


Tracking

Tracking
Tracking
Tracking

Tracking
Landing
Various
High-g

possibility to combine the Model-Based Control (MBC)


Roll

and the Artificial Neural etwork (ANN) techniques for the


control of Unmanned Aerial Vehicles (UAVs). Specifically,
SMC

SMC

we analyzed the advantages of MBC-based design and


PID

AC
OP

BS
FL
FL
FL
FL

FL

ANNs for studying complex systems such as flight control


systems, also presenting their limitations. As a result,
N/A
N/A
N/A

N/A
N/A
N/A
N/A
N/A
N/A
N/A

N/A
ANN TRAIN TC

we identified the complementary relationship existing


LM algorithm

between MBC and ANN techniques, which motivated their


combination for facing the flight control problem of UAV.
Lyapunov
Lyapunov
Lyapunov
Lyapunov
Lyapunov
Lyapunov
Lyapunov

Lyapunov

Lyapunov

Nonetheless, for the success of developing an MBC-


N/A

ANN flight controller, a series of challenging issues need to


be addressed, which include stability, robustness, paucity of
2 × 125 RBFNN with Gaussian function (static structure)
2 × 6 RBFNN with Gaussian function (static structure)

data, interpretability, and real-time performance. To provide


Table 10 Summary of literature review of ANN-based flight control for UAV applications (Section 4.1)

technical justifications as well as supported answers to


and problems that remain unsolved (ISSUE), as early defined in the reference template in Section 2

these questions, we carried out a thorough review on the


2 CNNs + 2 LSTMs + 1 dense regression layer

existing literature focusing on the combination of MBC


and ANNs techniques. A reference template was proposed
and used as a unified framework to fairly determine the
capabilities and limitations of each approach of dynamic
3 : 2 × (21 RBF + 40 sigma-pi)

modeling and control techniques. Bearing in mind the real-


time implementation and performance, analysis on time
complexity was also conducted for several frequently-used
ANNs in the literature. Based upon classification and
comparison results, we also shed light on the direction of
ANN ARCH

2 × 15 MLPs

future development.
2 × 50 FF
2 × 5 FF

2 × 8 FF
2 × 4 FF

RBFNN
CMAC

Our future work will mainly focus on the design of


a UAV flight control system with combined MBC-ANN
techniques which aims to be equipped with stable and
OFF-CTRLMM

OFF-CONTID

robust autonomous flight capability under uncertainties and


ANN FUNC

ON-ADAUG
ON-ADAUG
ON-ADAUG

ON-ADAUG

ON-ADAUG

ON-Control
ON-OBSV
ON-OBSV

ON-OBSV

disturbances. Also, it is of our great interest to investigate


if such system can be extendedly used for the control of
reconfigurable UAVs to manage the transitions. On top of
that, we will explore methods to “open” the black box of
Coaxial 8-rotor

ANNs, for instance, by incorporating domain knowledge,


Fixed-wing
Fixed-wing

Fixed-wing

Fixed-wing
Helicopter
Helicopter

Helicopter

Helicopter

Helicopter
Quadrotor

to make the system more intelligible and trustworthy. From


implementation perspectives, we will continue our study on
UAV

the time complexity of online training for different ANN


architecture and research suitable hardware platforms for
[52] (1997)
[78] (2002)
[76] (2009)
[79] (2011)
[58] (2012)
[80] (2015)
[84] (2015)
[81] (2016)
[85] (2016)
[86] (2008)
[82] (2018)

deployment and experimental tests for MBC-ANN flight


REF

controller.
1488 J Intell Robot Syst (2020) 100:1469–1491

Table 11 Literature classification of UAV flight control. Left-Column: Training methods. Right-Column: ANN functionalities. Row: Control
techniques

Offline Online ADAUG OBSV Other

PID N/A [76] 


FL [82] [52, 78–80]  
OP [58] N/A 
AC [82] [81]  
SMC N/A [84, 85]  
BS N/A [84, 86] 

References 17. Younes, Y.A., Drak, A., Noura, H.N., Rabhi, A., hajjaji, A.E.:
Model-free control of a quadrotor vehicle. 2014 International
Conference on Unmanned Aircraft Systems (ICUAS), pp 1126–
1. Voos, H.: Nonlinear control of a quadrotor micro-uav using
1131 (2014)
feedback-linearization. 2009 IEEE International Conference on
18. Chand, A.N., Kawanishi, M., Narikiyo, T.: Non-linear model-
Mechatronics, pp 1–6 (2009)
free control of flapping wing flying robot using ipid. 2016 IEEE
2. Nicol, C., Macnab, C.JB., Ramirez-Serrano, A.: Robust adaptive
International Conference on Robotics and Automation (ICRA),
control of a quadrotor helicopter. Mechatronics 21(6), 927–938
pp 2930–2937 (2016)
(2011)
19. Yip, P.P., Hedrick, J.K.: Adaptive dynamic surface con-
3. Alexis, K., Nikolakopoulos, G., Tzes, A.: Experimental model
trol: a simplified algorithm for adaptive backstepping con-
predictive attitude tracking control of a quadrotor helicopter
trol of nonlinear systems. Int. J. Control. 71(5), 959–979
subject to wind-gusts. 18th Mediterranean Conference on Control
(1998)
and Automation, MED’10, pp 1461–1466 (2010)
20. Hunt, K.J., Sbarbaro, D., Żbikowski, R., Gawthrop, P.J.: Neural
4. Xu, R., Ozguner, U.: Sliding mode control of a quadrotor
networks for control systems—a survey. Automatica 28(6), 1083–
helicopter. Proceedings of the 45th IEEE Conference on Decision
1112 (1992)
and Control, pp 4957–4962 (2006)
21. Carrio, A., Sampedro, C., Rodriguez-Ramos, A., Campoy,
5. Das, A., Lewis, F.L., Subbarao, K.: Backstepping approach for
P.: A review of deep learning methods and applications
controlling a quadrotor using lagrange form dynamics. J. Intell.
for unmanned aerial vehicles. Journal of Sensors, 2017.
Robot. Syst. 56, 127–151 (2009)
https://doi.org/10.1155/2017/3296874 (2017)
6. Raffo, G.V., Ortega, M.G., Rubio, F.R.: An underactuated h
22. Jiang, Y., Yang, C., Na, J., Li, G., Li, Y., Zhong, J.:
infinity control strategy for a quadrotor helicopter. 2009 European
“A Brief Review of Neural Networks based Learning and
Control Conference (ECC), pp 3845–3850 (2009)
Control and their Applications for Robots,” 2017, pp 1–30.
7. Santoso, F., Garratt, M.A., Anavatti, S.G.: State-of-the-art
https://doi.org/10.1155/2017/1895897 (2017)
intelligent flight control systems in unmanned aerial vehicles.
IEEE Trans. Autom. Sci. Eng. 15(2), 613–627 (2018) 23. Hornik, K.M., Stinchcomb, M., White, H.: Multilayer feedforward
8. Prouty, R.W.: Helicopter performance, stability, and control networks are universal approximator. IEEE Trans. Neural Netw.,
(1986) vol. 2, 01 (1989)
9. Hoffmann, G., Huang, H., Waslander, S., Tomlin, C.: Quadrotor 24. Cybenko, G.: Approximations by superpositions of a sigmoidal
helicopter flight dynamics and control: Theory and experiment. In: function. Mathematics Control Signals Systems, vol. 2 (1989)
AIAA Guidance, Navigation and Control Conference and Exhibit, 25. Lazer, D., Kennedy, R., King, G., Vespignani, A.: The parable of
p. 6461 (2007) google flu: Traps in big data analysis. Science (New York, N.Y.)
10. Conyers, S.A., Rutherford, M.J., Valavanis, K.P.: An empirical 343, 1203–5 (2014). https://doi.org/10.1126/science.1248506
evaluation of ceiling effect for small-scale rotorcraft*. 2018 Inter- 26. Cheng, T., Wen, P., Li, Y.D.: Research status of artificial
national Conference on Unmanned Aircraft Systems (ICUAS), pp neural network and its application assumption in aviation. 2016
243–249 (2018) 12th International Conference on Computational Intelligence and
11. Conyers, S.A., Rutherford, M.J., Valavanis, K.P.: An empirical Security (CIS), pp 407–410 (2016)
evaluation of ground effect for small-scale rotorcraft. 2018 IEEE 27. Zhang, R., Zhang, J., Yu, H.: Review of modeling and control
International Conference on Robotics and Automation (ICRA), pp in uav autonomous maneuvering flight. 2018 IEEE International
1244–1250 (2018) Conference on Mechatronics and Automation (ICMA), pp 1920–
12. Shamma, J.S., Cloutier, J.R.: Gain-scheduled missile autopilot 1925 (2018)
design using linear parameter varying transformations. Journal of 28. Gu, W., Valavanis, K.P., Rutherford, M.J., Rizzo, A.: A survey
guidance, Control, and dynamics 16(2), 256–263 (1993) of artificial neural networks with model-based control techniques
13. Tsakalis, K.S., Ioannou, P.A.: Linear time-varying systems: for flight control of unmanned aerial vehicles. 2019 International
control and adaptation. Prentice-Hall, Inc. (1993) Conference on Unmanned Aircraft Systems (ICUAS), pp 362–371
14. Ioannou, P.A., Sun, J.: Robust adaptive control. PTR Prentice-Hall (2019)
Upper Saddle River, NJ, vol. 1 (1996) 29. McCulloch, W.S., Pitts, W.: A logical calculus of the ideas
15. Sadraey, M., Colgren, R.: Robust nonlinear controller design for immanent in nervous activity. 1943. Bulletin of mathematical
a complete uav mission. In: AIAA Guidance, Navigation, and biology 52(1-2), 99–115 (1990). discussion 73–97
Control Conference and Exhibit, p. 6687 (2006) 30. Rosenblatt, F.F.: The perceptron: a probabilistic model for
16. Fekri, S., Athans, M., Pascoal, A.: Issues , progress and new information storage and organization in the brain. Psychological
results in robust adaptive control (2006) review 65(6), 386–408 (1958)
J Intell Robot Syst (2020) 100:1469–1491 1489

31. Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning rep- 56. Martı́n, R.S., Barrientos, A., Gutiérrez, P., Cerro, J.D.: Neural
resentations by back-propagating errors. Nature 323, 533–536 networks training architecture for uav modelling. 2006 World
(1986) Automation Congress, pp 1–6 (2006)
32. Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality 57. Puttige, V.R., Anavatti, S.G.: Comparison of real-time online and
of data with neural networks. Science 313 5786, 504–7 (2006) offline neural network models for a uav. 2007 International Joint
33. Schmidhuber, J.: Deep learning in neural networks: An overview. Conference on Neural Networks, pp 412–417 (2007)
Neural networks : the official journal of the International Neural 58. Garratt, M., Anavatti, S.: Non-linear control of heave for
Network Society 61, 85–117 (2015) an unmanned helicopter using a neural network. Journal
34. Beale, M.H., Hagan, M.T., Demuth, H.B.: Neural network of Intelligent & Robotic Systems 66(4), 495–504 (2012).
toolboxTM user’s guide. The Mathworks Inc (1992) https://doi.org/10.1007/s10846-011-9634-9. https://doi.org/10.
35. Broomhead, D.S., Lowe, D.: Multivariable functional interpola- 1007/s10846-011-9634-9
tion and adaptive networks. Complex Systems, vol. 2 (1988) 59. Bhandari, S., Raheja, A., Tang, D., Ortega, K., Dadian, O.,
36. Zhang, Q., Benveniste, A.: Wavelet networks. IEEE transactions Bettadapura, A.: Nonlinear control of uavs using multi-layer
on neural networks 3(6), 889–98 (1992) perceptrons with off-line and on-line learning. 2014 American
37. Hahnloser, R.H.R., Sarpeshkar, R., Mahowald, M.A., Douglas, Control Conference, pp 2875–2880 (2014)
R.J., Seung, H.S.: Digital selection and analogue amplification 60. Vargas, A., Ireland, M., Anderson, D.: System identification of
coexist in a cortex-inspired silicon circuit. Nature 405, 947–951 multi-rotor uavs using echo state networks (2015)
(2000) 61. Mohajerin, N., Waslander, S.L.: Modelling a quadrotor vehicle
38. Albus, J.S.: I a new approach to manipulator control: The i using a modular deep recurrent neural network. 2015 IEEE
cerebellar model articulation controller (1975) International Conference on Systems, Man, and Cybernetics,
39. Hopfield, J.J.: Neural networks and physical systems with pp 376–381 (2015)
emergent collective computational abilities. Proc. Natl. Acad. Sci. 62. Mohajerin, N., Waslander, S.L.: Modular deep recurrent neural
U.S.A. 79(8), 2554–8 (1982) network: Application to quadrotors. 2014 IEEE International
40. Elman, J.L.: Finding structure in time. Cognitive science 14(2), Conference on Systems, Man, and Cybernetics (SMC), pp 1374–
179–211 (1990) 1379 (2014)
41. Jaeger, H.: The”echo state”approach to analysing and training 63. Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependen-
recurrent neural networks (2001) cies with gradient descent is difficult. IEEE transactions on neural
42. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural networks 5(2), 157–166 (1994)
Comput. 9, 1735–1780 (1997) 64. Punjani, A., Abbeel, P.: Deep learning helicopter dynamics
43. Hochreiter, S., Bengio, Y.: Gradient flow in recurrent nets: the models. 2015 IEEE International Conference on Robotics and
difficulty of learning long-term dependencies (2001) Automation (ICRA), pp 3223–3230 (2015)
44. Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On 65. Takens, F.: Detecting strange attractors in turbulence (1981)
the properties of neural machine translation: Encoder-decoder 66. Robinson, J.C.: A topological delay embedding theorem for
approaches. ArXiv:1409.1259 (2014) infinite-dimensional dynamical systems (2005)
45. Hodgkin, A.L., Huxley, A.F.: A quantitative description of 67. Bansal, S., Akametalu, A.K., Jiang, F.J., Laine, F., Tomlin, C.J.:
membrane current and its application to conduction and excitation Learning quadrotor dynamics using neural network for flight
in nerve. The Journal of physiology 117 4, 500–44 (1952) control. 2016 IEEE 55th Conference on Decision and Control
46. Gerstner, W., Kistler, W.M.: Spiking neuron models: Single (CDC), pp 4653–4660 (2016)
neurons, populations, plasticity (2002) 68. Dadian, O., Bhandari, S., Raheja, A.: A recurrent neural network
47. Werbos, P.J.: Backpropagation through time: What it does and how for nonlinear control of a fixed-wing uav. In: American Control
to do it (1990) Conference (ACC), 2016, pp. 1341–1346, IEEE (2016)
48. Williams, R.J., Zipser, D.: A learning algorithm for continually 69. Lukoševičius, M.: A practical guide to applying echo state
running fully recurrent neural networks. Neural Comput. 1, 270– networks. In: Neural networks: Tricks of the trade, pp. 659–686.
280 (1989) Springer (2012)
49. Sierra, J.E., Peñas, M.S.: Modelling engineering systems using 70. Teng, Y.-F., Hu, B., Liu, Z.-W., Huang, J., Guan, Z.-H.: Adaptive
analytical and neural techniques: Hybridization. Neurocomputing neural network control for quadrotor unmanned aerial vehicles.
271, 70–83 (2018) 2017 11th Asian Control Conference (ASCC), pp 988–992
50. Struik, D.J.: A source book in mathematics, 1200-1800, Harvard (2017)
University Press, vol. 11 (1969) 71. Mohajerin, N., Mozifian, M., Waslander, S.L.: Deep learning a
51. Alvarenga, J., Vitzilaios, N.I., Valavanis, K.P., Rutherford, M.J.: quadrotor dynamic model for multi-step prediction. 2018 IEEE
Survey of unmanned helicopter model-based navigation and International Conference on Robotics and Automation (ICRA),pp
control techniques. Journal of Intelligent & Robotic Systems 2454–2459 (2018)
80(1), 87–138 (2015) 72. Gori, M., Tesi, A.: On the problem of local minima in
52. Kim, B.S., Calise, A.J.: Nonlinear flight control using neural backpropagation. IEEE Trans. Pattern Anal. Mach. Intell. 14,
networks. J. Guid. Control. Dyn. 20(1), 26–33 (1997) 76–86 (1992)
53. Mel, B.W., Koch, C.: Sigma-pi learning: On radial basis functions 73. Karpatne, A., Watkins, W., Read, J., Kumar, V.: Physics-guided
and cortical associative learning. In: NIPS (1989) neural networks (pgnn): An application in lake temperature
54. Nguyen, N.T., Krishnakumar, K.S., Kaneshige, J., Nespeca, P.: modeling. arXiv:1710.11431 (2017)
Dynamics and adaptive control for stability recovery of damaged 74. Jia, X., Willard, J., Karpatne, A., Read, J.S., Zward, J., Steinbach,
asymmetric aircraft (2006) M., Kumar, V.: Physics guided rnns for modeling dynamical
55. Martin, R.S., Barrientos, A., Gutiérrez, P., del Cerro, J.: systems: A case study in simulating lake temperature profiles.
Unmanned aerial vehicle (uav) modelling based on supervised ArXiv:1810.13075 (2018)
neural networks. Proceedings 2006 IEEE International Confer- 75. Karpatne, A., Atluri, G., Faghmous, J.H., Steinbach, M., Banerjee,
ence on Robotics and Automation, 2006. ICRA 2006, pp 2497– A., Ganguly, A.R., Shekhar, S., Samatova, N.F., Kumar, V.:
2502 (2006) Theory-guided data science: A new paradigm for scientific
1490 J Intell Robot Syst (2020) 100:1469–1491

discovery from data. IEEE Trans. Knowl. Data Eng. 29, 2318– Kimon P. Valavanis is John Evans Professor and Director of
2331 (2016) Research and Innovation in the D. F. Ritchie School of Engineering
76. Yang, T.-C., Juang, J.-G.: Aircraft landing control based on and Computer Science at the University of Denver. He is also
adaptive cmac. 2009 IEEE/ASME International Conference on Director of the DU Unmanned Systems Research Institute (DU2SRI).
Advanced Intelligent Mechatronics, pp 791–796 (2009) He holds a Guest Professor appointment in the Department
77. Leitner, J.A., Calise, A.J., Prasad, J.V.R.: Analysis of adaptive of Telecommunications, Faculty of Electrical Engineering and
neural networks for helicopter flight control (1995) Computing at the University of Zagreb, Croatia. His research interests
78. Kannan, S.K., Johnson, E.N.: Adaptive trajectory based control span the areas of intelligent control, robotics and automation, and
for autonomous helicopters (2002) distributed intelligent systems, focusing on: integrated control and
79. Chowdhary, G.V., Johnson, E.N.: Theory and flight-test validation diagnostics of unmanned systems; modeling and formation control
of a concurrent-learning adaptive controller. J. Guid. Control. of cooperative robot teams; navigation/control of unmanned aerial
Dyn. 34(2), 592–607 (2011) vehicles; modeling, design and development of complex mechatronic
80. Ocaña, D.T., Shin, H.-S., Tsourdos, A.: Development of a systems; design of the next generation of unmanned systems;
nonlinear reconfigurable f-16 model and flight control systems mathematical theories for intelligent machines. He has published more
using multilayer adaptive neural networks. IFAC-PapersOnLine than 400 journal and conference papers, book chapters, including 19
48(9), 138–143 (2015) books. He is Fellow of the American Association for the Advancement
81. Lai, G., Liu, Z.D., Zhang, Y., Chen, C.L.P.: Adaptive posi- of Science (AAAS), Senior Member of IEEE, Editor-in-Chief of the
tion/attitude tracking control of aerial robot with unknown inertial Journal of Intelligent and Robotic Systems (Springer), and Fulbright
matrix based on a new robust neural identifier. IEEE Trans- Scholar.
actions on Neural Networks and Learning Systems 27, 18–31
(2016)
82. Eroglu, B., Sahin, C., Yuksek, B., Ure, N.K., Inalhan, G.: Deep Matthew J. Rutherford is an Associate Professor in the Department
recurrent and convolutional networks for accelerated fault tolerant of Computer Science with a joint appointment in the Department of
adaptive flight control under severe failures. In: 2018 Annual Electrical and Computer Engineering at the University of Denver. He
American Control Conference (ACC), pp. 6559–6565 (2018) is also a Deputy Director of the DU Unmanned Systems Research
83. Akella, M.R., Junkinst, J.L., Robinett, R.D.: Structured model Institute (DU2SRI). His research portfolio includes: the development
reference adaptive control with actuator saturation limits (1998) of advanced controls and communication mechanisms for autonomous
84. Peng, C., Bai, Y., Gong, X., Gao, Q., Zhao, C., Tian, Y.: Modeling aerial and ground robots; applications of real-time computer vision to
and robust backstepping sliding mode control with adaptive robotics problems using GPU-based parallel processing; testing and
rbfnn for a novel coaxial eight-rotor uav. IEEE/CAA Journal of dynamic evaluation of embedded, real-time systems; development of
Automatica Sinica 2, 56–64 (2015) complex mechatronic systems (mechanical, electrical, and software);
85. Li, S., Wang, Y., Tan, J., Zheng, Y.: Adaptive rbfnns/integral the development of software techniques to reduce the amount of
sliding mode control for a quadrotor aircraft. Neurocomputing energy being consumed by hardware; development of a high-precision
216, 126–134 (2016) propulsion system for underwater robots.
86. Madani, T., Benallegue, A.: Adaptive control via backstepping
technique and neural networks of a quadrotor helicopter. IFAC
Proceedings Volumes 41(2), 6513–6518 (2008) Alessandro Rizzo is an Associate Professor in the Department of
87. Howard, D., Elfes, A.: Evolving spiking networks for turbulence- Electronics and Telecommunications at Politecnico di Torino, Italy.
tolerant quadrotor control (2014) He received the Laurea degree (summa cum laude) in computer
88. Clawson, T.S., Ferrari, S., Fuller, S.B., Wood, R.J.: Spiking neural engineering and the Ph.D. degree in automation and electronics
network (snn) control of a flapping insect-scale robot. 2016 IEEE engineering from the University of Catania, Italy, in 1996, and 2000,
55th Conference on Decision and Control (CDC), pp 3381–3388 respectively. In 1998, he worked as EURATOM Research Fellow
(2016) at JET Joint Undertaking, Abingdon, U.K., researching on sensor
89. Lobo, J.L., Ser, J.D., Bifet, A., Kasabov, N.: Spiking neural validation and fault diagnosis for nuclear fusion experiments. In 2000
networks and online learning: An overview and perspectives. and 2001, he worked as Research Consultant to ST Microelectronics,
Neural networks : the official journal of the International Neural Catania Site, Italy, and as Industry Professor of Robotics at the
Network Society 121, 88–100 (2020) University of Messina, Italy. From 2002 to 2015 he was a tenured
Assistant Professor at Politecnico di Bari, Italy. In November 2015, he
joined Politecnico di Torino. From 2012, he has also been a Visiting
Publisher’s Note Springer Nature remains neutral with regard to Professor at New York University Tandon School of Engineering,
jurisdictional claims in published maps and institutional affiliations. Brooklyn NY, USA. Prof. Rizzo is engaged in conducting and
supervising research on complex networks and systems, modeling and
control of nonlinear systems, cooperative robotics. He is author of
Weibin Gu received the BSc in Mechanical Engineering from both two books, two international patents, and more than 150 papers on
Tongji University, China and Politecnico di Torino, Italy (dual degree international journals and conference proceedings. Prof. Rizzo has
program), and the MSc in Mechatronic Engineering from Politecnico been the recipient of the award for the best application paper at the
di Torino. He is currently a flight control engineer at the R&D IFAC world triennial conference in 2002 and of the award for the most
Department of Shanghai Foia Co., China, and actively collaborates read papers in Mathematics and Computers in Simulation (Elsevier)
with the University of Denver, USA, and Politecnico di Torino. His in 2009. He is also a Distinguished Lecturer of the IEEE Nuclear and
research interests are in modeling and control of unmanned aerial Plasma Science Society and one of the recipients of the 2019 Amazon
vehicles. Research Awards.
J Intell Robot Syst (2020) 100:1469–1491 1491

Affiliations
Weibin Gu1 · Kimon P. Valavanis1 · Matthew J. Rutherford1 · Alessandro Rizzo2

Kimon P. Valavanis
Kimon.Valavanis@du.edu
Matthew J. Rutherford
Matthew.Rutherford@du.edu
Alessandro Rizzo
alessandro.rizzo@polito.it
1 Department of Electrical and Computer Engineering, DU
Unmanned Systems Research Institute (DU2SRI), Daniel Felix
Ritchie School of Engineering and Computer Science, University
of Denver, 2155 E Wesley Ave, Denver, CO 80210, USA
2 Department of Electronics and Telecommunications, Politecnico
di Torino, Corso Duca degli Abruzzi, 24, 10129 Torino TO, Italy

You might also like